• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..19-Feb-2021-

README.mdH A D19-Feb-20214.3 KiB5939

cnv_somatic_funcotate_seg_workflow.wdlH A D19-Feb-20219.2 KiB189168

cnv_somatic_oncotator_workflow.wdlH A D19-Feb-20213 KiB8971

cnv_somatic_pair_workflow.wdlH A D19-Feb-202145.1 KiB910830

cnv_somatic_panel_workflow.wdlH A D19-Feb-202110.1 KiB237213

README.md

1## Running the Somatic CNV WDL
2
3### Which WDL should you use?
4
5- Building a panel of normals (PoN): ``cnv_somatic_panel_workflow.wdl``
6- Running a matched pair: ``cnv_somatic_pair_workflow.wdl``
7
8#### Setting up parameter json file for a run
9
10To get started, create the json template (using ``java -jar wdltool.jar inputs <workflow>``) for the workflow you wish to run and adjust parameters accordingly.
11
12*Please note that there are optional workflow-level and task-level parameters that do not appear in the template file.  These are set to reasonable values by default, but can also be adjusted if desired.*
13
14#### Required parameters in the somatic panel workflow
15
16The reference used must be the same between PoN and case samples.
17
18- ``CNVSomaticPanelWorkflow.gatk_docker`` -- GATK Docker image (e.g., ``broadinstitute/gatk:latest``).
19- ``CNVSomaticPanelWorkflow.intervals`` -- Picard or GATK-style interval list.  For WGS, this should typically only include the autosomal chromosomes.
20- ``CNVSomaticPanelWorkflow.normal_bais`` -- List of BAI files.  This list must correspond to `normal_bams`.  For example, `["Sample1.bai", "Sample2.bai"]`.
21- ``CNVSomaticPanelWorkflow.normal_bams`` -- List of BAM files.  This list must correspond to `normal_bais`.  For example, `["Sample1.bam", "Sample2.bam"]`.
22- ``CNVSomaticPanelWorkflow.pon_entity_id`` -- Name of the final PoN file.
23- ``CNVSomaticPanelWorkflow.ref_fasta_dict`` -- Path to reference dict file.
24- ``CNVSomaticPanelWorkflow.ref_fasta_fai`` -- Path to reference fasta fai file.
25- ``CNVSomaticPanelWorkflow.ref_fasta`` -- Path to reference fasta file.
26
27Additionally, there are optional workflow-level and task-level parameters that may be set by advanced users; for example:
28
29- ``CNVSomaticPanelWorkflow.do_explicit_gc_correction`` -- (optional) If true, perform explicit GC-bias correction when creating PoN and in subsequent denoising of case samples.  If false, rely on PCA-based denoising to correct for GC bias.
30- ``CNVSomaticPanelWorkflow.PreprocessIntervals.bin_length`` -- Size of bins (in bp) for coverage collection.  Typically, this is set to `0` for capture samples.  *This must be the same value used for all case samples.*
31- ``CNVSomaticPanelWorkflow.PreprocessIntervals.padding`` -- Amount of padding (in bp) to add to both sides of targets for WES coverage collection.  *This must be the same value used for all case samples.*
32
33Further explanation of other task-level parameters may be found by invoking the ``--help`` documentation available in the gatk.jar for each tool.
34
35#### Required parameters in the somatic pair workflow
36
37The reference and bins (if specified) must be the same between PoN and case samples.
38
39- ``CNVSomaticPairWorkflow.common_sites`` -- Picard or GATK-style interval list of common sites to use for collecting allelic counts.
40- ``CNVSomaticPairWorkflow.gatk_docker`` -- GATK Docker image (e.g., ``broadinstitute/gatk:latest``).
41- ``CNVSomaticPairWorkflow.intervals`` -- Picard or GATK-style interval list.  For WGS, this should typically only include the autosomal chromosomes.
42- ``CNVSomaticPairWorkflow.normal_bam`` -- (optional, but recommended) Path to normal BAM file.  Do not specify in order to run the tumor sample without a matched normal.
43- ``CNVSomaticPairWorkflow.normal_bam_idx`` -- (optional, but recommended) Path to normal BAM file index.  Do not specify in order to run the tumor sample without a matched normal.
44- ``CNVSomaticPairWorkflow.read_count_pon`` -- Path to read-count PoN created by the panel workflow.
45- ``CNVSomaticPairWorkflow.ref_fasta_dict`` -- Path to reference dict file.
46- ``CNVSomaticPairWorkflow.ref_fasta_fai`` -- Path to reference fasta fai file.
47- ``CNVSomaticPairWorkflow.ref_fasta`` -- Path to reference fasta file.
48- ``CNVSomaticPairWorkflow.tumor_bam`` -- Path to tumor BAM file.
49- ``CNVSomaticPairWorkflow.tumor_bam_idx`` -- Path to tumor BAM file index.
50
51Additionally, there are several task-level parameters that may be set by advanced users as above.
52
53To invoke Oncotator on the called tumor copy-ratio segments:
54
55- ``CNVSomaticPairWorkflow.is_run_oncotator`` -- (optional) If true, run Oncotator on the called copy-ratio segments.  This will generate both a simple TSV and a gene list.
56
57
58Further explanation of these task-level parameters may be found by invoking the ``--help`` documentation available in the gatk.jar for each tool.
59