-
Notifications
You must be signed in to change notification settings - Fork 20
Expecting ploidyVcfFile in CanvasPartition #89
Comments
Thanks for pointing this out. It seems like we should make --ploidy-vcf a required parameter then. If you don't care about proper sex chromosome calling for a male sample, you can provide an empty vcf file (don't forget the header lines) for this parameter and both X and Y will be treated as diploid. |
Hi Eric, I am sorry if you have explained this somewhere, but do you have an example of how the --ploidy-vcf input file should look like? Thanks |
Here is a template ploidy vcf for a female sample using Grch38 :
Here is a template ploidy vcf for a male sample using GRCh38
and a male sample using hg19:
|
So is this supposed to be the ploidy of chromosomes in the Normal sample? |
Yes, the file should contain the sex chromosome ploidy of the normal sample. Canvas estimates tumor sample purity and overall ploidy (e.g. 4 for a tetraploid tumor) from both coverage and b-allele frequencies. |
Cool, just wanted to confirm that. |
Hi Eric, If I was to use the reference genome that we use for mapping other than the following are there any other constraints/processes I should be doing:
Please let me know if I am missing any steps. Also, would the results differ with the inclusion of decoy and HLA contigs in the reference? Thank you. |
What reference genome are you using? As long as the coordinates are the same you should be able to use one of the existing kmer.fa files we provide and simply change contig names if necessary. The presence of decoys and unmapped contigs may change the resulting calls slightly, but we don't know how significant an impact it is for CNVs. The steps you have listed look correct. Note that FlagUniqueKmers can take a day to run depending on the machine and will require a high memory machine. You may also need to create a fasta index as well (i.e. genome.fa.fai) using samtools. |
We are using GRCH37 with decoy and moving forward we would be using GRCH38 (including all the contigs and HLA contigs). The other question in the email was: |
We provide normalized coverage values for each bin in bigwig format. Look for the bigwig file contains floating point values, normalized to copy number. If you need to convert them to log2 for visualization you can parse the values from the bedgraph output in the VisualizationTemp subdirectory. Look for coverage.bedgraph |
Hi,
In the latest version 1.38 when I am running Somatic-WGS workflow, it is expecting a ploidyVcfFile at the 5th step 'CanvasPartition'.
So the command has:
${DOTNET}/dotnet {$CANVAS}/CanvasPartition/CanvasPartition.dll -p ""
So instead of NULL the script is taking it as an empty string and still trying to validate the VCF and erroring out.
Can you please check if that is actually an issue?
The way I am running CANVAS is:
{$DOTNET}/dotnet {$CANVAS}/Canvas.dll Somatic-WGS
--bam=$TUMOR_BAMFILE
--output=$outdir
--reference=$CANVAS_data/kmer.fa
--genome-folder=$CANVAS_data/WholeGenomeFasta
--sample-name=$SAMPLE
--filter-bed=$CANVAS_data/filter13.bed
--sample-b-allele-vcf=$NORMAL_VCF
--somatic-vcf=$TUMOR_VCF
The text was updated successfully, but these errors were encountered: