-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Longshot on Iso-seq data #106
Comments
One way to avoid excluding these reads would be to modify the bam/cram file to split the reads containing long 'N' cigar operations into multiple reads. See for example: https://gatk.broadinstitute.org/hc/en-us/articles/360036858811-SplitNCigarReads |
@vibansal Thank you so much for your reply. Sorry for my late reply, I somehow did not receive the notice. I tried with SplitNCigarReads. For some regions, even though the reads look pretty nice as a SNP, it got filtered out at the final step. (included in 4.0.final_genotypes.vcf but not in the final result) Does it have something to do with other settings? The output in 4.0.final_genotypes.vcf is
|
Sorry for the late reply. The variant is filtered since it has very high depth (9850). The "-max_cov" option can be used to increase the threshold. |
Thank you so much for the reply. I tried with (in 4.0.final_genotypes.vcf)
|
The genotype of the variant is 0/0 and the final output contains only variants with non-ref genotypes. |
Hi, thank you so much for the great tool.
I'm trying to apply longshot on long-read RNA-seq data (Iso-seq data from PacBio HiFi reads).
I ran the pipeline with default options and got ~30% reads assigned with HP tags (Is is true to assume HP:i:1 represents reads with REF SNPs and HP:i:2 with ALT SNPs?)
Do you have any recommendations to recover more reads?
I was wondering whether I should loosen
since the RNA-seq reads CIGAR skips large introns.
Thank you so much for your kind help.
The text was updated successfully, but these errors were encountered: