Somatic error correction for low allele fractions #4868

davidbenjamin · 2018-06-08T17:03:09Z

Read error correction tends to follow the basic strategy of 1) collect kmer counts 2) replace rare kmers with their closest non-rare match. For germline calling where there is a huge gap between error rates and diploid het allele fractions this is sufficient. Mutect, however, must contend with cases where counts alone do not discriminate perfectly between errors and real mutations.

Without committing to an approach, it seems like phasing might help. That is, we could construct haplotypes of rare kmers and error correct those. This should work because sequencing errors are unphased and real variants are. There are phased artifacts, of course, but we handle those in downstream filtering.

davidbenjamin added QuixoticDream Mutect labels Jun 8, 2018

davidbenjamin added this to the Mutect 3 milestone Jun 8, 2018

davidbenjamin self-assigned this Jun 8, 2018

davidbenjamin mentioned this issue Jun 24, 2018

HaplotypeCaller makes different variant calls depending on input padding #3697

Open

davidbenjamin changed the title ~~Somatic kmer correction for low allele fractions~~ Somatic error correction for low allele fractions Feb 27, 2020

davidbenjamin mentioned this issue Feb 27, 2020

Pileup-based read error corrector #6470

Merged

davidbenjamin closed this as completed in #6470 Mar 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Somatic error correction for low allele fractions #4868

Somatic error correction for low allele fractions #4868

davidbenjamin commented Jun 8, 2018 •

edited

Loading

Somatic error correction for low allele fractions #4868

Somatic error correction for low allele fractions #4868

Comments

davidbenjamin commented Jun 8, 2018 • edited Loading

davidbenjamin commented Jun 8, 2018 •

edited

Loading