-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cram references STILL not working #50
Comments
Cram references are handled in a very confusing way, so I'll try and document this here for future reference:
This is where the problem arises: a user provided reference will always make it into the reference table and even the hash map, but because the final version of the table is populated from a hashmap, it might not be used. Ultimately, the reference table contains a reference object that htslib uses to decode crams, but entries only make it in iff the MD5 of the chromosome in the fasta matches the MD5 recorded for that chromosome in the bam header |
The new fix has RNA-SeQC preload bams when opened. It has htslib populate the reference table exactly as above, then checks to see which chromosomes (present in the bam header) ended up using the provided reference as their reference. This allows for RNA-SeQC to spit out some error messages that can help reveal this problem when it occurs. |
I was pretty sure we fixed the issue with cram references back in walaj/SeqLib#38, but apparently travis builds are still failing. Somehow, despite a reference being handed to SeqLib and SeqLib setting the reference on the HTSlib bam object we still can't read Crams unless the fasta is in the hts cache or at the location indicated in the bam header.
I unfortunately don't have time to work on this right now, so for the time being I'm adding a warning message to RNA-SeQC and disabling tests on crams
The text was updated successfully, but these errors were encountered: