Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to check the integrity of BAM and FASTQ files after conversion #890

Open
zhuchcn opened this issue Dec 14, 2023 · 4 comments
Open

How to check the integrity of BAM and FASTQ files after conversion #890

zhuchcn opened this issue Dec 14, 2023 · 4 comments

Comments

@zhuchcn
Copy link

zhuchcn commented Dec 14, 2023

Hi SRA team, I have a question of how to check the integrity of the BAM or FASTQ files. The .sra.vdbcache files were downloaded using prefetch and validated using vdb-validate. But how can I check whether the extraction of BAM or FASTQ files from .vdbcache was complete and did not fail in the middle?

@klymenko
Copy link
Contributor

Use vdb-validate to validate sra file.

@zhuchcn
Copy link
Author

zhuchcn commented Dec 15, 2023

Use vdb-validate to validate sra file.

vdb-validate validates the sra file, yes. But when I use sam-dump to extract the SAM, how can I validate the SAM? My question is how can we find out if the sam-dump stopped in the middle and yielding a truncated SAM/BAM? This can still happen even when the SRA file is valid.

@klymenko
Copy link
Contributor

Check the exit code of sam-dump or fasterq-dump.

@zhuchcn
Copy link
Author

zhuchcn commented Dec 20, 2023

It will be great if some additional information can be shown. For example, showing the total reads that sam-dump prepares to extract, then users can count the reads in the output BAM files to check if they match.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants