Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use tabix when splitting imputation server vcfs #60

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

robkar
Copy link
Contributor

@robkar robkar commented Feb 20, 2019

As discussed in emails with Stephan. Using the new --deploy option for pre- and postprocessing imputation server files, I found the conversion from chromosomal vcfs to dosage chunks too slow in some cases.

I updated the scripts to use tabix for deployed imputation files with indices, which improved performance substantially (now my problematic chunks finish before the cluster kills them).

The logic I wrote stops with an error if using --deploy and there are no index files (I think both Sanger and Michigan provide them?), but for other uses of vcf2dos (i.e. in-house imputation), the script should proceed by reading the full file if there is no index.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant