-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
error converting a vcf #217
Comments
Thanks for the bug report @mufernando! I think the problem is with your
Your PL field leads to an inner dimension of 66! 🫢 So, each variant chunk of this array is 10000 * 746 * 66 * 4 which is more than 2147483647. The immediate bug here then is that we're not giving a usable error message. More generally, PL fields are a major problem (#185 and linked discussion) but we know how they should be dealt with. In the short term, I'd suggest creating a schema and dropping the call_PL field (assuming you're not using it):
|
I confirmed removing the PL field it works! For future reference, you can remove it from the schema using
|
I just merged a fix which should raise a more helpful error message @mufernando - would you mind trying it out on your data please? The message will need some links to as-yet unwritten docs about PL fields, but hopefully it's pointing people in the right direction. |
I have a 75Mb VCF that I tried converting using
vcf2zarr
CLI.Following the instructions, I first created the intermediate ICF using
which worked out fine.
Then, I tried converting to the final vcfzarr using
encode
:This is the verbose output I get https://gist.github.com/mufernando/805a39a636c71b21c1b25a8ff6417e49
Ideas on what might be going on here? I'm happy to share the VCF.
The text was updated successfully, but these errors were encountered: