-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bedtools getfasta error #89
Comments
Hi, I encountered the same issue using version 1.7.0 with mapped PacBio ccs reads. I was able to run a previous version of NanoVar (1.3.8) on the same data without any problem. I use the following command, which throws the 'feature beyond the length' message at the VCF generation step: The last files outputted are Installation was done with Thank you for any guidance or advice. |
Hi @C-YONG @LaurieLecomte, thanks for reporting this. I'll need some time to check on this and will revert in the next few days. Thank you for your patience. |
Hi @C-YONG and @LaurieLecomte, please try the v1.7.1-dev branch and let me know if it fixes the issue. You can clone the branch by: |
Hi @cytham, thank you very much for the quick response and the update. Unfortunately, I got the exact same error and outputs using the v1.7.1-dev version. I initially suspected that the bug might come from Inter-Ins(2) and InterTx, but after investigating a bit on my end, it looks like some calls of Inv(2) and Intra-Ins(2) also give the same error. I made a custom version of the
These start positions are also larger than the corresponding chromosome length, hence the error encountered. I got the same result with Hope this helps. |
Hi @LaurieLecomte, thanks for looking into this. Indeed, Intra-Ins and Inv could be causing the problem, my bad for missing that out. I have removed them in the latest v1.7.1-dev branch. Could you please try again? The fact that start positions are greater than end positions in some cases shouldn't affect the BED file creation. So I am thinking that it could be another bug somewhere for Intra-Ins, Inv, etc. But thanks a lot for looking into this, you have been of great help. Also, could you let me know the chromosome lengths of those you showed? (e.g. CM055703.1, CM055704.1, CM055705.1, CM055707.1, CM055708.1, CM055711.1) |
Hi @cytham, thanks for the update. I tried the latest branch and everything ran without any issue, yielding complete VCFs and no errors. Here are the chromosome lengths for the few problematic sites I shared in my previous comment (from the ASM2944872v1 assembly):
|
Thanks @LaurieLecomte. I'm trying to figure out the root cause for this. Is it possible to share the debug files with me? Specifically, subdata.tsv, detect.tsv, and parse1.tsv? Thanks |
Hello, unfortunately, I encountered a "getfasta error" when using both versions 1.7.1 and 1.7.0 of nanovar with HG002 HiFi data. Do you have any solutions for this issue?
|
hi @Flooooooooooooower, can you provide some logs please? |
Hello, thank you for your prompt response. Below is my log file.
|
@Flooooooooooooower I see your logs are from v1.7.0, can I please confirm that you get the same error for v1.7.1? thanks |
Hi,Below is the log file of 1.7.1.
Error message was: |
I also got the same error while using v1.7.0. @cytham Is this issue resolved? |
Hi @jiadong324, do you still get the error with the v1.7.1-dev branch? |
I just finished the test with v1.7.1-dev. No more errors! When will you release the v1.7.1. |
NanoVar v1.8.1 is now available from PyPI and Conda, thanks for your understanding |
Hello, software developer. I have three ONT datasets in total. The first two ran normally, but the third one encountered the following error. What could be the reason for this?
[24/07/2024 15:27:11] - NanoVar started
[24/07/2024 15:27:11] - Checking integrity of input files - Pass
[24/07/2024 15:27:12] - Analyzing read alignments and detecting SVs - Done
[24/07/2024 15:52:39] - Clustering SV breakends - Done
[24/07/2024 16:08:29] - Correcting DUP and detecting TE - Done
[24/07/2024 16:13:00] - Generating VCF files and report - Traceback (most recent call last):
File "/home/chiyong/miniconda3/envs/nanovar/bin/nanovar", line 635, in
main()
File "/home/chiyong/miniconda3/envs/nanovar/bin/nanovar", line 486, in main
run.vcf_report() #
^^^^^^^^^^^^^^^^
File "/home/chiyong/miniconda3/envs/nanovar/lib/python3.11/site-packages/nanovar/nv_characterize.py", line 205, in vcf_report
alt_seq = get_alt_seq(self.dir, self.out_nn, self.refpath)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/chiyong/miniconda3/envs/nanovar/lib/python3.11/site-packages/nanovar/nv_alt_seq.py", line 46, in get_alt_seq
fasta = bed.sequence(fi=ref_path, nameOnly=True)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/chiyong/miniconda3/envs/nanovar/lib/python3.11/site-packages/pybedtools/bedtool.py", line 907, in decorated
result = method(self, *args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/chiyong/miniconda3/envs/nanovar/lib/python3.11/site-packages/pybedtools/bedtool.py", line 388, in wrapped
stream = call_bedtools(
^^^^^^^^^^^^^^
File "/home/chiyong/miniconda3/envs/nanovar/lib/python3.11/site-packages/pybedtools/helpers.py", line 456, in call_bedtools
raise BEDToolsError(subprocess.list2cmdline(cmds), stderr)
pybedtools.helpers.BEDToolsError:
Command was:
Error message was:
Feature (LR584250.1:14620732-14620733) beyond the length of LR584250.1 size (12913240 bp). Skipping.
Writing to /tmp/bcftools.VhKjGh
The text was updated successfully, but these errors were encountered: