You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
pyani v0.2.12, processing over 2,000 genome sequences reached out segmentation fault
Description:
I have dataset containing over 5,000 sequences. When Using pyani for pairwise ANI analysis, a segmentation fault occurs if the input dataset contains more than 2,000 sequences. however, it works well if I subset the dataset to contain fewer than 1,900 sequences.
Can you confirm that you are using multiprocessing to distribute jobs? And can you also provide the relevant section of the log file or error output, so we can see where in the analysis the failure happens?
Summary:
pyani v0.2.12, processing over 2,000 genome sequences reached out segmentation fault
Description:
I have dataset containing over 5,000 sequences. When Using pyani for pairwise ANI analysis, a segmentation fault occurs if the input dataset contains more than 2,000 sequences. however, it works well if I subset the dataset to contain fewer than 1,900 sequences.
Reproducible Steps:
average_nucleotide_identity.py
-i input_dir
-o output_dir_2
-m ANIm
-g
--gformat pdf
--write_excel
--worker 10
Current Output:
pop out segmentation fault.
I guess it may be related to a deep or infinite recursion, where the stack space is exhausted.
Expected Output:
I wish it would work well with the entire dataset, but not just the subset dataset (>2,000 sequences).
pyani Version:
pyani v0.2.12
installed dependencies
If you are running a version of
pyani
v0.3 or later, then please run the commandpyani listdeps
at the command line, and enter the output below.Python Version:
Python v3.12.2
Operating System:
Debian
The text was updated successfully, but these errors were encountered: