-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with fastq extractor when basecalled with Dorado: ValueError: dictionary update sequence element #0 has length 1; 2 is required #34
Comments
Hi @ClairePt Sorry for the delayed answer. Could you please provide the head of your FASTQ file so I can reproduce this issue? Best regards, |
Hello @alihamraoui , @429a1c46-216d-4cfd-a67f-cdcf32c77ed5 st:Z:2024-06-11T13:10:52.535+00:00 RG:Z:2712a6db1cae52ac66ba3db53192fa918793254c_dna_r10.4.1_e8.2_400bps_hac@v5.0.0 DS:Z:gpu:Quadro RTX 3000 |
Hi @ClairePt, It seems that your FASTQ sequence name format is new—it looks like a SAM tag format. ToulligQC doesn't currently support this. This issue will be fixed in the upcoming version 2.7.2, which will be available soon (hopefully next week). Thanks! |
Hello,
Whenever I try to use toulligqc on files that have been basecalled with Dorado, I get the following error:
ValueError: dictionary update sequence element #0 has length 1; 2 is required
I have encountered no issues when using the exact same command on the same sequencing data, the only difference being that the basecall was performed with Guppy.
Here is the exact command I use:
srun -p COMPUTE toulligqc --report-name TEST --fastq /home/eleonore.durand/Nanopores_set2_2024_fastq_NO_BACKUP/10_CR2_dorado_hac.fastq --html-report-path TOULLIGQC/TEST/report.html
And here is the full error message I get:
`ToulligQC version 2.7.1
Traceback (most recent call last):
File "/eep/softwares/miniconda/envs/toulligqc-2.7.1/bin/toulligqc", line 10, in
sys.exit(main())
^^^^^^
File "/eep/softwares/miniconda/envs/toulligqc-2.7.1/lib/python3.12/site-packages/toulligqc/toulligqc.py", line 426, in main
extractor.init()
File "/eep/softwares/miniconda/envs/toulligqc-2.7.1/lib/python3.12/site-packages/toulligqc/fastq_extractor.py", line 60, in init
self.dataframe_1d = self._load_fastq_data()
^^^^^^^^^^^^^^^^^^^^^^^
File "/eep/softwares/miniconda/envs/toulligqc-2.7.1/lib/python3.12/site-packages/toulligqc/fastq_extractor.py", line 253, in _load_fastq_data
run_info = self.check_fastq()
^^^^^^^^^^^^^^^^^^
File "/eep/softwares/miniconda/envs/toulligqc-2.7.1/lib/python3.12/site-packages/toulligqc/fastq_extractor.py", line 360, in check_fastq
metadata = dict(x.split("=") for x in first_line.split(" ")[1:])
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ValueError: dictionary update sequence element #0 has length 1; 2 is required
srun: error: seed: task 0: Exited with exit code 1`
I have tried several fixes, including using specifically pandas==2.1.4 and numpy==1.26.4; but I always get the same error message.
What could I do to fix this?
Thank you for your help,
Best regards,
Claire
The text was updated successfully, but these errors were encountered: