Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Process read_correction is applied to a subset of reads from the fasta file #73

Open
mszimmermann opened this issue Aug 9, 2022 · 0 comments

Comments

@mszimmermann
Copy link

Hi, I managed to run the test run of NanoCLUST and got one classification result in the output file. Is this expected? Now I'm tryint to run it on my data and so far it looks the same. I noticed that the process read_correction is using canu to correct reads for a subset of the original file:

From main.nf:
Line 325: head -n$(( $count*4 )) $reads > subset.fastq
Line 326: canu -correct -p corrected_reads -nanopore-raw subset.fastq genomeSize=${params.avg_amplicon_size} stopOnLowCoverage=1 minInputCoverage=2 minReadLength=500 minOverlapLength=200

And corrected_reads.corrected_reads.fastq contains about 50 sequences. Is it supposed to be like that, and of yes, why is only a subset of original reads used?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant