Jbrowse indexer - track3 - current progress #66

vikasguptaebi · 2024-10-16T15:40:12Z

following are addressed for now --

Create a subworkflow that generates the required GFF and FASTA files for JBrowse2, this module should not include the jbrowse-import specifics.
Include tests for the new subworkflow.
Merge the subworkflow into nf-modules

… input

mberacochea

Nice stuff folks. I think it needs a bit of tiding up before mering:

Publishing dirs is responsability of the pipelines so remove the bespoke processes to do so
Don't use TAB is the input file is a GFF, it makes reading that trickier with no need (for the output use _gff to make it clear that is a modified version of the file
I would consider merged GFF_TRIM_FASTA and the indexation process (under a flag).. it will make it faster as there will be less file-copy over done by nextflow (ask me if there are questions around this)

mberacochea · 2024-10-18T11:20:15Z

modules/ebi-metagenomics/jbrowse/sortgff/main.nf

This module shouldn't be under jbrowse, this is a generic GFF sorting tool

mberacochea · 2024-10-18T11:20:40Z

modules/ebi-metagenomics/jbrowse/sortgff/main.nf

+    tuple val(meta), path(tab)
+
+    output:
+    tuple val(meta), path("${meta.id}_sorted.gff"), optional: true, emit: gff


The output shouldn't be optional, why is it optional?

mberacochea · 2024-10-18T11:20:53Z

modules/ebi-metagenomics/jbrowse/sortgff/main.nf

+    container 'quay.io/biocontainers/coreutils:8.25--0'
+
+    input:
+    tuple val(meta), path(tab)


Suggested change

tuple val(meta), path(tab)

tuple val(meta), path(gff)

mberacochea · 2024-10-18T11:21:05Z

modules/ebi-metagenomics/jbrowse/sortgff/main.nf

+    tuple val(meta), path(tab)
+
+    output:
+    tuple val(meta), path("${meta.id}_sorted.gff"), optional: true, emit: gff


Suggested change

tuple val(meta), path("${meta.id}_sorted.gff"), optional: true, emit: gff

tuple val(meta), path("${meta.id}_sorted.gff"), emit: sorted_gff

mberacochea · 2024-10-18T11:22:34Z

modules/ebi-metagenomics/jbrowse/sortgff/main.nf

+    label 'process_single'
+
+    conda "${moduleDir}/environment.yml"
+    container 'quay.io/biocontainers/coreutils:8.25--0'


This needs to consider the singularity image too, like this example:

conda "bioconda::blast=2.14.1" container "${ workflow.containerEngine == 'singularity' && !task.ext.singularity_pull_docker_container ? 'https://depot.galaxyproject.org/singularity/blast:2.14.1--pl5321h6f7f691_0': 'biocontainers/blast:2.14.1--pl5321h6f7f691_0' }"

Was this module created with the nf-core tools?

mberacochea · 2024-10-18T11:29:30Z

subworkflows/ebi-metagenomics/geneviewer_indexer/index_fasta/index_fasta.nf

+}
+
+// PUBLISH_OUTPUT_FILES process to save the output files
+process PUBLISH_OUTPUT_FILES {


I would remove this process

mberacochea · 2024-10-18T11:30:14Z

subworkflows/ebi-metagenomics/geneviewer_indexer/index_fasta/meta.yml

@@ -0,0 +1,36 @@
+# yaml-language-server: $schema=https://raw.githubusercontent.com/nf-core/modules/master/subworkflows/yaml-schema.json
+name: "index_fasta"
+description: Generate fasta indices


Suggested change

description: Generate fasta indices

description: Generate fasta indices using BGZIP and FADIX

mberacochea · 2024-10-18T11:30:48Z

subworkflows/ebi-metagenomics/geneviewer_indexer/index_gff/index_gff.nf

+    versions    = ch_versions                                           // Channel: [ versions.yml ]
+
+    // Call the process to publish files
+    PUBLISH_OUTPUT_FILES(gff_gz, tbi_files, output_dir)


The same as for the index one, this one should not be here

mberacochea · 2024-10-18T11:31:35Z

subworkflows/ebi-metagenomics/geneviewer_indexer/index_gff/meta.yml

@@ -0,0 +1,37 @@
+# yaml-language-server: $schema=https://raw.githubusercontent.com/nf-core/modules/master/subworkflows/yaml-schema.json
+name: "index_gff"
+description: Generate gff indices


Suggested change

description: Generate gff indices

description: Create an indexed GFF without the FASTA sequence

mberacochea · 2024-10-18T11:32:05Z

subworkflows/ebi-metagenomics/geneviewer_indexer/main.nf

What is this file for?

tgurbich and others added 14 commits October 16, 2024 10:18

Initial commit

b60c973

addede tabix and samtools faidx core modules

12bee79

Mofified nf-core module faidx

dd7d4d1

Added fasta index subworkflow

3c9c88e

minor fix

f37228b

hardcoded output path working. now wil work to pass it dynamically as…

c9ac5d9

… input

updated for output path input

81fe517

refactored

ebc536a

Returned emits

b719624

Added a test for index_fasta subworkflow

8e738b8

add gff indexing modules and workflow.

9617a31

emit added

b76667d

tests aded for gff

54d03d4

Added a snap for index_gff

de88417

vikasguptaebi requested review from mberacochea, SantiagoSanchezF and tgurbich October 16, 2024 15:40

Merge branch 'main' into jbrowse_indexer

cea8508

mberacochea requested changes Oct 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jbrowse indexer - track3 - current progress #66

Jbrowse indexer - track3 - current progress #66

vikasguptaebi commented Oct 16, 2024

mberacochea left a comment

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

mberacochea Oct 18, 2024

	tuple val(meta), path("${meta.id}_sorted.gff"), optional: true, emit: gff
	tuple val(meta), path("${meta.id}_sorted.gff"), emit: sorted_gff

	description: Generate fasta indices
	description: Generate fasta indices using BGZIP and FADIX

	description: Generate gff indices
	description: Create an indexed GFF without the FASTA sequence

Jbrowse indexer - track3 - current progress #66

Are you sure you want to change the base?

Jbrowse indexer - track3 - current progress #66

Conversation

vikasguptaebi commented Oct 16, 2024

mberacochea left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment