-
Notifications
You must be signed in to change notification settings - Fork 9
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* feat: upgrade to antismash 7 * feat: upgrade gtdbtk and gtdb to release 214 * chore: move default logs location * chore: make an alias for rules/pipelines and name/pep * fix: grab ARTS from release instead of git clone * fix: correct ARTS setup * feat: upgrade bigslice for compatibility with antiSMASH7 * test: update test for 0.7.0 * fix: enable ani_screen in gtdbtk * fix: correct gtdb release versioning * chore: set ani_screen off as default for gtdbtk * docs: add quickstart video * feat: upgrade extraction from mmseqs2 and clinker * feat: process mmseqs2 cog feature * fix: handle non standard BGC genbanks * feat: enable to switch between antismash 7 and 6 * docs: mention about the WIKI
- Loading branch information
1 parent
f3fa8af
commit 1b1a5b1
Showing
127 changed files
with
2,587 additions
and
1,841 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
8 changes: 4 additions & 4 deletions
8
...kii/tables/df_regions_antismash_6.1.1.csv → ...tobacillus/df_regions_antismash_7.0.0.csv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,5 @@ | ||
bgc_id,genome_id,region,accession,start_pos,end_pos,contig_edge,product,region_length,source,gbk_path | ||
CR954253.1.region001,GCA_000056065.1,1.1,CR954253.1,17407,39909,False,['lanthipeptide-class-iii'],22502,bgcflow,data/interim/antismash/6.1.1/GCA_000056065.1/CR954253.1.region001.gbk | ||
CR954253.1.region002,GCA_000056065.1,1.2,CR954253.1,1745672,1767868,False,['lanthipeptide-class-iv'],22196,bgcflow,data/interim/antismash/6.1.1/GCA_000056065.1/CR954253.1.region002.gbk | ||
CP000156.1.region001,GCA_000191165.1,1.1,CP000156.1,1767251,1789447,False,['lanthipeptide-class-iv'],22196,bgcflow,data/interim/antismash/6.1.1/GCA_000191165.1/CP000156.1.region001.gbk | ||
CP000412.1.region001,GCA_000014405.1,1.1,CP000412.1,17283,39785,False,['lanthipeptide-class-iii'],22502,bgcflow,data/interim/antismash/6.1.1/GCA_000014405.1/CP000412.1.region001.gbk | ||
CR954253.1.region001,GCA_000056065.1,1.1,CR954253.1,17407,39909,False,['lanthipeptide-class-iii'],22502,bgcflow,data/interim/antismash/7.0.0/GCA_000056065.1/CR954253.1.region001.gbk | ||
CR954253.1.region003,GCA_000056065.1,1.3,CR954253.1,1745672,1767868,False,['lanthipeptide-class-iv'],22196,bgcflow,data/interim/antismash/7.0.0/GCA_000056065.1/CR954253.1.region003.gbk | ||
CP000156.1.region002,GCA_000191165.1,1.2,CP000156.1,1767251,1789447,False,['lanthipeptide-class-iv'],22196,bgcflow,data/interim/antismash/7.0.0/GCA_000191165.1/CP000156.1.region002.gbk | ||
CP000412.1.region001,GCA_000014405.1,1.1,CP000412.1,17283,39785,False,['lanthipeptide-class-iii'],22502,bgcflow,data/interim/antismash/7.0.0/GCA_000014405.1/CP000412.1.region001.gbk |
3 changes: 2 additions & 1 deletion
3
.examples/lanthipeptide/project_config.yaml → ...peptide_lactobacillus/project_config.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,11 +1,12 @@ | ||
name: lanthipeptide_lactobacillus | ||
pep_version: 2.1.0 | ||
description: 'A selection of lanthipeptides from Lactobacillus delbrueckii' | ||
sample_table: df_antismash_6.1.1_bgc.csv | ||
sample_table: df_regions_antismash_7.0.0.csv | ||
|
||
rules: | ||
bigslice: TRUE | ||
bigscape: TRUE | ||
query-bigslice: TRUE | ||
clinker: TRUE | ||
interproscan: TRUE | ||
mmseqs2: TRUE |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
File renamed without changes.
2 changes: 1 addition & 1 deletion
2
....1.1/GCA_000056065.1/GCA_000056065.1.json → ....0.0/GCA_000056065.1/GCA_000056065.1.json
Large diffs are not rendered by default.
Oops, something went wrong.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
14 changes: 0 additions & 14 deletions
14
...smash_overview_gather/data/data/interim/antismash/6.1.1/GCA_000014405.1_bgc_overview.json
This file was deleted.
Oops, something went wrong.
1 change: 0 additions & 1 deletion
1
...smash_overview_gather/data/data/interim/antismash/6.1.1/GCA_000182835.1_bgc_overview.json
This file was deleted.
Oops, something went wrong.
14 changes: 0 additions & 14 deletions
14
...smash_overview_gather/data/data/interim/antismash/6.1.1/GCA_000191165.1_bgc_overview.json
This file was deleted.
Oops, something went wrong.
5 changes: 0 additions & 5 deletions
5
.tests/unit/antismash_overview_gather/data/data/interim/bgcflow_utils/samples.csv
This file was deleted.
Oops, something went wrong.
16 changes: 0 additions & 16 deletions
16
...erim/bgcs/Lactobacillus_delbrueckii/6.1.1/GCA_000014405.1/GCA_000014405.1-change_log.json
This file was deleted.
Oops, something went wrong.
16 changes: 0 additions & 16 deletions
16
...erim/bgcs/Lactobacillus_delbrueckii/6.1.1/GCA_000191165.1/GCA_000191165.1-change_log.json
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,64 @@ | ||
# This file should contain everything to configure the workflow on a global scale. | ||
|
||
#### PROJECT INFORMATION #### | ||
# This section control your project configuration. | ||
# Each project are separated by "-". | ||
# A project can be defined as (1) a yaml object or (2) a Portable Encapsulated Project (PEP) file. | ||
# (1) To define project as a yaml object, it must contain the variable "name" and "samples". | ||
# - name : name of your project | ||
# - samples : a csv file containing a list of genome ids for analysis with multiple sources mentioned. Genome ids must be unique. | ||
# - rules: a yaml file containing project rule configurations. This will override global rule configuration. | ||
# - prokka-db (optional): list of the custom accessions to use as prokka reference database. | ||
# - gtdb-tax (optional): output summary file of GTDB-tk with "user_genome" and "classification" as the two minimum columns | ||
# (2) To define project using PEP file, only variable "name" should be given that points to the location of the PEP yaml file. | ||
# - pep: path to PEP .yaml file. See project example_pep for details. | ||
# PS: the variable pep and name is an alias | ||
|
||
projects: | ||
# Project 1 (yaml object) | ||
- name: config/lactobacillus_delbruecki/project_config.yaml | ||
|
||
bgc_projects: | ||
- pep: config/lanthipeptide/project_config.yaml | ||
|
||
#### GLOBAL RULE CONFIGURATION #### | ||
# This section configures the rules to run globally. | ||
# Use project specific rule configurations if you want to run different rules for each projects. | ||
# pipelines or rules: set value to TRUE if you want to run the analysis or FALSE if you don't | ||
pipelines: | ||
seqfu: FALSE | ||
mash: FALSE | ||
fastani: FALSE | ||
checkm: FALSE | ||
gtdbtk: FALSE | ||
prokka-gbk: FALSE | ||
antismash: TRUE | ||
query-bigslice: FALSE | ||
bigscape: FALSE | ||
bigslice: FALSE | ||
automlst-wrapper: FALSE | ||
arts: FALSE | ||
roary: FALSE | ||
eggnog: FALSE | ||
eggnog-roary: FALSE | ||
deeptfactor: FALSE | ||
deeptfactor-roary: FALSE | ||
cblaster-genome: FALSE | ||
cblaster-bgc: FALSE | ||
|
||
#### RESOURCES CONFIGURATION #### | ||
# resources : the location of the resources to run the rule. | ||
# The default location is at "resources/{resource_name}". | ||
resources_path: | ||
antismash_db: resources/antismash_db | ||
eggnog_db: resources/eggnog_db | ||
BiG-SCAPE: resources/BiG-SCAPE | ||
bigslice: resources/bigslice | ||
checkm: resources/checkm | ||
gtdbtk: resources/gtdbtk | ||
#RNAmmer: resources/RNAmmer # If specified, will override Barnapp in Prokka | ||
|
||
rule_parameters: | ||
install_gtdbtk: | ||
release: 214 | ||
release_version: 214 |
30 changes: 30 additions & 0 deletions
30
.tests/unit/antismash_summary/data/config/lactobacillus_delbruecki/project_config.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,30 @@ | ||
name: Lactobacillus_delbrueckii | ||
|
||
pep_version: 2.1.0 | ||
|
||
description: "Lactobacillus delbrueckii 27 01 2023" | ||
|
||
sample_table: samples.csv | ||
|
||
#### RULE CONFIGURATION #### | ||
# rules: set value to TRUE if you want to run the analysis or FALSE if you don't | ||
rules: | ||
seqfu: TRUE | ||
mash: TRUE | ||
fastani: TRUE | ||
checkm: FALSE | ||
gtdbtk: FALSE | ||
prokka-gbk: TRUE | ||
antismash: TRUE | ||
query-bigslice: TRUE | ||
bigscape: TRUE | ||
bigslice: TRUE | ||
automlst-wrapper: TRUE | ||
arts: TRUE | ||
roary: TRUE | ||
eggnog: TRUE | ||
eggnog-roary: TRUE | ||
deeptfactor: TRUE | ||
deeptfactor-roary: TRUE | ||
cblaster-genome: TRUE | ||
cblaster-bgc: TRUE |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
12 changes: 12 additions & 0 deletions
12
.../unit/antismash_summary/data/data/interim/antismash/7.0.0/GCA_000014405.1_bgc_counts.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
{ | ||
"GCA_000014405.1": { | ||
"bgcs_count": 2, | ||
"bgcs_on_contig_edge": 0, | ||
"protoclusters_count": 0, | ||
"cand_clusters_count": 0, | ||
"products": { | ||
"lanthipeptide-class-iii": 1, | ||
"RiPP-like": 1 | ||
} | ||
} | ||
} |
13 changes: 13 additions & 0 deletions
13
.../unit/antismash_summary/data/data/interim/antismash/7.0.0/GCA_000056065.1_bgc_counts.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
{ | ||
"GCA_000056065.1": { | ||
"bgcs_count": 3, | ||
"bgcs_on_contig_edge": 0, | ||
"protoclusters_count": 0, | ||
"cand_clusters_count": 0, | ||
"products": { | ||
"lanthipeptide-class-iii": 1, | ||
"RiPP-like": 1, | ||
"lanthipeptide-class-iv": 1 | ||
} | ||
} | ||
} |
Oops, something went wrong.