Releases: zellerlab/GECCO
Releases · zellerlab/GECCO
0.4.2
0.4.1
0.4.0
Added
gecco.model.ProductType
enum to model the biosynthetic class of a BGC.
Removed
pandas
interaction from internal data model.ClusterCRF
code specific to cross-validation.
Changed
pandas
,fisher
andstatsmodels
dependencies are now optional.gecco train
command expects a cluster table in addition to the feature
table to know the types of the input BGCs.
0.3.0
Changed
- Replaced Nearest-Neighbours classifier with Random Forest to perform type
prediction for candidate BGCs. gecco.knn
module was renamed to implementation-agnostic namegecco.types
.
Fixed
- Extraction of domain composition taking a long time in
gecco train
command.
Removed
--metric
argument to thegecco run
CLI command.
0.2.2
0.2.1
0.2.0
Fixed
pandas
warning about unsorted columns ingecco run
.
Removed
Gene.probability
property, replaced byGene.maximum_probability
and
Gene.average_probability
properties to be explicit.
Changed
- Internal model now uses
Pfam
andTigrfam
with the top 35% features
selected with Fisher's Exact Test. ClusterRefiner
now removes genes onCluster
edges if they do not
contain any domain annotation.
0.1.1
Added
ClusterCRF.predict_probabilities
to annotate a list ofGene
.
Changed
- BGC probability is now stored at the
Domain
level instead of at theGene
level, independently of the feature extraction level used by the CRF. ClusterKNN
will use the model path provided togecco run
if any.
Docs
- Added this changelog file to document changes in the code.
- Added documentation to
gecco
submodules missing some. - Included the
CHANGELOG.md
file to the generated docs.