Skip to content

ebi-gene-expression-group/scpred-eval-workflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scpred-workflow

Nextflow pipeline for scpred-cli package. Workflow can be run for training/evaluating a model or for predicting labels on new data.

Evaluating the Model

This mode of the workflow is to train the prediction model and evaluate its performance. Method parameter has to be set to "evaluation" in the nextflow.config file.

As an input, a training dataset in 10X format is given.

The SingleCellExperiment object is then split into its training and test subsets. Eigenvalue decomposition and feature selection are performed with the train subset to train the model. Once trained, cell labels are predicted on the test subset.

As an output, the predicted cell labels, the prediction probabilities, and a confusion matrix comparing predicted and real annotations.

Predicting cell labels

This mode of the workflow is for actual cell label prediction with query data. Method parameter has to be set to "prediction" in the nextflow.config file.

As an input, a query dataset in 10X format is provided along with the training dataset. The model is trained with the entire training dataset (following the same eigenvalue decomposition and feature selection as in the evaluation mode) and applied to the query dataset. As an output, a table with the predicted cell labels of the query dataset.

About

Nextflow pipeline for scpred package

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published