PointNovo

The DeepNovo branch contains a pytorch re-implementation of DeepNovo

The PointNovo branch contains the implementation of our proposed PointNovo model. The software is tested on Ubuntu 1604/1804.

Dependency

python >= 3.6

pytorch >= 1.0

dataclasses, biopython, pyteomics, cython

For database search you also need to install percolator.

data files

The ABRF DDA spectrums file could be downloaded here. The PXD008844 and PXD010559 spectra for training, validation and testing and the EThcD NIST antibody sequence data could be found here.

And the 9 species data (published by the DeepNovo paper) could be downloaded here.

It is worth noting that in our implementation we represent training samples in a slightly different format (i.e. peptide stored in a csv file and spectrums stored in mgf files). We also include a script for converting the file format (data_format_converter.py in PointNovo branch).

knapsack files

Like DeepNovo, in PointNovo we also use the knapsack algorithm to further limit the search space. This means when performing de novo sequencing, the program needs to either read or create a knapsack matrix based on the selected PTMs (one time computation). Pre-built knapsack matrix files could be found here:

You can use symbolic links to choose which knapsack file to use. i.e.

ln -s fix_C_var_NMQ_knapsack.npy knapsack.npy

usage

first build cython modules

make build

train mode:

make train

On a RTX 2080 Ti GPU it takes around 0.3 seconds to train a batch of 16 annotated spectra. By default the trained model will be saved under ./train directory

denovo mode:

make denovo

On a RTX 2080 Ti GPU it takes around 0.4 second to train a batch of 16 annotated spectra

evaluate denovo result:

make test

This script is borrowed from the original DeepNovo implementation. It will generate the metrics defined by the paper.

database search mode:

make db

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
utils		utils
.gitignore		.gitignore
README.md		README.md
cat_file.py		cat_file.py
config.py		config.py
data_format_converter.py		data_format_converter.py
data_reader.py		data_reader.py
db_searcher.py		db_searcher.py
deepnovo_cython_modules.pyx		deepnovo_cython_modules.pyx
deepnovo_cython_setup.py		deepnovo_cython_setup.py
deepnovo_dia_script_select.py		deepnovo_dia_script_select.py
deepnovo_worker_test.py		deepnovo_worker_test.py
denovo.py		denovo.py
main.py		main.py
makefile		makefile
model.py		model.py
psm_ranker.py		psm_ranker.py
train_func.py		train_func.py
writer.py		writer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PointNovo

Dependency

data files

knapsack files

usage

first build cython modules

train mode:

denovo mode:

evaluate denovo result:

database search mode:

About

Releases

Packages

Contributors 3

Languages

irleader/PointNovo

Folders and files

Latest commit

History

Repository files navigation

PointNovo

Dependency

data files

knapsack files

usage

first build cython modules

train mode:

denovo mode:

evaluate denovo result:

database search mode:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages