ECloudGen

Comming Soon!

ECloudGen

This figure represent: Electron Clouds; Protein-Ligand Interactions; Latent Diffusion Process

Environment

Install via conda yaml file (cuda 11.3)

conda env create -f ecloudgen.yml
conda activate ecloudgen

Install manually

This environment has been successfully tested on CUDA==11.3

conda create -n ecloudgen rdkit openbabel moleculekit scikit-learn scipy jupyter python-lmdb pytorch cudatoolkit=11.3 omegaconf einops accelerate biopython h5py wandb xtb ignite gpytorch altair python=3.9 -c conda-forge

Data

Protein-ligand pair dataset preparation.

You can download the raw data as provided in ResGen. You can also download the processed protein-ligand pair from the this link.

Note: index.pkl, split_by_name.pt. are automatically downloaded with the SurfGen code. index.pkl saves the information of each protein-ligand pair, while split_by_name.pt save the train-test split of the dataset.

tar -xzvf crossdocked_pocket10.tar.gz
# Then follow the ./dataset/readme.md for processing protein-ligand dataset from scratch.

Generation

# modify the data path and batch_size in the ./configs/eclouddiff.yml 
python generate_from_pdb.py --pdb_file pdb_file ./play_around/peptide_example/7ux5_protein.pdb --lig_file ./play_around/peptide_example/7ux5_peptide.sdf --outputs_dir results

Training

The training process is released as train.py, the following command is an example of how to train a model.

# prepare a demo data
python ./datasets/generate_pktlig_data.py
# modify the data path and batch_size in the ./configs/eclouddiff.yml 
python train_eclouddiff.py

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
configs		configs
dataset		dataset
ecloud_utils		ecloud_utils
models		models
play_around		play_around
task		task
utils		utils
figures_making.py		figures_making.py
generate_from_pdb.py		generate_from_pdb.py
readme.md		readme.md
sampling.ipynb		sampling.ipynb
train_eclouddiff.py		train_eclouddiff.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ECloudGen

Environment

Install via conda yaml file (cuda 11.3)

Install manually

Data

Protein-ligand pair dataset preparation.

Generation

Training

About

Releases

Packages

Languages

HaotianZhangAI4Science/ECloudGen_demo

Folders and files

Latest commit

History

Repository files navigation

ECloudGen

Environment

Install via conda yaml file (cuda 11.3)

Install manually

Data

Protein-ligand pair dataset preparation.

Generation

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages