GrapHiC (Graph-based Hi-C)

This package provides functionality for producing bespoke graph datasets using multi-omics data and Hi-C. We provide compatibility with the standard bigwig and bed UCSC data formats and integrate with the popular Cooler format for Hi-C data.

Example usage

Creating a Hi-C graph from 3 disconnected regions of chromatin

from GrapHiC.Graph_creation import from_regions

# Specify cooler paths
coolerpath ={'WT':'tutorial_data/WT_contacts.cool',
             'KO':'tutorial_data/KO_contacts.cool'
            }

x = from_regions([coolerpath['WT'],coolerpath['KO']],         #paths to cooler files for edge featurisation
                 {'chr2':np.array([[8651256,10658971],        #Specifying region boundaries in chromosome 2, region 1
                                   [10678978,11658978],       #region 2
                                   [12678978,13658978],       #region 3
                                   [14678978,15658978],       #region 4
                                   [16678978,17658978]]),     #region 5
                  'chr3':np.array([[8651256,10658971],        #chromosome 3 region 1
                                   [10678978,11658978]])      #region 2
                 },
                 names = {'chr2': ['Ppp2r2c','reg2'],         #specifying names for the first two regions on chr2
                          'chr3': ['Ppp3r3c','reg4']          #specifying names for the first two regions on chr3
                         },
                 balance = True,                              #specifying whether to balance Hi-C matrices before extracting contact counts
                 join = False,                                #whether to join resulting dictionary of graphs into one graph object
                 backbone = True,                             #whether to force contacts to exist along the chromatin backbone
                 record_backbone_interactions = True,         #whether to explicitely record backbone interactions as an additional edge feature
                 record_cistrans_interactions = False,        #whether to calculate contacts across the interface of two regions (useful if joining region graphs)
                 record_node_chromosome_as_onehot = False     #whether to record a basic node feature as a one-hot encoding of the node chromosome
                )

Creating a DataTrack and evaluating over cooler bins

from GrapHiC.Datatrack_creation import evaluate_tracks_over_cooler_bins

#specify paths
contact_path = 'tutorial_data/WT_contacts.cool'
bed_peaks = "tutorial_data/GSEXXXXXHendrich20161026_Nanog_ESC_peaks.bed"
bigwig = "tutorial_data/GSE71932_Nanog_mESCs_treat_pileup_filter.bw"

dataframe = evaluate_tracks_over_cooler_bins(contact_path,                                #specify cooler path
                                             paths = [nanog_peaks,nanog_bigwig],          #specify bed or bigwig paths
                                             names = ['nanog_bigwig','nanog_peaks'],      #specify track names
                                             stats_types = ['mean','max','std'],          #specify bin statistics to collect
                                             allowed_chroms = ['chr1','chr2','chrX'],     #specify chromosomes
                                             value_col = 4)                               #speicfy the target column of any BED data tracks 
dataframe.to_csv("example.csv",
                 sep="\t", 
                 index = False)

Adding binned data to a list of graphs

from GrapHiC.Graph_creation import add_binned_data_to_graphlist

add_binned_data_to_graphlist(x['chr2'],
                            'example.csv')

Installation

Create env:

conda create --name graphic python=3.7
conda activate graphic

Install PyTorch:

N.B. Make sure to install appropriate version for your CUDA version

# Install PyTorch: MacOS
$ conda install pytorch torchvision -c pytorch                      # Only CPU Build

# Install PyTorch: Linux
$ conda install pytorch torchvision cpuonly -c pytorch              # For CPU Build
$ conda install pytorch torchvision cudatoolkit=9.2 -c pytorch      # For CUDA 9.2 Build
$ conda install pytorch torchvision cudatoolkit=10.1 -c pytorch     # For CUDA 10.1 Build
$ conda install pytorch torchvision cudatoolkit=10.2 -c pytorch     # For CUDA 10.2 Build

Install PyTorch Geometric:

$ pip install torch-scatter==latest+${CUDA} -f https://pytorch-geometric.com/whl/torch-${TORCH}.html
$ pip install torch-sparse==latest+${CUDA} -f https://pytorch-geometric.com/whl/torch-${TORCH}.html
$ pip install torch-cluster==latest+${CUDA} -f https://pytorch-geometric.com/whl/torch-${TORCH}.html
$ pip install torch-spline-conv==latest+${CUDA} -f https://pytorch-geometric.com/whl/torch-${TORCH}.html
$ pip install torch-geometric

Where ${CUDA} and ${TORCH} should be replaced by your specific CUDA version (cpu, cu92, cu101, cu102) and PyTorch version (1.4.0, 1.5.0, 1.6.0), respectively

N.B. Follow the instructions in the Torch-Geometric Docs to install the versions appropriate to your CUDA version.

install cython
```
pip install Cython
```
Install git-lfs
```
conda install git-lfs
```

Clone the git repo and install

git clone https://github.com/dhall1995/GrapHiC
cd GrapHiC
pip install -e .

Optionally, to run the tutorial notebooks, run the following from within the conda environment:

 conda install -c anaconda ipykernel
 python -m ipykernel install --user --name=graphic

Then when starting a jupyter notebook choose the graphic kernel

Documentation (work in progress)

For enquiries please email: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
GrapHiC		GrapHiC
docs		docs
examples		examples
notebooks		notebooks
tutorial_data		tutorial_data
.gitattributes		.gitattributes
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt
setup.py		setup.py
workflow.png		workflow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GrapHiC (Graph-based Hi-C)

Example usage

Creating a Hi-C graph from 3 disconnected regions of chromatin

Creating a DataTrack and evaluating over cooler bins

Adding binned data to a list of graphs

Installation

About

Releases

Packages

Contributors 2

Languages

License

dhall1995/GrapHiC

Folders and files

Latest commit

History

Repository files navigation

GrapHiC (Graph-based Hi-C)

Example usage

Creating a Hi-C graph from 3 disconnected regions of chromatin

Creating a DataTrack and evaluating over cooler bins

Adding binned data to a list of graphs

Installation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages