Skip to content

peakHiC method to identify 3D chromatin interactions (loops) from HiC data

Notifications You must be signed in to change notification settings

geertgeeven/peakHiC

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

peakHiC - a Hi-C peak calling pipeline

peakHiC is a method written in the R programming language to identify 3D chromatin interactions ("peaks") in Hi-C data. It works by first converting Hi-C contacts into virtual 4C profiles from loci of interest called "viewpoints" and then applying a statistical method to call peaks in these profiles, from replicate Hi-C experiments.

If you have any difficulties using the pipeline, please do not hesitate to contact us ([email protected]).

Citation

Valerio Bianchi, Geert Geeven, Nathan Tucker, Catharina R.E. Hilvering, Amelia W. Hall, Carolina Roselli, Matthew C. Hill, James F. Martin, Kenneth B. Margulies, Patrick T. Ellinor, Wouter de Laat. Detailed Regulatory Interaction Map of the Human Heart Facilitates Gene Discovery for Cardiovascular Disease. bioRxiv 705715; doi: https://doi.org/10.1101/705715

Prerequisites

Installation

First choose a folder where to install the pipeline. The R scripts will search for configuration files, HiC data and results relative to this baseFolder of install of peakHiC so it is important to remember this path and set peakHiC up correctly to use it. Here we will use /home/geert/localdev/github/ as folder where to clone the github repository. After navigating to this folder, we can clone the peakHiC repository using the following command:

git clone https://github.com/deLaatLab/peakHiC.git

Another path required to setup is the location of the pairix binary. peakHiC uses the 4DN-DCIC tool pairix (see Prerequisites) to read HiC data in the pairs format. Below we explain how to configure peakHiC to locate this tool.

Configure peakHiC to use example data

Example pairix files for a ~2Mb region on chromosome 1, from the GM12878 HiC dataset published by Rao et al. (2014) doi:10.1016/j.cell.2014.11.021 is included to test the peakHiC pipeline. pairix files for other Hi-C datasets are available through the 4DN data portal at https://data.4dnucleome.org/. Run the following commands in the R console to setup peakHiC to use the example data:

baseFolder <- "/home/geert/localdev/github/peakHiC/"
pairixBinary <- "/home/geert/localdev/prog/pairix/bin/pairix"
sourceFile <- paste0(baseFolder,"R/peakHiC_functions.R")
source(sourceFile)
initExampleData(baseFolder=baseFolder,pairixBinary)

Please be aware that you need to update the paths in the R code above to point to files and folders on your local machine.

Run the pipeline

Now navigate in a terminal to the peakHiC RUN folder, so in this example /home/geert/localdev/github/peakHiC/RUN/ and run the following commands:

Rscript createPartitionV4CsByChr.R -chr chr1 -peakHiCObj /home/geert/localdev/github/peakHiC/DATA/example_data/hg38_4DN_Rao_GM12878_peakHiC_example_peakHiCObj.rds
Rscript callPartitionPeaksbyChr.R -chr chr1 -peakHiCObj /home/geert/localdev/github/peakHiC/DATA/example_data/hg38_4DN_Rao_GM12878_peakHiC_example_peakHiCObj.rds
Rscript processLoops.R -chr chr1 -peakHiCObj /home/geert/localdev/github/peakHiC/DATA/example_data/hg38_4DN_Rao_GM12878_peakHiC_example_peakHiCObj.rds

This should generate R files with V4C profiles of some example viewpoints and a list of processed loops in /home/geert/localdev/github/peakHiC/RESULTS/Rao_4DN_GM12878_peakHiC_example/rds/loops/.

About

peakHiC method to identify 3D chromatin interactions (loops) from HiC data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 100.0%