diff --git a/data-representation.html b/data-representation.html
index 2086691..a8610ac 100644
--- a/data-representation.html
+++ b/data-representation.html
@@ -365,7 +365,7 @@ <h1 class="title">
 <section id="granges-class" class="level2" data-number="2.1"><h2 data-number="2.1" class="anchored" data-anchor-id="granges-class">
 <span class="header-section-number">2.1</span> <code>GRanges</code> class</h2>
 <p><code>GRanges</code> is a shorthand for <code>GenomicRanges</code>, a core class in <code>Bioconductor</code>. This class is primarily used to describe genomic ranges of any nature, e.g.&nbsp; sets of promoters, SNPs, chromatin loop anchors, ….<br>
-The data structure has been published in the seminal 2015 publication by the <code>Bioconductor</code> team (<span class="citation" data-cites="Huber2015Feb">Huber et al. (<a href="interoperability.html#ref-Huber2015Feb" role="doc-biblioref">2015</a>)</span>).</p>
+The data structure has been published in the seminal 2015 publication by the <code>Bioconductor</code> team (<span class="citation" data-cites="Huber_2015">Huber et al. (<a href="interoperability.html#ref-Huber_2015" role="doc-biblioref">2015</a>)</span>).</p>
 <section id="granges-fundamentals" class="level3" data-number="2.1.1"><h3 data-number="2.1.1" class="anchored" data-anchor-id="granges-fundamentals">
 <span class="header-section-number">2.1.1</span> <code>GRanges</code> fundamentals</h3>
 <p>The easiest way to generate a <code>GRanges</code> object is to coerce it from a vector of genomic coordinates in the UCSC format (e.g.&nbsp;<code>"chr2:2004-4853"</code>):</p>
@@ -1099,7 +1099,7 @@ <h1 class="title">
 <p>Note how close from a TSS the 8th peak was. It could be worth considering this as an overlap!</p>
 </section></section></section><section id="ginteractions-class" class="level2" data-number="2.2"><h2 data-number="2.2" class="anchored" data-anchor-id="ginteractions-class">
 <span class="header-section-number">2.2</span> <code>GInteractions</code> class</h2>
-<p><code>GRanges</code> describe genomic ranges and hence are of general use to study 1D genome organization. To study chromatin interactions, we need a way to link pairs of <code>GRanges</code>. This is exactly what the <code>GInteractions</code> class does. This data structure is defined in the <code>InteractionSet</code> package and has been published in the 2016 paper by <code>Lun et al.</code> (<span class="citation" data-cites="Lun2016May">Lun, Perry, and Ing-Simmons (<a href="interoperability.html#ref-Lun2016May" role="doc-biblioref">2016</a>)</span>).</p>
+<p><code>GRanges</code> describe genomic ranges and hence are of general use to study 1D genome organization. To study chromatin interactions, we need a way to link pairs of <code>GRanges</code>. This is exactly what the <code>GInteractions</code> class does. This data structure is defined in the <code>InteractionSet</code> package and has been published in the 2016 paper by <code>Lun et al.</code> (<span class="citation" data-cites="Lun_2016">Lun et al. (<a href="interoperability.html#ref-Lun_2016" role="doc-biblioref">2016</a>)</span>).</p>
 <p><img src="images/20230309114047.png" class="img-fluid"></p>
 <section id="building-a-ginteractions-object-from-scratch" class="level3" data-number="2.2.1"><h3 data-number="2.2.1" class="anchored" data-anchor-id="building-a-ginteractions-object-from-scratch">
 <span class="header-section-number">2.2.1</span> Building a <code>GInteractions</code> object from scratch</h3>
@@ -1597,7 +1597,7 @@ <h1 class="title">
 <div class="cell" data-layout-align="center" data-hash="data-representation_cache/html/unnamed-chunk-48_613feb304332cb6098ea31ae538c0ff9">
 <div class="sourceCode" id="cb47"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">coolf</span></span>
 <span><span class="co">##                                                   EH7702 </span></span>
-<span><span class="co">##  "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752"</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<span><span class="co">##  "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752"</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <p>Similarly, example files are available for other file formats:</p>
 <div class="cell" data-layout-align="center" data-hash="data-representation_cache/html/unnamed-chunk-49_07b0f603f324c3454818c1410cb3cd50">
@@ -1667,7 +1667,7 @@ <h1 class="title">
 <span><span class="co"># ----- This creates a connection to a `.(m)cool` file (path stored in `coolf`)</span></span>
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/CoolFile-class.html">CoolFile</a></span><span class="op">(</span><span class="va">coolf</span><span class="op">)</span></span>
 <span><span class="co">##  CoolFile object</span></span>
-<span><span class="co">##  .mcool file: /github/home/.cache/R/ExperimentHub/1a594277bd62_7752 </span></span>
+<span><span class="co">##  .mcool file: /github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752 </span></span>
 <span><span class="co">##  resolution: 1000 </span></span>
 <span><span class="co">##  pairs file: </span></span>
 <span><span class="co">##  metadata(0):</span></span>
@@ -1675,7 +1675,7 @@ <h1 class="title">
 <span><span class="co"># ----- This creates a connection to a `.hic` file (path stored in `hicf`)</span></span>
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/HicFile-class.html">HicFile</a></span><span class="op">(</span><span class="va">hicf</span><span class="op">)</span></span>
 <span><span class="co">##  HicFile object</span></span>
-<span><span class="co">##  .hic file: /github/home/.cache/R/ExperimentHub/1a5939a379f0_7836 </span></span>
+<span><span class="co">##  .hic file: /github/home/.cache/R/ExperimentHub/1a9a270f71fe_7836 </span></span>
 <span><span class="co">##  resolution: 1000 </span></span>
 <span><span class="co">##  pairs file: </span></span>
 <span><span class="co">##  metadata(0):</span></span>
@@ -1684,8 +1684,8 @@ <h1 class="title">
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/HicproFile-class.html">HicproFile</a></span><span class="op">(</span><span class="va">hicpromatrixf</span>, <span class="va">hicproregionsf</span><span class="op">)</span></span>
 <span><span class="co">##  HicproFile object</span></span>
 <span><span class="co">##  HiC-Pro files:</span></span>
-<span><span class="co">##    $ matrix:   /github/home/.cache/R/ExperimentHub/1a59dc812a9_7837 </span></span>
-<span><span class="co">##    $ regions:  /github/home/.cache/R/ExperimentHub/1a591fa0216e_7838 </span></span>
+<span><span class="co">##    $ matrix:   /github/home/.cache/R/ExperimentHub/1a9a6531ab2c_7837 </span></span>
+<span><span class="co">##    $ regions:  /github/home/.cache/R/ExperimentHub/1a9a3c1fca84_7838 </span></span>
 <span><span class="co">##  resolution: 1000 </span></span>
 <span><span class="co">##  pairs file: </span></span>
 <span><span class="co">##  metadata(0):</span></span>
@@ -1693,7 +1693,7 @@ <h1 class="title">
 <span><span class="co"># ----- This creates a connection to a pairs file</span></span>
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/PairsFile-class.html">PairsFile</a></span><span class="op">(</span><span class="va">pairsf</span><span class="op">)</span></span>
 <span><span class="co">##  PairsFile object</span></span>
-<span><span class="co">##  resource: /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<span><span class="co">##  resource: /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 </section><section id="contactfile-slots" class="level3" data-number="2.3.3"><h3 data-number="2.3.3" class="anchored" data-anchor-id="contactfile-slots">
 <span class="header-section-number">2.3.3</span> <code>ContactFile</code> slots</h3>
@@ -1709,7 +1709,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb51"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">cf</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/CoolFile-class.html">CoolFile</a></span><span class="op">(</span><span class="va">coolf</span><span class="op">)</span></span>
 <span><span class="va">cf</span></span>
 <span><span class="co">##  CoolFile object</span></span>
-<span><span class="co">##  .mcool file: /github/home/.cache/R/ExperimentHub/1a594277bd62_7752 </span></span>
+<span><span class="co">##  .mcool file: /github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752 </span></span>
 <span><span class="co">##  resolution: 1000 </span></span>
 <span><span class="co">##  pairs file: </span></span>
 <span><span class="co">##  metadata(0):</span></span>
@@ -1807,7 +1807,7 @@ <h1 class="title">
 <span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 8,757,906 contacts over 12,079 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "whole genome" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 1000 </span></span>
@@ -1849,7 +1849,7 @@ <h1 class="title">
 <p>These pieces of information are called <code>slots</code>. They can be directly accessed using <code>getter</code> functions, bearing the same name than the slot.</p>
 <div class="cell" data-layout-align="center" data-hash="data-representation_cache/html/unnamed-chunk-55_d36ff2f24162a6a16470a5d1582c04d5">
 <div class="sourceCode" id="cb55"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/BiocGenerics/man/fileName.html">fileName</a></span><span class="op">(</span><span class="va">hic</span><span class="op">)</span></span>
-<span><span class="co">##  [1] "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752"</span></span>
+<span><span class="co">##  [1] "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752"</span></span>
 <span></span>
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/AllGenerics.html">focus</a></span><span class="op">(</span><span class="va">hic</span><span class="op">)</span></span>
 <span><span class="co">##  NULL</span></span>
@@ -1928,7 +1928,7 @@ <h1 class="title">
 <span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 13,681,280 contacts over 12,165 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a5939a379f0_7836" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a270f71fe_7836" </span></span>
 <span><span class="co">##  focus: "whole genome" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 1000 </span></span>
@@ -2370,14 +2370,14 @@ <h1 class="title">
 <div class="sourceCode" id="cb63"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">yeast_hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 8,757,906 contacts over 763 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "whole genome" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 16000 </span></span>
 <span><span class="co">##  interactions: 267709 </span></span>
 <span><span class="co">##  scores(2): count balanced </span></span>
 <span><span class="co">##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) </span></span>
-<span><span class="co">##  pairsFile: /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753 </span></span>
+<span><span class="co">##  pairsFile: /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753 </span></span>
 <span><span class="co">##  metadata(3): ID org date</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <section id="interactions" class="level4" data-number="2.4.2.1"><h4 data-number="2.4.2.1" class="anchored" data-anchor-id="interactions">
@@ -2693,8 +2693,8 @@ <h1 class="title">
 <div class="sourceCode" id="cb71"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/AllGenerics.html">pairsFile</a></span><span class="op">(</span><span class="va">yeast_hic</span><span class="op">)</span> <span class="op">&lt;-</span> <span class="va">pairsf</span></span>
 <span></span>
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/AllGenerics.html">pairsFile</a></span><span class="op">(</span><span class="va">yeast_hic</span><span class="op">)</span></span>
-<span><span class="co">##                                                   EH7703 </span></span>
-<span><span class="co">##  "/github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753"</span></span>
+<span><span class="co">##                                                  EH7703 </span></span>
+<span><span class="co">##  "/github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753"</span></span>
 <span></span>
 <span><span class="fu"><a href="https://rdrr.io/r/base/readLines.html">readLines</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/AllGenerics.html">pairsFile</a></span><span class="op">(</span><span class="va">yeast_hic</span><span class="op">)</span>, <span class="fl">25</span><span class="op">)</span></span>
 <span><span class="co">##   [1] "## pairs format v1.0"                                                             </span></span>
@@ -2777,12 +2777,12 @@ <h1 class="title">
 
 
 </section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-Huber2015Feb" class="csl-entry" role="listitem">
-Huber, Wolfgang, Vincent J. Carey, Robert Gentleman, Simon Anders, Marc Carlson, Benilton S. Carvalho, Hector Corrada Bravo, et al. 2015. <span>“<span class="nocase">Orchestrating high-throughput genomic analysis with Bioconductor</span>.”</span> <em>Nat. Methods</em> 12 (2): 115–21. <a href="https://doi.org/10.1038/nmeth.3252">https://doi.org/10.1038/nmeth.3252</a>.
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Huber_2015" class="csl-entry" role="listitem">
+Huber, W., Carey, V. J., Gentleman, R., Anders, S., Carlson, M., Carvalho, B. S., Bravo, H. C., Davis, S., Gatto, L., Girke, T., Gottardo, R., Hahne, F., Hansen, K. D., Irizarry, R. A., Lawrence, M., Love, M. I., MacDonald, J., Obenchain, V., Oleś, A. K., … Morgan, M. (2015). Orchestrating high-throughput genomic analysis with bioconductor. <em>Nature Methods</em>, <em>12</em>(2), 115–121. <a href="https://doi.org/10.1038/nmeth.3252">https://doi.org/10.1038/nmeth.3252</a>
 </div>
-<div id="ref-Lun2016May" class="csl-entry" role="listitem">
-Lun, Aaron T. L., Malcolm Perry, and Elizabeth Ing-Simmons. 2016. <span>“<span class="nocase">Infrastructure for genomic interactions: Bioconductor classes for Hi-C, ChIA-PET and related experiments</span>.”</span> <em>F1000Research</em> 5 (May): 950. <a href="https://doi.org/10.12688/f1000research.8759.2">https://doi.org/10.12688/f1000research.8759.2</a>.
+<div id="ref-Lun_2016" class="csl-entry" role="listitem">
+Lun, A. T. L., Perry, M., &amp; Ing-Simmons, E. (2016). Infrastructure for genomic interactions: Bioconductor classes for hi-c, <span>ChIA</span>-<span>PET</span> and related experiments. <em>F1000Research</em>, <em>5</em>, 950. <a href="https://doi.org/10.12688/f1000research.8759.2">https://doi.org/10.12688/f1000research.8759.2</a>
 </div>
 </div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">
diff --git a/disseminating.html b/disseminating.html
index fe0b023..275c49e 100644
--- a/disseminating.html
+++ b/disseminating.html
@@ -411,7 +411,7 @@ <h1 class="title">
 </div>
 <ul>
 <li>
-<code>type = 'insulation'</code> will fetch a <code>.bigwig</code> track file precomputed by the 4DN consortium. This track corresponds to the genome-wide insulation score computed by <code>cooltools</code> as described in <span class="citation" data-cites="Crane2015Jul">Crane et al. (<a href="interoperability.html#ref-Crane2015Jul" role="doc-biblioref">2015</a>)</span>. To know more about this, read <a href="https://data.4dnucleome.org/resources/data-analysis/insulation_compartment_scores#insulation_scores_and_boundaries_page_all">the excerpt from 4DN data portal</a>. Once fetched from the 4DN data portal, the local file can be imported in <code>R</code> using the <code>import</code> function, which will generate a <code>RleList</code> object.</li>
+<code>type = 'insulation'</code> will fetch a <code>.bigwig</code> track file precomputed by the 4DN consortium. This track corresponds to the genome-wide insulation score computed by <code>cooltools</code> as described in <span class="citation" data-cites="Crane_2015">Crane et al. (<a href="interoperability.html#ref-Crane_2015" role="doc-biblioref">2015</a>)</span>. To know more about this, read <a href="https://data.4dnucleome.org/resources/data-analysis/insulation_compartment_scores#insulation_scores_and_boundaries_page_all">the excerpt from 4DN data portal</a>. Once fetched from the 4DN data portal, the local file can be imported in <code>R</code> using the <code>import</code> function, which will generate a <code>RleList</code> object.</li>
 </ul>
 <div class="cell" data-layout-align="center" data-hash="disseminating_cache/html/unnamed-chunk-6_d5634a6345d6486d15a775e7526591b6">
 <div class="sourceCode" id="cb5"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/fourDNData/man/fourDNData.html">fourDNData</a></span><span class="op">(</span>experimentSetAccession <span class="op">=</span> <span class="st">'4DNES25ABNZ1'</span>, type <span class="op">=</span> <span class="st">'insulation'</span><span class="op">)</span> <span class="op">|&gt;</span> </span>
@@ -603,9 +603,9 @@ <h1 class="title">
 
 
 </section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-Crane2015Jul" class="csl-entry" role="listitem">
-Crane, Emily, Qian Bian, Rachel Patton McCord, Bryan R. Lajoie, Bayly S. Wheeler, Edward J. Ralston, Satoru Uzawa, Job Dekker, and Barbara J. Meyer. 2015. <span>“<span class="nocase">Condensin-driven remodelling of X chromosome topology during dosage compensation</span>.”</span> <em>Nature</em> 523 (July): 240–44. <a href="https://doi.org/10.1038/nature14450">https://doi.org/10.1038/nature14450</a>.
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Crane_2015" class="csl-entry" role="listitem">
+Crane, E., Bian, Q., McCord, R. P., Lajoie, B. R., Wheeler, B. S., Ralston, E. J., Uzawa, S., Dekker, J., &amp; Meyer, B. J. (2015). Condensin-driven remodelling of x chromosome topology during dosage compensation. <em>Nature</em>, <em>523</em>(7559), 240–244. <a href="https://doi.org/10.1038/nature14450">https://doi.org/10.1038/nature14450</a>
 </div>
 </div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">
diff --git a/index.html b/index.html
index 20c7683..f0600b7 100644
--- a/index.html
+++ b/index.html
@@ -260,7 +260,7 @@
 <a href="#reproducibility" id="toc-reproducibility" class="nav-link" data-scroll-target="#reproducibility">Reproducibility</a>
   <ul class="collapse">
 <li><a href="#building-book" id="toc-building-book" class="nav-link" data-scroll-target="#building-book">Building book</a></li>
-  <li><a href="interoperability.html#session-info" id="toc-session-info" class="nav-link" data-scroll-target="#session-info">Session info</a></li>
+  <li><a href="#session-info" id="toc-session-info" class="nav-link" data-scroll-target="#session-info">Session info</a></li>
   </ul>
 </li>
   </ul><div class="toc-actions"><div><i class="bi bi-github"></i></div><div class="action-links"><p><a href="https://github.com/js2264/OHCA/edit/main/index.qmd" class="toc-action">Edit this page</a></p><p><a href="https://github.com/js2264/OHCA/issues/new" class="toc-action">Report an issue</a></p></div></div></nav>
@@ -283,7 +283,7 @@ <h1 class="title">Orchestrating Hi-C analysis with Bioconductor</h1>
 </header>
 <section id="welcome" class="unnumbered"><section class="level1 unnumbered"><h1 class="unnumbered">Welcome</h1>
 <p>This is the landing page of the <strong>“Orchestrating Hi-C analysis with Bioconductor”</strong> book. <strong>The primary aim of this book is to introduce the <code>R</code> user to Hi-C analysis</strong>. This book starts with key concepts important for the analysis of chromatin conformation capture and then presents <code>Bioconductor</code> tools that can be leveraged to process, analyze, explore and visualize Hi-C data.</p>
-<p><strong>Authors:</strong> Jacques Serizay [aut, cre]<br><strong>Version:</strong> 1.1.0<br><strong>Modified:</strong> 2023-04-14<br><strong>Compiled:</strong> 2023-09-25<br><strong>Environment:</strong> R version 4.3.1 (2023-06-16), Bioconductor 3.18<br><strong>License:</strong> MIT + file LICENSE<br><strong>Copyright:</strong> J. Serizay</p>
+<p><strong>Authors:</strong> Jacques Serizay [aut, cre]<br><strong>Version:</strong> 1.1.0<br><strong>Modified:</strong> 2023-04-14<br><strong>Compiled:</strong> 2023-10-19<br><strong>Environment:</strong> R version 4.3.1 (2023-06-16), Bioconductor 3.18<br><strong>License:</strong> MIT + file LICENSE<br><strong>Copyright:</strong> J. Serizay</p>
 </section><p><img src="cover.jpg" title="Orchestrating Hi-C analysis with Bioconductor" class="quarto-cover-image img-fluid"></p><section id="table-of-contents" class="level1 unnumbered"><h1 class="unnumbered">Table of contents</h1>
 <p>This book is divided in three parts:</p>
 <p><strong>Part I: Introduction to Hi-C analysis</strong></p>
@@ -452,7 +452,7 @@ <h1 class="title">Orchestrating Hi-C analysis with Bioconductor</h1>
 <span><span class="co">##   collate  en_US.UTF-8</span></span>
 <span><span class="co">##   ctype    en_US.UTF-8</span></span>
 <span><span class="co">##   tz       Etc/UTC</span></span>
-<span><span class="co">##   date     2023-09-25</span></span>
+<span><span class="co">##   date     2023-10-19</span></span>
 <span><span class="co">##   pandoc   3.1.1 @ /usr/local/bin/ (via rmarkdown)</span></span>
 <span><span class="co">##  </span></span>
 <span><span class="co">##  ─ Packages ────────────────────────────────────────────────────────────────</span></span>
@@ -460,7 +460,7 @@ <h1 class="title">Orchestrating Hi-C analysis with Bioconductor</h1>
 <span><span class="co">##   abind                    1.4-5     2016-07-21 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   AnnotationDbi            1.63.2    2023-07-02 [1] Bioconductor</span></span>
 <span><span class="co">##   AnnotationHub          * 3.9.2     2023-08-24 [1] Bioconductor</span></span>
-<span><span class="co">##   basilisk                 1.13.1    2023-06-07 [1] Bioconductor</span></span>
+<span><span class="co">##   basilisk                 1.13.4    2023-10-04 [1] Bioconductor</span></span>
 <span><span class="co">##   basilisk.utils           1.13.3    2023-09-04 [1] Bioconductor</span></span>
 <span><span class="co">##   beeswarm                 0.4.0     2021-06-01 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   Biobase                  2.61.0    2023-04-25 [1] Bioconductor</span></span>
@@ -476,56 +476,56 @@ <h1 class="title">Orchestrating Hi-C analysis with Bioconductor</h1>
 <span><span class="co">##   bit64                    4.0.5     2020-08-30 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   bitops                   1.0-7     2021-04-24 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   blob                     1.2.4     2023-03-17 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   bookdown                 0.35      2023-08-09 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   bookdown                 0.36      2023-10-16 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   cachem                   1.0.8     2023-05-01 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   cli                      3.6.1     2023-03-23 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   CodeDepends              0.6.5     2018-07-17 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   codetools                0.2-19    2023-02-01 [2] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   colorspace               2.1-0     2023-01-23 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   crayon                   1.5.2     2022-09-29 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   curl                     5.0.2     2023-08-14 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   curl                     5.1.0     2023-10-02 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   data.table               1.14.8    2023-02-17 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   DBI                      1.1.3     2022-06-18 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   dbplyr                 * 2.3.3     2023-07-07 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   dbplyr                 * 2.3.4     2023-09-26 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   DelayedArray             0.27.10   2023-07-28 [1] Bioconductor</span></span>
 <span><span class="co">##   digest                   0.6.33    2023-07-07 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   dir.expiry               1.9.0     2023-04-25 [1] Bioconductor</span></span>
 <span><span class="co">##   DNAZooData             * 1.1.0     2023-04-27 [1] Bioconductor</span></span>
 <span><span class="co">##   dplyr                    1.1.3     2023-09-03 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   ellipsis                 0.3.2     2021-04-29 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   evaluate                 0.21      2023-05-05 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   evaluate                 0.22      2023-09-29 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   ExperimentHub          * 2.9.1     2023-07-12 [1] Bioconductor</span></span>
-<span><span class="co">##   fansi                    1.0.4     2023-01-22 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   fansi                    1.0.5     2023-10-08 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   fastmap                  1.1.1     2023-02-24 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   filelock                 1.0.2     2018-10-05 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   fourDNData             * 1.1.0     2023-04-27 [1] Bioconductor</span></span>
 <span><span class="co">##   generics                 0.1.3     2022-07-05 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   GenomeInfoDb             1.37.4    2023-09-07 [1] Bioconductor</span></span>
-<span><span class="co">##   GenomeInfoDbData         1.2.10    2023-09-25 [1] Bioconductor</span></span>
-<span><span class="co">##   GenomicRanges            1.53.1    2023-05-04 [1] Bioconductor</span></span>
+<span><span class="co">##   GenomeInfoDb             1.37.6    2023-10-02 [1] Bioconductor</span></span>
+<span><span class="co">##   GenomeInfoDbData         1.2.11    2023-10-19 [1] Bioconductor</span></span>
+<span><span class="co">##   GenomicRanges            1.53.2    2023-10-08 [1] Bioconductor</span></span>
 <span><span class="co">##   ggbeeswarm               0.7.2     2023-04-29 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   ggplot2                  3.4.3     2023-08-14 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   ggplot2                  3.4.4     2023-10-12 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   ggrastr                  1.0.2     2023-06-01 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   glue                     1.6.2     2022-02-24 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   graph                    1.79.2    2023-09-22 [1] Bioconductor</span></span>
+<span><span class="co">##   graph                    1.79.4    2023-10-09 [1] Bioconductor</span></span>
 <span><span class="co">##   gtable                   0.3.4     2023-08-21 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   HiCExperiment          * 1.1.2     2023-09-04 [1] Bioconductor</span></span>
 <span><span class="co">##   HiContacts             * 1.3.2     2023-09-04 [1] Bioconductor</span></span>
 <span><span class="co">##   HiContactsData         * 1.3.0     2023-04-27 [1] Bioconductor</span></span>
 <span><span class="co">##   HiCool                 * 1.1.0     2023-05-19 [1] Bioconductor</span></span>
 <span><span class="co">##   hms                      1.1.3     2023-03-21 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   htmltools                0.5.6     2023-08-10 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   htmltools                0.5.6.1   2023-10-06 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   htmlwidgets              1.6.2     2023-03-17 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   httpuv                   1.6.11    2023-05-11 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   httr                     1.4.7     2023-08-15 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   InteractionSet           1.29.1    2023-06-14 [1] Bioconductor</span></span>
 <span><span class="co">##   interactiveDisplayBase   1.39.0    2023-04-25 [1] Bioconductor</span></span>
-<span><span class="co">##   IRanges                  2.35.2    2023-06-22 [1] Bioconductor</span></span>
+<span><span class="co">##   IRanges                  2.35.3    2023-10-12 [1] Bioconductor</span></span>
 <span><span class="co">##   jsonlite                 1.8.7     2023-06-29 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   KEGGREST                 1.41.0    2023-04-25 [1] Bioconductor</span></span>
+<span><span class="co">##   KEGGREST                 1.41.4    2023-09-25 [1] Bioconductor</span></span>
 <span><span class="co">##   knitr                    1.44      2023-09-11 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   later                    1.3.1     2023-05-02 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   lattice                  0.21-8    2023-04-05 [2] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   lattice                  0.21-9    2023-10-01 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   lazyeval                 0.2.2     2019-03-15 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   lifecycle                1.0.3     2022-10-07 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   magrittr                 2.0.3     2022-03-30 [1] CRAN (R 4.3.1)</span></span>
@@ -535,7 +535,7 @@ <h1 class="title">Orchestrating Hi-C analysis with Bioconductor</h1>
 <span><span class="co">##   memoise                  2.0.1     2021-11-26 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   mime                     0.12      2021-09-28 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   munsell                  0.5.0     2018-06-12 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   OHCA                   * 1.1.0     2023-09-25 [1] local</span></span>
+<span><span class="co">##   OHCA                   * 1.1.0     2023-10-19 [1] local</span></span>
 <span><span class="co">##   pillar                   1.9.0     2023-03-22 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   pkgconfig                2.0.3     2019-09-22 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   plotly                   4.10.2    2023-06-03 [1] CRAN (R 4.3.1)</span></span>
@@ -548,7 +548,7 @@ <h1 class="title">Orchestrating Hi-C analysis with Bioconductor</h1>
 <span><span class="co">##   RCurl                    1.98-1.12 2023-03-27 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   readr                    2.1.4     2023-02-10 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   rebook                   1.11.1    2023-05-25 [1] Bioconductor</span></span>
-<span><span class="co">##   reticulate               1.32.0    2023-09-11 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   reticulate               1.34.0    2023-10-12 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   rhdf5                    2.45.1    2023-07-10 [1] Bioconductor</span></span>
 <span><span class="co">##   rhdf5filters             1.13.5    2023-07-19 [1] Bioconductor</span></span>
 <span><span class="co">##   Rhdf5lib                 1.23.2    2023-09-10 [1] Bioconductor</span></span>
@@ -560,10 +560,10 @@ <h1 class="title">Orchestrating Hi-C analysis with Bioconductor</h1>
 <span><span class="co">##   RSQLite                  2.3.1     2023-04-03 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   rstudioapi               0.15.0    2023-07-07 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   S4Arrays                 1.1.6     2023-08-30 [1] Bioconductor</span></span>
-<span><span class="co">##   S4Vectors                0.39.2    2023-09-22 [1] Bioconductor</span></span>
+<span><span class="co">##   S4Vectors                0.39.3    2023-10-11 [1] Bioconductor</span></span>
 <span><span class="co">##   scales                   1.2.1     2022-08-20 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   sessioninfo              1.2.2     2021-12-06 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   shiny                    1.7.5     2023-08-12 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   shiny                    1.7.5.1   2023-10-14 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   SparseArray              1.1.12    2023-08-31 [1] Bioconductor</span></span>
 <span><span class="co">##   strawr                   0.0.91    2023-03-29 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   stringi                  1.7.12    2023-01-11 [1] CRAN (R 4.3.1)</span></span>
@@ -574,10 +574,10 @@ <h1 class="title">Orchestrating Hi-C analysis with Bioconductor</h1>
 <span><span class="co">##   tidyselect               1.2.0     2022-10-10 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   tzdb                     0.4.0     2023-05-12 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   utf8                     1.2.3     2023-01-31 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   vctrs                    0.6.3     2023-06-14 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   vctrs                    0.6.4     2023-10-12 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   vipor                    0.4.5     2017-03-22 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   viridisLite              0.4.2     2023-05-02 [1] CRAN (R 4.3.1)</span></span>
-<span><span class="co">##   vroom                    1.6.3     2023-04-28 [1] CRAN (R 4.3.1)</span></span>
+<span><span class="co">##   vroom                    1.6.4     2023-10-02 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   xfun                     0.40      2023-08-09 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   XML                      3.99-0.14 2023-03-19 [1] CRAN (R 4.3.1)</span></span>
 <span><span class="co">##   xtable                   1.8-4     2019-04-21 [1] CRAN (R 4.3.1)</span></span>
diff --git a/interactions-centric.html b/interactions-centric.html
index 9d4b6a3..3520889 100644
--- a/interactions-centric.html
+++ b/interactions-centric.html
@@ -372,7 +372,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb2"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 471,364 contacts over 407 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -400,7 +400,7 @@ <h1 class="title">
 <div class="cell" data-layout-align="center" data-hash="interactions-centric_cache/html/unnamed-chunk-5_3da1a691410bb1a0f2b340bcd81e3f08">
 <div class="sourceCode" id="cb4"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">pf</span></span>
 <span><span class="co">##  PairsFile object</span></span>
-<span><span class="co">##  resource: /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<span><span class="co">##  resource: /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <div class="callout callout-style-default callout-note callout-titled">
 <div class="callout-header d-flex align-content-center">
@@ -452,7 +452,7 @@ <h1 class="title">
 <div class="cell" data-layout-align="center" data-hash="interactions-centric_cache/html/unnamed-chunk-7_61a6b73bdd59d810076d1d2bca92deb1">
 <div class="sourceCode" id="cb6"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html">library</a></span><span class="op">(</span><span class="va"><a href="https://github.com/js2264/HiContacts">HiContacts</a></span><span class="op">)</span></span>
 <span><span class="va">ps</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/HiContacts/man/Ps.html">distanceLaw</a></span><span class="op">(</span><span class="va">pf</span>, by_chr <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span> </span>
-<span><span class="co">##  Importing pairs file /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753 in memory. This may take a while...</span></span>
+<span><span class="co">##  Importing pairs file /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753 in memory. This may take a while...</span></span>
 <span><span class="va">ps</span></span>
 <span><span class="co">##  # A tibble: 115 × 6</span></span>
 <span><span class="co">##    chr   binned_distance          p     norm_p norm_p_unity slope</span></span>
@@ -513,7 +513,7 @@ <h1 class="title">
 </div>
 <div class="cell" data-layout-align="center" data-hash="interactions-centric_cache/html/unnamed-chunk-10_a32782588d84e43b06cfc497d21b583a">
 <div class="sourceCode" id="cb11"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">eco1_ps</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/HiContacts/man/Ps.html">distanceLaw</a></span><span class="op">(</span><span class="va">eco1_pf</span>, by_chr <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span> </span>
-<span><span class="co">##  Importing pairs file /github/home/.cache/R/ExperimentHub/21b95aa8e2b4_7755 in memory. This may take a while...</span></span>
+<span><span class="co">##  Importing pairs file /github/home/.cache/R/ExperimentHub/21fb251da216_7755 in memory. This may take a while...</span></span>
 <span><span class="va">eco1_ps</span></span>
 <span><span class="co">##  # A tibble: 115 × 6</span></span>
 <span><span class="co">##    chr   binned_distance          p     norm_p norm_p_unity slope</span></span>
@@ -694,12 +694,12 @@ <h1 class="title">
 <p>This clearly highlights trans interactions of the chromosome <code>II</code> centromere with the centromeres from other chromosomes.</p>
 </section><section id="scalograms" class="level2 page-columns page-full" data-number="6.4"><h2 data-number="6.4" class="anchored" data-anchor-id="scalograms">
 <span class="header-section-number">6.4</span> Scalograms</h2>
-<p>Scalograms were introduced in <span class="citation" data-cites="Lioy2018Feb">Lioy et al. (<a href="interoperability.html#ref-Lioy2018Feb" role="doc-biblioref">2018</a>)</span> to investigate distance-dependent contact frequencies for individual genomic bins along chromosomes.<br>
+<p>Scalograms were introduced in <span class="citation" data-cites="Lioy_2018">Lioy et al. (<a href="interoperability.html#ref-Lioy_2018" role="doc-biblioref">2018</a>)</span> to investigate distance-dependent contact frequencies for individual genomic bins along chromosomes.<br>
 To generate a scalogram, one needs to provide a <code>HiCExperiment</code> object with a valid associated <code>pairsFile</code>.</p>
 <div class="cell" data-layout-align="center" data-hash="interactions-centric_cache/html/unnamed-chunk-17_4e0d15336a002cea58219b80a7c4dc89">
 <div class="sourceCode" id="cb20"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/AllGenerics.html">pairsFile</a></span><span class="op">(</span><span class="va">hic</span><span class="op">)</span> <span class="op">&lt;-</span> <span class="va">pairsf</span></span>
 <span><span class="va">scalo</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/HiContacts/man/scalogram.html">scalogram</a></span><span class="op">(</span><span class="va">hic</span><span class="op">)</span> </span>
-<span><span class="co">##  Importing pairs file /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753 in memory. This may take a while...</span></span>
+<span><span class="co">##  Importing pairs file /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753 in memory. This may take a while...</span></span>
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiContacts/man/plotScalogram.html">plotScalogram</a></span><span class="op">(</span><span class="va">scalo</span> <span class="op">|&gt;</span> <span class="fu"><a href="https://dplyr.tidyverse.org/reference/filter.html">filter</a></span><span class="op">(</span><span class="va">chr</span> <span class="op">==</span> <span class="st">'II'</span><span class="op">)</span>, ylim <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html">c</a></span><span class="op">(</span><span class="fl">1e3</span>, <span class="fl">1e5</span><span class="op">)</span><span class="op">)</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
 <div class="quarto-figure quarto-figure-center">
@@ -723,7 +723,7 @@ <h1 class="title">
 <span><span class="co">##  loading from cache</span></span>
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/AllGenerics.html">pairsFile</a></span><span class="op">(</span><span class="va">eco1_hic</span><span class="op">)</span> <span class="op">&lt;-</span> <span class="va">eco1_pairsf</span></span>
 <span><span class="va">eco1_scalo</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/HiContacts/man/scalogram.html">scalogram</a></span><span class="op">(</span><span class="va">eco1_hic</span><span class="op">)</span> </span>
-<span><span class="co">##  Importing pairs file /github/home/.cache/R/ExperimentHub/21b95aa8e2b4_7755 in memory. This may take a while...</span></span>
+<span><span class="co">##  Importing pairs file /github/home/.cache/R/ExperimentHub/21fb251da216_7755 in memory. This may take a while...</span></span>
 <span><span class="va">merged_scalo</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/BiocGenerics/man/cbind.html">rbind</a></span><span class="op">(</span></span>
 <span>    <span class="va">scalo</span> <span class="op">|&gt;</span> <span class="fu"><a href="https://dplyr.tidyverse.org/reference/mutate.html">mutate</a></span><span class="op">(</span>sample <span class="op">=</span> <span class="st">'WT'</span><span class="op">)</span>, </span>
 <span>    <span class="va">eco1_scalo</span> <span class="op">|&gt;</span> <span class="fu"><a href="https://dplyr.tidyverse.org/reference/mutate.html">mutate</a></span><span class="op">(</span>sample <span class="op">=</span> <span class="st">'eco1'</span><span class="op">)</span></span>
@@ -742,9 +742,9 @@ <h1 class="title">
 
 
 </section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-Lioy2018Feb" class="csl-entry" role="listitem">
-Lioy, Virginia S., Axel Cournac, Martial Marbouty, Stéphane Duigou, Julien Mozziconacci, Olivier Espéli, Frédéric Boccard, and Romain Koszul. 2018. <span>“<span class="nocase">Multiscale Structuring of the E. coli Chromosome by Nucleoid-Associated and Condensin Proteins</span>.”</span> <em>Cell</em> 172 (4): 771–78318. <a href="https://doi.org/10.1016/j.cell.2017.12.027">https://doi.org/10.1016/j.cell.2017.12.027</a>.
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Lioy_2018" class="csl-entry" role="listitem">
+Lioy, V. S., Cournac, A., Marbouty, M., Duigou, S., Mozziconacci, J., Espéli, O., Boccard, F., &amp; Koszul, R. (2018). Multiscale structuring of the e.&nbsp;Coli chromosome by nucleoid-associated and condensin proteins. <em>Cell</em>, <em>172</em>(4), 771–783.e18. <a href="https://doi.org/10.1016/j.cell.2017.12.027">https://doi.org/10.1016/j.cell.2017.12.027</a>
 </div>
 </div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">
diff --git a/interoperability.html b/interoperability.html
index 4e1b587..f9b1c78 100644
--- a/interoperability.html
+++ b/interoperability.html
@@ -274,6 +274,7 @@
   <li><a href="#gothic" id="toc-gothic" class="nav-link" data-scroll-target="#gothic"><span class="header-section-number">9.4</span> GOTHiC</a></li>
   <li><a href="#references" id="toc-references" class="nav-link" data-scroll-target="#references">References</a></li>
   <li><a href="#session-info" id="toc-session-info" class="nav-link" data-scroll-target="#session-info">Session info</a></li>
+  <li><a href="#bibliography" id="toc-bibliography" class="nav-link" data-scroll-target="#bibliography">References</a></li>
   </ul><div class="toc-actions"><div><i class="bi bi-github"></i></div><div class="action-links"><p><a href="https://github.com/js2264/OHCA/edit/main/interoperability.qmd" class="toc-action">Edit this page</a></p><p><a href="https://github.com/js2264/OHCA/issues/new" class="toc-action">Report an issue</a></p></div></div></nav>
     </div>
 <!-- main -->
@@ -316,7 +317,7 @@ <h1 class="title">
 </div>
 <section id="hicrep" class="level2" data-number="9.1"><h2 data-number="9.1" class="anchored" data-anchor-id="hicrep">
 <span class="header-section-number">9.1</span> HiCrep</h2>
-<p><code>hicrep</code> is a popular package to compute <strong>stratum-adjusted correlations</strong> between Hi-C datasets (<span class="citation" data-cites="Yang2017Nov">Yang et al. (<a href="interoperability.html#ref-Yang2017Nov" role="doc-biblioref">2017</a>)</span>). “Stratum” refers to the distance from the main diagonal: with increase distance from the main diagonal, interactions of the DNA polymer are bound to decrease. <code>hicrep</code> computes a “per-stratum” correlation score and computes a weighted average correlation for entire chromosomes.</p>
+<p><code>hicrep</code> is a popular package to compute <strong>stratum-adjusted correlations</strong> between Hi-C datasets (<span class="citation" data-cites="Yang_2017">Yang et al. (<a href="interoperability.html#ref-Yang_2017" role="doc-biblioref">2017</a>)</span>). “Stratum” refers to the distance from the main diagonal: with increase distance from the main diagonal, interactions of the DNA polymer are bound to decrease. <code>hicrep</code> computes a “per-stratum” correlation score and computes a weighted average correlation for entire chromosomes.</p>
 <div class="callout callout-style-default callout-tip callout-titled">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
@@ -352,7 +353,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb4"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic_wt</span></span>
 <span><span class="co">##  `HiCExperiment` object with 8,757,906 contacts over 12,079 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "whole genome" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 1000 </span></span>
@@ -413,7 +414,7 @@ <h1 class="title">
 </div>
 </section><section id="multihiccompare" class="level2" data-number="9.2"><h2 data-number="9.2" class="anchored" data-anchor-id="multihiccompare">
 <span class="header-section-number">9.2</span> multiHiCcompare</h2>
-<p>The <code>multiHiCcompare</code> package provides functions for joint normalization and difference detection in multiple Hi-C datasets (<span class="citation" data-cites="Stansfield2019Sep">Stansfield, Cresswell, and Dozmorov (<a href="interoperability.html#ref-Stansfield2019Sep" role="doc-biblioref">2019</a>)</span>). According to its excerpt, to perform differential interaction analysis, it requires a <code>list</code> of <strong>raw counts</strong> for different samples/replicates, stored in <strong>data frames with four columns</strong> (<code>chr</code>, <code>start1</code>, <code>start2</code>, <code>count</code>).<br>
+<p>The <code>multiHiCcompare</code> package provides functions for joint normalization and difference detection in multiple Hi-C datasets (<span class="citation" data-cites="Stansfield_2019">Stansfield et al. (<a href="interoperability.html#ref-Stansfield_2019" role="doc-biblioref">2019</a>)</span>). According to its excerpt, to perform differential interaction analysis, it requires a <code>list</code> of <strong>raw counts</strong> for different samples/replicates, stored in <strong>data frames with four columns</strong> (<code>chr</code>, <code>start1</code>, <code>start2</code>, <code>count</code>).<br>
 Manipulate a <code>HiCExperiment</code> object to coerce it into such structure is straightforward.</p>
 <div class="cell" data-layout-align="center" data-hash="interoperability_cache/html/unnamed-chunk-6_d24ebec732326e797e3e4682b18fdbf7">
 <div class="sourceCode" id="cb5"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html">library</a></span><span class="op">(</span><span class="va"><a href="https://dplyr.tidyverse.org">dplyr</a></span><span class="op">)</span></span>
@@ -464,7 +465,7 @@ <h1 class="title">
 </div>
 </section><section id="topdom" class="level2" data-number="9.3"><h2 data-number="9.3" class="anchored" data-anchor-id="topdom">
 <span class="header-section-number">9.3</span> TopDom</h2>
-<p>The <code>TopDom</code> method is widely used to annotate topological domains in genomes from Hi-C data (<span class="citation" data-cites="Shin2016Apr">Shin et al. (<a href="interoperability.html#ref-Shin2016Apr" role="doc-biblioref">2016</a>)</span>). The <code>TopDom</code> package was created to implement this method in <code>R</code> (<span class="citation" data-cites="BibEntry2021May">Bengtsson et al. (<a href="interoperability.html#ref-BibEntry2021May" role="doc-biblioref">2020</a>)</span>).</p>
+<p>The <code>TopDom</code> method is widely used to annotate topological domains in genomes from Hi-C data (<span class="citation" data-cites="Shin_2016">(<a href="interoperability.html#ref-Shin_2016" role="doc-biblioref"><strong>Shin_2016?</strong></a>)</span>). The <code>TopDom</code> package was created to implement this method in <code>R</code> (<span class="citation" data-cites="Bengtsson_2020">Bengtsson et al. (<a href="interoperability.html#ref-Bengtsson_2020" role="doc-biblioref">2020</a>)</span>).</p>
 <p>Unfortunately, the format of the input to <code>TopDom</code> is rather tricky (see <code><a href="https://rdrr.io/pkg/TopDom/man/readHiC.html">?TopDom::readHiC</a></code>). The following chunk of code shows how to coerce a <code>HiCExperiment</code> object into a <code>TopDom</code>-compatible object.</p>
 <div class="cell" data-layout-align="center" data-hash="interoperability_cache/html/unnamed-chunk-8_07c490529053a4c73281535a7f52a573">
 <div class="sourceCode" id="cb7"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html">library</a></span><span class="op">(</span><span class="va"><a href="https://github.com/HenrikBengtsson/TopDom">TopDom</a></span><span class="op">)</span></span>
@@ -557,7 +558,7 @@ <h1 class="title">
 </div>
 </section><section id="gothic" class="level2" data-number="9.4"><h2 data-number="9.4" class="anchored" data-anchor-id="gothic">
 <span class="header-section-number">9.4</span> GOTHiC</h2>
-<p><code>GOTHiC</code> relies on a cumulative binomial test to detect interactions between distal genomic loci that have significantly more reads than expected by chance in Hi-C experiments (<span class="citation" data-cites="Mifsud2017Apr">Mifsud et al. (<a href="interoperability.html#ref-Mifsud2017Apr" role="doc-biblioref">2017</a>)</span>).</p>
+<p><code>GOTHiC</code> relies on a cumulative binomial test to detect interactions between distal genomic loci that have significantly more reads than expected by chance in Hi-C experiments (<span class="citation" data-cites="Mifsud_2017">Mifsud et al. (<a href="interoperability.html#ref-Mifsud_2017" role="doc-biblioref">2017</a>)</span>).</p>
 <div class="callout callout-style-default callout-important callout-titled">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
@@ -647,7 +648,7 @@ <h1 class="title">
 <span><span class="va">res</span></span>
 <span><span class="co">##  `HiCExperiment` object with 471,364 contacts over 802 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 1000 </span></span>
@@ -704,196 +705,6 @@ <h1 class="title">
 </div>
 <!-- ## HiCDOC -->
 </section><section id="references" class="level2 unnumbered"><h2 class="unnumbered anchored" data-anchor-id="references">References</h2>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list">
-<div id="ref-Abdennur2020Jan" class="csl-entry" role="listitem">
-Abdennur, Nezar, and Leonid A. Mirny. 2020. <span>“<span class="nocase">Cooler: scalable storage for Hi-C data and other
-genomically labeled arrays</span>.”</span> <em>Bioinformatics</em> 36
-(1): 311–16. <a href="https://doi.org/10.1093/bioinformatics/btz540">https://doi.org/10.1093/bioinformatics/btz540</a>.
-</div>
-<div id="ref-Ay2014Feb" class="csl-entry" role="listitem">
-Ay, Ferhat, Timothy L. Bailey, and William Stafford Noble. 2014.
-<span>“<span class="nocase">Statistical confidence estimation for Hi-C
-data reveals regulatory chromatin contacts</span>.”</span> <em>Genome
-Res.</em> 24 (6): 999–1011. <a href="https://doi.org/10.1101/gr.160374.113">https://doi.org/10.1101/gr.160374.113</a>.
-</div>
-<div id="ref-BibEntry2021May" class="csl-entry" role="listitem">
-Bengtsson, Henrik, Hanjun Shin, Harris Lazaris, Gangqing Hu, and
-Xianghong Zhou. 2020. <em>R Package TopDom: An Efficient and
-Deterministic Method for Identifying Topological Domains in
-Genomes</em>. <a href="https://github.com/HenrikBengtsson/TopDom">https://github.com/HenrikBengtsson/TopDom</a>.
-</div>
-<div id="ref-Cournac2012Dec" class="csl-entry" role="listitem">
-Cournac, Axel, Hervé Marie-Nelly, Martial Marbouty, Romain Koszul, and
-Julien Mozziconacci. 2012. <span>“<span class="nocase">Normalization of
-a chromosomal contact map</span>.”</span> <em>BMC Genomics</em> 13 (1):
-1–13. <a href="https://doi.org/10.1186/1471-2164-13-436">https://doi.org/10.1186/1471-2164-13-436</a>.
-</div>
-<div id="ref-Crane2015Jul" class="csl-entry" role="listitem">
-Crane, Emily, Qian Bian, Rachel Patton McCord, Bryan R. Lajoie, Bayly S.
-Wheeler, Edward J. Ralston, Satoru Uzawa, Job Dekker, and Barbara J.
-Meyer. 2015. <span>“<span class="nocase">Condensin-driven remodelling of
-X chromosome topology during dosage compensation</span>.”</span>
-<em>Nature</em> 523 (July): 240–44. <a href="https://doi.org/10.1038/nature14450">https://doi.org/10.1038/nature14450</a>.
-</div>
-<div id="ref-Dekker2002Feb" class="csl-entry" role="listitem">
-Dekker, Job, Karsten Rippe, Martijn Dekker, and Nancy Kleckner. 2002.
-<span>“<span>Capturing Chromosome Conformation</span>.”</span>
-<em>Science</em> 295 (5558): 1306–11. <a href="https://doi.org/10.1126/science.1067799">https://doi.org/10.1126/science.1067799</a>.
-</div>
-<div id="ref-Deshpande2022Oct" class="csl-entry" role="listitem">
-Deshpande, Aditya S., Netha Ulahannan, Matthew Pendleton, Xiaoguang Dai,
-Lynn Ly, Julie M. Behr, Stefan Schwenk, et al. 2022. <span>“<span class="nocase">Identifying synergistic high-order 3D chromatin
-conformations from genome-scale nanopore concatemer
-sequencing</span>.”</span> <em>Nat. Biotechnol.</em> 40 (October):
-1488–99. <a href="https://doi.org/10.1038/s41587-022-01289-z">https://doi.org/10.1038/s41587-022-01289-z</a>.
-</div>
-<div id="ref-Duan2010May" class="csl-entry" role="listitem">
-Duan, Zhijun, Mirela Andronescu, Kevin Schutz, Sean McIlwain, Yoo Jung
-Kim, Choli Lee, Jay Shendure, Stanley Fields, C. Anthony Blau, and
-William S. Noble. 2010. <span>“<span class="nocase">A three-dimensional
-model of the yeast genome</span>.”</span> <em>Nature</em> 465 (May):
-363–67. <a href="https://doi.org/10.1038/nature08973">https://doi.org/10.1038/nature08973</a>.
-</div>
-<div id="ref-Durand2016Jul" class="csl-entry" role="listitem">
-Durand, Neva C., Muhammad S. Shamim, Ido Machol, Suhas S. P. Rao, Miriam
-H. Huntley, Eric S. Lander, and Erez Lieberman Aiden. 2016. <span>“<span class="nocase">Juicer provides a one-click system for analyzing
-loop-resolution Hi-C experiments</span>.”</span> <em>Cell Systems</em> 3
-(1): 95. <a href="https://doi.org/10.1016/j.cels.2016.07.002">https://doi.org/10.1016/j.cels.2016.07.002</a>.
-</div>
-<div id="ref-Gibcus2018Feb" class="csl-entry" role="listitem">
-Gibcus, Johan H., Kumiko Samejima, Anton Goloborodko, Itaru Samejima,
-Natalia Naumova, Johannes Nuebler, Masato T. Kanemaki, et al. 2018.
-<span>“<span class="nocase">A pathway for mitotic chromosome
-formation</span>.”</span> <em>Science</em> 359 (6376): eaao6135. <a href="https://doi.org/10.1126/science.aao6135">https://doi.org/10.1126/science.aao6135</a>.
-</div>
-<div id="ref-Huber2015Feb" class="csl-entry" role="listitem">
-Huber, Wolfgang, Vincent J. Carey, Robert Gentleman, Simon Anders, Marc
-Carlson, Benilton S. Carvalho, Hector Corrada Bravo, et al. 2015.
-<span>“<span class="nocase">Orchestrating high-throughput genomic
-analysis with Bioconductor</span>.”</span> <em>Nat. Methods</em> 12 (2):
-115–21. <a href="https://doi.org/10.1038/nmeth.3252">https://doi.org/10.1038/nmeth.3252</a>.
-</div>
-<div id="ref-Imakaev2012Oct" class="csl-entry" role="listitem">
-Imakaev, Maxim, Geoffrey Fudenberg, Rachel Patton McCord, Natalia
-Naumova, Anton Goloborodko, Bryan R. Lajoie, Job Dekker, and Leonid A.
-Mirny. 2012. <span>“<span class="nocase">Iterative correction of Hi-C
-data reveals hallmarks of chromosome organization</span>.”</span>
-<em>Nat. Methods</em> 9 (October): 999–1003. <a href="https://doi.org/10.1038/nmeth.2148">https://doi.org/10.1038/nmeth.2148</a>.
-</div>
-<div id="ref-J.O.2017Jan" class="csl-entry" role="listitem">
-J. O., Davies, Oudelaar A. M., Higgs D. R., and Hughes J. R. 2017.
-<span>“<span class="nocase">How best to identify chromosomal
-interactions: a comparison of approaches.</span>”</span> <em>Nat.
-Methods</em> 14 (2): 125–34. <a href="https://doi.org/10.1038/nmeth.4146">https://doi.org/10.1038/nmeth.4146</a>.
-</div>
-<div id="ref-Krietenstein2020May" class="csl-entry" role="listitem">
-Krietenstein, Nils, Sameer Abraham, Sergey V. Venev, Nezar Abdennur,
-Johan Gibcus, Tsung-Han S. Hsieh, Krishna Mohan Parsi, et al. 2020.
-<span>“<span class="nocase">Ultrastructural Details of Mammalian
-Chromosome Architecture</span>.”</span> <em>Mol. Cell</em> 78 (3):
-554–565.e7. <a href="https://doi.org/10.1016/j.molcel.2020.03.003">https://doi.org/10.1016/j.molcel.2020.03.003</a>.
-</div>
-<div id="ref-Krismer2020Apr" class="csl-entry" role="listitem">
-Krismer, Konstantin, Yuchun Guo, and David K. Gifford. 2020.
-<span>“<span class="nocase">IDR2D identifies reproducible genomic
-interactions</span>.”</span> <em>Nucleic Acids Res.</em> 48 (6): e31. <a href="https://doi.org/10.1093/nar/gkaa030">https://doi.org/10.1093/nar/gkaa030</a>.
-</div>
-<div id="ref-Lieberman-Aiden2009Oct" class="csl-entry" role="listitem">
-Lieberman-Aiden, Erez, Nynke L. van Berkum, Louise Williams, Maxim
-Imakaev, Tobias Ragoczy, Agnes Telling, Ido Amit, et al. 2009.
-<span>“<span class="nocase">Comprehensive mapping of long range
-interactions reveals folding principles of the human
-genome</span>.”</span> <em>Science (New York, N.Y.)</em> 326 (5950):
-289. <a href="https://doi.org/10.1126/science.1181369">https://doi.org/10.1126/science.1181369</a>.
-</div>
-<div id="ref-Lioy2018Feb" class="csl-entry" role="listitem">
-Lioy, Virginia S., Axel Cournac, Martial Marbouty, Stéphane Duigou,
-Julien Mozziconacci, Olivier Espéli, Frédéric Boccard, and Romain
-Koszul. 2018. <span>“<span class="nocase">Multiscale Structuring of the
-E. coli Chromosome by Nucleoid-Associated and Condensin
-Proteins</span>.”</span> <em>Cell</em> 172 (4): 771–78318. <a href="https://doi.org/10.1016/j.cell.2017.12.027">https://doi.org/10.1016/j.cell.2017.12.027</a>.
-</div>
-<div id="ref-Lun2016May" class="csl-entry" role="listitem">
-Lun, Aaron T. L., Malcolm Perry, and Elizabeth Ing-Simmons. 2016.
-<span>“<span class="nocase">Infrastructure for genomic interactions:
-Bioconductor classes for Hi-C, ChIA-PET and related
-experiments</span>.”</span> <em>F1000Research</em> 5 (May): 950. <a href="https://doi.org/10.12688/f1000research.8759.2">https://doi.org/10.12688/f1000research.8759.2</a>.
-</div>
-<div id="ref-MattheyDoret2020Nov" class="csl-entry" role="listitem">
-Matthey-Doret, Cyril, Lyam Baudry, Axel Breuer, Rémi Montagne, Nadège
-Guiglielmoni, Vittore Scolari, Etienne Jean, et al. 2020. <span>“<span class="nocase">Computer vision for pattern detection in chromosome
-contact maps</span>.”</span> <em>Nat. Commun.</em> 11 (5795): 1–11. <a href="https://doi.org/10.1038/s41467-020-19562-7">https://doi.org/10.1038/s41467-020-19562-7</a>.
-</div>
-<div id="ref-Mifsud2017Apr" class="csl-entry" role="listitem">
-Mifsud, Borbala, Inigo Martincorena, Elodie Darbo, Robert Sugar, Stefan
-Schoenfelder, Peter Fraser, and Nicholas M. Luscombe. 2017. <span>“<span class="nocase">GOTHiC, a probabilistic model to resolve complex biases
-and to identify real interactions in Hi-C data</span>.”</span> <em>PLoS
-One</em> 12 (4): e0174744. <a href="https://doi.org/10.1371/journal.pone.0174744">https://doi.org/10.1371/journal.pone.0174744</a>.
-</div>
-<div id="ref-Open2C2023Jan" class="csl-entry" role="listitem">
-Open2C, Nezar Abdennur, Geoffrey Fudenberg, Ilya M. Flyamer, Aleksandra
-A. Galitsyna, Anton Goloborodko, Maxim Imakaev, and Sergey V. Venev.
-2023. <span>“<span class="nocase">Pairtools: from sequencing data to
-chromosome contacts</span>.”</span> <em>bioRxiv</em>, February,
-2023.02.13.528389. <a href="https://doi.org/10.1101/2023.02.13.528389">https://doi.org/10.1101/2023.02.13.528389</a>.
-</div>
-<div id="ref-Pope2014Nov" class="csl-entry" role="listitem">
-Pope, Benjamin D., Tyrone Ryba, Vishnu Dileep, Feng Yue, Weisheng Wu,
-Olgert Denas, Daniel L. Vera, et al. 2014. <span>“<span class="nocase">Topologically associating domains are stable units of
-replication-timing regulation</span>.”</span> <em>Nature</em> 515
-(7527): 402–5. <a href="https://doi.org/10.1038/nature13986">https://doi.org/10.1038/nature13986</a>.
-</div>
-<div id="ref-Schmitt2016Nov" class="csl-entry" role="listitem">
-Schmitt, Anthony D., Ming Hu, Inkyung Jung, Zheng Xu, Yunjiang Qiu,
-Catherine L. Tan, Yun Li, et al. 2016. <span>“<span class="nocase">A
-Compendium of Chromatin Contact Maps Reveals Spatially Active Regions in
-the Human Genome</span>.”</span> <em>Cell Rep.</em> 17 (8): 2042–59. <a href="https://doi.org/10.1016/j.celrep.2016.10.061">https://doi.org/10.1016/j.celrep.2016.10.061</a>.
-</div>
-<div id="ref-Sefer2022Dec" class="csl-entry" role="listitem">
-Sefer, Emre. 2022. <span>“<span class="nocase">A comparison of
-topologically associating domain callers over mammals at high
-resolution</span>.”</span> <em>BMC Bioinf.</em> 23 (1): 1–39. <a href="https://doi.org/10.1186/s12859-022-04674-2">https://doi.org/10.1186/s12859-022-04674-2</a>.
-</div>
-<div id="ref-Servant2015Dec" class="csl-entry" role="listitem">
-Servant, Nicolas, Nelle Varoquaux, Bryan R. Lajoie, Eric Viara,
-Chong-Jian Chen, Jean-Philippe Vert, Edith Heard, Job Dekker, and
-Emmanuel Barillot. 2015. <span>“<span class="nocase">HiC-Pro: an
-optimized and flexible pipeline for Hi-C data processing</span>.”</span>
-<em>Genome Biol.</em> 16 (1): 1–11. <a href="https://doi.org/10.1186/s13059-015-0831-x">https://doi.org/10.1186/s13059-015-0831-x</a>.
-</div>
-<div id="ref-Shin2016Apr" class="csl-entry" role="listitem">
-Shin, Hanjun, Yi Shi, Chao Dai, Harianto Tjong, Ke Gong, Frank Alber,
-and Xianghong Jasmine Zhou. 2016. <span>“TopDom: An Efficient and
-Deterministic Method for Identifying Topological Domains in
-Genomes.”</span> <em>Nucleic Acids Research</em> 44 (7): e70. <a href="https://doi.org/10.1093/nar/gkv1505">https://doi.org/10.1093/nar/gkv1505</a>.
-</div>
-<div id="ref-Stadhouders2019May" class="csl-entry" role="listitem">
-Stadhouders, Ralph, Guillaume J. Filion, and Thomas Graf. 2019.
-<span>“<span class="nocase">Transcription factors and 3D genome
-conformation in cell-fate decisions</span>.”</span> <em>Nature</em> 569
-(7756): 345–54. <a href="https://doi.org/10.1038/s41586-019-1182-7">https://doi.org/10.1038/s41586-019-1182-7</a>.
-</div>
-<div id="ref-Stansfield2019Sep" class="csl-entry" role="listitem">
-Stansfield, John C., Kellen G. Cresswell, and Mikhail G. Dozmorov. 2019.
-<span>“<span class="nocase">multiHiCcompare: joint normalization and
-comparative analysis of complex Hi-C experiments</span>.”</span>
-<em>Bioinformatics</em> 35 (17): 2916–23. <a href="https://doi.org/10.1093/bioinformatics/btz048">https://doi.org/10.1093/bioinformatics/btz048</a>.
-</div>
-<div id="ref-Tavares-Cadete2020Dec" class="csl-entry" role="listitem">
-Tavares-Cadete, Filipe, Davood Norouzi, Bastiaan Dekker, Yu Liu, and Job
-Dekker. 2020. <span>“<span class="nocase">Multi-contact 3C reveals that
-the human genome during interphase is largely not
-entangled</span>.”</span> <em>Nat. Struct. Mol. Biol.</em> 27
-(December): 1105–14. <a href="https://doi.org/10.1038/s41594-020-0506-5">https://doi.org/10.1038/s41594-020-0506-5</a>.
-</div>
-<div id="ref-Yang2017Nov" class="csl-entry" role="listitem">
-Yang, Tao, Feipeng Zhang, Galip Gürkan Yardımcı, Fan Song, Ross C.
-Hardison, William Stafford Noble, Feng Yue, and Qunhua Li. 2017.
-<span>“<span class="nocase">HiCRep: assessing the reproducibility of
-Hi-C data using a stratum-adjusted correlation
-coefficient</span>.”</span> <em>Genome Res.</em> 27 (11): 1939–49. <a href="https://doi.org/10.1101/gr.220640.117">https://doi.org/10.1101/gr.220640.117</a>.
-</div>
-</div>
 </section><section id="session-info" class="level2 unnumbered"><h2 class="unnumbered anchored" data-anchor-id="session-info">Session info</h2>
 <div class="cell" data-layout-align="center" data-hash="interoperability_cache/html/unnamed-chunk-13_a84c50b6e020bde5c90193c539123c78">
 <pre><code>##  ─ Session info ────────────────────────────────────────────────────────────
@@ -906,7 +717,7 @@ <h1 class="title">
 ##   collate  en_US.UTF-8
 ##   ctype    en_US.UTF-8
 ##   tz       Etc/UTC
-##   date     2023-09-25
+##   date     2023-10-19
 ##   pandoc   3.1.1 @ /usr/local/bin/ (via rmarkdown)
 ##  
 ##  ─ Packages ────────────────────────────────────────────────────────────────
@@ -933,26 +744,26 @@ <h1 class="title">
 ##   codetools                0.2-19    2023-02-01 [2] CRAN (R 4.3.1)
 ##   colorspace               2.1-0     2023-01-23 [1] CRAN (R 4.3.1)
 ##   crayon                   1.5.2     2022-09-29 [1] CRAN (R 4.3.1)
-##   curl                     5.0.2     2023-08-14 [1] CRAN (R 4.3.1)
+##   curl                     5.1.0     2023-10-02 [1] CRAN (R 4.3.1)
 ##   data.table               1.14.8    2023-02-17 [1] CRAN (R 4.3.1)
 ##   DBI                      1.1.3     2022-06-18 [1] CRAN (R 4.3.1)
-##   dbplyr                 * 2.3.3     2023-07-07 [1] CRAN (R 4.3.1)
+##   dbplyr                 * 2.3.4     2023-09-26 [1] CRAN (R 4.3.1)
 ##   DelayedArray             0.27.10   2023-07-28 [1] Bioconductor
 ##   digest                   0.6.33    2023-07-07 [1] CRAN (R 4.3.1)
 ##   dplyr                  * 1.1.3     2023-09-03 [1] CRAN (R 4.3.1)
-##   edgeR                    3.43.8    2023-08-02 [1] Bioconductor
+##   edgeR                    3.99.3    2023-10-16 [1] Bioconductor
 ##   ellipsis                 0.3.2     2021-04-29 [1] CRAN (R 4.3.1)
-##   evaluate                 0.21      2023-05-05 [1] CRAN (R 4.3.1)
+##   evaluate                 0.22      2023-09-29 [1] CRAN (R 4.3.1)
 ##   ExperimentHub          * 2.9.1     2023-07-12 [1] Bioconductor
-##   fansi                    1.0.4     2023-01-22 [1] CRAN (R 4.3.1)
+##   fansi                    1.0.5     2023-10-08 [1] CRAN (R 4.3.1)
 ##   fastmap                  1.1.1     2023-02-24 [1] CRAN (R 4.3.1)
 ##   filelock                 1.0.2     2018-10-05 [1] CRAN (R 4.3.1)
 ##   generics                 0.1.3     2022-07-05 [1] CRAN (R 4.3.1)
-##   GenomeInfoDb           * 1.37.4    2023-09-07 [1] Bioconductor
-##   GenomeInfoDbData         1.2.10    2023-09-25 [1] Bioconductor
+##   GenomeInfoDb           * 1.37.6    2023-10-02 [1] Bioconductor
+##   GenomeInfoDbData         1.2.11    2023-10-19 [1] Bioconductor
 ##   GenomicAlignments        1.37.0    2023-04-25 [1] Bioconductor
-##   GenomicRanges          * 1.53.1    2023-05-04 [1] Bioconductor
-##   ggplot2                * 3.4.3     2023-08-14 [1] CRAN (R 4.3.1)
+##   GenomicRanges          * 1.53.2    2023-10-08 [1] Bioconductor
+##   ggplot2                * 3.4.4     2023-10-12 [1] CRAN (R 4.3.1)
 ##   glue                     1.6.2     2022-02-24 [1] CRAN (R 4.3.1)
 ##   gridExtra                2.3       2017-09-09 [1] CRAN (R 4.3.1)
 ##   gtable                   0.3.4     2023-08-21 [1] CRAN (R 4.3.1)
@@ -960,22 +771,22 @@ <h1 class="title">
 ##   HiCcompare               1.23.1    2023-06-04 [1] Bioconductor
 ##   HiCExperiment          * 1.1.2     2023-09-04 [1] Bioconductor
 ##   HiContactsData         * 1.3.0     2023-04-27 [1] Bioconductor
-##   hicrep                 * 1.12.2    2023-09-25 [1] Github (TaoYang-dev/hicrep@e485dfa)
-##   htmltools                0.5.6     2023-08-10 [1] CRAN (R 4.3.1)
+##   hicrep                 * 1.12.2    2023-10-19 [1] Github (TaoYang-dev/hicrep@e485dfa)
+##   htmltools                0.5.6.1   2023-10-06 [1] CRAN (R 4.3.1)
 ##   htmlwidgets              1.6.2     2023-03-17 [1] CRAN (R 4.3.1)
 ##   httpuv                   1.6.11    2023-05-11 [1] CRAN (R 4.3.1)
 ##   httr                     1.4.7     2023-08-15 [1] CRAN (R 4.3.1)
 ##   InteractionSet         * 1.29.1    2023-06-14 [1] Bioconductor
 ##   interactiveDisplayBase   1.39.0    2023-04-25 [1] Bioconductor
-##   IRanges                * 2.35.2    2023-06-22 [1] Bioconductor
+##   IRanges                * 2.35.3    2023-10-12 [1] Bioconductor
 ##   jsonlite                 1.8.7     2023-06-29 [1] CRAN (R 4.3.1)
-##   KEGGREST                 1.41.0    2023-04-25 [1] Bioconductor
+##   KEGGREST                 1.41.4    2023-09-25 [1] Bioconductor
 ##   KernSmooth               2.23-22   2023-07-10 [1] CRAN (R 4.3.1)
 ##   knitr                    1.44      2023-09-11 [1] CRAN (R 4.3.1)
 ##   later                    1.3.1     2023-05-02 [1] CRAN (R 4.3.1)
-##   lattice                  0.21-8    2023-04-05 [2] CRAN (R 4.3.1)
+##   lattice                  0.21-9    2023-10-01 [1] CRAN (R 4.3.1)
 ##   lifecycle                1.0.3     2022-10-07 [1] CRAN (R 4.3.1)
-##   limma                    3.57.7    2023-08-02 [1] Bioconductor
+##   limma                    3.57.10   2023-10-11 [1] Bioconductor
 ##   locfit                   1.5-9.8   2023-06-11 [1] CRAN (R 4.3.1)
 ##   magrittr                 2.0.3     2022-03-30 [1] CRAN (R 4.3.1)
 ##   MASS                     7.3-60    2023-05-04 [2] CRAN (R 4.3.1)
@@ -992,7 +803,7 @@ <h1 class="title">
 ##   pheatmap                 1.0.12    2019-01-04 [1] CRAN (R 4.3.1)
 ##   pillar                   1.9.0     2023-03-22 [1] CRAN (R 4.3.1)
 ##   pkgconfig                2.0.3     2019-09-22 [1] CRAN (R 4.3.1)
-##   plyr                     1.8.8     2022-11-11 [1] CRAN (R 4.3.1)
+##   plyr                     1.8.9     2023-10-02 [1] CRAN (R 4.3.1)
 ##   png                      0.1-8     2022-11-29 [1] CRAN (R 4.3.1)
 ##   promises                 1.2.1     2023-08-10 [1] CRAN (R 4.3.1)
 ##   purrr                  * 1.0.2     2023-08-10 [1] CRAN (R 4.3.1)
@@ -1015,10 +826,10 @@ <h1 class="title">
 ##   rstudioapi               0.15.0    2023-07-07 [1] CRAN (R 4.3.1)
 ##   rtracklayer              1.61.1    2023-08-15 [1] Bioconductor
 ##   S4Arrays                 1.1.6     2023-08-30 [1] Bioconductor
-##   S4Vectors              * 0.39.2    2023-09-22 [1] Bioconductor
+##   S4Vectors              * 0.39.3    2023-10-11 [1] Bioconductor
 ##   scales                   1.2.1     2022-08-20 [1] CRAN (R 4.3.1)
 ##   sessioninfo              1.2.2     2021-12-06 [1] CRAN (R 4.3.1)
-##   shiny                    1.7.5     2023-08-12 [1] CRAN (R 4.3.1)
+##   shiny                    1.7.5.1   2023-10-14 [1] CRAN (R 4.3.1)
 ##   SparseArray              1.1.12    2023-08-31 [1] Bioconductor
 ##   statmod                  1.5.0     2023-01-06 [1] CRAN (R 4.3.1)
 ##   strawr                   0.0.91    2023-03-29 [1] CRAN (R 4.3.1)
@@ -1031,9 +842,9 @@ <h1 class="title">
 ##   TopDom                 * 0.10.1    2021-05-06 [1] CRAN (R 4.3.1)
 ##   tzdb                     0.4.0     2023-05-12 [1] CRAN (R 4.3.1)
 ##   utf8                     1.2.3     2023-01-31 [1] CRAN (R 4.3.1)
-##   vctrs                    0.6.3     2023-06-14 [1] CRAN (R 4.3.1)
-##   vroom                    1.6.3     2023-04-28 [1] CRAN (R 4.3.1)
-##   withr                    2.5.0     2022-03-03 [1] CRAN (R 4.3.1)
+##   vctrs                    0.6.4     2023-10-12 [1] CRAN (R 4.3.1)
+##   vroom                    1.6.4     2023-10-02 [1] CRAN (R 4.3.1)
+##   withr                    2.5.1     2023-09-26 [1] CRAN (R 4.3.1)
 ##   xfun                     0.40      2023-08-09 [1] CRAN (R 4.3.1)
 ##   XML                      3.99-0.14 2023-03-19 [1] CRAN (R 4.3.1)
 ##   xtable                   1.8-4     2019-04-21 [1] CRAN (R 4.3.1)
@@ -1048,6 +859,186 @@ <h1 class="title">
 </div>
 
 
+</section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list">
+<div id="ref-Abdennur_2019" class="csl-entry" role="listitem">
+Abdennur, N., &amp; Mirny, L. A. (2019). Cooler: Scalable storage for
+hi-c data and other genomically labeled arrays. <em>Bioinformatics</em>,
+<em>36</em>(1), 311–316. <a href="https://doi.org/10.1093/bioinformatics/btz540">https://doi.org/10.1093/bioinformatics/btz540</a>
+</div>
+<div id="ref-Ay_2014" class="csl-entry" role="listitem">
+Ay, F., Bailey, T. L., &amp; Noble, W. S. (2014). Statistical confidence
+estimation for hi-c data reveals regulatory chromatin contacts.
+<em>Genome Research</em>, <em>24</em>(6), 999–1011. <a href="https://doi.org/10.1101/gr.160374.113">https://doi.org/10.1101/gr.160374.113</a>
+</div>
+<div id="ref-Bengtsson_2020" class="csl-entry" role="listitem">
+Bengtsson, H., Shin, H., Lazaris, H., Hu, G., &amp; Zhou, X. (2020).
+<em>R package TopDom: An efficient and deterministic method for
+identifying topological domains in genomes</em>. <a href="https://github.com/HenrikBengtsson/TopDom">https://github.com/HenrikBengtsson/TopDom</a>
+</div>
+<div id="ref-Cournac_2012" class="csl-entry" role="listitem">
+Cournac, A., Marie-Nelly, H., Marbouty, M., Koszul, R., &amp;
+Mozziconacci, J. (2012). Normalization of a chromosomal contact map.
+<em><span>BMC</span> Genomics</em>, <em>13</em>(1). <a href="https://doi.org/10.1186/1471-2164-13-436">https://doi.org/10.1186/1471-2164-13-436</a>
+</div>
+<div id="ref-Crane_2015" class="csl-entry" role="listitem">
+Crane, E., Bian, Q., McCord, R. P., Lajoie, B. R., Wheeler, B. S.,
+Ralston, E. J., Uzawa, S., Dekker, J., &amp; Meyer, B. J. (2015).
+Condensin-driven remodelling of x chromosome topology during dosage
+compensation. <em>Nature</em>, <em>523</em>(7559), 240–244. <a href="https://doi.org/10.1038/nature14450">https://doi.org/10.1038/nature14450</a>
+</div>
+<div id="ref-Davies_2017" class="csl-entry" role="listitem">
+Davies, J. O. J., Oudelaar, A. M., Higgs, D. R., &amp; Hughes, J. R.
+(2017). How best to identify chromosomal interactions: A comparison of
+approaches. <em>Nature Methods</em>, <em>14</em>(2), 125–134. <a href="https://doi.org/10.1038/nmeth.4146">https://doi.org/10.1038/nmeth.4146</a>
+</div>
+<div id="ref-Dekker_2002" class="csl-entry" role="listitem">
+Dekker, J., Rippe, K., Dekker, M., &amp; Kleckner, N. (2002). Capturing
+chromosome conformation. <em>Science</em>, <em>295</em>(5558),
+1306–1311. <a href="https://doi.org/10.1126/science.1067799">https://doi.org/10.1126/science.1067799</a>
+</div>
+<div id="ref-Deshpande_2022" class="csl-entry" role="listitem">
+Deshpande, A. S., Ulahannan, N., Pendleton, M., Dai, X., Ly, L., Behr,
+J. M., Schwenk, S., Liao, W., Augello, M. A., Tyer, C., Rughani, P.,
+Kudman, S., Tian, H., Otis, H. G., Adney, E., Wilkes, D., Mosquera, J.
+M., Barbieri, C. E., Melnick, A., … Imieliński, M. (2022). Identifying
+synergistic high-order 3D chromatin conformations from genome-scale
+nanopore concatemer sequencing. <em>Nature Biotechnology</em>,
+<em>40</em>(10), 1488–1499. <a href="https://doi.org/10.1038/s41587-022-01289-z">https://doi.org/10.1038/s41587-022-01289-z</a>
+</div>
+<div id="ref-Duan_2010" class="csl-entry" role="listitem">
+Duan, Z., Andronescu, M., Schutz, K., McIlwain, S., Kim, Y. J., Lee, C.,
+Shendure, J., Fields, S., Blau, C. A., &amp; Noble, W. S. (2010). A
+three-dimensional model of the yeast genome. <em>Nature</em>,
+<em>465</em>(7296), 363–367. <a href="https://doi.org/10.1038/nature08973">https://doi.org/10.1038/nature08973</a>
+</div>
+<div id="ref-Durand_2016" class="csl-entry" role="listitem">
+Durand, N. C., Shamim, M. S., Machol, I., Rao, S. S. P., Huntley, M. H.,
+Lander, E. S., &amp; Aiden, E. L. (2016). Juicer provides a one-click
+system for analyzing loop-resolution hi-c experiments. <em>Cell
+Systems</em>, <em>3</em>(1), 95–98. <a href="https://doi.org/10.1016/j.cels.2016.07.002">https://doi.org/10.1016/j.cels.2016.07.002</a>
+</div>
+<div id="ref-Gibcus_2018" class="csl-entry" role="listitem">
+Gibcus, J. H., Samejima, K., Goloborodko, A., Samejima, I., Naumova, N.,
+Nuebler, J., Kanemaki, M. T., Xie, L., Paulson, J. R., Earnshaw, W. C.,
+Mirny, L. A., &amp; Dekker, J. (2018). A pathway for mitotic chromosome
+formation. <em>Science</em>, <em>359</em>(6376). <a href="https://doi.org/10.1126/science.aao6135">https://doi.org/10.1126/science.aao6135</a>
+</div>
+<div id="ref-Huber_2015" class="csl-entry" role="listitem">
+Huber, W., Carey, V. J., Gentleman, R., Anders, S., Carlson, M.,
+Carvalho, B. S., Bravo, H. C., Davis, S., Gatto, L., Girke, T.,
+Gottardo, R., Hahne, F., Hansen, K. D., Irizarry, R. A., Lawrence, M.,
+Love, M. I., MacDonald, J., Obenchain, V., Oleś, A. K., … Morgan, M.
+(2015). Orchestrating high-throughput genomic analysis with
+bioconductor. <em>Nature Methods</em>, <em>12</em>(2), 115–121. <a href="https://doi.org/10.1038/nmeth.3252">https://doi.org/10.1038/nmeth.3252</a>
+</div>
+<div id="ref-Imakaev_2012" class="csl-entry" role="listitem">
+Imakaev, M., Fudenberg, G., McCord, R. P., Naumova, N., Goloborodko, A.,
+Lajoie, B. R., Dekker, J., &amp; Mirny, L. A. (2012). Iterative
+correction of hi-c data reveals hallmarks of chromosome organization.
+<em>Nature Methods</em>, <em>9</em>(10), 999–1003. <a href="https://doi.org/10.1038/nmeth.2148">https://doi.org/10.1038/nmeth.2148</a>
+</div>
+<div id="ref-Krietenstein_2020" class="csl-entry" role="listitem">
+Krietenstein, N., Abraham, S., Venev, S. V., Abdennur, N., Gibcus, J.,
+Hsieh, T.-H. S., Parsi, K. M., Yang, L., Maehr, R., Mirny, L. A.,
+Dekker, J., &amp; Rando, O. J. (2020). Ultrastructural details of
+mammalian chromosome architecture. <em>Molecular Cell</em>,
+<em>78</em>(3), 554–565.e7. <a href="https://doi.org/10.1016/j.molcel.2020.03.003">https://doi.org/10.1016/j.molcel.2020.03.003</a>
+</div>
+<div id="ref-Krismer_2020" class="csl-entry" role="listitem">
+Krismer, K., Guo, Y., &amp; Gifford, D. K. (2020). <span>IDR</span>2D
+identifies reproducible genomic interactions. <em>Nucleic Acids
+Research</em>, <em>48</em>(6), e31–e31. <a href="https://doi.org/10.1093/nar/gkaa030">https://doi.org/10.1093/nar/gkaa030</a>
+</div>
+<div id="ref-Lieberman_Aiden_2009" class="csl-entry" role="listitem">
+Lieberman-Aiden, E., Berkum, N. L. van, Williams, L., Imakaev, M.,
+Ragoczy, T., Telling, A., Amit, I., Lajoie, B. R., Sabo, P. J.,
+Dorschner, M. O., Sandstrom, R., Bernstein, B., Bender, M. A., Groudine,
+M., Gnirke, A., Stamatoyannopoulos, J., Mirny, L. A., Lander, E. S.,
+&amp; Dekker, J. (2009). Comprehensive mapping of long-range
+interactions reveals folding principles of the human genome.
+<em>Science</em>, <em>326</em>(5950), 289–293. <a href="https://doi.org/10.1126/science.1181369">https://doi.org/10.1126/science.1181369</a>
+</div>
+<div id="ref-Lioy_2018" class="csl-entry" role="listitem">
+Lioy, V. S., Cournac, A., Marbouty, M., Duigou, S., Mozziconacci, J.,
+Espéli, O., Boccard, F., &amp; Koszul, R. (2018). Multiscale structuring
+of the e.&nbsp;Coli chromosome by nucleoid-associated and condensin proteins.
+<em>Cell</em>, <em>172</em>(4), 771–783.e18. <a href="https://doi.org/10.1016/j.cell.2017.12.027">https://doi.org/10.1016/j.cell.2017.12.027</a>
+</div>
+<div id="ref-Lun_2016" class="csl-entry" role="listitem">
+Lun, A. T. L., Perry, M., &amp; Ing-Simmons, E. (2016). Infrastructure
+for genomic interactions: Bioconductor classes for hi-c,
+<span>ChIA</span>-<span>PET</span> and related experiments.
+<em>F1000Research</em>, <em>5</em>, 950. <a href="https://doi.org/10.12688/f1000research.8759.2">https://doi.org/10.12688/f1000research.8759.2</a>
+</div>
+<div id="ref-Matthey_Doret_2020" class="csl-entry" role="listitem">
+Matthey-Doret, C., Baudry, L., Breuer, A., Montagne, R., Guiglielmoni,
+N., Scolari, V., Jean, E., Campeas, A., Chanut, P. H., Oriol, E., Méot,
+A., Politis, L., Vigouroux, A., Moreau, P., Koszul, R., &amp; Cournac,
+A. (2020). Computer vision for pattern detection in chromosome contact
+maps. <em>Nature Communications</em>, <em>11</em>(1). <a href="https://doi.org/10.1038/s41467-020-19562-7">https://doi.org/10.1038/s41467-020-19562-7</a>
+</div>
+<div id="ref-Mifsud_2017" class="csl-entry" role="listitem">
+Mifsud, B., Martincorena, I., Darbo, E., Sugar, R., Schoenfelder, S.,
+Fraser, P., &amp; Luscombe, N. M. (2017). <span>GOTHiC</span>, a
+probabilistic model to resolve complex biases and to identify real
+interactions in hi-c data. <em><span>PLOS</span> <span>ONE</span></em>,
+<em>12</em>(4), e0174744. <a href="https://doi.org/10.1371/journal.pone.0174744">https://doi.org/10.1371/journal.pone.0174744</a>
+</div>
+<div id="ref-Open2C_2023" class="csl-entry" role="listitem">
+Open2C, Abdennur, N., Fudenberg, G., Flyamer, I. M., Galitsyna, A. A.,
+Goloborodko, A., Imakaev, M., &amp; Venev, S. V. (2023). <em>Pairtools:
+From sequencing data to chromosome contacts</em>. <a href="https://doi.org/10.1101/2023.02.13.528389">https://doi.org/10.1101/2023.02.13.528389</a>
+</div>
+<div id="ref-Pope_2014" class="csl-entry" role="listitem">
+Pope, B. D., Ryba, T., Dileep, V., Yue, F., Wu, W., Denas, O., Vera, D.
+L., Wang, Y., Hansen, R. S., Canfield, T. K., Thurman, R. E., Cheng, Y.,
+Gülsoy, G., Dennis, J. H., Snyder, M. P., Stamatoyannopoulos, J. A.,
+Taylor, J., Hardison, R. C., Kahveci, T., … Gilbert, D. M. (2014).
+Topologically associating domains are stable units of replication-timing
+regulation. <em>Nature</em>, <em>515</em>(7527), 402–405. <a href="https://doi.org/10.1038/nature13986">https://doi.org/10.1038/nature13986</a>
+</div>
+<div id="ref-Schmitt_2016" class="csl-entry" role="listitem">
+Schmitt, A. D., Hu, M., Jung, I., Xu, Z., Qiu, Y., Tan, C. L., Li, Y.,
+Lin, S., Lin, Y., Barr, C. L., &amp; Ren, B. (2016). A compendium of
+chromatin contact maps reveals spatially active regions in the human
+genome. <em>Cell Reports</em>, <em>17</em>(8), 2042–2059. <a href="https://doi.org/10.1016/j.celrep.2016.10.061">https://doi.org/10.1016/j.celrep.2016.10.061</a>
+</div>
+<div id="ref-Sefer_2022" class="csl-entry" role="listitem">
+Sefer, E. (2022). A comparison of topologically associating domain
+callers over mammals at high resolution. <em><span>BMC</span>
+Bioinformatics</em>, <em>23</em>(1). <a href="https://doi.org/10.1186/s12859-022-04674-2">https://doi.org/10.1186/s12859-022-04674-2</a>
+</div>
+<div id="ref-Servant_2015" class="csl-entry" role="listitem">
+Servant, N., Varoquaux, N., Lajoie, B. R., Viara, E., Chen, C.-J., Vert,
+J.-P., Heard, E., Dekker, J., &amp; Barillot, E. (2015).
+<span>HiC</span>-pro: An optimized and flexible pipeline for hi-c data
+processing. <em>Genome Biology</em>, <em>16</em>(1). <a href="https://doi.org/10.1186/s13059-015-0831-x">https://doi.org/10.1186/s13059-015-0831-x</a>
+</div>
+<div id="ref-Stadhouders_2019" class="csl-entry" role="listitem">
+Stadhouders, R., Filion, G. J., &amp; Graf, T. (2019). Transcription
+factors and 3D genome conformation in cell-fate decisions.
+<em>Nature</em>, <em>569</em>(7756), 345–354. <a href="https://doi.org/10.1038/s41586-019-1182-7">https://doi.org/10.1038/s41586-019-1182-7</a>
+</div>
+<div id="ref-Stansfield_2019" class="csl-entry" role="listitem">
+Stansfield, J. C., Cresswell, K. G., &amp; Dozmorov, M. G. (2019). <span class="nocase">multiHiCcompare</span>: Joint normalization and
+comparative analysis of complex hi-c experiments.
+<em>Bioinformatics</em>, <em>35</em>(17), 2916–2923. <a href="https://doi.org/10.1093/bioinformatics/btz048">https://doi.org/10.1093/bioinformatics/btz048</a>
+</div>
+<div id="ref-Tavares_Cadete_2020" class="csl-entry" role="listitem">
+Tavares-Cadete, F., Norouzi, D., Dekker, B., Liu, Y., &amp; Dekker, J.
+(2020). Multi-contact 3C reveals that the human genome during interphase
+is largely not entangled. <em>Nature Structural
+<span>&amp;</span>Amp<span class="math inline">$\mathsemicolon$</span>
+Molecular Biology</em>, <em>27</em>(12), 1105–1114. <a href="https://doi.org/10.1038/s41594-020-0506-5">https://doi.org/10.1038/s41594-020-0506-5</a>
+</div>
+<div id="ref-Yang_2017" class="csl-entry" role="listitem">
+Yang, T., Zhang, F., Yardımcı, G. G., Song, F., Hardison, R. C., Noble,
+W. S., Yue, F., &amp; Li, Q. (2017). <span>HiCRep</span>: Assessing the
+reproducibility of hi-c data using a stratum-adjusted correlation
+coefficient. <em>Genome Research</em>, <em>27</em>(11), 1939–1949. <a href="https://doi.org/10.1101/gr.220640.117">https://doi.org/10.1101/gr.220640.117</a>
+</div>
+</div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">
 window.document.addEventListener("DOMContentLoaded", function (event) {
   const toggleBodyColorMode = (bsSheetEl) => {
diff --git a/matrix-centric.html b/matrix-centric.html
index 209c46a..32d5c7c 100644
--- a/matrix-centric.html
+++ b/matrix-centric.html
@@ -381,7 +381,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb2"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 471,364 contacts over 407 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -399,13 +399,13 @@ <h1 class="title">
 <section id="balancing-a-raw-interaction-count-map" class="level3 page-columns page-full" data-number="5.1.1"><h3 data-number="5.1.1" class="anchored" data-anchor-id="balancing-a-raw-interaction-count-map">
 <span class="header-section-number">5.1.1</span> Balancing a raw interaction count map</h3>
 <p>Hi-C sequencing coverage is systematically affected by multiple confounding factors, e.g.&nbsp; density of restriction sites, GC%, genome mappability, etc.. Overall, it generally ends up not homogenous throughout the entire genome and this leads to artifacts in un-normalized <code>count</code> matrices.</p>
-<p>To correct for sequencing coverage heterogeneity of raw <code>count</code> maps, Hi-C data can be normalized using matrix balancing approaches (<span class="citation" data-cites="Cournac2012Dec">Cournac et al. (<a href="interoperability.html#ref-Cournac2012Dec" role="doc-biblioref">2012</a>)</span>, <span class="citation" data-cites="Imakaev2012Oct">Imakaev et al. (<a href="interoperability.html#ref-Imakaev2012Oct" role="doc-biblioref">2012</a>)</span>). This is generally done directly on the disk-stored matrices using out-of-memory strategies (e.g.&nbsp;with <code>cooler balance &lt;.cool&gt;</code>). However, if contact matrix files are imported into a <code>HiCExperiment</code> object but no <code>balanced</code> scores are available, in-memory balancing can be performed using the <code>normalize</code> function.</p>
+<p>To correct for sequencing coverage heterogeneity of raw <code>count</code> maps, Hi-C data can be normalized using matrix balancing approaches (<span class="citation" data-cites="Cournac_2012">Cournac et al. (<a href="interoperability.html#ref-Cournac_2012" role="doc-biblioref">2012</a>)</span>, <span class="citation" data-cites="Imakaev_2012">Imakaev et al. (<a href="interoperability.html#ref-Imakaev_2012" role="doc-biblioref">2012</a>)</span>). This is generally done directly on the disk-stored matrices using out-of-memory strategies (e.g.&nbsp;with <code>cooler balance &lt;.cool&gt;</code>). However, if contact matrix files are imported into a <code>HiCExperiment</code> object but no <code>balanced</code> scores are available, in-memory balancing can be performed using the <code>normalize</code> function.</p>
 <div class="cell" data-layout-align="center" data-hash="matrix-centric_cache/html/unnamed-chunk-4_a8656a185d0ffb0d8086dac30b97c407">
 <div class="sourceCode" id="cb3"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">normalized_hic</span> <span class="op">&lt;-</span> <span class="fu">normalize</span><span class="op">(</span><span class="va">hic</span><span class="op">)</span></span>
 <span><span class="va">normalized_hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 471,364 contacts over 407 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -455,7 +455,7 @@ <h1 class="title">
 <span><span class="va">detrended_hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 471,364 contacts over 407 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -520,7 +520,7 @@ <h1 class="title">
 </div>
 </section><section id="computing-autocorrelated-map" class="level3" data-number="5.1.3"><h3 data-number="5.1.3" class="anchored" data-anchor-id="computing-autocorrelated-map">
 <span class="header-section-number">5.1.3</span> Computing autocorrelated map</h3>
-<p>Correlation matrices are often calculated from balanced Hi-C matrices. For instance, in genomes composed of eu- and heterochromatin, a correlation matrix can be used to reveal a checkerboard pattern emphasizing the segregation of chromatin into two A/B compartments (<span class="citation" data-cites="Lieberman-Aiden2009Oct">Lieberman-Aiden et al. (<a href="interoperability.html#ref-Lieberman-Aiden2009Oct" role="doc-biblioref">2009</a>)</span>).</p>
+<p>Correlation matrices are often calculated from balanced Hi-C matrices. For instance, in genomes composed of eu- and heterochromatin, a correlation matrix can be used to reveal a checkerboard pattern emphasizing the segregation of chromatin into two A/B compartments (<span class="citation" data-cites="Lieberman_Aiden_2009">Lieberman-Aiden et al. (<a href="interoperability.html#ref-Lieberman_Aiden_2009" role="doc-biblioref">2009</a>)</span>).</p>
 <p>The <code>autocorrelate</code> function is used to compute a correlation matrix of a <code>HiCExperiment</code> object. For each pair of interacting loci, the <code>autocorrelated</code> score represents the correlation between their respective interaction profiles with the rest of the genome.</p>
 <div class="cell" data-layout-align="center" data-hash="matrix-centric_cache/html/unnamed-chunk-8_f38822885e3d353465dd7647fa009713">
 <div class="sourceCode" id="cb7"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">autocorr_hic</span> <span class="op">&lt;-</span> <span class="fu">autocorrelate</span><span class="op">(</span><span class="va">hic</span><span class="op">)</span></span>
@@ -528,7 +528,7 @@ <h1 class="title">
 <span><span class="va">autocorr_hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 471,364 contacts over 407 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -555,7 +555,7 @@ <h1 class="title">
 </div>
 </div>
 <div class="callout-body-container callout-body">
-<p>Here we have illustrated how to compute an autocorrelation matrix from a <code>HiCExperiment</code> object using the example <strong>yeast</strong> Hi-C experiment. Bear in mind that this is unusual and not very useful, as yeast chromatin is not segregated in two compartments but rather follows a Rabl conformation (<span class="citation" data-cites="Duan2010May">Duan et al. (<a href="interoperability.html#ref-Duan2010May" role="doc-biblioref">2010</a>)</span>). An example of autocorrelation map from a vertebrate Hi-C experiment (for which chromatin is segregated in A/B compartments) is shown in <a href="./workflow-chicken.html">Chapter 10</a>.</p>
+<p>Here we have illustrated how to compute an autocorrelation matrix from a <code>HiCExperiment</code> object using the example <strong>yeast</strong> Hi-C experiment. Bear in mind that this is unusual and not very useful, as yeast chromatin is not segregated in two compartments but rather follows a Rabl conformation (<span class="citation" data-cites="Duan_2010">Duan et al. (<a href="interoperability.html#ref-Duan_2010" role="doc-biblioref">2010</a>)</span>). An example of autocorrelation map from a vertebrate Hi-C experiment (for which chromatin is segregated in A/B compartments) is shown in <a href="./workflow-chicken.html">Chapter 10</a>.</p>
 </div>
 </div>
 <div class="cell" data-layout-align="center" data-hash="matrix-centric_cache/html/unnamed-chunk-10_1144d89783a617fb804b5399c35e60b9">
@@ -600,7 +600,7 @@ <h1 class="title">
 <span><span class="va">hic2</span></span>
 <span><span class="co">##  `HiCExperiment` object with 168,785 contacts over 150 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:400,000-700,000" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -720,18 +720,18 @@ <h1 class="title">
 
 
 </section></section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-Cournac2012Dec" class="csl-entry" role="listitem">
-Cournac, Axel, Hervé Marie-Nelly, Martial Marbouty, Romain Koszul, and Julien Mozziconacci. 2012. <span>“<span class="nocase">Normalization of a chromosomal contact map</span>.”</span> <em>BMC Genomics</em> 13 (1): 1–13. <a href="https://doi.org/10.1186/1471-2164-13-436">https://doi.org/10.1186/1471-2164-13-436</a>.
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Cournac_2012" class="csl-entry" role="listitem">
+Cournac, A., Marie-Nelly, H., Marbouty, M., Koszul, R., &amp; Mozziconacci, J. (2012). Normalization of a chromosomal contact map. <em><span>BMC</span> Genomics</em>, <em>13</em>(1). <a href="https://doi.org/10.1186/1471-2164-13-436">https://doi.org/10.1186/1471-2164-13-436</a>
 </div>
-<div id="ref-Duan2010May" class="csl-entry" role="listitem">
-Duan, Zhijun, Mirela Andronescu, Kevin Schutz, Sean McIlwain, Yoo Jung Kim, Choli Lee, Jay Shendure, Stanley Fields, C. Anthony Blau, and William S. Noble. 2010. <span>“<span class="nocase">A three-dimensional model of the yeast genome</span>.”</span> <em>Nature</em> 465 (May): 363–67. <a href="https://doi.org/10.1038/nature08973">https://doi.org/10.1038/nature08973</a>.
+<div id="ref-Duan_2010" class="csl-entry" role="listitem">
+Duan, Z., Andronescu, M., Schutz, K., McIlwain, S., Kim, Y. J., Lee, C., Shendure, J., Fields, S., Blau, C. A., &amp; Noble, W. S. (2010). A three-dimensional model of the yeast genome. <em>Nature</em>, <em>465</em>(7296), 363–367. <a href="https://doi.org/10.1038/nature08973">https://doi.org/10.1038/nature08973</a>
 </div>
-<div id="ref-Imakaev2012Oct" class="csl-entry" role="listitem">
-Imakaev, Maxim, Geoffrey Fudenberg, Rachel Patton McCord, Natalia Naumova, Anton Goloborodko, Bryan R. Lajoie, Job Dekker, and Leonid A. Mirny. 2012. <span>“<span class="nocase">Iterative correction of Hi-C data reveals hallmarks of chromosome organization</span>.”</span> <em>Nat. Methods</em> 9 (October): 999–1003. <a href="https://doi.org/10.1038/nmeth.2148">https://doi.org/10.1038/nmeth.2148</a>.
+<div id="ref-Imakaev_2012" class="csl-entry" role="listitem">
+Imakaev, M., Fudenberg, G., McCord, R. P., Naumova, N., Goloborodko, A., Lajoie, B. R., Dekker, J., &amp; Mirny, L. A. (2012). Iterative correction of hi-c data reveals hallmarks of chromosome organization. <em>Nature Methods</em>, <em>9</em>(10), 999–1003. <a href="https://doi.org/10.1038/nmeth.2148">https://doi.org/10.1038/nmeth.2148</a>
 </div>
-<div id="ref-Lieberman-Aiden2009Oct" class="csl-entry" role="listitem">
-Lieberman-Aiden, Erez, Nynke L. van Berkum, Louise Williams, Maxim Imakaev, Tobias Ragoczy, Agnes Telling, Ido Amit, et al. 2009. <span>“<span class="nocase">Comprehensive mapping of long range interactions reveals folding principles of the human genome</span>.”</span> <em>Science (New York, N.Y.)</em> 326 (5950): 289. <a href="https://doi.org/10.1126/science.1181369">https://doi.org/10.1126/science.1181369</a>.
+<div id="ref-Lieberman_Aiden_2009" class="csl-entry" role="listitem">
+Lieberman-Aiden, E., Berkum, N. L. van, Williams, L., Imakaev, M., Ragoczy, T., Telling, A., Amit, I., Lajoie, B. R., Sabo, P. J., Dorschner, M. O., Sandstrom, R., Bernstein, B., Bender, M. A., Groudine, M., Gnirke, A., Stamatoyannopoulos, J., Mirny, L. A., Lander, E. S., &amp; Dekker, J. (2009). Comprehensive mapping of long-range interactions reveals folding principles of the human genome. <em>Science</em>, <em>326</em>(5950), 289–293. <a href="https://doi.org/10.1126/science.1181369">https://doi.org/10.1126/science.1181369</a>
 </div>
 </div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">
diff --git a/matrix-centric_files/figure-html/unnamed-chunk-10-1.png b/matrix-centric_files/figure-html/unnamed-chunk-10-1.png
index e5ed693..7039b43 100644
Binary files a/matrix-centric_files/figure-html/unnamed-chunk-10-1.png and b/matrix-centric_files/figure-html/unnamed-chunk-10-1.png differ
diff --git a/matrix-centric_files/figure-html/unnamed-chunk-13-1.png b/matrix-centric_files/figure-html/unnamed-chunk-13-1.png
index fe29745..8671b5c 100644
Binary files a/matrix-centric_files/figure-html/unnamed-chunk-13-1.png and b/matrix-centric_files/figure-html/unnamed-chunk-13-1.png differ
diff --git a/matrix-centric_files/figure-html/unnamed-chunk-16-1.png b/matrix-centric_files/figure-html/unnamed-chunk-16-1.png
index a6a9d16..b4a86b1 100644
Binary files a/matrix-centric_files/figure-html/unnamed-chunk-16-1.png and b/matrix-centric_files/figure-html/unnamed-chunk-16-1.png differ
diff --git a/parsing.html b/parsing.html
index 68b4a70..9a659c6 100644
--- a/parsing.html
+++ b/parsing.html
@@ -350,7 +350,7 @@ <h1 class="title">
 <span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 10,801 contacts over 11 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:10,000-50,000" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 4000 </span></span>
@@ -425,7 +425,7 @@ <h1 class="title">
 <span><span class="va">cf</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/CoolFile-class.html">CoolFile</a></span><span class="op">(</span><span class="va">coolf</span><span class="op">)</span></span>
 <span><span class="va">cf</span></span>
 <span><span class="co">##  CoolFile object</span></span>
-<span><span class="co">##  .mcool file: /github/home/.cache/R/ExperimentHub/1a594277bd62_7752 </span></span>
+<span><span class="co">##  .mcool file: /github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752 </span></span>
 <span><span class="co">##  resolution: 1000 </span></span>
 <span><span class="co">##  pairs file: </span></span>
 <span><span class="co">##  metadata(0):</span></span>
@@ -435,7 +435,7 @@ <h1 class="title">
 <span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 306,212 contacts over 257 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300,001-813,184" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -492,7 +492,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb7"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/BiocIO/man/IO.html">import</a></span><span class="op">(</span><span class="va">cf</span>, focus <span class="op">=</span> <span class="st">'II:300001-800000'</span>, resolution <span class="op">=</span> <span class="fl">2000</span><span class="op">)</span></span>
 <span><span class="co">##  `HiCExperiment` object with 301,018 contacts over 250 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300,001-800,000" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -509,7 +509,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb8"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/BiocIO/man/IO.html">import</a></span><span class="op">(</span><span class="va">cf</span>, focus <span class="op">=</span> <span class="st">'II:300001-400000|II:600001-700000'</span>, resolution <span class="op">=</span> <span class="fl">2000</span><span class="op">)</span></span>
 <span><span class="co">##  `HiCExperiment` object with 402 contacts over 100 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300001-400000|II:600001-700000" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -526,7 +526,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb9"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/BiocIO/man/IO.html">import</a></span><span class="op">(</span><span class="va">cf</span>, focus <span class="op">=</span> <span class="st">'II'</span>, resolution <span class="op">=</span> <span class="fl">2000</span><span class="op">)</span></span>
 <span><span class="co">##  `HiCExperiment` object with 471,364 contacts over 407 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -543,7 +543,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb10"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/BiocIO/man/IO.html">import</a></span><span class="op">(</span><span class="va">cf</span>, focus <span class="op">=</span> <span class="st">'II|III'</span>, resolution <span class="op">=</span> <span class="fl">2000</span><span class="op">)</span></span>
 <span><span class="co">##  `HiCExperiment` object with 9,092 contacts over 566 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II|III" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -560,7 +560,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb11"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/pkg/BiocIO/man/IO.html">import</a></span><span class="op">(</span><span class="va">cf</span>, focus <span class="op">=</span> <span class="st">'II:300001-800000|V:1-500000'</span>, resolution <span class="op">=</span> <span class="fl">2000</span><span class="op">)</span></span>
 <span><span class="co">##  `HiCExperiment` object with 7,147 contacts over 500 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300001-800000|V:1-500000" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -592,7 +592,7 @@ <h1 class="title">
 hic 
 ##  `HiCExperiment` object with 306,212 contacts over 257 regions 
 ##  -------
-##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" 
+##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" 
 ##  focus: "II:300,001-813,184" 
 ##  resolutions(5): 1000 2000 4000 8000 16000
 ##  active resolution: 2000 
@@ -607,7 +607,7 @@ <h1 class="title">
 import(cf, focus = 'III', resolution = 2000)
 ##  `HiCExperiment` object with 151,990 contacts over 159 regions 
 ##  -------
-##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" 
+##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" 
 ##  focus: "III" 
 ##  resolutions(5): 1000 2000 4000 8000 16000
 ##  active resolution: 2000 
@@ -622,7 +622,7 @@ <h1 class="title">
 refocus(hic, 'III')
 ##  `HiCExperiment` object with 151,990 contacts over 159 regions 
 ##  -------
-##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" 
+##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" 
 ##  focus: "III" 
 ##  resolutions(5): 1000 2000 4000 8000 16000
 ##  active resolution: 2000 
@@ -757,7 +757,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb15"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span><span class="op">[</span><span class="st">"II:800001-813184"</span><span class="op">]</span></span>
 <span><span class="co">##  `HiCExperiment` object with 1,040 contacts over 6 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:800,001-813,184" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -774,7 +774,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb16"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span><span class="op">[</span><span class="st">"II:300001-320000|II:800001-813184"</span><span class="op">]</span></span>
 <span><span class="co">##  `HiCExperiment` object with 3 contacts over 6 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300001-320000|II:800001-813184" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -791,7 +791,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb17"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span><span class="op">[</span><span class="st">"II"</span><span class="op">]</span></span>
 <span><span class="co">##  `HiCExperiment` object with 306,212 contacts over 257 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -808,7 +808,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb18"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span><span class="op">[</span><span class="st">"II|IV"</span><span class="op">]</span></span>
 <span><span class="co">##  `HiCExperiment` object with 0 contacts over 0 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:1-813184|IV:1-1531933" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -825,7 +825,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb19"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span><span class="op">[</span><span class="st">"II:300001-320000|IV:1-100000"</span><span class="op">]</span></span>
 <span><span class="co">##  `HiCExperiment` object with 0 contacts over 0 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300001-320000|IV:1-100000" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -842,7 +842,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb20"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span><span class="op">[</span><span class="fu"><a href="https://rdrr.io/r/base/c.html">c</a></span><span class="op">(</span><span class="st">'II'</span>, <span class="st">'III'</span>, <span class="st">'IV'</span><span class="op">)</span><span class="op">]</span></span>
 <span><span class="co">##  `HiCExperiment` object with 306,212 contacts over 257 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II, III, IV" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -880,7 +880,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb21"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 306,212 contacts over 257 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300,001-813,184" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -893,7 +893,7 @@ <h1 class="title">
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/AllGenerics.html">zoom</a></span><span class="op">(</span><span class="va">hic</span>, <span class="fl">4000</span><span class="op">)</span></span>
 <span><span class="co">##  `HiCExperiment` object with 306,212 contacts over 129 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300,001-813,184" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 4000 </span></span>
@@ -906,7 +906,7 @@ <h1 class="title">
 <span><span class="fu"><a href="https://rdrr.io/pkg/HiCExperiment/man/AllGenerics.html">zoom</a></span><span class="op">(</span><span class="va">hic</span>, <span class="fl">1000</span><span class="op">)</span></span>
 <span><span class="co">##  `HiCExperiment` object with 306,212 contacts over 514 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300,001-813,184" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 1000 </span></span>
@@ -1076,7 +1076,7 @@ <h1 class="title">
 <span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 306,212 contacts over 257 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300,001-813,184" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -1120,14 +1120,14 @@ <h1 class="title">
 <span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 306,212 contacts over 257 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "II:300,001-813,184" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
 <span><span class="co">##  interactions: 18513 </span></span>
 <span><span class="co">##  scores(3): count balanced random </span></span>
 <span><span class="co">##  topologicalFeatures: compartments(0) borders(0) loops(9) viewpoints(0) CTCF(4) </span></span>
-<span><span class="co">##  pairsFile: /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753 </span></span>
+<span><span class="co">##  pairsFile: /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753 </span></span>
 <span><span class="co">##  metadata(0):</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 </section><section id="metadata" class="level4" data-number="3.2.2.4"><h4 data-number="3.2.2.4" class="anchored" data-anchor-id="metadata">
@@ -1144,7 +1144,7 @@ <h1 class="title">
 <span><span class="co">##  [1] "HiCExperiment created from an example .mcool file from `HiContactsData`"</span></span>
 <span><span class="co">##  </span></span>
 <span><span class="co">##  $date</span></span>
-<span><span class="co">##  [1] "Mon Sep 25 17:02:32 2023"</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<span><span class="co">##  [1] "Thu Oct 19 10:04:41 2023"</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 </section></section></section><section id="coercing-hicexperiment-objects" class="level2" data-number="3.3"><h2 data-number="3.3" class="anchored" data-anchor-id="coercing-hicexperiment-objects">
 <span class="header-section-number">3.3</span> Coercing <code>HiCExperiment</code> objects</h2>
diff --git a/preamble.html b/preamble.html
index b9f0217..664e4a3 100644
--- a/preamble.html
+++ b/preamble.html
@@ -275,7 +275,7 @@ <h1 class="title">Preamble</h1>
 
 </header>
 
-<p>Hi-C is an experimental method to quantify spatial interactions between any pair of genomic loci. While a number of command-line interfaces (CLI) exist to process and manipulate Hi-C data (e.g.&nbsp;<code>cooler</code> (<span class="citation" data-cites="Abdennur2020Jan">Abdennur and Mirny (<a href="interoperability.html#ref-Abdennur2020Jan" role="doc-biblioref">2020</a>)</span>), <code>juicer</code> (<span class="citation" data-cites="Durand2016Jul">Durand et al. (<a href="interoperability.html#ref-Durand2016Jul" role="doc-biblioref">2016</a>)</span>) and HiC-Pro (<span class="citation" data-cites="Servant2015Dec">Servant et al. (<a href="interoperability.html#ref-Servant2015Dec" role="doc-biblioref">2015</a>)</span>)), they generally suffer from several limitations often found in emerging genomics techniques:</p>
+<p>Hi-C is an experimental method to quantify spatial interactions between any pair of genomic loci. While a number of command-line interfaces (CLI) exist to process and manipulate Hi-C data (e.g.&nbsp;<code>cooler</code> (<span class="citation" data-cites="Abdennur_2019">Abdennur &amp; Mirny (<a href="interoperability.html#ref-Abdennur_2019" role="doc-biblioref">2019</a>)</span>), <code>juicer</code> (<span class="citation" data-cites="Durand_2016">Durand et al. (<a href="interoperability.html#ref-Durand_2016" role="doc-biblioref">2016</a>)</span>) and HiC-Pro (<span class="citation" data-cites="Servant_2015">Servant et al. (<a href="interoperability.html#ref-Servant_2015" role="doc-biblioref">2015</a>)</span>)), they generally suffer from several limitations often found in emerging genomics techniques:</p>
 <ul>
 <li><strong>No genomic representation of Hi-C processed data</strong>: the existing CLIs can efficiently parse Hi-C data as a numerical matrix and perform a few standard quantitative operations (e.g.&nbsp;contact matrix binning and normalization, dimensionality reduction, etc). However, they systematically fail to <strong>represent a Hi-C contact matrix as a genomic object</strong>. Qualitative analyses (e.g.&nbsp;intersecting chromatin loops with genomic features, finding genes overlapping with domains, etc) therefore remain extremely tedious.</li>
 <li><strong>No format-agnostic analysis libraries</strong>. Three competing file format standards (<code>.(m)cool</code>, <code>.hic</code> and <code>HiC-Pro</code> files) currently exist to store Hi-C processed data and dedicated CLIs propose sets of tools specifically working with their corresponding Hi-C processed data file format. This has curbed the development of generic Hi-C data analysis libraries by favoring the emergence of several redundant tools.</li>
@@ -347,15 +347,15 @@ <h1 class="unnumbered">Package status</h1>
 </section>
 <section id="bibliography" class="level1 unnumbered">
 <h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-Abdennur2020Jan" class="csl-entry" role="listitem">
-Abdennur, Nezar, and Leonid A. Mirny. 2020. <span>“<span class="nocase">Cooler: scalable storage for Hi-C data and other genomically labeled arrays</span>.”</span> <em>Bioinformatics</em> 36 (1): 311–16. <a href="https://doi.org/10.1093/bioinformatics/btz540">https://doi.org/10.1093/bioinformatics/btz540</a>.
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Abdennur_2019" class="csl-entry" role="listitem">
+Abdennur, N., &amp; Mirny, L. A. (2019). Cooler: Scalable storage for hi-c data and other genomically labeled arrays. <em>Bioinformatics</em>, <em>36</em>(1), 311–316. <a href="https://doi.org/10.1093/bioinformatics/btz540">https://doi.org/10.1093/bioinformatics/btz540</a>
 </div>
-<div id="ref-Durand2016Jul" class="csl-entry" role="listitem">
-Durand, Neva C., Muhammad S. Shamim, Ido Machol, Suhas S. P. Rao, Miriam H. Huntley, Eric S. Lander, and Erez Lieberman Aiden. 2016. <span>“<span class="nocase">Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments</span>.”</span> <em>Cell Systems</em> 3 (1): 95. <a href="https://doi.org/10.1016/j.cels.2016.07.002">https://doi.org/10.1016/j.cels.2016.07.002</a>.
+<div id="ref-Durand_2016" class="csl-entry" role="listitem">
+Durand, N. C., Shamim, M. S., Machol, I., Rao, S. S. P., Huntley, M. H., Lander, E. S., &amp; Aiden, E. L. (2016). Juicer provides a one-click system for analyzing loop-resolution hi-c experiments. <em>Cell Systems</em>, <em>3</em>(1), 95–98. <a href="https://doi.org/10.1016/j.cels.2016.07.002">https://doi.org/10.1016/j.cels.2016.07.002</a>
 </div>
-<div id="ref-Servant2015Dec" class="csl-entry" role="listitem">
-Servant, Nicolas, Nelle Varoquaux, Bryan R. Lajoie, Eric Viara, Chong-Jian Chen, Jean-Philippe Vert, Edith Heard, Job Dekker, and Emmanuel Barillot. 2015. <span>“<span class="nocase">HiC-Pro: an optimized and flexible pipeline for Hi-C data processing</span>.”</span> <em>Genome Biol.</em> 16 (1): 1–11. <a href="https://doi.org/10.1186/s13059-015-0831-x">https://doi.org/10.1186/s13059-015-0831-x</a>.
+<div id="ref-Servant_2015" class="csl-entry" role="listitem">
+Servant, N., Varoquaux, N., Lajoie, B. R., Viara, E., Chen, C.-J., Vert, J.-P., Heard, E., Dekker, J., &amp; Barillot, E. (2015). <span>HiC</span>-pro: An optimized and flexible pipeline for hi-c data processing. <em>Genome Biology</em>, <em>16</em>(1). <a href="https://doi.org/10.1186/s13059-015-0831-x">https://doi.org/10.1186/s13059-015-0831-x</a>
 </div>
 </div>
 </section>
diff --git a/principles.html b/principles.html
index c3b71e8..fcde423 100644
--- a/principles.html
+++ b/principles.html
@@ -110,7 +110,7 @@
     "search-submit-button-title": "Submit",
     "search-label": "Search"
   }
-}</script>
+}</script><script src="https://polyfill.io/v3/polyfill.min.js?features=es6"></script><script src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-chtml-full.js" type="text/javascript"></script>
 </head>
 <body class="nav-sidebar floating">
 
@@ -291,6 +291,7 @@
   <li><a href="#hicool-hicstuff-within-r" id="toc-hicool-hicstuff-within-r" class="nav-link" data-scroll-target="#hicool-hicstuff-within-r"><span class="header-section-number">1.3.3</span> HiCool: hicstuff within R</a></li>
   </ul>
 </li>
+  <li><a href="#exploratory-data-analysis-of-processed-hi-c-files" id="toc-exploratory-data-analysis-of-processed-hi-c-files" class="nav-link" data-scroll-target="#exploratory-data-analysis-of-processed-hi-c-files"><span class="header-section-number">1.4</span> Exploratory data analysis of processed Hi-C files</a></li>
   <li><a href="#bibliography" id="toc-bibliography" class="nav-link" data-scroll-target="#bibliography">References</a></li>
   </ul><div class="toc-actions"><div><i class="bi bi-github"></i></div><div class="action-links"><p><a href="https://github.com/js2264/OHCA/edit/main/principles.qmd" class="toc-action">Edit this page</a></p><p><a href="https://github.com/js2264/OHCA/issues/new" class="toc-action">Report an issue</a></p></div></div></nav>
     </div>
@@ -328,14 +329,14 @@ <h1 class="title">
 <span class="header-section-number">1.1</span> Experimental considerations</h2>
 <section id="experimental-approach" class="level3" data-number="1.1.1"><h3 data-number="1.1.1" class="anchored" data-anchor-id="experimental-approach">
 <span class="header-section-number">1.1.1</span> Experimental approach</h3>
-<p>The Hi-C procedure (<span class="citation" data-cites="Lieberman-Aiden2009Oct">Lieberman-Aiden et al. (<a href="interoperability.html#ref-Lieberman-Aiden2009Oct" role="doc-biblioref">2009</a>)</span>) stems from the clever combination of high-throughput sequencing and Chromatin Conformation Capture (3C) experimental approach (<span class="citation" data-cites="Dekker2002Feb">Dekker et al. (<a href="interoperability.html#ref-Dekker2002Feb" role="doc-biblioref">2002</a>)</span>).<br>
+<p>The Hi-C procedure (<span class="citation" data-cites="Lieberman_Aiden_2009">Lieberman-Aiden et al. (<a href="interoperability.html#ref-Lieberman_Aiden_2009" role="doc-biblioref">2009</a>)</span>) stems from the clever combination of high-throughput sequencing and Chromatin Conformation Capture (3C) experimental approach (<span class="citation" data-cites="Dekker_2002">Dekker et al. (<a href="interoperability.html#ref-Dekker_2002" role="doc-biblioref">2002</a>)</span>).<br>
 In Hi-C, chromatin is crosslinked within intact nuclei and enzymatically digested (usually with one or several restriction enzymes, but Hi-C variants using MNase or DNase exist). End-repair introduces biotinylated dNTPs and is followed by religation, which generates chimeric DNA fragments consisting of genomic loci originally lying in spatial proximity, usually crosslinked to a shared protein complex. After religation, DNA fragments are sheared, biotin-containing fragments are pulled-down and converted into a sequencing library.</p>
 <p><img src="images/20230215221337.png" class="img-fluid"></p>
 </section><section id="c-variants" class="level3" data-number="1.1.2"><h3 data-number="1.1.2" class="anchored" data-anchor-id="c-variants">
 <span class="header-section-number">1.1.2</span> C variants</h3>
-<p>A number of C variants have been proposed since the publication of the original 3C method (reviewed by <span class="citation" data-cites="J.O.2017Jan">J. O. et al. (<a href="interoperability.html#ref-J.O.2017Jan" role="doc-biblioref">2017</a>)</span>), the main ones being Capture-C and ChIA-PET (see procedure below).</p>
+<p>A number of C variants have been proposed since the publication of the original 3C method (reviewed by <span class="citation" data-cites="Davies_2017">Davies et al. (<a href="interoperability.html#ref-Davies_2017" role="doc-biblioref">2017</a>)</span>), the main ones being Capture-C and ChIA-PET (see procedure below).</p>
 <p><img src="images/20230221172531.png" class="img-fluid"></p>
-<p>Capture-C is useful to quantify interactions between a set of regulatory elements of interest. ChIA-PET, on the other hand, can identify interactions mediated by a specific protein of interest. Finally, an increasing number of Hi-C approaches rely on long-read sequencing (e.g. <span class="citation" data-cites="Deshpande2022Oct">Deshpande et al. (<a href="interoperability.html#ref-Deshpande2022Oct" role="doc-biblioref">2022</a>)</span>, <span class="citation" data-cites="Tavares-Cadete2020Dec">Tavares-Cadete et al. (<a href="interoperability.html#ref-Tavares-Cadete2020Dec" role="doc-biblioref">2020</a>)</span>) to identify clusters of 3D contacts.</p>
+<p>Capture-C is useful to quantify interactions between a set of regulatory elements of interest. ChIA-PET, on the other hand, can identify interactions mediated by a specific protein of interest. Finally, an increasing number of Hi-C approaches rely on long-read sequencing (e.g. <span class="citation" data-cites="Deshpande_2022">Deshpande et al. (<a href="interoperability.html#ref-Deshpande_2022" role="doc-biblioref">2022</a>)</span>, <span class="citation" data-cites="Tavares_Cadete_2020">Tavares-Cadete et al. (<a href="interoperability.html#ref-Tavares_Cadete_2020" role="doc-biblioref">2020</a>)</span>) to identify clusters of 3D contacts.</p>
 </section><section id="sequencing" class="level3" data-number="1.1.3"><h3 data-number="1.1.3" class="anchored" data-anchor-id="sequencing">
 <span class="header-section-number">1.1.3</span> Sequencing</h3>
 <p>Hi-C libraries are traditionally sequenced with short-read technology, and are by essence paired-end libraries. For this reason, the end result of the experimental side of the Hi-C consists of two fastq files, each one containing sequences for one extremity of the DNA fragments purified during Hi-C. These are the two files we need to move on to the computational side of Hi-C.</p>
@@ -464,7 +465,7 @@ <h1 class="title">
 </div>
 </div>
 </div>
-<p>More information about the conventions related to this text file are provided by the <a href="https://github.com/4dn-dcic/pairix/blob/master/pairs_format_specification.md">4DN consortium</a>, which originally formalized the specifications of this file format.</p>
+<p><a href="https://github.com/4dn-dcic/pairix/blob/master/pairs_format_specification.md">More information</a> about the conventions related to this text file are provided by the <a href="https://www.4dnucleome.org/">4DN consortium</a>, which originally formalized the specifications of this file format.</p>
 </section><section id="binned-contact-matrix-files" class="level3" data-number="1.2.2"><h3 data-number="1.2.2" class="anchored" data-anchor-id="binned-contact-matrix-files">
 <span class="header-section-number">1.2.2</span> Binned contact matrix files</h3>
 <section id="binning-pairs-into-a-matrix" class="level4" data-number="1.2.2.1"><h4 data-number="1.2.2.1" class="anchored" data-anchor-id="binning-pairs-into-a-matrix">
@@ -532,12 +533,12 @@ <h1 class="title">
 <p>In this context, the <code>regions.bed</code> acts as a secondary “dictionary” describing the nature of <code>i</code> and <code>j</code> indices, i.e.&nbsp;the location of genomic bins.</p>
 </section><section id="plain-text-matrices-hic-pro-style" class="level4" data-number="1.2.2.2"><h4 data-number="1.2.2.2" class="anchored" data-anchor-id="plain-text-matrices-hic-pro-style">
 <span class="header-section-number">1.2.2.2</span> Plain-text matrices: HiC-Pro style</h4>
-<p>The HiC-Pro pipeline (<span class="citation" data-cites="Servant2015Dec">Servant et al. (<a href="interoperability.html#ref-Servant2015Dec" role="doc-biblioref">2015</a>)</span>) outputs 2 text files: a <code>regions.bed</code> file and a <code>count.matrix</code> file. They are generated by the exact process explained above.</p>
+<p>The HiC-Pro pipeline (<span class="citation" data-cites="Servant_2015">Servant et al. (<a href="interoperability.html#ref-Servant_2015" role="doc-biblioref">2015</a>)</span>) outputs 2 text files: a <code>regions.bed</code> file and a <code>count.matrix</code> file. They are generated by the exact process explained above.</p>
 <p>Together, these two files can describe the interaction frequency between any pair of genomic loci. They are non-binarized text files, and as such are technically human-readable. However, it is relatively hard to get a grasp of these files compared to a plain <code>.pairs</code> file, as information regarding genomic bins and interaction frequencies are stored in separate files. Moreover, because they are non-binarized, these files often end up using a large disk space and cannot be easily indexed. This prevents easy subsetting of the data stored in these files.</p>
 <p><code>.(m)cool</code> and <code>.hic</code> file formats are two standards addressing these limitations.</p>
 </section><section id="mcool-matrices" class="level4" data-number="1.2.2.3"><h4 data-number="1.2.2.3" class="anchored" data-anchor-id="mcool-matrices">
 <span class="header-section-number">1.2.2.3</span> <code>.(m)cool</code> matrices</h4>
-<p>The <code>.cool</code> format has been formally defined in <span class="citation" data-cites="Abdennur2020Jan">Abdennur and Mirny (<a href="interoperability.html#ref-Abdennur2020Jan" role="doc-biblioref">2020</a>)</span> and is a particular type of <code>HDF5</code> (<code>Hierarchical Data Format</code>) file. It is an indexed archive file storing rectangular tables called:</p>
+<p>The <code>.cool</code> format has been formally defined in <span class="citation" data-cites="Abdennur_2019">Abdennur &amp; Mirny (<a href="interoperability.html#ref-Abdennur_2019" role="doc-biblioref">2019</a>)</span> and is a particular type of <code>HDF5</code> (<code>Hierarchical Data Format</code>) file. It is an indexed archive file storing rectangular tables called:</p>
 <ul>
 <li>
 <code>bins</code>: containing the same information than the <code>regions.bed</code> file;</li>
@@ -560,7 +561,7 @@ <h1 class="title">
 <p>Moreover, parsing <code>.cool</code> files is possible using <code>HDF</code> standard APIs.</p>
 </section><section id="hic-matrices" class="level4" data-number="1.2.2.4"><h4 data-number="1.2.2.4" class="anchored" data-anchor-id="hic-matrices">
 <span class="header-section-number">1.2.2.4</span> <code>.hic</code> matrices</h4>
-<p>The <code>.hic</code> format is another type of binarized, indexed and highly-compressed file (<span class="citation" data-cites="Durand2016Jul">Durand et al. (<a href="interoperability.html#ref-Durand2016Jul" role="doc-biblioref">2016</a>)</span>). It can store virtually the same information than a <code>.cool</code> file. However, parsing <code>.hic</code> files is not as straightforward as <code>.cool</code> files, as it does not rely on a generic file standard. Still, the <code>straw</code> library has been implemented in several computing languages to facilitate parsing of <code>.hic</code> files (<span class="citation" data-cites="Durand2016Jul">Durand et al. (<a href="interoperability.html#ref-Durand2016Jul" role="doc-biblioref">2016</a>)</span>).</p>
+<p>The <code>.hic</code> format is another type of binarized, indexed and highly-compressed file (<span class="citation" data-cites="Durand_2016">Durand et al. (<a href="interoperability.html#ref-Durand_2016" role="doc-biblioref">2016</a>)</span>). It can store virtually the same information than a <code>.cool</code> file. However, parsing <code>.hic</code> files is not as straightforward as <code>.cool</code> files, as it does not rely on a generic file standard. Still, the <code>straw</code> library has been implemented in several computing languages to facilitate parsing of <code>.hic</code> files (<span class="citation" data-cites="Durand_2016">Durand et al. (<a href="interoperability.html#ref-Durand_2016" role="doc-biblioref">2016</a>)</span>).</p>
 </section></section></section><section id="pre-processing-hi-c-data" class="level2" data-number="1.3"><h2 data-number="1.3" class="anchored" data-anchor-id="pre-processing-hi-c-data">
 <span class="header-section-number">1.3</span> Pre-processing Hi-C data</h2>
 <section id="processing-workflow" class="level3" data-number="1.3.1"><h3 data-number="1.3.1" class="anchored" data-anchor-id="processing-workflow">
@@ -575,7 +576,7 @@ <h1 class="title">
 <li>Normalization of contact matrix and multi-resolution matrix generation</li>
 </ol>
 <p><img src="images/20230303125432.png" class="img-fluid"></p>
-<p>In practice, a minimal workflow to pre-process Hi-C data is the following (adapted from <span class="citation" data-cites="Open2C2023Jan">Open2C et al. (<a href="interoperability.html#ref-Open2C2023Jan" role="doc-biblioref">2023</a>)</span>):</p>
+<p>In practice, a minimal workflow to pre-process Hi-C data is the following (adapted from <span class="citation" data-cites="Open2C_2023">Open2C et al. (<a href="interoperability.html#ref-Open2C_2023" role="doc-biblioref">2023</a>)</span>):</p>
 <div class="cell" data-layout-align="center" data-hash="principles_cache/html/unnamed-chunk-2_80c5f3240e155b0eb90bdc95f4bda3bc">
 <div class="sourceCode cell-code" id="cb10"><pre class="sourceCode sh code-with-copy"><code class="sourceCode bash"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a><span class="co">## Note these fields have to be replaced by appropriate variables: </span></span>
 <span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a><span class="co">##    &lt;index&gt;</span></span>
@@ -596,9 +597,9 @@ <h1 class="title">
 <code>nf-distiller</code>: a combination of an aligner + <code>pairtools</code> + <code>cooler</code>
 </li>
 <li>
-<code>HiC-pro</code> (<span class="citation" data-cites="Servant2015Dec">Servant et al. (<a href="interoperability.html#ref-Servant2015Dec" role="doc-biblioref">2015</a>)</span>)</li>
+<code>HiC-pro</code> (<span class="citation" data-cites="Servant_2015">Servant et al. (<a href="interoperability.html#ref-Servant_2015" role="doc-biblioref">2015</a>)</span>)</li>
 <li>
-<code>Juicer</code> (<span class="citation" data-cites="Durand2016Jul">Durand et al. (<a href="interoperability.html#ref-Durand2016Jul" role="doc-biblioref">2016</a>)</span>)</li>
+<code>Juicer</code> (<span class="citation" data-cites="Durand_2016">Durand et al. (<a href="interoperability.html#ref-Durand_2016" role="doc-biblioref">2016</a>)</span>)</li>
 </ul>
 <div class="callout callout-style-default callout-note callout-titled">
 <div class="callout-header d-flex align-content-center">
@@ -610,7 +611,7 @@ <h1 class="title">
 </div>
 </div>
 <div class="callout-body-container callout-body">
-<p>For larger genomes (&gt; 1Gb) with more than few tens of M of reads per fastq (e.g.&nbsp;&gt; 100M), we recommend pre-processing data on an HPC cluster. Aligners, pairs processing and matrix binning can greatly benefit from parallelization over multiple CPUs (<span class="citation" data-cites="Open2C2023Jan">Open2C et al. (<a href="interoperability.html#ref-Open2C2023Jan" role="doc-biblioref">2023</a>)</span>)).<br>
+<p>For larger genomes (&gt; 1Gb) with more than few tens of M of reads per fastq (e.g.&nbsp;&gt; 100M), we recommend pre-processing data on an HPC cluster. Aligners, pairs processing and matrix binning can greatly benefit from parallelization over multiple CPUs (<span class="citation" data-cites="Open2C_2023">Open2C et al. (<a href="interoperability.html#ref-Open2C_2023" role="doc-biblioref">2023</a>)</span>)).<br>
 To scale <strong>up</strong> data pre-processing, we recommend to rely on an efficient read mapper such as <code>bwa</code>, followed by pairs parsing, sorting and deduplication with <code>pairtools</code> and binning with <code>cooler</code>.</p>
 </div>
 </div>
@@ -661,10 +662,10 @@ <h1 class="title">
 <span><span class="op">)</span></span>
 <span><span class="co">##  HiCool :: Fetching bowtie genome index files from AWS iGenomes S3 bucket...</span></span>
 <span><span class="co">##  HiCool :: Recovering bowtie2 genome index from AWS iGenomes...</span></span>
-<span><span class="co">##  + /github/home/.cache/R/basilisk/1.13.1/0/bin/conda 'create' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.1/HiCool/1.1.0/env' 'python=3.7.12' '--quiet' '-c' 'conda-forge' '-c' 'bioconda'</span></span>
-<span><span class="co">##  + /github/home/.cache/R/basilisk/1.13.1/0/bin/conda 'install' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.1/HiCool/1.1.0/env' 'python=3.7.12'</span></span>
-<span><span class="co">##  + /github/home/.cache/R/basilisk/1.13.1/0/bin/conda 'install' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.1/HiCool/1.1.0/env' '-c' 'conda-forge' '-c' 'bioconda' 'python=3.7.12' 'python=3.7.12' 'bowtie2=2.5.0' 'samtools=1.16.1' 'hicstuff=3.1.5' 'chromosight=1.6.3' 'cooler=0.9.1'</span></span>
-<span><span class="co">##  HiCool :: Initiating processing of fastq files [tmp folder: /tmp/RtmpyLujmT/WL4DIE]...</span></span>
+<span><span class="co">##  + /github/home/.cache/R/basilisk/1.13.4/0/bin/conda 'create' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.4/HiCool/1.1.0/env' 'python=3.7.12' '--quiet' '-c' 'conda-forge' '-c' 'bioconda'</span></span>
+<span><span class="co">##  + /github/home/.cache/R/basilisk/1.13.4/0/bin/conda 'install' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.4/HiCool/1.1.0/env' 'python=3.7.12'</span></span>
+<span><span class="co">##  + /github/home/.cache/R/basilisk/1.13.4/0/bin/conda 'install' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.4/HiCool/1.1.0/env' '-c' 'conda-forge' '-c' 'bioconda' 'python=3.7.12' 'python=3.7.12' 'bowtie2=2.5.0' 'samtools=1.16.1' 'hicstuff=3.1.5' 'chromosight=1.6.3' 'cooler=0.9.1'</span></span>
+<span><span class="co">##  HiCool :: Initiating processing of fastq files [tmp folder: /tmp/RtmpIWmk55/WL4DIE]...</span></span>
 <span><span class="co">##  HiCool :: Mapping fastq files...</span></span>
 <span><span class="co">##  HiCool :: Removing unwanted chromosomes...</span></span>
 <span><span class="co">##  HiCool :: Parsing pairs into .cool file...</span></span>
@@ -674,12 +675,12 @@ <h1 class="title">
 <span><span class="co">##  HiCool :: .fastq to .mcool processing done!</span></span>
 <span><span class="co">##  HiCool :: Check ./HiCool/folder to find the generated files</span></span>
 <span><span class="co">##  HiCool :: Generating HiCool report. This might take a while.</span></span>
-<span><span class="co">##  HiCool :: Report generated and available @ /__w/OHCA/OHCA/HiCool/148151d75a8_7833^mapped-R64-1-1^WL4DIE.html</span></span>
+<span><span class="co">##  HiCool :: Report generated and available @ /__w/OHCA/OHCA/HiCool/14976d56f7a_7833^mapped-R64-1-1^WL4DIE.html</span></span>
 <span><span class="co">##  HiCool :: All processing successfully achieved. Congrats!</span></span>
 <span><span class="co">##  CoolFile object</span></span>
-<span><span class="co">##  .mcool file: ./HiCool//matrices/148151d75a8_7833^mapped-R64-1-1^WL4DIE.mcool </span></span>
+<span><span class="co">##  .mcool file: ./HiCool//matrices/14976d56f7a_7833^mapped-R64-1-1^WL4DIE.mcool </span></span>
 <span><span class="co">##  resolution: 4000 </span></span>
-<span><span class="co">##  pairs file: ./HiCool//pairs/148151d75a8_7833^mapped-R64-1-1^WL4DIE.pairs </span></span>
+<span><span class="co">##  pairs file: ./HiCool//pairs/14976d56f7a_7833^mapped-R64-1-1^WL4DIE.pairs </span></span>
 <span><span class="co">##  metadata(3): log args stats</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <div class="callout callout-style-default callout-tip callout-titled">
@@ -708,16 +709,16 @@ <h1 class="title">
 <div class="cell" data-layout-align="center" data-hash="principles_cache/html/unnamed-chunk-4_7a9232306d921b18185f38a5b6cbb5b1">
 <div class="sourceCode" id="cb14"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="fu">fs</span><span class="fu">::</span><span class="fu"><a href="https://fs.r-lib.org/reference/dir_tree.html">dir_tree</a></span><span class="op">(</span><span class="st">'HiCool/'</span><span class="op">)</span></span>
 <span><span class="co">##  HiCool/</span></span>
-<span><span class="co">##  ├── 148151d75a8_7833^mapped-R64-1-1^WL4DIE.html</span></span>
+<span><span class="co">##  ├── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE.html</span></span>
 <span><span class="co">##  ├── logs</span></span>
-<span><span class="co">##  │   └── 148151d75a8_7833^mapped-R64-1-1^WL4DIE.log</span></span>
+<span><span class="co">##  │   └── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE.log</span></span>
 <span><span class="co">##  ├── matrices</span></span>
-<span><span class="co">##  │   └── 148151d75a8_7833^mapped-R64-1-1^WL4DIE.mcool</span></span>
+<span><span class="co">##  │   └── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE.mcool</span></span>
 <span><span class="co">##  ├── pairs</span></span>
-<span><span class="co">##  │   └── 148151d75a8_7833^mapped-R64-1-1^WL4DIE.pairs</span></span>
+<span><span class="co">##  │   └── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE.pairs</span></span>
 <span><span class="co">##  └── plots</span></span>
-<span><span class="co">##      ├── 148151d75a8_7833^mapped-R64-1-1^WL4DIE_event_distance.pdf</span></span>
-<span><span class="co">##      └── 148151d75a8_7833^mapped-R64-1-1^WL4DIE_event_distribution.pdf</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<span><span class="co">##      ├── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE_event_distance.pdf</span></span>
+<span><span class="co">##      └── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE_event_distribution.pdf</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <ul>
 <li>The <code>*.pairs</code> and <code>*.mcool</code> files are the pairs and contact matrix files, respectively. <strong>These are the output files the end-user is generally looking for.</strong>
@@ -739,36 +740,93 @@ <h1 class="title">
 <p>All the files generated by a single <code>HiCool</code> pipeline execution contain the same 6-letter unique hash to make sure they are not overwritten if re-executing the same command.</p>
 </div>
 </div>
+</section></section></section><section id="exploratory-data-analysis-of-processed-hi-c-files" class="level2" data-number="1.4"><h2 data-number="1.4" class="anchored" data-anchor-id="exploratory-data-analysis-of-processed-hi-c-files">
+<span class="header-section-number">1.4</span> Exploratory data analysis of processed Hi-C files</h2>
+<p>Once Hi-C raw data has been transformed into a set of processed files, exploratory data analysis is typically conducted following two main routes:</p>
+<ul>
+<li>Data visualization;</li>
+<li>Data investigation.</li>
+</ul>
+<p>During the last decade, a number of softwares have been developed to unlock <strong>Hi-C data visualization and investigation</strong>. Here we provide a non-exhaustive list of notable tools developed throughout the recent years for downstream Hi-C analysis, selected from <a href="https://github.com/mdozmorov/HiC_tools">this longer list</a>.</p>
+<ul>
+<li>
+<p>2012-2015:</p>
+<ul>
+<li>HiTC (2012)</li>
+<li>HiCCUPS (2014)</li>
+<li>HiCseg (2014)</li>
+<li>Fit-Hi-C (2014)</li>
+<li>HiC-Pro (2015)</li>
+<li>diffHic (2015)</li>
+<li>cooltools (2015)</li>
+<li>HiCUP (2015)</li>
+<li>HiCPlotter (2015)</li>
+<li>HiFive (2015)</li>
+</ul>
+</li>
+<li>
+<p>2016-2019:</p>
+<ul>
+<li>CHiCAGO (2016)</li>
+<li>TADbit (2017)</li>
+<li>HiCRep (2017)</li>
+<li>HiC-DC (2017)</li>
+<li>GoTHIC (2017)</li>
+<li>HiCExplorer (2018)</li>
+<li>Boost-HiC (2018)</li>
+<li>HiCcompare (2018)</li>
+<li>HiPiler (2018)</li>
+<li>coolpuppy (2019)</li>
+</ul>
+</li>
+<li>
+<p>2020-present:</p>
+<ul>
+<li>Serpentine (2020)</li>
+<li>CHESS (2020)</li>
+<li>DeepHiC (2020)</li>
+<li>Chromosight (2020)</li>
+<li>Mustache (2020)</li>
+<li>TADcompare (2020)</li>
+<li>POSSUM (2021)</li>
+<li>Calder (2021)</li>
+<li>HICDCPlus (2021)</li>
+<li>plotgardener (2021)</li>
+<li>GENOVA (2021)</li>
+</ul>
+</li>
+</ul>
+<p>All references as well as many other softwares and references are available <a href="https://github.com/mdozmorov/HiC_tools">here</a>.</p>
 
 
-</section></section></section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-Abdennur2020Jan" class="csl-entry" role="listitem">
-Abdennur, Nezar, and Leonid A. Mirny. 2020. <span>“<span class="nocase">Cooler: scalable storage for Hi-C data and other genomically labeled arrays</span>.”</span> <em>Bioinformatics</em> 36 (1): 311–16. <a href="https://doi.org/10.1093/bioinformatics/btz540">https://doi.org/10.1093/bioinformatics/btz540</a>.
+</section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Abdennur_2019" class="csl-entry" role="listitem">
+Abdennur, N., &amp; Mirny, L. A. (2019). Cooler: Scalable storage for hi-c data and other genomically labeled arrays. <em>Bioinformatics</em>, <em>36</em>(1), 311–316. <a href="https://doi.org/10.1093/bioinformatics/btz540">https://doi.org/10.1093/bioinformatics/btz540</a>
 </div>
-<div id="ref-Dekker2002Feb" class="csl-entry" role="listitem">
-Dekker, Job, Karsten Rippe, Martijn Dekker, and Nancy Kleckner. 2002. <span>“<span>Capturing Chromosome Conformation</span>.”</span> <em>Science</em> 295 (5558): 1306–11. <a href="https://doi.org/10.1126/science.1067799">https://doi.org/10.1126/science.1067799</a>.
+<div id="ref-Davies_2017" class="csl-entry" role="listitem">
+Davies, J. O. J., Oudelaar, A. M., Higgs, D. R., &amp; Hughes, J. R. (2017). How best to identify chromosomal interactions: A comparison of approaches. <em>Nature Methods</em>, <em>14</em>(2), 125–134. <a href="https://doi.org/10.1038/nmeth.4146">https://doi.org/10.1038/nmeth.4146</a>
 </div>
-<div id="ref-Deshpande2022Oct" class="csl-entry" role="listitem">
-Deshpande, Aditya S., Netha Ulahannan, Matthew Pendleton, Xiaoguang Dai, Lynn Ly, Julie M. Behr, Stefan Schwenk, et al. 2022. <span>“<span class="nocase">Identifying synergistic high-order 3D chromatin conformations from genome-scale nanopore concatemer sequencing</span>.”</span> <em>Nat. Biotechnol.</em> 40 (October): 1488–99. <a href="https://doi.org/10.1038/s41587-022-01289-z">https://doi.org/10.1038/s41587-022-01289-z</a>.
+<div id="ref-Dekker_2002" class="csl-entry" role="listitem">
+Dekker, J., Rippe, K., Dekker, M., &amp; Kleckner, N. (2002). Capturing chromosome conformation. <em>Science</em>, <em>295</em>(5558), 1306–1311. <a href="https://doi.org/10.1126/science.1067799">https://doi.org/10.1126/science.1067799</a>
 </div>
-<div id="ref-Durand2016Jul" class="csl-entry" role="listitem">
-Durand, Neva C., Muhammad S. Shamim, Ido Machol, Suhas S. P. Rao, Miriam H. Huntley, Eric S. Lander, and Erez Lieberman Aiden. 2016. <span>“<span class="nocase">Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments</span>.”</span> <em>Cell Systems</em> 3 (1): 95. <a href="https://doi.org/10.1016/j.cels.2016.07.002">https://doi.org/10.1016/j.cels.2016.07.002</a>.
+<div id="ref-Deshpande_2022" class="csl-entry" role="listitem">
+Deshpande, A. S., Ulahannan, N., Pendleton, M., Dai, X., Ly, L., Behr, J. M., Schwenk, S., Liao, W., Augello, M. A., Tyer, C., Rughani, P., Kudman, S., Tian, H., Otis, H. G., Adney, E., Wilkes, D., Mosquera, J. M., Barbieri, C. E., Melnick, A., … Imieliński, M. (2022). Identifying synergistic high-order 3D chromatin conformations from genome-scale nanopore concatemer sequencing. <em>Nature Biotechnology</em>, <em>40</em>(10), 1488–1499. <a href="https://doi.org/10.1038/s41587-022-01289-z">https://doi.org/10.1038/s41587-022-01289-z</a>
 </div>
-<div id="ref-J.O.2017Jan" class="csl-entry" role="listitem">
-J. O., Davies, Oudelaar A. M., Higgs D. R., and Hughes J. R. 2017. <span>“<span class="nocase">How best to identify chromosomal interactions: a comparison of approaches.</span>”</span> <em>Nat. Methods</em> 14 (2): 125–34. <a href="https://doi.org/10.1038/nmeth.4146">https://doi.org/10.1038/nmeth.4146</a>.
+<div id="ref-Durand_2016" class="csl-entry" role="listitem">
+Durand, N. C., Shamim, M. S., Machol, I., Rao, S. S. P., Huntley, M. H., Lander, E. S., &amp; Aiden, E. L. (2016). Juicer provides a one-click system for analyzing loop-resolution hi-c experiments. <em>Cell Systems</em>, <em>3</em>(1), 95–98. <a href="https://doi.org/10.1016/j.cels.2016.07.002">https://doi.org/10.1016/j.cels.2016.07.002</a>
 </div>
-<div id="ref-Lieberman-Aiden2009Oct" class="csl-entry" role="listitem">
-Lieberman-Aiden, Erez, Nynke L. van Berkum, Louise Williams, Maxim Imakaev, Tobias Ragoczy, Agnes Telling, Ido Amit, et al. 2009. <span>“<span class="nocase">Comprehensive mapping of long range interactions reveals folding principles of the human genome</span>.”</span> <em>Science (New York, N.Y.)</em> 326 (5950): 289. <a href="https://doi.org/10.1126/science.1181369">https://doi.org/10.1126/science.1181369</a>.
+<div id="ref-Lieberman_Aiden_2009" class="csl-entry" role="listitem">
+Lieberman-Aiden, E., Berkum, N. L. van, Williams, L., Imakaev, M., Ragoczy, T., Telling, A., Amit, I., Lajoie, B. R., Sabo, P. J., Dorschner, M. O., Sandstrom, R., Bernstein, B., Bender, M. A., Groudine, M., Gnirke, A., Stamatoyannopoulos, J., Mirny, L. A., Lander, E. S., &amp; Dekker, J. (2009). Comprehensive mapping of long-range interactions reveals folding principles of the human genome. <em>Science</em>, <em>326</em>(5950), 289–293. <a href="https://doi.org/10.1126/science.1181369">https://doi.org/10.1126/science.1181369</a>
 </div>
-<div id="ref-Open2C2023Jan" class="csl-entry" role="listitem">
-Open2C, Nezar Abdennur, Geoffrey Fudenberg, Ilya M. Flyamer, Aleksandra A. Galitsyna, Anton Goloborodko, Maxim Imakaev, and Sergey V. Venev. 2023. <span>“<span class="nocase">Pairtools: from sequencing data to chromosome contacts</span>.”</span> <em>bioRxiv</em>, February, 2023.02.13.528389. <a href="https://doi.org/10.1101/2023.02.13.528389">https://doi.org/10.1101/2023.02.13.528389</a>.
+<div id="ref-Open2C_2023" class="csl-entry" role="listitem">
+Open2C, Abdennur, N., Fudenberg, G., Flyamer, I. M., Galitsyna, A. A., Goloborodko, A., Imakaev, M., &amp; Venev, S. V. (2023). <em>Pairtools: From sequencing data to chromosome contacts</em>. <a href="https://doi.org/10.1101/2023.02.13.528389">https://doi.org/10.1101/2023.02.13.528389</a>
 </div>
-<div id="ref-Servant2015Dec" class="csl-entry" role="listitem">
-Servant, Nicolas, Nelle Varoquaux, Bryan R. Lajoie, Eric Viara, Chong-Jian Chen, Jean-Philippe Vert, Edith Heard, Job Dekker, and Emmanuel Barillot. 2015. <span>“<span class="nocase">HiC-Pro: an optimized and flexible pipeline for Hi-C data processing</span>.”</span> <em>Genome Biol.</em> 16 (1): 1–11. <a href="https://doi.org/10.1186/s13059-015-0831-x">https://doi.org/10.1186/s13059-015-0831-x</a>.
+<div id="ref-Servant_2015" class="csl-entry" role="listitem">
+Servant, N., Varoquaux, N., Lajoie, B. R., Viara, E., Chen, C.-J., Vert, J.-P., Heard, E., Dekker, J., &amp; Barillot, E. (2015). <span>HiC</span>-pro: An optimized and flexible pipeline for hi-c data processing. <em>Genome Biology</em>, <em>16</em>(1). <a href="https://doi.org/10.1186/s13059-015-0831-x">https://doi.org/10.1186/s13059-015-0831-x</a>
 </div>
-<div id="ref-Tavares-Cadete2020Dec" class="csl-entry" role="listitem">
-Tavares-Cadete, Filipe, Davood Norouzi, Bastiaan Dekker, Yu Liu, and Job Dekker. 2020. <span>“<span class="nocase">Multi-contact 3C reveals that the human genome during interphase is largely not entangled</span>.”</span> <em>Nat. Struct. Mol. Biol.</em> 27 (December): 1105–14. <a href="https://doi.org/10.1038/s41594-020-0506-5">https://doi.org/10.1038/s41594-020-0506-5</a>.
+<div id="ref-Tavares_Cadete_2020" class="csl-entry" role="listitem">
+Tavares-Cadete, F., Norouzi, D., Dekker, B., Liu, Y., &amp; Dekker, J. (2020). Multi-contact 3C reveals that the human genome during interphase is largely not entangled. <em>Nature Structural <span>&amp;</span>Amp<span class="math inline">\(\mathsemicolon\)</span> Molecular Biology</em>, <em>27</em>(12), 1105–1114. <a href="https://doi.org/10.1038/s41594-020-0506-5">https://doi.org/10.1038/s41594-020-0506-5</a>
 </div>
 </div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">
diff --git a/search.json b/search.json
index a3e5a08..62899b1 100644
--- a/search.json
+++ b/search.json
@@ -4,7 +4,7 @@
     "href": "index.html",
     "title": "Orchestrating Hi-C analysis with Bioconductor",
     "section": "",
-    "text": "Welcome\nThis is the landing page of the “Orchestrating Hi-C analysis with Bioconductor” book. The primary aim of this book is to introduce the R user to Hi-C analysis. This book starts with key concepts important for the analysis of chromatin conformation capture and then presents Bioconductor tools that can be leveraged to process, analyze, explore and visualize Hi-C data.\nAuthors: Jacques Serizay [aut, cre]Version: 1.1.0Modified: 2023-04-14Compiled: 2023-09-25Environment: R version 4.3.1 (2023-06-16), Bioconductor 3.18License: MIT + file LICENSECopyright: J. Serizay\nThis book is divided in three parts:\nPart I: Introduction to Hi-C analysis\nPart II: In-depth Hi-C analysis\nPart III: Hi-C analysis workflows"
+    "text": "Welcome\nThis is the landing page of the “Orchestrating Hi-C analysis with Bioconductor” book. The primary aim of this book is to introduce the R user to Hi-C analysis. This book starts with key concepts important for the analysis of chromatin conformation capture and then presents Bioconductor tools that can be leveraged to process, analyze, explore and visualize Hi-C data.\nAuthors: Jacques Serizay [aut, cre]Version: 1.1.0Modified: 2023-04-14Compiled: 2023-10-19Environment: R version 4.3.1 (2023-06-16), Bioconductor 3.18License: MIT + file LICENSECopyright: J. Serizay\nThis book is divided in three parts:\nPart I: Introduction to Hi-C analysis\nPart II: In-depth Hi-C analysis\nPart III: Hi-C analysis workflows"
   },
   {
     "objectID": "index.html#general-audience",
@@ -39,14 +39,14 @@
     "href": "index.html#session-info",
     "title": "Orchestrating Hi-C analysis with Bioconductor",
     "section": "Session info",
-    "text": "Session info\n\nsessioninfo::session_info()\n##  ─ Session info ────────────────────────────────────────────────────────────\n##   setting  value\n##   version  R version 4.3.1 (2023-06-16)\n##   os       Ubuntu 22.04.3 LTS\n##   system   x86_64, linux-gnu\n##   ui       X11\n##   language (EN)\n##   collate  en_US.UTF-8\n##   ctype    en_US.UTF-8\n##   tz       Etc/UTC\n##   date     2023-09-25\n##   pandoc   3.1.1 @ /usr/local/bin/ (via rmarkdown)\n##  \n##  ─ Packages ────────────────────────────────────────────────────────────────\n##   package                * version   date (UTC) lib source\n##   abind                    1.4-5     2016-07-21 [1] CRAN (R 4.3.1)\n##   AnnotationDbi            1.63.2    2023-07-02 [1] Bioconductor\n##   AnnotationHub          * 3.9.2     2023-08-24 [1] Bioconductor\n##   basilisk                 1.13.1    2023-06-07 [1] Bioconductor\n##   basilisk.utils           1.13.3    2023-09-04 [1] Bioconductor\n##   beeswarm                 0.4.0     2021-06-01 [1] CRAN (R 4.3.1)\n##   Biobase                  2.61.0    2023-04-25 [1] Bioconductor\n##   BiocFileCache          * 2.9.1     2023-07-12 [1] Bioconductor\n##   BiocGenerics           * 0.47.0    2023-04-25 [1] Bioconductor\n##   BiocIO                   1.11.0    2023-04-25 [1] Bioconductor\n##   BiocManager              1.30.22   2023-08-08 [1] CRAN (R 4.3.1)\n##   BiocParallel             1.35.4    2023-08-17 [1] Bioconductor\n##   BiocStyle                2.29.2    2023-09-14 [1] Bioconductor\n##   BiocVersion              3.18.0    2023-04-25 [1] Bioconductor\n##   Biostrings               2.69.2    2023-07-02 [1] Bioconductor\n##   bit                      4.0.5     2022-11-15 [1] CRAN (R 4.3.1)\n##   bit64                    4.0.5     2020-08-30 [1] CRAN (R 4.3.1)\n##   bitops                   1.0-7     2021-04-24 [1] CRAN (R 4.3.1)\n##   blob                     1.2.4     2023-03-17 [1] CRAN (R 4.3.1)\n##   bookdown                 0.35      2023-08-09 [1] CRAN (R 4.3.1)\n##   cachem                   1.0.8     2023-05-01 [1] CRAN (R 4.3.1)\n##   cli                      3.6.1     2023-03-23 [1] CRAN (R 4.3.1)\n##   CodeDepends              0.6.5     2018-07-17 [1] CRAN (R 4.3.1)\n##   codetools                0.2-19    2023-02-01 [2] CRAN (R 4.3.1)\n##   colorspace               2.1-0     2023-01-23 [1] CRAN (R 4.3.1)\n##   crayon                   1.5.2     2022-09-29 [1] CRAN (R 4.3.1)\n##   curl                     5.0.2     2023-08-14 [1] CRAN (R 4.3.1)\n##   data.table               1.14.8    2023-02-17 [1] CRAN (R 4.3.1)\n##   DBI                      1.1.3     2022-06-18 [1] CRAN (R 4.3.1)\n##   dbplyr                 * 2.3.3     2023-07-07 [1] CRAN (R 4.3.1)\n##   DelayedArray             0.27.10   2023-07-28 [1] Bioconductor\n##   digest                   0.6.33    2023-07-07 [1] CRAN (R 4.3.1)\n##   dir.expiry               1.9.0     2023-04-25 [1] Bioconductor\n##   DNAZooData             * 1.1.0     2023-04-27 [1] Bioconductor\n##   dplyr                    1.1.3     2023-09-03 [1] CRAN (R 4.3.1)\n##   ellipsis                 0.3.2     2021-04-29 [1] CRAN (R 4.3.1)\n##   evaluate                 0.21      2023-05-05 [1] CRAN (R 4.3.1)\n##   ExperimentHub          * 2.9.1     2023-07-12 [1] Bioconductor\n##   fansi                    1.0.4     2023-01-22 [1] CRAN (R 4.3.1)\n##   fastmap                  1.1.1     2023-02-24 [1] CRAN (R 4.3.1)\n##   filelock                 1.0.2     2018-10-05 [1] CRAN (R 4.3.1)\n##   fourDNData             * 1.1.0     2023-04-27 [1] Bioconductor\n##   generics                 0.1.3     2022-07-05 [1] CRAN (R 4.3.1)\n##   GenomeInfoDb             1.37.4    2023-09-07 [1] Bioconductor\n##   GenomeInfoDbData         1.2.10    2023-09-25 [1] Bioconductor\n##   GenomicRanges            1.53.1    2023-05-04 [1] Bioconductor\n##   ggbeeswarm               0.7.2     2023-04-29 [1] CRAN (R 4.3.1)\n##   ggplot2                  3.4.3     2023-08-14 [1] CRAN (R 4.3.1)\n##   ggrastr                  1.0.2     2023-06-01 [1] CRAN (R 4.3.1)\n##   glue                     1.6.2     2022-02-24 [1] CRAN (R 4.3.1)\n##   graph                    1.79.2    2023-09-22 [1] Bioconductor\n##   gtable                   0.3.4     2023-08-21 [1] CRAN (R 4.3.1)\n##   HiCExperiment          * 1.1.2     2023-09-04 [1] Bioconductor\n##   HiContacts             * 1.3.2     2023-09-04 [1] Bioconductor\n##   HiContactsData         * 1.3.0     2023-04-27 [1] Bioconductor\n##   HiCool                 * 1.1.0     2023-05-19 [1] Bioconductor\n##   hms                      1.1.3     2023-03-21 [1] CRAN (R 4.3.1)\n##   htmltools                0.5.6     2023-08-10 [1] CRAN (R 4.3.1)\n##   htmlwidgets              1.6.2     2023-03-17 [1] CRAN (R 4.3.1)\n##   httpuv                   1.6.11    2023-05-11 [1] CRAN (R 4.3.1)\n##   httr                     1.4.7     2023-08-15 [1] CRAN (R 4.3.1)\n##   InteractionSet           1.29.1    2023-06-14 [1] Bioconductor\n##   interactiveDisplayBase   1.39.0    2023-04-25 [1] Bioconductor\n##   IRanges                  2.35.2    2023-06-22 [1] Bioconductor\n##   jsonlite                 1.8.7     2023-06-29 [1] CRAN (R 4.3.1)\n##   KEGGREST                 1.41.0    2023-04-25 [1] Bioconductor\n##   knitr                    1.44      2023-09-11 [1] CRAN (R 4.3.1)\n##   later                    1.3.1     2023-05-02 [1] CRAN (R 4.3.1)\n##   lattice                  0.21-8    2023-04-05 [2] CRAN (R 4.3.1)\n##   lazyeval                 0.2.2     2019-03-15 [1] CRAN (R 4.3.1)\n##   lifecycle                1.0.3     2022-10-07 [1] CRAN (R 4.3.1)\n##   magrittr                 2.0.3     2022-03-30 [1] CRAN (R 4.3.1)\n##   Matrix                   1.6-1.1   2023-09-18 [1] CRAN (R 4.3.1)\n##   MatrixGenerics           1.13.1    2023-07-25 [1] Bioconductor\n##   matrixStats              1.0.0     2023-06-02 [1] CRAN (R 4.3.1)\n##   memoise                  2.0.1     2021-11-26 [1] CRAN (R 4.3.1)\n##   mime                     0.12      2021-09-28 [1] CRAN (R 4.3.1)\n##   munsell                  0.5.0     2018-06-12 [1] CRAN (R 4.3.1)\n##   OHCA                   * 1.1.0     2023-09-25 [1] local\n##   pillar                   1.9.0     2023-03-22 [1] CRAN (R 4.3.1)\n##   pkgconfig                2.0.3     2019-09-22 [1] CRAN (R 4.3.1)\n##   plotly                   4.10.2    2023-06-03 [1] CRAN (R 4.3.1)\n##   png                      0.1-8     2022-11-29 [1] CRAN (R 4.3.1)\n##   promises                 1.2.1     2023-08-10 [1] CRAN (R 4.3.1)\n##   purrr                    1.0.2     2023-08-10 [1] CRAN (R 4.3.1)\n##   R6                       2.5.1     2021-08-19 [1] CRAN (R 4.3.1)\n##   rappdirs                 0.3.3     2021-01-31 [1] CRAN (R 4.3.1)\n##   Rcpp                     1.0.11    2023-07-06 [1] CRAN (R 4.3.1)\n##   RCurl                    1.98-1.12 2023-03-27 [1] CRAN (R 4.3.1)\n##   readr                    2.1.4     2023-02-10 [1] CRAN (R 4.3.1)\n##   rebook                   1.11.1    2023-05-25 [1] Bioconductor\n##   reticulate               1.32.0    2023-09-11 [1] CRAN (R 4.3.1)\n##   rhdf5                    2.45.1    2023-07-10 [1] Bioconductor\n##   rhdf5filters             1.13.5    2023-07-19 [1] Bioconductor\n##   Rhdf5lib                 1.23.2    2023-09-10 [1] Bioconductor\n##   rjson                    0.2.21    2022-01-09 [1] CRAN (R 4.3.1)\n##   rlang                    1.1.1     2023-04-28 [1] CRAN (R 4.3.1)\n##   rmarkdown                2.25      2023-09-18 [1] CRAN (R 4.3.1)\n##   rmdformats               1.0.4     2022-05-17 [1] CRAN (R 4.3.1)\n##   RSpectra                 0.16-1    2022-04-24 [1] CRAN (R 4.3.1)\n##   RSQLite                  2.3.1     2023-04-03 [1] CRAN (R 4.3.1)\n##   rstudioapi               0.15.0    2023-07-07 [1] CRAN (R 4.3.1)\n##   S4Arrays                 1.1.6     2023-08-30 [1] Bioconductor\n##   S4Vectors                0.39.2    2023-09-22 [1] Bioconductor\n##   scales                   1.2.1     2022-08-20 [1] CRAN (R 4.3.1)\n##   sessioninfo              1.2.2     2021-12-06 [1] CRAN (R 4.3.1)\n##   shiny                    1.7.5     2023-08-12 [1] CRAN (R 4.3.1)\n##   SparseArray              1.1.12    2023-08-31 [1] Bioconductor\n##   strawr                   0.0.91    2023-03-29 [1] CRAN (R 4.3.1)\n##   stringi                  1.7.12    2023-01-11 [1] CRAN (R 4.3.1)\n##   stringr                  1.5.0     2022-12-02 [1] CRAN (R 4.3.1)\n##   SummarizedExperiment     1.31.1    2023-05-01 [1] Bioconductor\n##   tibble                   3.2.1     2023-03-20 [1] CRAN (R 4.3.1)\n##   tidyr                    1.3.0     2023-01-24 [1] CRAN (R 4.3.1)\n##   tidyselect               1.2.0     2022-10-10 [1] CRAN (R 4.3.1)\n##   tzdb                     0.4.0     2023-05-12 [1] CRAN (R 4.3.1)\n##   utf8                     1.2.3     2023-01-31 [1] CRAN (R 4.3.1)\n##   vctrs                    0.6.3     2023-06-14 [1] CRAN (R 4.3.1)\n##   vipor                    0.4.5     2017-03-22 [1] CRAN (R 4.3.1)\n##   viridisLite              0.4.2     2023-05-02 [1] CRAN (R 4.3.1)\n##   vroom                    1.6.3     2023-04-28 [1] CRAN (R 4.3.1)\n##   xfun                     0.40      2023-08-09 [1] CRAN (R 4.3.1)\n##   XML                      3.99-0.14 2023-03-19 [1] CRAN (R 4.3.1)\n##   xtable                   1.8-4     2019-04-21 [1] CRAN (R 4.3.1)\n##   XVector                  0.41.1    2023-05-03 [1] Bioconductor\n##   yaml                     2.3.7     2023-01-23 [1] CRAN (R 4.3.1)\n##   zlibbioc                 1.47.0    2023-04-25 [1] Bioconductor\n##  \n##   [1] /usr/local/lib/R/site-library\n##   [2] /usr/local/lib/R/library\n##  \n##  ───────────────────────────────────────────────────────────────────────────"
+    "text": "Session info\n\nsessioninfo::session_info()\n##  ─ Session info ────────────────────────────────────────────────────────────\n##   setting  value\n##   version  R version 4.3.1 (2023-06-16)\n##   os       Ubuntu 22.04.3 LTS\n##   system   x86_64, linux-gnu\n##   ui       X11\n##   language (EN)\n##   collate  en_US.UTF-8\n##   ctype    en_US.UTF-8\n##   tz       Etc/UTC\n##   date     2023-10-19\n##   pandoc   3.1.1 @ /usr/local/bin/ (via rmarkdown)\n##  \n##  ─ Packages ────────────────────────────────────────────────────────────────\n##   package                * version   date (UTC) lib source\n##   abind                    1.4-5     2016-07-21 [1] CRAN (R 4.3.1)\n##   AnnotationDbi            1.63.2    2023-07-02 [1] Bioconductor\n##   AnnotationHub          * 3.9.2     2023-08-24 [1] Bioconductor\n##   basilisk                 1.13.4    2023-10-04 [1] Bioconductor\n##   basilisk.utils           1.13.3    2023-09-04 [1] Bioconductor\n##   beeswarm                 0.4.0     2021-06-01 [1] CRAN (R 4.3.1)\n##   Biobase                  2.61.0    2023-04-25 [1] Bioconductor\n##   BiocFileCache          * 2.9.1     2023-07-12 [1] Bioconductor\n##   BiocGenerics           * 0.47.0    2023-04-25 [1] Bioconductor\n##   BiocIO                   1.11.0    2023-04-25 [1] Bioconductor\n##   BiocManager              1.30.22   2023-08-08 [1] CRAN (R 4.3.1)\n##   BiocParallel             1.35.4    2023-08-17 [1] Bioconductor\n##   BiocStyle                2.29.2    2023-09-14 [1] Bioconductor\n##   BiocVersion              3.18.0    2023-04-25 [1] Bioconductor\n##   Biostrings               2.69.2    2023-07-02 [1] Bioconductor\n##   bit                      4.0.5     2022-11-15 [1] CRAN (R 4.3.1)\n##   bit64                    4.0.5     2020-08-30 [1] CRAN (R 4.3.1)\n##   bitops                   1.0-7     2021-04-24 [1] CRAN (R 4.3.1)\n##   blob                     1.2.4     2023-03-17 [1] CRAN (R 4.3.1)\n##   bookdown                 0.36      2023-10-16 [1] CRAN (R 4.3.1)\n##   cachem                   1.0.8     2023-05-01 [1] CRAN (R 4.3.1)\n##   cli                      3.6.1     2023-03-23 [1] CRAN (R 4.3.1)\n##   CodeDepends              0.6.5     2018-07-17 [1] CRAN (R 4.3.1)\n##   codetools                0.2-19    2023-02-01 [2] CRAN (R 4.3.1)\n##   colorspace               2.1-0     2023-01-23 [1] CRAN (R 4.3.1)\n##   crayon                   1.5.2     2022-09-29 [1] CRAN (R 4.3.1)\n##   curl                     5.1.0     2023-10-02 [1] CRAN (R 4.3.1)\n##   data.table               1.14.8    2023-02-17 [1] CRAN (R 4.3.1)\n##   DBI                      1.1.3     2022-06-18 [1] CRAN (R 4.3.1)\n##   dbplyr                 * 2.3.4     2023-09-26 [1] CRAN (R 4.3.1)\n##   DelayedArray             0.27.10   2023-07-28 [1] Bioconductor\n##   digest                   0.6.33    2023-07-07 [1] CRAN (R 4.3.1)\n##   dir.expiry               1.9.0     2023-04-25 [1] Bioconductor\n##   DNAZooData             * 1.1.0     2023-04-27 [1] Bioconductor\n##   dplyr                    1.1.3     2023-09-03 [1] CRAN (R 4.3.1)\n##   ellipsis                 0.3.2     2021-04-29 [1] CRAN (R 4.3.1)\n##   evaluate                 0.22      2023-09-29 [1] CRAN (R 4.3.1)\n##   ExperimentHub          * 2.9.1     2023-07-12 [1] Bioconductor\n##   fansi                    1.0.5     2023-10-08 [1] CRAN (R 4.3.1)\n##   fastmap                  1.1.1     2023-02-24 [1] CRAN (R 4.3.1)\n##   filelock                 1.0.2     2018-10-05 [1] CRAN (R 4.3.1)\n##   fourDNData             * 1.1.0     2023-04-27 [1] Bioconductor\n##   generics                 0.1.3     2022-07-05 [1] CRAN (R 4.3.1)\n##   GenomeInfoDb             1.37.6    2023-10-02 [1] Bioconductor\n##   GenomeInfoDbData         1.2.11    2023-10-19 [1] Bioconductor\n##   GenomicRanges            1.53.2    2023-10-08 [1] Bioconductor\n##   ggbeeswarm               0.7.2     2023-04-29 [1] CRAN (R 4.3.1)\n##   ggplot2                  3.4.4     2023-10-12 [1] CRAN (R 4.3.1)\n##   ggrastr                  1.0.2     2023-06-01 [1] CRAN (R 4.3.1)\n##   glue                     1.6.2     2022-02-24 [1] CRAN (R 4.3.1)\n##   graph                    1.79.4    2023-10-09 [1] Bioconductor\n##   gtable                   0.3.4     2023-08-21 [1] CRAN (R 4.3.1)\n##   HiCExperiment          * 1.1.2     2023-09-04 [1] Bioconductor\n##   HiContacts             * 1.3.2     2023-09-04 [1] Bioconductor\n##   HiContactsData         * 1.3.0     2023-04-27 [1] Bioconductor\n##   HiCool                 * 1.1.0     2023-05-19 [1] Bioconductor\n##   hms                      1.1.3     2023-03-21 [1] CRAN (R 4.3.1)\n##   htmltools                0.5.6.1   2023-10-06 [1] CRAN (R 4.3.1)\n##   htmlwidgets              1.6.2     2023-03-17 [1] CRAN (R 4.3.1)\n##   httpuv                   1.6.11    2023-05-11 [1] CRAN (R 4.3.1)\n##   httr                     1.4.7     2023-08-15 [1] CRAN (R 4.3.1)\n##   InteractionSet           1.29.1    2023-06-14 [1] Bioconductor\n##   interactiveDisplayBase   1.39.0    2023-04-25 [1] Bioconductor\n##   IRanges                  2.35.3    2023-10-12 [1] Bioconductor\n##   jsonlite                 1.8.7     2023-06-29 [1] CRAN (R 4.3.1)\n##   KEGGREST                 1.41.4    2023-09-25 [1] Bioconductor\n##   knitr                    1.44      2023-09-11 [1] CRAN (R 4.3.1)\n##   later                    1.3.1     2023-05-02 [1] CRAN (R 4.3.1)\n##   lattice                  0.21-9    2023-10-01 [1] CRAN (R 4.3.1)\n##   lazyeval                 0.2.2     2019-03-15 [1] CRAN (R 4.3.1)\n##   lifecycle                1.0.3     2022-10-07 [1] CRAN (R 4.3.1)\n##   magrittr                 2.0.3     2022-03-30 [1] CRAN (R 4.3.1)\n##   Matrix                   1.6-1.1   2023-09-18 [1] CRAN (R 4.3.1)\n##   MatrixGenerics           1.13.1    2023-07-25 [1] Bioconductor\n##   matrixStats              1.0.0     2023-06-02 [1] CRAN (R 4.3.1)\n##   memoise                  2.0.1     2021-11-26 [1] CRAN (R 4.3.1)\n##   mime                     0.12      2021-09-28 [1] CRAN (R 4.3.1)\n##   munsell                  0.5.0     2018-06-12 [1] CRAN (R 4.3.1)\n##   OHCA                   * 1.1.0     2023-10-19 [1] local\n##   pillar                   1.9.0     2023-03-22 [1] CRAN (R 4.3.1)\n##   pkgconfig                2.0.3     2019-09-22 [1] CRAN (R 4.3.1)\n##   plotly                   4.10.2    2023-06-03 [1] CRAN (R 4.3.1)\n##   png                      0.1-8     2022-11-29 [1] CRAN (R 4.3.1)\n##   promises                 1.2.1     2023-08-10 [1] CRAN (R 4.3.1)\n##   purrr                    1.0.2     2023-08-10 [1] CRAN (R 4.3.1)\n##   R6                       2.5.1     2021-08-19 [1] CRAN (R 4.3.1)\n##   rappdirs                 0.3.3     2021-01-31 [1] CRAN (R 4.3.1)\n##   Rcpp                     1.0.11    2023-07-06 [1] CRAN (R 4.3.1)\n##   RCurl                    1.98-1.12 2023-03-27 [1] CRAN (R 4.3.1)\n##   readr                    2.1.4     2023-02-10 [1] CRAN (R 4.3.1)\n##   rebook                   1.11.1    2023-05-25 [1] Bioconductor\n##   reticulate               1.34.0    2023-10-12 [1] CRAN (R 4.3.1)\n##   rhdf5                    2.45.1    2023-07-10 [1] Bioconductor\n##   rhdf5filters             1.13.5    2023-07-19 [1] Bioconductor\n##   Rhdf5lib                 1.23.2    2023-09-10 [1] Bioconductor\n##   rjson                    0.2.21    2022-01-09 [1] CRAN (R 4.3.1)\n##   rlang                    1.1.1     2023-04-28 [1] CRAN (R 4.3.1)\n##   rmarkdown                2.25      2023-09-18 [1] CRAN (R 4.3.1)\n##   rmdformats               1.0.4     2022-05-17 [1] CRAN (R 4.3.1)\n##   RSpectra                 0.16-1    2022-04-24 [1] CRAN (R 4.3.1)\n##   RSQLite                  2.3.1     2023-04-03 [1] CRAN (R 4.3.1)\n##   rstudioapi               0.15.0    2023-07-07 [1] CRAN (R 4.3.1)\n##   S4Arrays                 1.1.6     2023-08-30 [1] Bioconductor\n##   S4Vectors                0.39.3    2023-10-11 [1] Bioconductor\n##   scales                   1.2.1     2022-08-20 [1] CRAN (R 4.3.1)\n##   sessioninfo              1.2.2     2021-12-06 [1] CRAN (R 4.3.1)\n##   shiny                    1.7.5.1   2023-10-14 [1] CRAN (R 4.3.1)\n##   SparseArray              1.1.12    2023-08-31 [1] Bioconductor\n##   strawr                   0.0.91    2023-03-29 [1] CRAN (R 4.3.1)\n##   stringi                  1.7.12    2023-01-11 [1] CRAN (R 4.3.1)\n##   stringr                  1.5.0     2022-12-02 [1] CRAN (R 4.3.1)\n##   SummarizedExperiment     1.31.1    2023-05-01 [1] Bioconductor\n##   tibble                   3.2.1     2023-03-20 [1] CRAN (R 4.3.1)\n##   tidyr                    1.3.0     2023-01-24 [1] CRAN (R 4.3.1)\n##   tidyselect               1.2.0     2022-10-10 [1] CRAN (R 4.3.1)\n##   tzdb                     0.4.0     2023-05-12 [1] CRAN (R 4.3.1)\n##   utf8                     1.2.3     2023-01-31 [1] CRAN (R 4.3.1)\n##   vctrs                    0.6.4     2023-10-12 [1] CRAN (R 4.3.1)\n##   vipor                    0.4.5     2017-03-22 [1] CRAN (R 4.3.1)\n##   viridisLite              0.4.2     2023-05-02 [1] CRAN (R 4.3.1)\n##   vroom                    1.6.4     2023-10-02 [1] CRAN (R 4.3.1)\n##   xfun                     0.40      2023-08-09 [1] CRAN (R 4.3.1)\n##   XML                      3.99-0.14 2023-03-19 [1] CRAN (R 4.3.1)\n##   xtable                   1.8-4     2019-04-21 [1] CRAN (R 4.3.1)\n##   XVector                  0.41.1    2023-05-03 [1] Bioconductor\n##   yaml                     2.3.7     2023-01-23 [1] CRAN (R 4.3.1)\n##   zlibbioc                 1.47.0    2023-04-25 [1] Bioconductor\n##  \n##   [1] /usr/local/lib/R/site-library\n##   [2] /usr/local/lib/R/library\n##  \n##  ───────────────────────────────────────────────────────────────────────────"
   },
   {
     "objectID": "preamble.html",
     "href": "preamble.html",
     "title": "Preamble",
     "section": "",
-    "text": "Hi-C is an experimental method to quantify spatial interactions between any pair of genomic loci. While a number of command-line interfaces (CLI) exist to process and manipulate Hi-C data (e.g. cooler (Abdennur and Mirny (2020)), juicer (Durand et al. (2016)) and HiC-Pro (Servant et al. (2015))), they generally suffer from several limitations often found in emerging genomics techniques:\n\nNo genomic representation of Hi-C processed data: the existing CLIs can efficiently parse Hi-C data as a numerical matrix and perform a few standard quantitative operations (e.g. contact matrix binning and normalization, dimensionality reduction, etc). However, they systematically fail to represent a Hi-C contact matrix as a genomic object. Qualitative analyses (e.g. intersecting chromatin loops with genomic features, finding genes overlapping with domains, etc) therefore remain extremely tedious.\nNo format-agnostic analysis libraries. Three competing file format standards (.(m)cool, .hic and HiC-Pro files) currently exist to store Hi-C processed data and dedicated CLIs propose sets of tools specifically working with their corresponding Hi-C processed data file format. This has curbed the development of generic Hi-C data analysis libraries by favoring the emergence of several redundant tools.\nLack of integration within a biology-oriented community. While rapid development of Hi-C analysis methodology is ongoing, it is primarily driven by small-scale teams rather than by a community as a whole. This oriented development is less likely to fulfill the needs met by other investigators.\n\nIn this book, we provide an overview of a set of tools that enable processing, visualization and in-depth investigation of Hi-C data in R, ensuring intuitive integration of Hi-C data in the existing Bioconductor ecosystem. We introduce a high-level HiCExperiment data structure to represent Hi-C data, directly extending robust, pre-existing core genomic classes offered by Bioconductor. This guarantees a stable and intuitive Hi-C data representation in R as a genomic entity, which is highly interoperable and can be used by all existing analysis packages in R.\n\nOn top of the HiCExperiment data structure, the HiContacts package offers extended functionalities to perform matrix-centric and interaction-centric analysis directly on HiCExperiment objects and provides powerful visualization tools specifically designed for Hi-C data to facilitate exploratory data analysis. In addition, the HiCool package implements a processing workflow based on a lightweight library to process raw Hi-C data into binned Hi-C contact matrices ready to be imported as HiCExperiment objects. Finally, the fourDNData and DNAZooData packages offer a gateway to major public data repositories directly in R.\n\n\nPackage status\n\n\n\nGithub repo 💾\nDoc 📘\nGithub checks ✅\nBioc builds 🏗\nLifecycle 🌱\n\n\n\n\nHiCExperiment\nDoc\n\nBioc release: Bioc devel:\n\n\n\nHiContacts\nDoc\n\nBioc release: Bioc devel:\n\n\n\nHiCool\nDoc\n\nBioc release: Bioc devel:\n\n\n\nHiContactsData\nDoc\n\nBioc release: Bioc devel:\n\n\n\nDNAZooData\nDoc\n\nBioc release: Bioc devel:\n\n\n\nfourDNData\nDoc\n\nBioc release: Bioc devel:\n\n\n\n\n\n\n\n\nReferences\n\n\nAbdennur, Nezar, and Leonid A. Mirny. 2020. “Cooler: scalable storage for Hi-C data and other genomically labeled arrays.” Bioinformatics 36 (1): 311–16. https://doi.org/10.1093/bioinformatics/btz540.\n\n\nDurand, Neva C., Muhammad S. Shamim, Ido Machol, Suhas S. P. Rao, Miriam H. Huntley, Eric S. Lander, and Erez Lieberman Aiden. 2016. “Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments.” Cell Systems 3 (1): 95. https://doi.org/10.1016/j.cels.2016.07.002.\n\n\nServant, Nicolas, Nelle Varoquaux, Bryan R. Lajoie, Eric Viara, Chong-Jian Chen, Jean-Philippe Vert, Edith Heard, Job Dekker, and Emmanuel Barillot. 2015. “HiC-Pro: an optimized and flexible pipeline for Hi-C data processing.” Genome Biol. 16 (1): 1–11. https://doi.org/10.1186/s13059-015-0831-x."
+    "text": "Hi-C is an experimental method to quantify spatial interactions between any pair of genomic loci. While a number of command-line interfaces (CLI) exist to process and manipulate Hi-C data (e.g. cooler (Abdennur & Mirny (2019)), juicer (Durand et al. (2016)) and HiC-Pro (Servant et al. (2015))), they generally suffer from several limitations often found in emerging genomics techniques:\n\nNo genomic representation of Hi-C processed data: the existing CLIs can efficiently parse Hi-C data as a numerical matrix and perform a few standard quantitative operations (e.g. contact matrix binning and normalization, dimensionality reduction, etc). However, they systematically fail to represent a Hi-C contact matrix as a genomic object. Qualitative analyses (e.g. intersecting chromatin loops with genomic features, finding genes overlapping with domains, etc) therefore remain extremely tedious.\nNo format-agnostic analysis libraries. Three competing file format standards (.(m)cool, .hic and HiC-Pro files) currently exist to store Hi-C processed data and dedicated CLIs propose sets of tools specifically working with their corresponding Hi-C processed data file format. This has curbed the development of generic Hi-C data analysis libraries by favoring the emergence of several redundant tools.\nLack of integration within a biology-oriented community. While rapid development of Hi-C analysis methodology is ongoing, it is primarily driven by small-scale teams rather than by a community as a whole. This oriented development is less likely to fulfill the needs met by other investigators.\n\nIn this book, we provide an overview of a set of tools that enable processing, visualization and in-depth investigation of Hi-C data in R, ensuring intuitive integration of Hi-C data in the existing Bioconductor ecosystem. We introduce a high-level HiCExperiment data structure to represent Hi-C data, directly extending robust, pre-existing core genomic classes offered by Bioconductor. This guarantees a stable and intuitive Hi-C data representation in R as a genomic entity, which is highly interoperable and can be used by all existing analysis packages in R.\n\nOn top of the HiCExperiment data structure, the HiContacts package offers extended functionalities to perform matrix-centric and interaction-centric analysis directly on HiCExperiment objects and provides powerful visualization tools specifically designed for Hi-C data to facilitate exploratory data analysis. In addition, the HiCool package implements a processing workflow based on a lightweight library to process raw Hi-C data into binned Hi-C contact matrices ready to be imported as HiCExperiment objects. Finally, the fourDNData and DNAZooData packages offer a gateway to major public data repositories directly in R.\n\n\nPackage status\n\n\n\nGithub repo 💾\nDoc 📘\nGithub checks ✅\nBioc builds 🏗\nLifecycle 🌱\n\n\n\n\nHiCExperiment\nDoc\n\nBioc release: Bioc devel:\n\n\n\nHiContacts\nDoc\n\nBioc release: Bioc devel:\n\n\n\nHiCool\nDoc\n\nBioc release: Bioc devel:\n\n\n\nHiContactsData\nDoc\n\nBioc release: Bioc devel:\n\n\n\nDNAZooData\nDoc\n\nBioc release: Bioc devel:\n\n\n\nfourDNData\nDoc\n\nBioc release: Bioc devel:\n\n\n\n\n\n\n\n\nReferences\n\n\nAbdennur, N., & Mirny, L. A. (2019). Cooler: Scalable storage for hi-c data and other genomically labeled arrays. Bioinformatics, 36(1), 311–316. https://doi.org/10.1093/bioinformatics/btz540\n\n\nDurand, N. C., Shamim, M. S., Machol, I., Rao, S. S. P., Huntley, M. H., Lander, E. S., & Aiden, E. L. (2016). Juicer provides a one-click system for analyzing loop-resolution hi-c experiments. Cell Systems, 3(1), 95–98. https://doi.org/10.1016/j.cels.2016.07.002\n\n\nServant, N., Varoquaux, N., Lajoie, B. R., Viara, E., Chen, C.-J., Vert, J.-P., Heard, E., Dekker, J., & Barillot, E. (2015). HiC-pro: An optimized and flexible pipeline for hi-c data processing. Genome Biology, 16(1). https://doi.org/10.1186/s13059-015-0831-x"
   },
   {
     "objectID": "principles.html",
@@ -60,21 +60,28 @@
     "href": "principles.html#experimental-considerations",
     "title": "\n1  Hi-C pre-processing steps\n",
     "section": "\n1.1 Experimental considerations",
-    "text": "1.1 Experimental considerations\n\n1.1.1 Experimental approach\nThe Hi-C procedure (Lieberman-Aiden et al. (2009)) stems from the clever combination of high-throughput sequencing and Chromatin Conformation Capture (3C) experimental approach (Dekker et al. (2002)).\nIn Hi-C, chromatin is crosslinked within intact nuclei and enzymatically digested (usually with one or several restriction enzymes, but Hi-C variants using MNase or DNase exist). End-repair introduces biotinylated dNTPs and is followed by religation, which generates chimeric DNA fragments consisting of genomic loci originally lying in spatial proximity, usually crosslinked to a shared protein complex. After religation, DNA fragments are sheared, biotin-containing fragments are pulled-down and converted into a sequencing library.\n\n\n1.1.2 C variants\nA number of C variants have been proposed since the publication of the original 3C method (reviewed by J. O. et al. (2017)), the main ones being Capture-C and ChIA-PET (see procedure below).\n\nCapture-C is useful to quantify interactions between a set of regulatory elements of interest. ChIA-PET, on the other hand, can identify interactions mediated by a specific protein of interest. Finally, an increasing number of Hi-C approaches rely on long-read sequencing (e.g. Deshpande et al. (2022), Tavares-Cadete et al. (2020)) to identify clusters of 3D contacts.\n\n1.1.3 Sequencing\nHi-C libraries are traditionally sequenced with short-read technology, and are by essence paired-end libraries. For this reason, the end result of the experimental side of the Hi-C consists of two fastq files, each one containing sequences for one extremity of the DNA fragments purified during Hi-C. These are the two files we need to move on to the computational side of Hi-C.\n\n\n\n\n\n\nWhat is a fastq file?\n\n\n\nFastq files are plain text files (usually compressed, with the .gz extension). They are generated by the sequencing machine during a sequencing run, and for Hi-C, necessarily come in pairs, generally called *_R1.fq.gz and *_R2.fq.gz.\nHere is the first read listed in sample_R1.fq.gz file:\n\n\nsample_R1.fq.gz\n\n@SRR5399542.1.1 DH1DQQN1:393:H9GEWADXX:1:1101:1187:2211 length=24\nCAACTTCAATACCAGCAGCAGCAA\n+\nCCCFFFFFHHHHHJJJJJIJJJJJ\n\nAnd here is the first read listed in sample_R2.fq.gz file:\n\n\nsample_R2.fq.gz\n\n@SRR5399542.1.1 DH1DQQN1:393:H9GEWADXX:1:1101:1187:2211 length=24\nGCTGTTGTTGTTGTTGTATTTGCA\n+\n@@@FFFFFFHHHHIJJIJJHIIEH\n\nThese two reads are the first listed in their respective file. Notice how they bear the same name (first line): they form a pair. The second line corresponds to the sequence read by the sequencer, the third line is a single + separator, and the last line indicates the per-base sequencing quality following a nebulous cypher."
+    "text": "1.1 Experimental considerations\n\n1.1.1 Experimental approach\nThe Hi-C procedure (Lieberman-Aiden et al. (2009)) stems from the clever combination of high-throughput sequencing and Chromatin Conformation Capture (3C) experimental approach (Dekker et al. (2002)).\nIn Hi-C, chromatin is crosslinked within intact nuclei and enzymatically digested (usually with one or several restriction enzymes, but Hi-C variants using MNase or DNase exist). End-repair introduces biotinylated dNTPs and is followed by religation, which generates chimeric DNA fragments consisting of genomic loci originally lying in spatial proximity, usually crosslinked to a shared protein complex. After religation, DNA fragments are sheared, biotin-containing fragments are pulled-down and converted into a sequencing library.\n\n\n1.1.2 C variants\nA number of C variants have been proposed since the publication of the original 3C method (reviewed by Davies et al. (2017)), the main ones being Capture-C and ChIA-PET (see procedure below).\n\nCapture-C is useful to quantify interactions between a set of regulatory elements of interest. ChIA-PET, on the other hand, can identify interactions mediated by a specific protein of interest. Finally, an increasing number of Hi-C approaches rely on long-read sequencing (e.g. Deshpande et al. (2022), Tavares-Cadete et al. (2020)) to identify clusters of 3D contacts.\n\n1.1.3 Sequencing\nHi-C libraries are traditionally sequenced with short-read technology, and are by essence paired-end libraries. For this reason, the end result of the experimental side of the Hi-C consists of two fastq files, each one containing sequences for one extremity of the DNA fragments purified during Hi-C. These are the two files we need to move on to the computational side of Hi-C.\n\n\n\n\n\n\nWhat is a fastq file?\n\n\n\nFastq files are plain text files (usually compressed, with the .gz extension). They are generated by the sequencing machine during a sequencing run, and for Hi-C, necessarily come in pairs, generally called *_R1.fq.gz and *_R2.fq.gz.\nHere is the first read listed in sample_R1.fq.gz file:\n\n\nsample_R1.fq.gz\n\n@SRR5399542.1.1 DH1DQQN1:393:H9GEWADXX:1:1101:1187:2211 length=24\nCAACTTCAATACCAGCAGCAGCAA\n+\nCCCFFFFFHHHHHJJJJJIJJJJJ\n\nAnd here is the first read listed in sample_R2.fq.gz file:\n\n\nsample_R2.fq.gz\n\n@SRR5399542.1.1 DH1DQQN1:393:H9GEWADXX:1:1101:1187:2211 length=24\nGCTGTTGTTGTTGTTGTATTTGCA\n+\n@@@FFFFFFHHHHIJJIJJHIIEH\n\nThese two reads are the first listed in their respective file. Notice how they bear the same name (first line): they form a pair. The second line corresponds to the sequence read by the sequencer, the third line is a single + separator, and the last line indicates the per-base sequencing quality following a nebulous cypher."
   },
   {
     "objectID": "principles.html#hi-c-file-formats",
     "href": "principles.html#hi-c-file-formats",
     "title": "\n1  Hi-C pre-processing steps\n",
     "section": "\n1.2 Hi-C file formats",
-    "text": "1.2 Hi-C file formats\nTwo important output files are typically generated during Hi-C data pre-processing:\n\nA “pairs” file\nA binned “contact matrix” file\n\nWe will now describe the structure of these different types of files. Directly jump to the next chapter if you want to know more about importing data from a contact matrix or a pairs file in R.\n\n1.2.1 Pairs files\nA “pairs” file (optionally, but generally filtered and sorted) is the direct output of processing Hi-C fastq files. It stores information about putative proximity contacts identified by digestion/religation, in the lossless, human-readable, indexable format: the .pairs format.\n\n\n\n\n\n\nWhat is a .pairs file?\n\n\n\nA .pairs file is organized in a header followed by a body:\n\n\nheader: starts with #\n\nRequired entries\n\nFirst line: ## pairs format v1.0\n\n\n#columns: column contents and ordering (e.g. #columns: readID chr1 pos1 chr2 pos2 strand1 strand2 &lt;column_name&gt; &lt;column_name&gt; ...)\n\n#chromsize: chromosome names and their size in bp, one chromosome per line, in the same order that defines ordering between mates (e.g. #chromsize: chr1 230218). Chromosome order is actually defined by this header, not by the order of pairs listed in the body!\n\n\nOptional entries with reserved header keys (sorted, shape, command, genome_assembly)\n\n\n#sorted: to indicate the sorting mechanism (e.g. #sorted: chr1-chr2-pos1-pos2, #sorted: chr1-pos1, #sorted: none)\n\n#shape: to specify whether the matrix is stored as upper triangle or lower triangle (#shape: upper triangle, #shape: lower triangle)\n\n#command: to specify any command, e.g. the command used to generate the pairs file (#command: bam2pairs mysample.bam mysample)\n\n#genome_assembly: to specify the genome assembly (e.g. #genome_assembly: hg38)\n\n\n\n\n\nbody: tab-separated columns\n\n7 reserved (4 of them required) columns:\n\nreadID, chr1, pos1, chr2, pos2, strand1, strand2\nColumns 2-5 (chr1, pos1, chr2, pos2) are required and cannot have missing values\nFor column 1, 6 & 7: missing values are annotated with a single-character dummy (.)\n\n\n2 extra reserved, optional column names:\n\n\nfrag1, frag2: restriction enzyme fragment index used by Juicer\n\n\n\nAny number of optional columns can be added\n\n\n\n\n\nsample.pairs\n\n## pairs format v1.0\n#sorted: chr1-chr2-pos1-pos2\n#shape: upper triangle\n#genome_assembly: hg38\n#chromsize: chr1 249250621\n#chromsize: chr2 243199373\n#chromsize: chr3 198022430\n...\n#columns: readID chr1 pos1 chr2 pos2 strand1 strand2\nEAS139:136:FC706VJ:2:2104:23462:197393 chr1 10000 chr1 20000 + +\nEAS139:136:FC706VJ:2:8762:23765:128766 chr1 50000 chr1 70000 + +\nEAS139:136:FC706VJ:2:2342:15343:9863 chr1 60000 chr2 10000 + +\nEAS139:136:FC706VJ:2:1286:25:275154 chr1 30000 chr3 40000 + -\n\n\n\nMore information about the conventions related to this text file are provided by the 4DN consortium, which originally formalized the specifications of this file format.\n\n1.2.2 Binned contact matrix files\n\n1.2.2.1 Binning pairs into a matrix\nThe action of “binning” a .pairs file into a contact matrix consists in (1) discretizing a genome reference into genomic bins, (2) attributing bins for each pair’s extremity and (3) computing the interaction frequency between any pair of genomic bins, i.e. the “contact matrix”.\nFor instance, here is a dummy .pairs file with a total of 5 pairs:\n\n\ndummy.pairs\n\n## pairs format v1.0\n#sorted: chr1-chr2-pos1-pos2\n#columns: readID chr1 pos1 chr2 pos2 strand1 strand2\n#chromsize: chr1 389\n. chr1 162 chr1 172 . . \n. chr1 180 chr1 192 . . \n. chr1 183 chr1 254 . .\n. chr1 221 chr1 273 . . \n. chr1 254 chr1 298 . . \n\nNote that this genome reference is made of a single chromosome (chr1), very short (length of 389). By binning this chromosome in 100bp-wide bins (100 bp is the resolution), one would optain the following four bins:\n\n\nbins.bed\n\n&lt;chr&gt;  &lt;pos&gt; &lt;bin&gt;\nchr1   1     100\nchr1   101   200\nchr1   201   300\nchr1   301   389\n\nEach pair extremity can be changed to an integer indicating the position of the bin it falls in, e.g. for the left-hand extremity of the pairs file printed hereinabove (bin1):\n&lt;chr1&gt;  &lt;pos1&gt;  -&gt;  &lt;bin1&gt;\nchr1    162     -&gt;  2\nchr1    180     -&gt;  2\nchr1    183     -&gt;  2\nchr1    221     -&gt;  3\nchr1    254     -&gt;  3\nSimilarly for the right-hand extremity of the pairs file (bin2):\n&lt;chr2&gt;  &lt;pos2&gt;  -&gt;  &lt;bin2&gt;\nchr1    172     -&gt;  chr1 2\nchr1    192     -&gt;  chr1 2\nchr1    254     -&gt;  chr1 3\nchr1    273     -&gt;  chr1 3\nchr1    298     -&gt;  chr1 3\nBy pasting side-to-side the left-hand and right-hand extremities of each pair, the .pairs file can be turned into something like:\n&lt;bin1&gt; &lt;bin2&gt;\n2      2\n2      2\n2      3\n3      3\n3      3\nAnd if we now count the number of each &lt;bin1&gt; &lt;bin2&gt; combinaison, adding a third &lt;count&gt; column, we end up with a count.matrix text file:\n\n\ncount.matrix\n\n&lt;bin1&gt; &lt;bin2&gt;  &lt;count&gt;\n2      2       2\n2      3       1\n3      3       2\n\nThis count.matrix file lists a total of 5 pairs, and in which bin each extremity of each pair is contained. Thus, a count matrix is a lossy file format, as it “rounds up” the position of each pair’s extremity to the genomic bin containing it.\nThis “i-j-x” 3-column format, in which i-j relate to a pair of “coordinates” indices (or a pair of genomic bin indices) in a matrix, and x relates to a score associated with the pair of indices, is generally called a “COO sparse matrix”.\nIn this context, the regions.bed acts as a secondary “dictionary” describing the nature of i and j indices, i.e. the location of genomic bins.\n\n1.2.2.2 Plain-text matrices: HiC-Pro style\nThe HiC-Pro pipeline (Servant et al. (2015)) outputs 2 text files: a regions.bed file and a count.matrix file. They are generated by the exact process explained above.\nTogether, these two files can describe the interaction frequency between any pair of genomic loci. They are non-binarized text files, and as such are technically human-readable. However, it is relatively hard to get a grasp of these files compared to a plain .pairs file, as information regarding genomic bins and interaction frequencies are stored in separate files. Moreover, because they are non-binarized, these files often end up using a large disk space and cannot be easily indexed. This prevents easy subsetting of the data stored in these files.\n.(m)cool and .hic file formats are two standards addressing these limitations.\n\n1.2.2.3 .(m)cool matrices\nThe .cool format has been formally defined in Abdennur and Mirny (2020) and is a particular type of HDF5 (Hierarchical Data Format) file. It is an indexed archive file storing rectangular tables called:\n\n\nbins: containing the same information than the regions.bed file;\n\npixels: containing the same information than the count.matrix (each “pixel” is a pair of 2 bins and has one or several associated scores);\n\nchroms: summarizing the order and length of the chromosomes present in a Hi-C contact matrix;\n\nindexes: allowing random access, i.e. parsing of only a subset of the data without having to read through the entire set of data.\n\n\nA single .pairs file binned at different resolutions can also be saved into a single, multi-resolution .mcool file. .mcool essentially consists of nested .cool files.\nImportantly, as an HDF5-based format, .cool files are binarized, indexed and highly-compressed. This has two major benefits:\n\nSmaller disk storage footprint\n\nRapid subsetting of the data through random access\n\n\nMoreover, parsing .cool files is possible using HDF standard APIs.\n\n1.2.2.4 .hic matrices\nThe .hic format is another type of binarized, indexed and highly-compressed file (Durand et al. (2016)). It can store virtually the same information than a .cool file. However, parsing .hic files is not as straightforward as .cool files, as it does not rely on a generic file standard. Still, the straw library has been implemented in several computing languages to facilitate parsing of .hic files (Durand et al. (2016))."
+    "text": "1.2 Hi-C file formats\nTwo important output files are typically generated during Hi-C data pre-processing:\n\nA “pairs” file\nA binned “contact matrix” file\n\nWe will now describe the structure of these different types of files. Directly jump to the next chapter if you want to know more about importing data from a contact matrix or a pairs file in R.\n\n1.2.1 Pairs files\nA “pairs” file (optionally, but generally filtered and sorted) is the direct output of processing Hi-C fastq files. It stores information about putative proximity contacts identified by digestion/religation, in the lossless, human-readable, indexable format: the .pairs format.\n\n\n\n\n\n\nWhat is a .pairs file?\n\n\n\nA .pairs file is organized in a header followed by a body:\n\n\nheader: starts with #\n\nRequired entries\n\nFirst line: ## pairs format v1.0\n\n\n#columns: column contents and ordering (e.g. #columns: readID chr1 pos1 chr2 pos2 strand1 strand2 &lt;column_name&gt; &lt;column_name&gt; ...)\n\n#chromsize: chromosome names and their size in bp, one chromosome per line, in the same order that defines ordering between mates (e.g. #chromsize: chr1 230218). Chromosome order is actually defined by this header, not by the order of pairs listed in the body!\n\n\nOptional entries with reserved header keys (sorted, shape, command, genome_assembly)\n\n\n#sorted: to indicate the sorting mechanism (e.g. #sorted: chr1-chr2-pos1-pos2, #sorted: chr1-pos1, #sorted: none)\n\n#shape: to specify whether the matrix is stored as upper triangle or lower triangle (#shape: upper triangle, #shape: lower triangle)\n\n#command: to specify any command, e.g. the command used to generate the pairs file (#command: bam2pairs mysample.bam mysample)\n\n#genome_assembly: to specify the genome assembly (e.g. #genome_assembly: hg38)\n\n\n\n\n\nbody: tab-separated columns\n\n7 reserved (4 of them required) columns:\n\nreadID, chr1, pos1, chr2, pos2, strand1, strand2\nColumns 2-5 (chr1, pos1, chr2, pos2) are required and cannot have missing values\nFor column 1, 6 & 7: missing values are annotated with a single-character dummy (.)\n\n\n2 extra reserved, optional column names:\n\n\nfrag1, frag2: restriction enzyme fragment index used by Juicer\n\n\n\nAny number of optional columns can be added\n\n\n\n\n\nsample.pairs\n\n## pairs format v1.0\n#sorted: chr1-chr2-pos1-pos2\n#shape: upper triangle\n#genome_assembly: hg38\n#chromsize: chr1 249250621\n#chromsize: chr2 243199373\n#chromsize: chr3 198022430\n...\n#columns: readID chr1 pos1 chr2 pos2 strand1 strand2\nEAS139:136:FC706VJ:2:2104:23462:197393 chr1 10000 chr1 20000 + +\nEAS139:136:FC706VJ:2:8762:23765:128766 chr1 50000 chr1 70000 + +\nEAS139:136:FC706VJ:2:2342:15343:9863 chr1 60000 chr2 10000 + +\nEAS139:136:FC706VJ:2:1286:25:275154 chr1 30000 chr3 40000 + -\n\n\n\nMore information about the conventions related to this text file are provided by the 4DN consortium, which originally formalized the specifications of this file format.\n\n1.2.2 Binned contact matrix files\n\n1.2.2.1 Binning pairs into a matrix\nThe action of “binning” a .pairs file into a contact matrix consists in (1) discretizing a genome reference into genomic bins, (2) attributing bins for each pair’s extremity and (3) computing the interaction frequency between any pair of genomic bins, i.e. the “contact matrix”.\nFor instance, here is a dummy .pairs file with a total of 5 pairs:\n\n\ndummy.pairs\n\n## pairs format v1.0\n#sorted: chr1-chr2-pos1-pos2\n#columns: readID chr1 pos1 chr2 pos2 strand1 strand2\n#chromsize: chr1 389\n. chr1 162 chr1 172 . . \n. chr1 180 chr1 192 . . \n. chr1 183 chr1 254 . .\n. chr1 221 chr1 273 . . \n. chr1 254 chr1 298 . . \n\nNote that this genome reference is made of a single chromosome (chr1), very short (length of 389). By binning this chromosome in 100bp-wide bins (100 bp is the resolution), one would optain the following four bins:\n\n\nbins.bed\n\n&lt;chr&gt;  &lt;pos&gt; &lt;bin&gt;\nchr1   1     100\nchr1   101   200\nchr1   201   300\nchr1   301   389\n\nEach pair extremity can be changed to an integer indicating the position of the bin it falls in, e.g. for the left-hand extremity of the pairs file printed hereinabove (bin1):\n&lt;chr1&gt;  &lt;pos1&gt;  -&gt;  &lt;bin1&gt;\nchr1    162     -&gt;  2\nchr1    180     -&gt;  2\nchr1    183     -&gt;  2\nchr1    221     -&gt;  3\nchr1    254     -&gt;  3\nSimilarly for the right-hand extremity of the pairs file (bin2):\n&lt;chr2&gt;  &lt;pos2&gt;  -&gt;  &lt;bin2&gt;\nchr1    172     -&gt;  chr1 2\nchr1    192     -&gt;  chr1 2\nchr1    254     -&gt;  chr1 3\nchr1    273     -&gt;  chr1 3\nchr1    298     -&gt;  chr1 3\nBy pasting side-to-side the left-hand and right-hand extremities of each pair, the .pairs file can be turned into something like:\n&lt;bin1&gt; &lt;bin2&gt;\n2      2\n2      2\n2      3\n3      3\n3      3\nAnd if we now count the number of each &lt;bin1&gt; &lt;bin2&gt; combinaison, adding a third &lt;count&gt; column, we end up with a count.matrix text file:\n\n\ncount.matrix\n\n&lt;bin1&gt; &lt;bin2&gt;  &lt;count&gt;\n2      2       2\n2      3       1\n3      3       2\n\nThis count.matrix file lists a total of 5 pairs, and in which bin each extremity of each pair is contained. Thus, a count matrix is a lossy file format, as it “rounds up” the position of each pair’s extremity to the genomic bin containing it.\nThis “i-j-x” 3-column format, in which i-j relate to a pair of “coordinates” indices (or a pair of genomic bin indices) in a matrix, and x relates to a score associated with the pair of indices, is generally called a “COO sparse matrix”.\nIn this context, the regions.bed acts as a secondary “dictionary” describing the nature of i and j indices, i.e. the location of genomic bins.\n\n1.2.2.2 Plain-text matrices: HiC-Pro style\nThe HiC-Pro pipeline (Servant et al. (2015)) outputs 2 text files: a regions.bed file and a count.matrix file. They are generated by the exact process explained above.\nTogether, these two files can describe the interaction frequency between any pair of genomic loci. They are non-binarized text files, and as such are technically human-readable. However, it is relatively hard to get a grasp of these files compared to a plain .pairs file, as information regarding genomic bins and interaction frequencies are stored in separate files. Moreover, because they are non-binarized, these files often end up using a large disk space and cannot be easily indexed. This prevents easy subsetting of the data stored in these files.\n.(m)cool and .hic file formats are two standards addressing these limitations.\n\n1.2.2.3 .(m)cool matrices\nThe .cool format has been formally defined in Abdennur & Mirny (2019) and is a particular type of HDF5 (Hierarchical Data Format) file. It is an indexed archive file storing rectangular tables called:\n\n\nbins: containing the same information than the regions.bed file;\n\npixels: containing the same information than the count.matrix (each “pixel” is a pair of 2 bins and has one or several associated scores);\n\nchroms: summarizing the order and length of the chromosomes present in a Hi-C contact matrix;\n\nindexes: allowing random access, i.e. parsing of only a subset of the data without having to read through the entire set of data.\n\n\nA single .pairs file binned at different resolutions can also be saved into a single, multi-resolution .mcool file. .mcool essentially consists of nested .cool files.\nImportantly, as an HDF5-based format, .cool files are binarized, indexed and highly-compressed. This has two major benefits:\n\nSmaller disk storage footprint\n\nRapid subsetting of the data through random access\n\n\nMoreover, parsing .cool files is possible using HDF standard APIs.\n\n1.2.2.4 .hic matrices\nThe .hic format is another type of binarized, indexed and highly-compressed file (Durand et al. (2016)). It can store virtually the same information than a .cool file. However, parsing .hic files is not as straightforward as .cool files, as it does not rely on a generic file standard. Still, the straw library has been implemented in several computing languages to facilitate parsing of .hic files (Durand et al. (2016))."
   },
   {
     "objectID": "principles.html#pre-processing-hi-c-data",
     "href": "principles.html#pre-processing-hi-c-data",
     "title": "\n1  Hi-C pre-processing steps\n",
     "section": "\n1.3 Pre-processing Hi-C data",
-    "text": "1.3 Pre-processing Hi-C data\n\n1.3.1 Processing workflow\nFundamentally, the main steps performed to pre-process Hi-C are:\n\nSeparate read mapping\nPairs parsing\nPairs sorting\nPairs filtering\nPairs binning into a contact matrix\nNormalization of contact matrix and multi-resolution matrix generation\n\n\nIn practice, a minimal workflow to pre-process Hi-C data is the following (adapted from Open2C et al. (2023)):\n\n## Note these fields have to be replaced by appropriate variables: \n##    &lt;index&gt;\n##    &lt;input.R1.fq.gz&gt;\n##    &lt;input.R2.fq.gz&gt;\n##    &lt;chromsizes.txt&gt;\n##    &lt;prefix&gt;\nbwa mem2 -SP5M &lt;index&gt; &lt;input.R1.fq.gz&gt; &lt;input.R2.fq.gz&gt; \\\n    | pairtools parse -c &lt;chromsizes.txt&gt; \\\n    | pairtools sort \\\n    | pairtools dedup \\\n    | cooler cload pairs -c1 2 -p1 3 -c2 4 -p2 5 &lt;chromsizes.txt&gt;:10000 - &lt;prefix&gt;.cool\ncooler zoomify --balance --nproc 32 --resolutions 5000N --out &lt;prefix&gt;.mcool &lt;prefix&gt;.cool\n\nSeveral pipelines have been developed to facilitate Hi-C data pre-processing. A few of them stand out from the crowd:\n\n\nnf-distiller: a combination of an aligner + pairtools + cooler\n\n\nHiC-pro (Servant et al. (2015))\n\nJuicer (Durand et al. (2016))\n\n\n\n\n\n\n\nNote\n\n\n\nFor larger genomes (&gt; 1Gb) with more than few tens of M of reads per fastq (e.g. &gt; 100M), we recommend pre-processing data on an HPC cluster. Aligners, pairs processing and matrix binning can greatly benefit from parallelization over multiple CPUs (Open2C et al. (2023))).\nTo scale up data pre-processing, we recommend to rely on an efficient read mapper such as bwa, followed by pairs parsing, sorting and deduplication with pairtools and binning with cooler.\n\n\n\n1.3.2 hicstuff: lightweight Hi-C pipeline\nhicstuff is an integrated workflow to process Hi-C data. Some advantages compared to solutions mentioned above are its simplicity, flexibility and lightweight. For shallow sequencing or Hi-C on smaller genomes, it efficiently parses fastq reads and processes data into binned contact matrices with a single terminal command.\nhicstuff provides both a command-line interface (CLI) and a python API to process fastq reads into a binned contact matrix. A processing pipeline can be launched using the standard command pipeline as follows:\n\n## Note these fields have to be replaced by appropriate variables: \n##    &lt;hicstuff-options&gt;\n##    &lt;genome.fa&gt;\n##    &lt;input.R1.fq.gz&gt;\n##    &lt;input.R2.fq.gz&gt;\nhicstuff pipeline \\\n   &lt;hicstuff-options&gt; \\\n   --genome &lt;genome.fa&gt; \\\n   &lt;input.R1.fq.gz&gt; \\\n   &lt;input.R2.fq.gz&gt;  \n\nhicstuff documentation website is available here: https://hicstuff.readthedocs.io/ to read more about available options and internal processing steps.\n\n1.3.3 HiCool: hicstuff within R\nhicstuff is available as a standalone (conda install -c bioconda hicstuff it!). It is also shipped in an R package: HiCool. Thus, HiCool can process fastq files directly within an R console.\n\n1.3.3.1 Executing HiCool\nTo demonstrate this, we first fetch example .fastq files:\n\nlibrary(HiContactsData)\nr1 &lt;- HiContactsData(sample = 'yeast_wt', format = 'fastq_R1')\nr2 &lt;- HiContactsData(sample = 'yeast_wt', format = 'fastq_R2')\n\nWe then load the HiCool library and execute the main HiCool function.\n\nlibrary(HiCool)\n##  Loading required package: HiCExperiment\n##  Consider using the `HiContacts` package to perform advanced genomic operations \n##  on `HiCExperiment` objects.\n##  \n##  Read \"Orchestrating Hi-C analysis with Bioconductor\" online book to learn more:\n##  https://js2264.github.io/OHCA/\nHiCool(\n    r1, \n    r2, \n    restriction = 'DpnII,HinfI', \n    resolutions = c(4000, 8000, 16000), \n    genome = 'R64-1-1', \n    output = './HiCool/'\n)\n##  HiCool :: Fetching bowtie genome index files from AWS iGenomes S3 bucket...\n##  HiCool :: Recovering bowtie2 genome index from AWS iGenomes...\n##  + /github/home/.cache/R/basilisk/1.13.1/0/bin/conda 'create' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.1/HiCool/1.1.0/env' 'python=3.7.12' '--quiet' '-c' 'conda-forge' '-c' 'bioconda'\n##  + /github/home/.cache/R/basilisk/1.13.1/0/bin/conda 'install' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.1/HiCool/1.1.0/env' 'python=3.7.12'\n##  + /github/home/.cache/R/basilisk/1.13.1/0/bin/conda 'install' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.1/HiCool/1.1.0/env' '-c' 'conda-forge' '-c' 'bioconda' 'python=3.7.12' 'python=3.7.12' 'bowtie2=2.5.0' 'samtools=1.16.1' 'hicstuff=3.1.5' 'chromosight=1.6.3' 'cooler=0.9.1'\n##  HiCool :: Initiating processing of fastq files [tmp folder: /tmp/RtmpyLujmT/WL4DIE]...\n##  HiCool :: Mapping fastq files...\n##  HiCool :: Removing unwanted chromosomes...\n##  HiCool :: Parsing pairs into .cool file...\n##  HiCool :: Generating multi-resolution .mcool file...\n##  HiCool :: Balancing .mcool file...\n##  HiCool :: Tidying up everything for you...\n##  HiCool :: .fastq to .mcool processing done!\n##  HiCool :: Check ./HiCool/folder to find the generated files\n##  HiCool :: Generating HiCool report. This might take a while.\n##  HiCool :: Report generated and available @ /__w/OHCA/OHCA/HiCool/148151d75a8_7833^mapped-R64-1-1^WL4DIE.html\n##  HiCool :: All processing successfully achieved. Congrats!\n##  CoolFile object\n##  .mcool file: ./HiCool//matrices/148151d75a8_7833^mapped-R64-1-1^WL4DIE.mcool \n##  resolution: 4000 \n##  pairs file: ./HiCool//pairs/148151d75a8_7833^mapped-R64-1-1^WL4DIE.pairs \n##  metadata(3): log args stats\n\n\n\n\n\n\n\nHiCool arguments\n\n\n\nSeveral arguments can be passed to HiCool and some are worth mentioning them:\n- restriction: (default: \"DpnII,HinfI\")\n- resolutions: (default: NULL, automatically inferring resolutions based on genome size)\n- iterative: (default: TRUE)\n- filter: (default: TRUE)\n- balancing_args: (default: \" --cis-only --min-nnz 3 --mad-max 7 \")\n- threads: (default: 1L)\n\n\nOther HiCool arguments can be listed by checking HiCool documentation in R: ?HiCool::HiCool.\n\n1.3.3.2 HiCool outputs\nWe can check the generated output files placed in the HiCool/ directory.\n\nfs::dir_tree('HiCool/')\n##  HiCool/\n##  ├── 148151d75a8_7833^mapped-R64-1-1^WL4DIE.html\n##  ├── logs\n##  │   └── 148151d75a8_7833^mapped-R64-1-1^WL4DIE.log\n##  ├── matrices\n##  │   └── 148151d75a8_7833^mapped-R64-1-1^WL4DIE.mcool\n##  ├── pairs\n##  │   └── 148151d75a8_7833^mapped-R64-1-1^WL4DIE.pairs\n##  └── plots\n##      ├── 148151d75a8_7833^mapped-R64-1-1^WL4DIE_event_distance.pdf\n##      └── 148151d75a8_7833^mapped-R64-1-1^WL4DIE_event_distribution.pdf\n\n\nThe *.pairs and *.mcool files are the pairs and contact matrix files, respectively. These are the output files the end-user is generally looking for.\n\nThe *.html file is a report summarizing pairs numbers, filtering, etc…\nThe *.log file contains all output and error messages, as well as the full list of commands that have been executed to pre-process the input dataset.\nThe *.pdf graphic files provide a visual representation of the distribution of informative/non-informative pairs.\n\n\n\n\n\n\n\nTip\n\n\n\nAll the files generated by a single HiCool pipeline execution contain the same 6-letter unique hash to make sure they are not overwritten if re-executing the same command."
+    "text": "1.3 Pre-processing Hi-C data\n\n1.3.1 Processing workflow\nFundamentally, the main steps performed to pre-process Hi-C are:\n\nSeparate read mapping\nPairs parsing\nPairs sorting\nPairs filtering\nPairs binning into a contact matrix\nNormalization of contact matrix and multi-resolution matrix generation\n\n\nIn practice, a minimal workflow to pre-process Hi-C data is the following (adapted from Open2C et al. (2023)):\n\n## Note these fields have to be replaced by appropriate variables: \n##    &lt;index&gt;\n##    &lt;input.R1.fq.gz&gt;\n##    &lt;input.R2.fq.gz&gt;\n##    &lt;chromsizes.txt&gt;\n##    &lt;prefix&gt;\nbwa mem2 -SP5M &lt;index&gt; &lt;input.R1.fq.gz&gt; &lt;input.R2.fq.gz&gt; \\\n    | pairtools parse -c &lt;chromsizes.txt&gt; \\\n    | pairtools sort \\\n    | pairtools dedup \\\n    | cooler cload pairs -c1 2 -p1 3 -c2 4 -p2 5 &lt;chromsizes.txt&gt;:10000 - &lt;prefix&gt;.cool\ncooler zoomify --balance --nproc 32 --resolutions 5000N --out &lt;prefix&gt;.mcool &lt;prefix&gt;.cool\n\nSeveral pipelines have been developed to facilitate Hi-C data pre-processing. A few of them stand out from the crowd:\n\n\nnf-distiller: a combination of an aligner + pairtools + cooler\n\n\nHiC-pro (Servant et al. (2015))\n\nJuicer (Durand et al. (2016))\n\n\n\n\n\n\n\nNote\n\n\n\nFor larger genomes (&gt; 1Gb) with more than few tens of M of reads per fastq (e.g. &gt; 100M), we recommend pre-processing data on an HPC cluster. Aligners, pairs processing and matrix binning can greatly benefit from parallelization over multiple CPUs (Open2C et al. (2023))).\nTo scale up data pre-processing, we recommend to rely on an efficient read mapper such as bwa, followed by pairs parsing, sorting and deduplication with pairtools and binning with cooler.\n\n\n\n1.3.2 hicstuff: lightweight Hi-C pipeline\nhicstuff is an integrated workflow to process Hi-C data. Some advantages compared to solutions mentioned above are its simplicity, flexibility and lightweight. For shallow sequencing or Hi-C on smaller genomes, it efficiently parses fastq reads and processes data into binned contact matrices with a single terminal command.\nhicstuff provides both a command-line interface (CLI) and a python API to process fastq reads into a binned contact matrix. A processing pipeline can be launched using the standard command pipeline as follows:\n\n## Note these fields have to be replaced by appropriate variables: \n##    &lt;hicstuff-options&gt;\n##    &lt;genome.fa&gt;\n##    &lt;input.R1.fq.gz&gt;\n##    &lt;input.R2.fq.gz&gt;\nhicstuff pipeline \\\n   &lt;hicstuff-options&gt; \\\n   --genome &lt;genome.fa&gt; \\\n   &lt;input.R1.fq.gz&gt; \\\n   &lt;input.R2.fq.gz&gt;  \n\nhicstuff documentation website is available here: https://hicstuff.readthedocs.io/ to read more about available options and internal processing steps.\n\n1.3.3 HiCool: hicstuff within R\nhicstuff is available as a standalone (conda install -c bioconda hicstuff it!). It is also shipped in an R package: HiCool. Thus, HiCool can process fastq files directly within an R console.\n\n1.3.3.1 Executing HiCool\nTo demonstrate this, we first fetch example .fastq files:\n\nlibrary(HiContactsData)\nr1 &lt;- HiContactsData(sample = 'yeast_wt', format = 'fastq_R1')\nr2 &lt;- HiContactsData(sample = 'yeast_wt', format = 'fastq_R2')\n\nWe then load the HiCool library and execute the main HiCool function.\n\nlibrary(HiCool)\n##  Loading required package: HiCExperiment\n##  Consider using the `HiContacts` package to perform advanced genomic operations \n##  on `HiCExperiment` objects.\n##  \n##  Read \"Orchestrating Hi-C analysis with Bioconductor\" online book to learn more:\n##  https://js2264.github.io/OHCA/\nHiCool(\n    r1, \n    r2, \n    restriction = 'DpnII,HinfI', \n    resolutions = c(4000, 8000, 16000), \n    genome = 'R64-1-1', \n    output = './HiCool/'\n)\n##  HiCool :: Fetching bowtie genome index files from AWS iGenomes S3 bucket...\n##  HiCool :: Recovering bowtie2 genome index from AWS iGenomes...\n##  + /github/home/.cache/R/basilisk/1.13.4/0/bin/conda 'create' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.4/HiCool/1.1.0/env' 'python=3.7.12' '--quiet' '-c' 'conda-forge' '-c' 'bioconda'\n##  + /github/home/.cache/R/basilisk/1.13.4/0/bin/conda 'install' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.4/HiCool/1.1.0/env' 'python=3.7.12'\n##  + /github/home/.cache/R/basilisk/1.13.4/0/bin/conda 'install' '--yes' '--prefix' '/github/home/.cache/R/basilisk/1.13.4/HiCool/1.1.0/env' '-c' 'conda-forge' '-c' 'bioconda' 'python=3.7.12' 'python=3.7.12' 'bowtie2=2.5.0' 'samtools=1.16.1' 'hicstuff=3.1.5' 'chromosight=1.6.3' 'cooler=0.9.1'\n##  HiCool :: Initiating processing of fastq files [tmp folder: /tmp/RtmpIWmk55/WL4DIE]...\n##  HiCool :: Mapping fastq files...\n##  HiCool :: Removing unwanted chromosomes...\n##  HiCool :: Parsing pairs into .cool file...\n##  HiCool :: Generating multi-resolution .mcool file...\n##  HiCool :: Balancing .mcool file...\n##  HiCool :: Tidying up everything for you...\n##  HiCool :: .fastq to .mcool processing done!\n##  HiCool :: Check ./HiCool/folder to find the generated files\n##  HiCool :: Generating HiCool report. This might take a while.\n##  HiCool :: Report generated and available @ /__w/OHCA/OHCA/HiCool/14976d56f7a_7833^mapped-R64-1-1^WL4DIE.html\n##  HiCool :: All processing successfully achieved. Congrats!\n##  CoolFile object\n##  .mcool file: ./HiCool//matrices/14976d56f7a_7833^mapped-R64-1-1^WL4DIE.mcool \n##  resolution: 4000 \n##  pairs file: ./HiCool//pairs/14976d56f7a_7833^mapped-R64-1-1^WL4DIE.pairs \n##  metadata(3): log args stats\n\n\n\n\n\n\n\nHiCool arguments\n\n\n\nSeveral arguments can be passed to HiCool and some are worth mentioning them:\n- restriction: (default: \"DpnII,HinfI\")\n- resolutions: (default: NULL, automatically inferring resolutions based on genome size)\n- iterative: (default: TRUE)\n- filter: (default: TRUE)\n- balancing_args: (default: \" --cis-only --min-nnz 3 --mad-max 7 \")\n- threads: (default: 1L)\n\n\nOther HiCool arguments can be listed by checking HiCool documentation in R: ?HiCool::HiCool.\n\n1.3.3.2 HiCool outputs\nWe can check the generated output files placed in the HiCool/ directory.\n\nfs::dir_tree('HiCool/')\n##  HiCool/\n##  ├── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE.html\n##  ├── logs\n##  │   └── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE.log\n##  ├── matrices\n##  │   └── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE.mcool\n##  ├── pairs\n##  │   └── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE.pairs\n##  └── plots\n##      ├── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE_event_distance.pdf\n##      └── 14976d56f7a_7833^mapped-R64-1-1^WL4DIE_event_distribution.pdf\n\n\nThe *.pairs and *.mcool files are the pairs and contact matrix files, respectively. These are the output files the end-user is generally looking for.\n\nThe *.html file is a report summarizing pairs numbers, filtering, etc…\nThe *.log file contains all output and error messages, as well as the full list of commands that have been executed to pre-process the input dataset.\nThe *.pdf graphic files provide a visual representation of the distribution of informative/non-informative pairs.\n\n\n\n\n\n\n\nTip\n\n\n\nAll the files generated by a single HiCool pipeline execution contain the same 6-letter unique hash to make sure they are not overwritten if re-executing the same command."
+  },
+  {
+    "objectID": "principles.html#exploratory-data-analysis-of-processed-hi-c-files",
+    "href": "principles.html#exploratory-data-analysis-of-processed-hi-c-files",
+    "title": "\n1  Hi-C pre-processing steps\n",
+    "section": "\n1.4 Exploratory data analysis of processed Hi-C files",
+    "text": "1.4 Exploratory data analysis of processed Hi-C files\nOnce Hi-C raw data has been transformed into a set of processed files, exploratory data analysis is typically conducted following two main routes:\n\nData visualization;\nData investigation.\n\nDuring the last decade, a number of softwares have been developed to unlock Hi-C data visualization and investigation. Here we provide a non-exhaustive list of notable tools developed throughout the recent years for downstream Hi-C analysis, selected from this longer list.\n\n\n2012-2015:\n\nHiTC (2012)\nHiCCUPS (2014)\nHiCseg (2014)\nFit-Hi-C (2014)\nHiC-Pro (2015)\ndiffHic (2015)\ncooltools (2015)\nHiCUP (2015)\nHiCPlotter (2015)\nHiFive (2015)\n\n\n\n2016-2019:\n\nCHiCAGO (2016)\nTADbit (2017)\nHiCRep (2017)\nHiC-DC (2017)\nGoTHIC (2017)\nHiCExplorer (2018)\nBoost-HiC (2018)\nHiCcompare (2018)\nHiPiler (2018)\ncoolpuppy (2019)\n\n\n\n2020-present:\n\nSerpentine (2020)\nCHESS (2020)\nDeepHiC (2020)\nChromosight (2020)\nMustache (2020)\nTADcompare (2020)\nPOSSUM (2021)\nCalder (2021)\nHICDCPlus (2021)\nplotgardener (2021)\nGENOVA (2021)\n\n\n\nAll references as well as many other softwares and references are available here."
   },
   {
     "objectID": "data-representation.html",
@@ -95,21 +102,21 @@
     "href": "data-representation.html#ginteractions-class",
     "title": "\n2  Hi-C data structures in R\n",
     "section": "\n2.2 GInteractions class",
-    "text": "2.2 GInteractions class\nGRanges describe genomic ranges and hence are of general use to study 1D genome organization. To study chromatin interactions, we need a way to link pairs of GRanges. This is exactly what the GInteractions class does. This data structure is defined in the InteractionSet package and has been published in the 2016 paper by Lun et al. (Lun, Perry, and Ing-Simmons (2016)).\n\n\n2.2.1 Building a GInteractions object from scratch\nLet’s first define two parallel GRanges objects (i.e. two GRanges of same length). Each GRanges will contain 5 ranges.\n\ngr_first &lt;- GRanges(c(\n    'chr1:1-100', \n    'chr1:1001-2000', \n    'chr1:5001-6000', \n    'chr1:8001-9000', \n    'chr1:7001-8000'  \n))\ngr_second &lt;- GRanges(c(\n    'chr1:1-100', \n    'chr1:3001-4000', \n    'chr1:8001-9000', \n    'chr1:7001-8000', \n    'chr2:13000-14000'  \n))\n\nBecause these two GRanges objects are of same length (5), one can “bind” them together by using the GInteractionsfunction. This effectively associate each entry from one GRanges to the entry aligned in the other GRanges object.\n\nlibrary(InteractionSet)\ngi &lt;- GInteractions(gr_first, gr_second)\ngi\n##  GInteractions object with 5 interactions and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nThe way GInteractions objects are printed in an R console mimics that of GRanges, but pairs two “ends” (a.k.a. anchors) of an interaction together, each end being represented as a separate GRanges range.\n\n\n\n\n\n\nNotes\n\n\n\n\nNote that it is possible to have interactions joining two identical anchors.\n\n\ngi[1]\n##  GInteractions object with 1 interaction and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2   ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt;\n##    [1]      chr1     1-100 ---      chr1     1-100\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\nIt is also technically possible (though not advised) to have interactions for which the “first” end is located after the “second” end along the chromosome.\n\n\ngi[4]\n##  GInteractions object with 1 interaction and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2   ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt;\n##    [1]      chr1 8001-9000 ---      chr1 7001-8000\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\nFinally, it is possible to define inter-chromosomal interactions (a.k.a. trans interactions).\n\n\ngi[5]\n##  GInteractions object with 1 interaction and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt;\n##    [1]      chr1 7001-8000 ---      chr2 13000-14000\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n\n\n2.2.2 GInteractions specific slots\nCompared to GRanges, extra slots are available for GInteractions objects, e.g. anchors and regions.\n\n2.2.2.1 Anchors\n“Anchors” of a single genomic interaction refer to the two ends of this interaction. These anchors can be extracted from a GInteractions object using the anchors() function. This outputs a list of two GRanges, the first corresponding to the “left” end of interactions (when printed to the console) and the second corresponding to the “right” end of interactions (when printed to the console).\n\n# ----- This extracts the two sets of anchors (\"first\" and \"second\") from a GInteractions object\nanchors(gi)\n##  $first\n##  GRanges object with 5 ranges and 0 metadata columns:\n##        seqnames    ranges strand\n##           &lt;Rle&gt; &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1     1-100      *\n##    [2]     chr1 1001-2000      *\n##    [3]     chr1 5001-6000      *\n##    [4]     chr1 8001-9000      *\n##    [5]     chr1 7001-8000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n##  \n##  $second\n##  GRanges object with 5 ranges and 0 metadata columns:\n##        seqnames      ranges strand\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1       1-100      *\n##    [2]     chr1   3001-4000      *\n##    [3]     chr1   8001-9000      *\n##    [4]     chr1   7001-8000      *\n##    [5]     chr2 13000-14000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n# ----- We can query for the \"first\" or \"second\" set of anchors directly\nanchors(gi, \"first\")\n##  GRanges object with 5 ranges and 0 metadata columns:\n##        seqnames    ranges strand\n##           &lt;Rle&gt; &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1     1-100      *\n##    [2]     chr1 1001-2000      *\n##    [3]     chr1 5001-6000      *\n##    [4]     chr1 8001-9000      *\n##    [5]     chr1 7001-8000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nanchors(gi, \"second\")\n##  GRanges object with 5 ranges and 0 metadata columns:\n##        seqnames      ranges strand\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1       1-100      *\n##    [2]     chr1   3001-4000      *\n##    [3]     chr1   8001-9000      *\n##    [4]     chr1   7001-8000      *\n##    [5]     chr2 13000-14000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n2.2.2.2 Regions\n“Regions” of a set of interactions refer to the universe of unique anchors represented in a set of interactions. Therefore, the length of the regions can only be equal to or strictly lower than twice the length of anchors.\nThe regions function returns the regions associated with a GInteractions object, stored as a GRanges object.\n\nregions(gi)\n##  GRanges object with 7 ranges and 0 metadata columns:\n##        seqnames      ranges strand\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1       1-100      *\n##    [2]     chr1   1001-2000      *\n##    [3]     chr1   3001-4000      *\n##    [4]     chr1   5001-6000      *\n##    [5]     chr1   7001-8000      *\n##    [6]     chr1   8001-9000      *\n##    [7]     chr2 13000-14000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nlength(regions(gi))\n##  [1] 7\n\nlength(anchors(gi, \"first\"))\n##  [1] 5\n\n\n2.2.3 GInteractions methods\nGInteractions behave as an extension of GRanges. For this reason, many methods that work with GRanges will work seamlessly with GInteractions.\n\n2.2.3.1 Metadata\nOne can add metadata columns directly to a GInteractions object.\n\nmcols(gi)\n##  DataFrame with 5 rows and 0 columns\nmcols(gi) &lt;- data.frame(\n    idx = seq(1, length(gi)),\n    type = c(\"cis\", \"cis\", \"cis\", \"trans\", \"cis\")\n)\ngi\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\ngi$type\n##  [1] \"cis\"   \"cis\"   \"cis\"   \"trans\" \"cis\"\n\nImportantly, metadata columns can also be directly added to regions of a GInteractions object, since these regions are a GRanges object themselves!\n\nregions(gi)\n##  GRanges object with 7 ranges and 0 metadata columns:\n##        seqnames      ranges strand\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1       1-100      *\n##    [2]     chr1   1001-2000      *\n##    [3]     chr1   3001-4000      *\n##    [4]     chr1   5001-6000      *\n##    [5]     chr1   7001-8000      *\n##    [6]     chr1   8001-9000      *\n##    [7]     chr2 13000-14000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\nregions(gi)$binID &lt;- seq_along(regions(gi))\nregions(gi)$type &lt;- c(\"P\", \"P\", \"P\", \"E\", \"E\", \"P\", \"P\")\nregions(gi)\n##  GRanges object with 7 ranges and 2 metadata columns:\n##        seqnames      ranges strand |     binID        type\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]     chr1       1-100      * |         1           P\n##    [2]     chr1   1001-2000      * |         2           P\n##    [3]     chr1   3001-4000      * |         3           P\n##    [4]     chr1   5001-6000      * |         4           E\n##    [5]     chr1   7001-8000      * |         5           E\n##    [6]     chr1   8001-9000      * |         6           P\n##    [7]     chr2 13000-14000      * |         7           P\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n2.2.3.2 Sorting GInteractions\n\nThe sort function works seamlessly with GInteractions objects. It sorts the interactions using a similar approach to that performed by pairtools sort ... for disk-stored .pairs files, sorting on the “first” anchor first, then for interactions with the same “first” anchors, sorting on the “second” anchor.\n\ngi\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nsort(gi)\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    [5]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n2.2.3.3 Swapping GInteractions anchors\nFor an individual interaction contained in a GInteractions object, the “first” and “second” anchors themselves can be sorted as well. This is called “pairs swapping”, and it is performed similarly to pairtools flip ... for disk-stored .pairs files. This ensures that interactions, when represented as a contact matrix, generate an upper-triangular matrix.\n\ngi\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nswapAnchors(gi)\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 7001-8000 ---      chr1   8001-9000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n\n\n\n\n\nNote\n\n\n\n“Sorting” and “swapping” a GInteractions object are two entirely different actions:\n\n“sorting” reorganizes all rows (interactions);\n“swapping” anchors reorganizes “first” and “second” anchors for each interaction independently.\n\n\n\n\n2.2.3.4 GInteractions distance method\n“Distance”, when applied to genomic interactions, typically refers to the genomic distance between the two anchors of a single interaction. For GInteractions, this is computed using the pairdist function.\n\ngi\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\npairdist(gi)\n##  [1]    0 2000 3000 1000   NA\n\nNote that for “trans” inter-chromosomal interactions, i.e. interactions with anchors on different chromosomes, the notion of genomic distance is meaningless and for this reason, pairdist returns a NA value.\n\n\n\n\n\n\nAdvanced pairdist arguments\n\n\n\nThe type argument can be tweaked to specify which type of “distance” should be computed:\n\n\nmid: The distance between the midpoints of the two regions (rounded down to the nearest integer) is returned (Default).\n\ngap: The length of the gap between the closest points of the two regions is computed - negative lengths are returned for overlapping regions, indicating the length of the overlap.\n\nspan: The distance between the furthermost points of the two regions is computed.\n\ndiag: The difference between the anchor indices is returned. This corresponds to a diagonal on the interaction space when bins are used in the ‘regions’ slot of ‘x’.\n\n\n\n\n2.2.3.5 GInteractions overlap methods\n“Overlaps” for genomic interactions could be computed in different contexts:\n\nCase 1: Overlap between any of the two anchors of an interaction with a genomic range\nCase 2: Overlap between anchors of an interaction with anchors of another interaction\nCase 3: Spanning of the interaction “across” a genomic range\n\n\nCase 1: Overlap between any of the two anchors of an interaction with a genomic range\n\nThis is the default behavior of findOverlaps when providing a GInteractions object as query and a GRanges as a subject.\n\ngr &lt;- GRanges(c(\"chr1:7501-7600\", \"chr1:8501-8600\"))\nfindOverlaps(query = gi, subject = gr)\n##  Hits object with 4 hits and 0 metadata columns:\n##        queryHits subjectHits\n##        &lt;integer&gt;   &lt;integer&gt;\n##    [1]         3           2\n##    [2]         4           1\n##    [3]         4           2\n##    [4]         5           1\n##    -------\n##    queryLength: 5 / subjectLength: 2\n\ncountOverlaps(gi, gr)\n##  [1] 0 0 1 2 1\n\nsubsetByOverlaps(gi, gr)\n##  GInteractions object with 3 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [2]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [3]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nHere again, the order matters!\n\ncountOverlaps(gr, gi)\n##  [1] 2 2\n\nAnd again, the %over% operator can be used here:\n\ngi %over% gr\n##  [1] FALSE FALSE  TRUE  TRUE  TRUE\n\ngi[gi %over% gr] # ----- Equivalent to `subsetByOverlaps(gi, gr)`\n##  GInteractions object with 3 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [2]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [3]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\nCase 2: Overlap between anchors of an interaction with anchors of another interaction\n\nThis slightly different scenario involves overlapping two sets of interactions, to see whether any interaction in Set-1 has its two anchors overlapping anchors from an interaction in Set-2.\n\ngi2 &lt;- GInteractions(\n    GRanges(\"chr1:1081-1090\"), \n    GRanges(\"chr1:3401-3501\")\n)\ngi %over% gi2\n##  [1] FALSE  TRUE FALSE FALSE FALSE\n\nNote that both anchors of an interaction from a query have to overlap to a pair of anchors of a single interaction from a subject with this method!\n\ngi3 &lt;- GInteractions(\n    GRanges(\"chr1:1-1000\"), \n    GRanges(\"chr1:3401-3501\")\n)\ngi %over% gi3\n##  [1] FALSE FALSE FALSE FALSE FALSE\n\n\nCase 3 : Spanning of the interaction “accross” a genomic range\n\nThis requires a bit of wrangling, to mimic an overlap between two GRanges objects:\n\ngi &lt;- swapAnchors(gi) # ----- Make sure anchors are correctly sorted\ngi &lt;- sort(gi) # ----- Make sure interactions are correctly sorted\ngi &lt;- gi[!is.na(pairdist(gi))] # ----- Remove inter-chromosomal interactions\nspanning_gi &lt;- GRanges(\n    seqnames = seqnames(anchors(gi)[[1]]), \n    ranges = IRanges(\n        start(anchors(gi)[[1]]), \n        end(anchors(gi)[[2]])\n    )\n)\nspanning_gi \n##  GRanges object with 4 ranges and 0 metadata columns:\n##        seqnames    ranges strand\n##           &lt;Rle&gt; &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1     1-100      *\n##    [2]     chr1 1001-4000      *\n##    [3]     chr1 5001-9000      *\n##    [4]     chr1 7001-9000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nspanning_gi %over% gr\n##  [1] FALSE FALSE  TRUE  TRUE\n\n\n\n\n\n\n\nGoing further\n\n\n\nA detailed manual of overlap methods available for GInteractions object can be read by typing ?`Interaction-overlaps` in R."
+    "text": "2.2 GInteractions class\nGRanges describe genomic ranges and hence are of general use to study 1D genome organization. To study chromatin interactions, we need a way to link pairs of GRanges. This is exactly what the GInteractions class does. This data structure is defined in the InteractionSet package and has been published in the 2016 paper by Lun et al. (Lun et al. (2016)).\n\n\n2.2.1 Building a GInteractions object from scratch\nLet’s first define two parallel GRanges objects (i.e. two GRanges of same length). Each GRanges will contain 5 ranges.\n\ngr_first &lt;- GRanges(c(\n    'chr1:1-100', \n    'chr1:1001-2000', \n    'chr1:5001-6000', \n    'chr1:8001-9000', \n    'chr1:7001-8000'  \n))\ngr_second &lt;- GRanges(c(\n    'chr1:1-100', \n    'chr1:3001-4000', \n    'chr1:8001-9000', \n    'chr1:7001-8000', \n    'chr2:13000-14000'  \n))\n\nBecause these two GRanges objects are of same length (5), one can “bind” them together by using the GInteractionsfunction. This effectively associate each entry from one GRanges to the entry aligned in the other GRanges object.\n\nlibrary(InteractionSet)\ngi &lt;- GInteractions(gr_first, gr_second)\ngi\n##  GInteractions object with 5 interactions and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nThe way GInteractions objects are printed in an R console mimics that of GRanges, but pairs two “ends” (a.k.a. anchors) of an interaction together, each end being represented as a separate GRanges range.\n\n\n\n\n\n\nNotes\n\n\n\n\nNote that it is possible to have interactions joining two identical anchors.\n\n\ngi[1]\n##  GInteractions object with 1 interaction and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2   ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt;\n##    [1]      chr1     1-100 ---      chr1     1-100\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\nIt is also technically possible (though not advised) to have interactions for which the “first” end is located after the “second” end along the chromosome.\n\n\ngi[4]\n##  GInteractions object with 1 interaction and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2   ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt;\n##    [1]      chr1 8001-9000 ---      chr1 7001-8000\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\nFinally, it is possible to define inter-chromosomal interactions (a.k.a. trans interactions).\n\n\ngi[5]\n##  GInteractions object with 1 interaction and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt;\n##    [1]      chr1 7001-8000 ---      chr2 13000-14000\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n\n\n2.2.2 GInteractions specific slots\nCompared to GRanges, extra slots are available for GInteractions objects, e.g. anchors and regions.\n\n2.2.2.1 Anchors\n“Anchors” of a single genomic interaction refer to the two ends of this interaction. These anchors can be extracted from a GInteractions object using the anchors() function. This outputs a list of two GRanges, the first corresponding to the “left” end of interactions (when printed to the console) and the second corresponding to the “right” end of interactions (when printed to the console).\n\n# ----- This extracts the two sets of anchors (\"first\" and \"second\") from a GInteractions object\nanchors(gi)\n##  $first\n##  GRanges object with 5 ranges and 0 metadata columns:\n##        seqnames    ranges strand\n##           &lt;Rle&gt; &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1     1-100      *\n##    [2]     chr1 1001-2000      *\n##    [3]     chr1 5001-6000      *\n##    [4]     chr1 8001-9000      *\n##    [5]     chr1 7001-8000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n##  \n##  $second\n##  GRanges object with 5 ranges and 0 metadata columns:\n##        seqnames      ranges strand\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1       1-100      *\n##    [2]     chr1   3001-4000      *\n##    [3]     chr1   8001-9000      *\n##    [4]     chr1   7001-8000      *\n##    [5]     chr2 13000-14000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n# ----- We can query for the \"first\" or \"second\" set of anchors directly\nanchors(gi, \"first\")\n##  GRanges object with 5 ranges and 0 metadata columns:\n##        seqnames    ranges strand\n##           &lt;Rle&gt; &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1     1-100      *\n##    [2]     chr1 1001-2000      *\n##    [3]     chr1 5001-6000      *\n##    [4]     chr1 8001-9000      *\n##    [5]     chr1 7001-8000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nanchors(gi, \"second\")\n##  GRanges object with 5 ranges and 0 metadata columns:\n##        seqnames      ranges strand\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1       1-100      *\n##    [2]     chr1   3001-4000      *\n##    [3]     chr1   8001-9000      *\n##    [4]     chr1   7001-8000      *\n##    [5]     chr2 13000-14000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n2.2.2.2 Regions\n“Regions” of a set of interactions refer to the universe of unique anchors represented in a set of interactions. Therefore, the length of the regions can only be equal to or strictly lower than twice the length of anchors.\nThe regions function returns the regions associated with a GInteractions object, stored as a GRanges object.\n\nregions(gi)\n##  GRanges object with 7 ranges and 0 metadata columns:\n##        seqnames      ranges strand\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1       1-100      *\n##    [2]     chr1   1001-2000      *\n##    [3]     chr1   3001-4000      *\n##    [4]     chr1   5001-6000      *\n##    [5]     chr1   7001-8000      *\n##    [6]     chr1   8001-9000      *\n##    [7]     chr2 13000-14000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nlength(regions(gi))\n##  [1] 7\n\nlength(anchors(gi, \"first\"))\n##  [1] 5\n\n\n2.2.3 GInteractions methods\nGInteractions behave as an extension of GRanges. For this reason, many methods that work with GRanges will work seamlessly with GInteractions.\n\n2.2.3.1 Metadata\nOne can add metadata columns directly to a GInteractions object.\n\nmcols(gi)\n##  DataFrame with 5 rows and 0 columns\nmcols(gi) &lt;- data.frame(\n    idx = seq(1, length(gi)),\n    type = c(\"cis\", \"cis\", \"cis\", \"trans\", \"cis\")\n)\ngi\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 0 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\ngi$type\n##  [1] \"cis\"   \"cis\"   \"cis\"   \"trans\" \"cis\"\n\nImportantly, metadata columns can also be directly added to regions of a GInteractions object, since these regions are a GRanges object themselves!\n\nregions(gi)\n##  GRanges object with 7 ranges and 0 metadata columns:\n##        seqnames      ranges strand\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1       1-100      *\n##    [2]     chr1   1001-2000      *\n##    [3]     chr1   3001-4000      *\n##    [4]     chr1   5001-6000      *\n##    [5]     chr1   7001-8000      *\n##    [6]     chr1   8001-9000      *\n##    [7]     chr2 13000-14000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\nregions(gi)$binID &lt;- seq_along(regions(gi))\nregions(gi)$type &lt;- c(\"P\", \"P\", \"P\", \"E\", \"E\", \"P\", \"P\")\nregions(gi)\n##  GRanges object with 7 ranges and 2 metadata columns:\n##        seqnames      ranges strand |     binID        type\n##           &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]     chr1       1-100      * |         1           P\n##    [2]     chr1   1001-2000      * |         2           P\n##    [3]     chr1   3001-4000      * |         3           P\n##    [4]     chr1   5001-6000      * |         4           E\n##    [5]     chr1   7001-8000      * |         5           E\n##    [6]     chr1   8001-9000      * |         6           P\n##    [7]     chr2 13000-14000      * |         7           P\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n2.2.3.2 Sorting GInteractions\n\nThe sort function works seamlessly with GInteractions objects. It sorts the interactions using a similar approach to that performed by pairtools sort ... for disk-stored .pairs files, sorting on the “first” anchor first, then for interactions with the same “first” anchors, sorting on the “second” anchor.\n\ngi\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nsort(gi)\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    [5]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n2.2.3.3 Swapping GInteractions anchors\nFor an individual interaction contained in a GInteractions object, the “first” and “second” anchors themselves can be sorted as well. This is called “pairs swapping”, and it is performed similarly to pairtools flip ... for disk-stored .pairs files. This ensures that interactions, when represented as a contact matrix, generate an upper-triangular matrix.\n\ngi\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nswapAnchors(gi)\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 7001-8000 ---      chr1   8001-9000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\n\n\n\n\n\nNote\n\n\n\n“Sorting” and “swapping” a GInteractions object are two entirely different actions:\n\n“sorting” reorganizes all rows (interactions);\n“swapping” anchors reorganizes “first” and “second” anchors for each interaction independently.\n\n\n\n\n2.2.3.4 GInteractions distance method\n“Distance”, when applied to genomic interactions, typically refers to the genomic distance between the two anchors of a single interaction. For GInteractions, this is computed using the pairdist function.\n\ngi\n##  GInteractions object with 5 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1     1-100 ---      chr1       1-100 |         1         cis\n##    [2]      chr1 1001-2000 ---      chr1   3001-4000 |         2         cis\n##    [3]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [4]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [5]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\npairdist(gi)\n##  [1]    0 2000 3000 1000   NA\n\nNote that for “trans” inter-chromosomal interactions, i.e. interactions with anchors on different chromosomes, the notion of genomic distance is meaningless and for this reason, pairdist returns a NA value.\n\n\n\n\n\n\nAdvanced pairdist arguments\n\n\n\nThe type argument can be tweaked to specify which type of “distance” should be computed:\n\n\nmid: The distance between the midpoints of the two regions (rounded down to the nearest integer) is returned (Default).\n\ngap: The length of the gap between the closest points of the two regions is computed - negative lengths are returned for overlapping regions, indicating the length of the overlap.\n\nspan: The distance between the furthermost points of the two regions is computed.\n\ndiag: The difference between the anchor indices is returned. This corresponds to a diagonal on the interaction space when bins are used in the ‘regions’ slot of ‘x’.\n\n\n\n\n2.2.3.5 GInteractions overlap methods\n“Overlaps” for genomic interactions could be computed in different contexts:\n\nCase 1: Overlap between any of the two anchors of an interaction with a genomic range\nCase 2: Overlap between anchors of an interaction with anchors of another interaction\nCase 3: Spanning of the interaction “across” a genomic range\n\n\nCase 1: Overlap between any of the two anchors of an interaction with a genomic range\n\nThis is the default behavior of findOverlaps when providing a GInteractions object as query and a GRanges as a subject.\n\ngr &lt;- GRanges(c(\"chr1:7501-7600\", \"chr1:8501-8600\"))\nfindOverlaps(query = gi, subject = gr)\n##  Hits object with 4 hits and 0 metadata columns:\n##        queryHits subjectHits\n##        &lt;integer&gt;   &lt;integer&gt;\n##    [1]         3           2\n##    [2]         4           1\n##    [3]         4           2\n##    [4]         5           1\n##    -------\n##    queryLength: 5 / subjectLength: 2\n\ncountOverlaps(gi, gr)\n##  [1] 0 0 1 2 1\n\nsubsetByOverlaps(gi, gr)\n##  GInteractions object with 3 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [2]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [3]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nHere again, the order matters!\n\ncountOverlaps(gr, gi)\n##  [1] 2 2\n\nAnd again, the %over% operator can be used here:\n\ngi %over% gr\n##  [1] FALSE FALSE  TRUE  TRUE  TRUE\n\ngi[gi %over% gr] # ----- Equivalent to `subsetByOverlaps(gi, gr)`\n##  GInteractions object with 3 interactions and 2 metadata columns:\n##        seqnames1   ranges1     seqnames2     ranges2 |       idx        type\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt;   &lt;IRanges&gt; | &lt;integer&gt; &lt;character&gt;\n##    [1]      chr1 5001-6000 ---      chr1   8001-9000 |         3         cis\n##    [2]      chr1 8001-9000 ---      chr1   7001-8000 |         4       trans\n##    [3]      chr1 7001-8000 ---      chr2 13000-14000 |         5         cis\n##    -------\n##    regions: 7 ranges and 2 metadata columns\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\n\nCase 2: Overlap between anchors of an interaction with anchors of another interaction\n\nThis slightly different scenario involves overlapping two sets of interactions, to see whether any interaction in Set-1 has its two anchors overlapping anchors from an interaction in Set-2.\n\ngi2 &lt;- GInteractions(\n    GRanges(\"chr1:1081-1090\"), \n    GRanges(\"chr1:3401-3501\")\n)\ngi %over% gi2\n##  [1] FALSE  TRUE FALSE FALSE FALSE\n\nNote that both anchors of an interaction from a query have to overlap to a pair of anchors of a single interaction from a subject with this method!\n\ngi3 &lt;- GInteractions(\n    GRanges(\"chr1:1-1000\"), \n    GRanges(\"chr1:3401-3501\")\n)\ngi %over% gi3\n##  [1] FALSE FALSE FALSE FALSE FALSE\n\n\nCase 3 : Spanning of the interaction “accross” a genomic range\n\nThis requires a bit of wrangling, to mimic an overlap between two GRanges objects:\n\ngi &lt;- swapAnchors(gi) # ----- Make sure anchors are correctly sorted\ngi &lt;- sort(gi) # ----- Make sure interactions are correctly sorted\ngi &lt;- gi[!is.na(pairdist(gi))] # ----- Remove inter-chromosomal interactions\nspanning_gi &lt;- GRanges(\n    seqnames = seqnames(anchors(gi)[[1]]), \n    ranges = IRanges(\n        start(anchors(gi)[[1]]), \n        end(anchors(gi)[[2]])\n    )\n)\nspanning_gi \n##  GRanges object with 4 ranges and 0 metadata columns:\n##        seqnames    ranges strand\n##           &lt;Rle&gt; &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]     chr1     1-100      *\n##    [2]     chr1 1001-4000      *\n##    [3]     chr1 5001-9000      *\n##    [4]     chr1 7001-9000      *\n##    -------\n##    seqinfo: 2 sequences from an unspecified genome; no seqlengths\n\nspanning_gi %over% gr\n##  [1] FALSE FALSE  TRUE  TRUE\n\n\n\n\n\n\n\nGoing further\n\n\n\nA detailed manual of overlap methods available for GInteractions object can be read by typing ?`Interaction-overlaps` in R."
   },
   {
     "objectID": "data-representation.html#contactfile-class",
     "href": "data-representation.html#contactfile-class",
     "title": "\n2  Hi-C data structures in R\n",
     "section": "\n2.3 ContactFile class",
-    "text": "2.3 ContactFile class\nHi-C contacts can be stored in four different formats (see previous chapter):\n\nAs a .(m)cool matrix (multi-scores, multi-resolution, indexed)\nAs a .hic matrix (multi-scores, multi-resolution, indexed)\nAs a HiC-pro derived matrix (single-score, single-resolution, non-indexed)\nUnbinned, Hi-C contacts can be stored in .pairs files\n\n\n2.3.1 Accessing example Hi-C files\nExample contact files can be downloaded using HiContactsData function.\n\nlibrary(HiContactsData)\ncoolf &lt;- HiContactsData('yeast_wt', 'mcool')\n\nThis fetches files from the cloud, download them locally and returns the path of the local file.\n\ncoolf\n##                                                   EH7702 \n##  \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\"\n\nSimilarly, example files are available for other file formats:\n\nhicf &lt;- HiContactsData('yeast_wt', 'hic')\nhicpromatrixf &lt;- HiContactsData('yeast_wt', 'hicpro_matrix')\nhicproregionsf &lt;- HiContactsData('yeast_wt', 'hicpro_bed')\npairsf &lt;- HiContactsData('yeast_wt', 'pairs.gz')\n\nWe can even check the content of some of these files to make sure they are actually what they are:\n\n# ---- HiC-Pro generates a tab-separated `regions.bed` file\nreadLines(hicproregionsf, 25)\n##   [1] \"I\\t0\\t1000\"      \"I\\t1000\\t2000\"   \"I\\t2000\\t3000\"   \"I\\t3000\\t4000\"  \n##   [5] \"I\\t4000\\t5000\"   \"I\\t5000\\t6000\"   \"I\\t6000\\t7000\"   \"I\\t7000\\t8000\"  \n##   [9] \"I\\t8000\\t9000\"   \"I\\t9000\\t10000\"  \"I\\t10000\\t11000\" \"I\\t11000\\t12000\"\n##  [13] \"I\\t12000\\t13000\" \"I\\t13000\\t14000\" \"I\\t14000\\t15000\" \"I\\t15000\\t16000\"\n##  [17] \"I\\t16000\\t17000\" \"I\\t17000\\t18000\" \"I\\t18000\\t19000\" \"I\\t19000\\t20000\"\n##  [21] \"I\\t20000\\t21000\" \"I\\t21000\\t22000\" \"I\\t22000\\t23000\" \"I\\t23000\\t24000\"\n##  [25] \"I\\t24000\\t25000\"\n\n# ---- Pairs are also tab-separated \nreadLines(pairsf, 25)\n##   [1] \"## pairs format v1.0\"                                                             \n##   [2] \"#sorted: chr1-pos1-chr2-pos2\"                                                     \n##   [3] \"#columns: readID chr1 pos1 chr2 pos2 strand1 strand2 frag1 frag2\"                 \n##   [4] \"#chromsize: I 230218\"                                                             \n##   [5] \"#chromsize: II 813184\"                                                            \n##   [6] \"#chromsize: III 316620\"                                                           \n##   [7] \"#chromsize: IV 1531933\"                                                           \n##   [8] \"#chromsize: V 576874\"                                                             \n##   [9] \"#chromsize: VI 270161\"                                                            \n##  [10] \"#chromsize: VII 1090940\"                                                          \n##  [11] \"#chromsize: VIII 562643\"                                                          \n##  [12] \"#chromsize: IX 439888\"                                                            \n##  [13] \"#chromsize: X 745751\"                                                             \n##  [14] \"#chromsize: XI 666816\"                                                            \n##  [15] \"#chromsize: XII 1078177\"                                                          \n##  [16] \"#chromsize: XIII 924431\"                                                          \n##  [17] \"#chromsize: XIV 784333\"                                                           \n##  [18] \"#chromsize: XV 1091291\"                                                           \n##  [19] \"#chromsize: XVI 948066\"                                                           \n##  [20] \"#chromsize: Mito 85779\"                                                           \n##  [21] \"NS500150:527:HHGYNBGXF:3:21611:19085:3986\\tII\\t105\\tII\\t48548\\t+\\t-\\t1358\\t1681\"  \n##  [22] \"NS500150:527:HHGYNBGXF:4:13604:19734:2406\\tII\\t113\\tII\\t45003\\t-\\t+\\t1358\\t1658\"  \n##  [23] \"NS500150:527:HHGYNBGXF:2:11108:25178:11036\\tII\\t119\\tII\\t687251\\t-\\t+\\t1358\\t5550\"\n##  [24] \"NS500150:527:HHGYNBGXF:1:22301:8468:1586\\tII\\t160\\tII\\t26124\\t+\\t-\\t1358\\t1510\"   \n##  [25] \"NS500150:527:HHGYNBGXF:4:23606:24037:2076\\tII\\t169\\tII\\t39052\\t+\\t+\\t1358\\t1613\"\n\n\n2.3.2 ContactFile fundamentals\nA ContactFile object establishes a connection with a disk-stored Hi-C file (e.g. a .cool file, or a .pairs file, …). ContactFile classes are defined in the HiCExperiment package.\nContactFiles come in four different flavors:\n\n\nCoolFile: connection to a .(m)cool file\n\nHicFile: connection to a .hic file\n\nHicproFile: connection to output files generated by HiC-Pro\n\nPairsFile: connection to a .pairs file\n\nTo create each flavor of ContactFile, one can use the corresponding function:\n\nlibrary(HiCExperiment)\n\n# ----- This creates a connection to a `.(m)cool` file (path stored in `coolf`)\nCoolFile(coolf)\n##  CoolFile object\n##  .mcool file: /github/home/.cache/R/ExperimentHub/1a594277bd62_7752 \n##  resolution: 1000 \n##  pairs file: \n##  metadata(0):\n\n# ----- This creates a connection to a `.hic` file (path stored in `hicf`)\nHicFile(hicf)\n##  HicFile object\n##  .hic file: /github/home/.cache/R/ExperimentHub/1a5939a379f0_7836 \n##  resolution: 1000 \n##  pairs file: \n##  metadata(0):\n\n# ----- This creates a connection to output files from HiC-Pro\nHicproFile(hicpromatrixf, hicproregionsf)\n##  HicproFile object\n##  HiC-Pro files:\n##    $ matrix:   /github/home/.cache/R/ExperimentHub/1a59dc812a9_7837 \n##    $ regions:  /github/home/.cache/R/ExperimentHub/1a591fa0216e_7838 \n##  resolution: 1000 \n##  pairs file: \n##  metadata(0):\n\n# ----- This creates a connection to a pairs file\nPairsFile(pairsf)\n##  PairsFile object\n##  resource: /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753\n\n\n2.3.3 ContactFile slots\nSeveral “slots” (i.e. pieces of information) are attached to a ContactFile object:\n\nThe path to the disk-stored contact matrix;\nThe active resolution (by default, the finest resolution available in a multi-resolution contact matrix);\nOptionally, the path to a matching pairs file (see below);\nSome metadata.\n\nSlots of a CoolFile object can be accessed as follow:\n\ncf &lt;- CoolFile(coolf)\ncf\n##  CoolFile object\n##  .mcool file: /github/home/.cache/R/ExperimentHub/1a594277bd62_7752 \n##  resolution: 1000 \n##  pairs file: \n##  metadata(0):\n\nresolution(cf)\n##  [1] 1000\n\npairsFile(cf)\n##  NULL\n\nmetadata(cf)\n##  list()\n\n\n\n\n\n\n\nImportant!\n\n\n\nContactFile objects are only connections to a disk-stored HiC file. Although metadata is available, they do not contain actual data!\n\n\n\n2.3.4 ContactFile methods\nTwo useful methods are available for ContactFiles:\n\n\navailableResolutions checks which resolutions are available in a ContactFile.\n\n\navailableResolutions(cf)\n##  resolutions(5): 1000 2000 4000 8000 16000\n##  \n\n\n\navailableChromosomes checks which chromosomes are available in a ContactFile, along with their length.\n\n\navailableChromosomes(cf)\n##  Seqinfo object with 16 sequences from an unspecified genome:\n##    seqnames seqlengths isCircular genome\n##    I            230218       &lt;NA&gt;   &lt;NA&gt;\n##    II           813184       &lt;NA&gt;   &lt;NA&gt;\n##    III          316620       &lt;NA&gt;   &lt;NA&gt;\n##    IV          1531933       &lt;NA&gt;   &lt;NA&gt;\n##    V            576874       &lt;NA&gt;   &lt;NA&gt;\n##    ...             ...        ...    ...\n##    XII         1078177       &lt;NA&gt;   &lt;NA&gt;\n##    XIII         924431       &lt;NA&gt;   &lt;NA&gt;\n##    XIV          784333       &lt;NA&gt;   &lt;NA&gt;\n##    XV          1091291       &lt;NA&gt;   &lt;NA&gt;\n##    XVI          948066       &lt;NA&gt;   &lt;NA&gt;"
+    "text": "2.3 ContactFile class\nHi-C contacts can be stored in four different formats (see previous chapter):\n\nAs a .(m)cool matrix (multi-scores, multi-resolution, indexed)\nAs a .hic matrix (multi-scores, multi-resolution, indexed)\nAs a HiC-pro derived matrix (single-score, single-resolution, non-indexed)\nUnbinned, Hi-C contacts can be stored in .pairs files\n\n\n2.3.1 Accessing example Hi-C files\nExample contact files can be downloaded using HiContactsData function.\n\nlibrary(HiContactsData)\ncoolf &lt;- HiContactsData('yeast_wt', 'mcool')\n\nThis fetches files from the cloud, download them locally and returns the path of the local file.\n\ncoolf\n##                                                   EH7702 \n##  \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\"\n\nSimilarly, example files are available for other file formats:\n\nhicf &lt;- HiContactsData('yeast_wt', 'hic')\nhicpromatrixf &lt;- HiContactsData('yeast_wt', 'hicpro_matrix')\nhicproregionsf &lt;- HiContactsData('yeast_wt', 'hicpro_bed')\npairsf &lt;- HiContactsData('yeast_wt', 'pairs.gz')\n\nWe can even check the content of some of these files to make sure they are actually what they are:\n\n# ---- HiC-Pro generates a tab-separated `regions.bed` file\nreadLines(hicproregionsf, 25)\n##   [1] \"I\\t0\\t1000\"      \"I\\t1000\\t2000\"   \"I\\t2000\\t3000\"   \"I\\t3000\\t4000\"  \n##   [5] \"I\\t4000\\t5000\"   \"I\\t5000\\t6000\"   \"I\\t6000\\t7000\"   \"I\\t7000\\t8000\"  \n##   [9] \"I\\t8000\\t9000\"   \"I\\t9000\\t10000\"  \"I\\t10000\\t11000\" \"I\\t11000\\t12000\"\n##  [13] \"I\\t12000\\t13000\" \"I\\t13000\\t14000\" \"I\\t14000\\t15000\" \"I\\t15000\\t16000\"\n##  [17] \"I\\t16000\\t17000\" \"I\\t17000\\t18000\" \"I\\t18000\\t19000\" \"I\\t19000\\t20000\"\n##  [21] \"I\\t20000\\t21000\" \"I\\t21000\\t22000\" \"I\\t22000\\t23000\" \"I\\t23000\\t24000\"\n##  [25] \"I\\t24000\\t25000\"\n\n# ---- Pairs are also tab-separated \nreadLines(pairsf, 25)\n##   [1] \"## pairs format v1.0\"                                                             \n##   [2] \"#sorted: chr1-pos1-chr2-pos2\"                                                     \n##   [3] \"#columns: readID chr1 pos1 chr2 pos2 strand1 strand2 frag1 frag2\"                 \n##   [4] \"#chromsize: I 230218\"                                                             \n##   [5] \"#chromsize: II 813184\"                                                            \n##   [6] \"#chromsize: III 316620\"                                                           \n##   [7] \"#chromsize: IV 1531933\"                                                           \n##   [8] \"#chromsize: V 576874\"                                                             \n##   [9] \"#chromsize: VI 270161\"                                                            \n##  [10] \"#chromsize: VII 1090940\"                                                          \n##  [11] \"#chromsize: VIII 562643\"                                                          \n##  [12] \"#chromsize: IX 439888\"                                                            \n##  [13] \"#chromsize: X 745751\"                                                             \n##  [14] \"#chromsize: XI 666816\"                                                            \n##  [15] \"#chromsize: XII 1078177\"                                                          \n##  [16] \"#chromsize: XIII 924431\"                                                          \n##  [17] \"#chromsize: XIV 784333\"                                                           \n##  [18] \"#chromsize: XV 1091291\"                                                           \n##  [19] \"#chromsize: XVI 948066\"                                                           \n##  [20] \"#chromsize: Mito 85779\"                                                           \n##  [21] \"NS500150:527:HHGYNBGXF:3:21611:19085:3986\\tII\\t105\\tII\\t48548\\t+\\t-\\t1358\\t1681\"  \n##  [22] \"NS500150:527:HHGYNBGXF:4:13604:19734:2406\\tII\\t113\\tII\\t45003\\t-\\t+\\t1358\\t1658\"  \n##  [23] \"NS500150:527:HHGYNBGXF:2:11108:25178:11036\\tII\\t119\\tII\\t687251\\t-\\t+\\t1358\\t5550\"\n##  [24] \"NS500150:527:HHGYNBGXF:1:22301:8468:1586\\tII\\t160\\tII\\t26124\\t+\\t-\\t1358\\t1510\"   \n##  [25] \"NS500150:527:HHGYNBGXF:4:23606:24037:2076\\tII\\t169\\tII\\t39052\\t+\\t+\\t1358\\t1613\"\n\n\n2.3.2 ContactFile fundamentals\nA ContactFile object establishes a connection with a disk-stored Hi-C file (e.g. a .cool file, or a .pairs file, …). ContactFile classes are defined in the HiCExperiment package.\nContactFiles come in four different flavors:\n\n\nCoolFile: connection to a .(m)cool file\n\nHicFile: connection to a .hic file\n\nHicproFile: connection to output files generated by HiC-Pro\n\nPairsFile: connection to a .pairs file\n\nTo create each flavor of ContactFile, one can use the corresponding function:\n\nlibrary(HiCExperiment)\n\n# ----- This creates a connection to a `.(m)cool` file (path stored in `coolf`)\nCoolFile(coolf)\n##  CoolFile object\n##  .mcool file: /github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752 \n##  resolution: 1000 \n##  pairs file: \n##  metadata(0):\n\n# ----- This creates a connection to a `.hic` file (path stored in `hicf`)\nHicFile(hicf)\n##  HicFile object\n##  .hic file: /github/home/.cache/R/ExperimentHub/1a9a270f71fe_7836 \n##  resolution: 1000 \n##  pairs file: \n##  metadata(0):\n\n# ----- This creates a connection to output files from HiC-Pro\nHicproFile(hicpromatrixf, hicproregionsf)\n##  HicproFile object\n##  HiC-Pro files:\n##    $ matrix:   /github/home/.cache/R/ExperimentHub/1a9a6531ab2c_7837 \n##    $ regions:  /github/home/.cache/R/ExperimentHub/1a9a3c1fca84_7838 \n##  resolution: 1000 \n##  pairs file: \n##  metadata(0):\n\n# ----- This creates a connection to a pairs file\nPairsFile(pairsf)\n##  PairsFile object\n##  resource: /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753\n\n\n2.3.3 ContactFile slots\nSeveral “slots” (i.e. pieces of information) are attached to a ContactFile object:\n\nThe path to the disk-stored contact matrix;\nThe active resolution (by default, the finest resolution available in a multi-resolution contact matrix);\nOptionally, the path to a matching pairs file (see below);\nSome metadata.\n\nSlots of a CoolFile object can be accessed as follow:\n\ncf &lt;- CoolFile(coolf)\ncf\n##  CoolFile object\n##  .mcool file: /github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752 \n##  resolution: 1000 \n##  pairs file: \n##  metadata(0):\n\nresolution(cf)\n##  [1] 1000\n\npairsFile(cf)\n##  NULL\n\nmetadata(cf)\n##  list()\n\n\n\n\n\n\n\nImportant!\n\n\n\nContactFile objects are only connections to a disk-stored HiC file. Although metadata is available, they do not contain actual data!\n\n\n\n2.3.4 ContactFile methods\nTwo useful methods are available for ContactFiles:\n\n\navailableResolutions checks which resolutions are available in a ContactFile.\n\n\navailableResolutions(cf)\n##  resolutions(5): 1000 2000 4000 8000 16000\n##  \n\n\n\navailableChromosomes checks which chromosomes are available in a ContactFile, along with their length.\n\n\navailableChromosomes(cf)\n##  Seqinfo object with 16 sequences from an unspecified genome:\n##    seqnames seqlengths isCircular genome\n##    I            230218       &lt;NA&gt;   &lt;NA&gt;\n##    II           813184       &lt;NA&gt;   &lt;NA&gt;\n##    III          316620       &lt;NA&gt;   &lt;NA&gt;\n##    IV          1531933       &lt;NA&gt;   &lt;NA&gt;\n##    V            576874       &lt;NA&gt;   &lt;NA&gt;\n##    ...             ...        ...    ...\n##    XII         1078177       &lt;NA&gt;   &lt;NA&gt;\n##    XIII         924431       &lt;NA&gt;   &lt;NA&gt;\n##    XIV          784333       &lt;NA&gt;   &lt;NA&gt;\n##    XV          1091291       &lt;NA&gt;   &lt;NA&gt;\n##    XVI          948066       &lt;NA&gt;   &lt;NA&gt;"
   },
   {
     "objectID": "data-representation.html#hicexperiment-class",
     "href": "data-representation.html#hicexperiment-class",
     "title": "\n2  Hi-C data structures in R\n",
     "section": "\n2.4 HiCExperiment class",
-    "text": "2.4 HiCExperiment class\nBased on the previous sections, we have different Bioconductor classes relevant for Hi-C:\n\n\nGInteractions which can be used to represent genomic interactions in R\n\nContactFiles which can be used to establish a connection with disk-stored Hi-C files\n\nHiCExperiment objects are created when parsing a ContactFile in R. The HiCExperiment class reads a ContactFile in memory and store genomic interactions as GInteractions. The HiCExperiment class is, quite obviously, defined in the HiCExperiment package.\n\n2.4.1 Creating a HiCExperiment object\n\n2.4.1.1 Importing a ContactFile\n\nIn practice, to create a HiCExperiment object from a ContactFile, one can use the import method.\n\n\n\n\n\n\nCaution\n\n\n\n\nCreating a HiCExperiment object means importing data from a Hi-C matrix (e.g.  from a ContactFile) in memory in R.\n\nCreating a HiCExperiment object from large disk-stored contact matrices can potentially take a long time.\n\n\n\n\ncf &lt;- CoolFile(coolf)\nhic &lt;- import(cf)\nhic\n##  `HiCExperiment` object with 8,757,906 contacts over 12,079 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"whole genome\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 2945692 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\nPrinting a HiCExperiment to the console will not reveal the actual data stored in the object (it would most likely crash your R session!). Instead, it gives a summary of the data stored in the object:\n\nThe fileName, i.e. the path to the disk-stored data file\nThe focus, i.e. the genomic location for which data has been imported (in the example above, \"whole genome\" implies that all the data has been imported in R)\n\nresolutions available in the disk-stored data file (this will be identical to availableResolutions(cf))\n\nactive resolution indicates at which resolution the data is currently imported\n\ninteractions refers to the actual GInteractions imported in R and “hidden” (for now!) in the HiCExperiment object\n\nscores refer to different interaction frequency estimates. These can be raw counts, balanced (if the contact matrix has been previously normalized), or whatever score the end-user want to attribute to each interaction (e.g. ratio of counts between two Hi-C maps, …)\n\ntopologicalFeatures is a list of GRanges or GInteractions objects to describe important topological features.\n\npairsFile is a pointer to an optional disk-stored .pairs file from which the contact matrix has been created. This is often useful to estimate some Hi-C metrics.\n\nmetadata is a list to further describe the experiment.\n\n\n\n\n\n\n\nHiCExperiment slots\n\n\n\nThese pieces of information are called slots. They can be directly accessed using getter functions, bearing the same name than the slot.\n\nfileName(hic)\n##  [1] \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\"\n\nfocus(hic)\n##  NULL\n\nresolutions(hic)\n##  [1]  1000  2000  4000  8000 16000\n\nresolution(hic)\n##  [1] 1000\n\ninteractions(hic)\n##  GInteractions object with 2945692 interactions and 4 metadata columns:\n##              seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##                  &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##          [1]         I        1-1000 ---         I        1-1000 |         0\n##          [2]         I        1-1000 ---         I     1001-2000 |         0\n##          [3]         I        1-1000 ---         I     2001-3000 |         0\n##          [4]         I        1-1000 ---         I     3001-4000 |         0\n##          [5]         I        1-1000 ---         I     4001-5000 |         0\n##          ...       ...           ... ...       ...           ... .       ...\n##    [2945688]       XVI 940001-941000 ---       XVI 942001-943000 |     12070\n##    [2945689]       XVI 940001-941000 ---       XVI 943001-944000 |     12070\n##    [2945690]       XVI 941001-942000 ---       XVI 941001-942000 |     12071\n##    [2945691]       XVI 941001-942000 ---       XVI 942001-943000 |     12071\n##    [2945692]       XVI 941001-942000 ---       XVI 943001-944000 |     12071\n##                bin_id2     count  balanced\n##              &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;\n##          [1]         0        15 0.0663491\n##          [2]         1        21 0.1273505\n##          [3]         2        21 0.0738691\n##          [4]         3        38 0.0827051\n##          [5]         4        17 0.0591984\n##          ...       ...       ...       ...\n##    [2945688]     12072        11 0.0575550\n##    [2945689]     12073         1       NaN\n##    [2945690]     12071        74 0.0504615\n##    [2945691]     12072        39 0.1624599\n##    [2945692]     12073         1       NaN\n##    -------\n##    regions: 12079 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\nscores(hic)\n##  List of length 2\n##  names(2): count balanced\n\ntopologicalFeatures(hic)\n##  List of length 4\n##  names(4): compartments borders loops viewpoints\n\npairsFile(hic)\n##  NULL\n\nmetadata(hic)\n##  list()\n\n\n\n\n\n\n\n\n\nNotes\n\n\n\nimport also works for other types of ContactFile (HicFile, HicproFile, PairsFile), e.g. \n\nFor HicFile and HicproFile, import seamlessly returns a HiCExperiment as well:\n\n\nhf &lt;- HicFile(hicf)\nhic &lt;- import(hf)\nhic\n##  `HiCExperiment` object with 13,681,280 contacts over 12,165 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a5939a379f0_7836\" \n##  focus: \"whole genome\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 2965693 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nFor PairsFile, the returned object is a representation of Hi-C “pairs” in R, i.e. GInteractions\n\n\n\npf &lt;- PairsFile(pairsf)\npairs &lt;- import(pf)\npairs\n##  GInteractions object with 471364 interactions and 3 metadata columns:\n##             seqnames1   ranges1     seqnames2   ranges2 |     frag1     frag2\n##                 &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt; | &lt;numeric&gt; &lt;numeric&gt;\n##         [1]        II       105 ---        II     48548 |      1358      1681\n##         [2]        II       113 ---        II     45003 |      1358      1658\n##         [3]        II       119 ---        II    687251 |      1358      5550\n##         [4]        II       160 ---        II     26124 |      1358      1510\n##         [5]        II       169 ---        II     39052 |      1358      1613\n##         ...       ...       ... ...       ...       ... .       ...       ...\n##    [471360]        II    808605 ---        II    809683 |      6316      6320\n##    [471361]        II    808609 ---        II    809917 |      6316      6324\n##    [471362]        II    808617 ---        II    809506 |      6316      6319\n##    [471363]        II    809447 ---        II    809685 |      6319      6321\n##    [471364]        II    809472 ---        II    809675 |      6319      6320\n##              distance\n##             &lt;integer&gt;\n##         [1]     48443\n##         [2]     44890\n##         [3]    687132\n##         [4]     25964\n##         [5]     38883\n##         ...       ...\n##    [471360]      1078\n##    [471361]      1308\n##    [471362]       889\n##    [471363]       238\n##    [471364]       203\n##    -------\n##    regions: 549331 ranges and 0 metadata columns\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\n\n\n\n2.4.1.2 Customizing the import\n\nTo reduce the import to only parse the data that is relevant to the study, two arguments can be passed to import, along with a ContactFile.\n\n\n\n\n\n\nKey import arguments:\n\n\n\n\n\nfocus: This can be used to only parse data for a specific genomic location.\n\nresolution: This can be used to choose which resolution to parse the contact matrix at (this is ignored if the ContactFile is not multi-resolution, e.g. .cool or HiC-Pro generated matrices)\n\n\n\n\nImport interactions within a single chromosome:\n\n\nhic &lt;- import(cf, focus = 'II', resolution = 2000)\n\nregions(hic) # ---- `regions()` work on `HiCExperiment` the same way than on `GInteractions`\n##  GRanges object with 407 ranges and 4 metadata columns:\n##                     seqnames        ranges strand |    bin_id    weight   chr\n##                        &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##           II_1_2000       II        1-2000      * |       116       NaN    II\n##        II_2001_4000       II     2001-4000      * |       117       NaN    II\n##        II_4001_6000       II     4001-6000      * |       118       NaN    II\n##        II_6001_8000       II     6001-8000      * |       119       NaN    II\n##       II_8001_10000       II    8001-10000      * |       120 0.0461112    II\n##                 ...      ...           ...    ... .       ...       ...   ...\n##    II_804001_806000       II 804001-806000      * |       518 0.0493107    II\n##    II_806001_808000       II 806001-808000      * |       519 0.0611355    II\n##    II_808001_810000       II 808001-810000      * |       520       NaN    II\n##    II_810001_812000       II 810001-812000      * |       521       NaN    II\n##    II_812001_813184       II 812001-813184      * |       522       NaN    II\n##                        center\n##                     &lt;integer&gt;\n##           II_1_2000      1000\n##        II_2001_4000      3000\n##        II_4001_6000      5000\n##        II_6001_8000      7000\n##       II_8001_10000      9000\n##                 ...       ...\n##    II_804001_806000    805000\n##    II_806001_808000    807000\n##    II_808001_810000    809000\n##    II_810001_812000    811000\n##    II_812001_813184    812592\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\ntable(seqnames(regions(hic)))\n##  \n##     I   II  III   IV    V   VI  VII VIII   IX    X   XI  XII XIII  XIV   XV \n##     0  407    0    0    0    0    0    0    0    0    0    0    0    0    0 \n##   XVI \n##     0\n\nanchors(hic) # ---- `anchors()` work on `HiCExperiment` the same way than on `GInteractions`\n##  $first\n##  GRanges object with 34063 ranges and 4 metadata columns:\n##            seqnames        ranges strand |    bin_id    weight   chr\n##               &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##        [1]       II        1-2000      * |       116       NaN    II\n##        [2]       II        1-2000      * |       116       NaN    II\n##        [3]       II        1-2000      * |       116       NaN    II\n##        [4]       II        1-2000      * |       116       NaN    II\n##        [5]       II        1-2000      * |       116       NaN    II\n##        ...      ...           ...    ... .       ...       ...   ...\n##    [34059]       II 804001-806000      * |       518 0.0493107    II\n##    [34060]       II 806001-808000      * |       519 0.0611355    II\n##    [34061]       II 806001-808000      * |       519 0.0611355    II\n##    [34062]       II 806001-808000      * |       519 0.0611355    II\n##    [34063]       II 808001-810000      * |       520       NaN    II\n##               center\n##            &lt;integer&gt;\n##        [1]      1000\n##        [2]      1000\n##        [3]      1000\n##        [4]      1000\n##        [5]      1000\n##        ...       ...\n##    [34059]    805000\n##    [34060]    807000\n##    [34061]    807000\n##    [34062]    807000\n##    [34063]    809000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 34063 ranges and 4 metadata columns:\n##            seqnames        ranges strand |    bin_id    weight   chr\n##               &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##        [1]       II        1-2000      * |       116       NaN    II\n##        [2]       II     4001-6000      * |       118       NaN    II\n##        [3]       II     6001-8000      * |       119       NaN    II\n##        [4]       II    8001-10000      * |       120 0.0461112    II\n##        [5]       II   10001-12000      * |       121 0.0334807    II\n##        ...      ...           ...    ... .       ...       ...   ...\n##    [34059]       II 810001-812000      * |       521       NaN    II\n##    [34060]       II 806001-808000      * |       519 0.0611355    II\n##    [34061]       II 808001-810000      * |       520       NaN    II\n##    [34062]       II 810001-812000      * |       521       NaN    II\n##    [34063]       II 808001-810000      * |       520       NaN    II\n##               center\n##            &lt;integer&gt;\n##        [1]      1000\n##        [2]      5000\n##        [3]      7000\n##        [4]      9000\n##        [5]     11000\n##        ...       ...\n##    [34059]    811000\n##    [34060]    807000\n##    [34061]    809000\n##    [34062]    811000\n##    [34063]    809000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\nImport interactions within a segment of a chromosome:\n\n\nhic &lt;- import(cf, focus = 'II:40000-60000', resolution = 1000)\n\nregions(hic) \n##  GRanges object with 21 ranges and 4 metadata columns:\n##                   seqnames      ranges strand |    bin_id    weight   chr\n##                      &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##    II_39001_40000       II 39001-40000      * |       270 0.0220798    II\n##    II_40001_41000       II 40001-41000      * |       271 0.0246775    II\n##    II_41001_42000       II 41001-42000      * |       272 0.0269232    II\n##    II_42001_43000       II 42001-43000      * |       273 0.0341849    II\n##    II_43001_44000       II 43001-44000      * |       274 0.0265386    II\n##               ...      ...         ...    ... .       ...       ...   ...\n##    II_55001_56000       II 55001-56000      * |       286 0.0213532    II\n##    II_56001_57000       II 56001-57000      * |       287 0.0569839    II\n##    II_57001_58000       II 57001-58000      * |       288 0.0338612    II\n##    II_58001_59000       II 58001-59000      * |       289 0.0294531    II\n##    II_59001_60000       II 59001-60000      * |       290 0.0306662    II\n##                      center\n##                   &lt;integer&gt;\n##    II_39001_40000     39500\n##    II_40001_41000     40500\n##    II_41001_42000     41500\n##    II_42001_43000     42500\n##    II_43001_44000     43500\n##               ...       ...\n##    II_55001_56000     55500\n##    II_56001_57000     56500\n##    II_57001_58000     57500\n##    II_58001_59000     58500\n##    II_59001_60000     59500\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\nanchors(hic)\n##  $first\n##  GRanges object with 210 ranges and 4 metadata columns:\n##          seqnames      ranges strand |    bin_id    weight   chr    center\n##             &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt;\n##      [1]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [2]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [3]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [4]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [5]       II 40001-41000      * |       271 0.0246775    II     40500\n##      ...      ...         ...    ... .       ...       ...   ...       ...\n##    [206]       II 57001-58000      * |       288 0.0338612    II     57500\n##    [207]       II 57001-58000      * |       288 0.0338612    II     57500\n##    [208]       II 58001-59000      * |       289 0.0294531    II     58500\n##    [209]       II 58001-59000      * |       289 0.0294531    II     58500\n##    [210]       II 59001-60000      * |       290 0.0306662    II     59500\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 210 ranges and 4 metadata columns:\n##          seqnames      ranges strand |    bin_id    weight   chr    center\n##             &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt;\n##      [1]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [2]       II 41001-42000      * |       272 0.0269232    II     41500\n##      [3]       II 42001-43000      * |       273 0.0341849    II     42500\n##      [4]       II 43001-44000      * |       274 0.0265386    II     43500\n##      [5]       II 44001-45000      * |       275 0.0488968    II     44500\n##      ...      ...         ...    ... .       ...       ...   ...       ...\n##    [206]       II 58001-59000      * |       289 0.0294531    II     58500\n##    [207]       II 59001-60000      * |       290 0.0306662    II     59500\n##    [208]       II 58001-59000      * |       289 0.0294531    II     58500\n##    [209]       II 59001-60000      * |       290 0.0306662    II     59500\n##    [210]       II 59001-60000      * |       290 0.0306662    II     59500\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\nImport interactions between two chromosomes:\n\n\nhic2 &lt;- import(cf, focus = 'II|XV', resolution = 4000)\n\nregions(hic2)\n##  GRanges object with 477 ranges and 4 metadata columns:\n##                       seqnames          ranges strand |    bin_id    weight\n##                          &lt;Rle&gt;       &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt;\n##             II_1_4000       II          1-4000      * |        58       NaN\n##          II_4001_8000       II       4001-8000      * |        59       NaN\n##         II_8001_12000       II      8001-12000      * |        60 0.0274474\n##        II_12001_16000       II     12001-16000      * |        61 0.0342116\n##        II_16001_20000       II     16001-20000      * |        62 0.0195128\n##                   ...      ...             ...    ... .       ...       ...\n##    XV_1072001_1076000       XV 1072001-1076000      * |      2783  0.041763\n##    XV_1076001_1080000       XV 1076001-1080000      * |      2784       NaN\n##    XV_1080001_1084000       XV 1080001-1084000      * |      2785       NaN\n##    XV_1084001_1088000       XV 1084001-1088000      * |      2786       NaN\n##    XV_1088001_1091291       XV 1088001-1091291      * |      2787       NaN\n##                         chr    center\n##                       &lt;Rle&gt; &lt;integer&gt;\n##             II_1_4000    II      2000\n##          II_4001_8000    II      6000\n##         II_8001_12000    II     10000\n##        II_12001_16000    II     14000\n##        II_16001_20000    II     18000\n##                   ...   ...       ...\n##    XV_1072001_1076000    XV   1074000\n##    XV_1076001_1080000    XV   1078000\n##    XV_1080001_1084000    XV   1082000\n##    XV_1084001_1088000    XV   1086000\n##    XV_1088001_1091291    XV   1089646\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\nanchors(hic2)\n##  $first\n##  GRanges object with 18032 ranges and 4 metadata columns:\n##            seqnames        ranges strand |    bin_id    weight   chr\n##               &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##        [1]       II        1-4000      * |        58       NaN    II\n##        [2]       II        1-4000      * |        58       NaN    II\n##        [3]       II        1-4000      * |        58       NaN    II\n##        [4]       II        1-4000      * |        58       NaN    II\n##        [5]       II        1-4000      * |        58       NaN    II\n##        ...      ...           ...    ... .       ...       ...   ...\n##    [18028]       II 808001-812000      * |       260       NaN    II\n##    [18029]       II 808001-812000      * |       260       NaN    II\n##    [18030]       II 808001-812000      * |       260       NaN    II\n##    [18031]       II 808001-812000      * |       260       NaN    II\n##    [18032]       II 808001-812000      * |       260       NaN    II\n##               center\n##            &lt;integer&gt;\n##        [1]      2000\n##        [2]      2000\n##        [3]      2000\n##        [4]      2000\n##        [5]      2000\n##        ...       ...\n##    [18028]    810000\n##    [18029]    810000\n##    [18030]    810000\n##    [18031]    810000\n##    [18032]    810000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 18032 ranges and 4 metadata columns:\n##            seqnames          ranges strand |    bin_id    weight   chr\n##               &lt;Rle&gt;       &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##        [1]       XV     48001-52000      * |      2527 0.0185354    XV\n##        [2]       XV   348001-352000      * |      2602 0.0233750    XV\n##        [3]       XV   468001-472000      * |      2632 0.0153615    XV\n##        [4]       XV   472001-476000      * |      2633 0.0189624    XV\n##        [5]       XV   584001-588000      * |      2661 0.0167715    XV\n##        ...      ...             ...    ... .       ...       ...   ...\n##    [18028]       XV   980001-984000      * |      2760 0.0187827    XV\n##    [18029]       XV   984001-988000      * |      2761 0.0250094    XV\n##    [18030]       XV   992001-996000      * |      2763 0.0185599    XV\n##    [18031]       XV 1004001-1008000      * |      2766 0.0196942    XV\n##    [18032]       XV 1064001-1068000      * |      2781 0.0208220    XV\n##               center\n##            &lt;integer&gt;\n##        [1]     50000\n##        [2]    350000\n##        [3]    470000\n##        [4]    474000\n##        [5]    586000\n##        ...       ...\n##    [18028]    982000\n##    [18029]    986000\n##    [18030]    994000\n##    [18031]   1006000\n##    [18032]   1066000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\nImport interactions between segments of two chromosomes:\n\n\nhic3 &lt;- import(cf, focus = 'III:10000-40000|XV:10000-40000', resolution = 2000)\n\nregions(hic3)\n##  GRanges object with 32 ranges and 4 metadata columns:\n##                    seqnames      ranges strand |    bin_id    weight   chr\n##                       &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##     III_8001_10000      III  8001-10000      * |       527       NaN   III\n##    III_10001_12000      III 10001-12000      * |       528       NaN   III\n##    III_12001_14000      III 12001-14000      * |       529       NaN   III\n##    III_14001_16000      III 14001-16000      * |       530 0.0356351   III\n##    III_16001_18000      III 16001-18000      * |       531 0.0230693   III\n##                ...      ...         ...    ... .       ...       ...   ...\n##     XV_30001_32000       XV 30001-32000      * |      5039 0.0482465    XV\n##     XV_32001_34000       XV 32001-34000      * |      5040 0.0241580    XV\n##     XV_34001_36000       XV 34001-36000      * |      5041 0.0273166    XV\n##     XV_36001_38000       XV 36001-38000      * |      5042 0.0542235    XV\n##     XV_38001_40000       XV 38001-40000      * |      5043 0.0206849    XV\n##                       center\n##                    &lt;integer&gt;\n##     III_8001_10000      9000\n##    III_10001_12000     11000\n##    III_12001_14000     13000\n##    III_14001_16000     15000\n##    III_16001_18000     17000\n##                ...       ...\n##     XV_30001_32000     31000\n##     XV_32001_34000     33000\n##     XV_34001_36000     35000\n##     XV_36001_38000     37000\n##     XV_38001_40000     39000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\nanchors(hic3)\n##  $first\n##  GRanges object with 11 ranges and 4 metadata columns:\n##         seqnames      ranges strand |    bin_id    weight   chr    center\n##            &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt;\n##     [1]      III 14001-16000      * |       530 0.0356351   III     15000\n##     [2]      III 16001-18000      * |       531 0.0230693   III     17000\n##     [3]      III 16001-18000      * |       531 0.0230693   III     17000\n##     [4]      III 20001-22000      * |       533 0.0343250   III     21000\n##     [5]      III 22001-24000      * |       534 0.0258604   III     23000\n##     [6]      III 24001-26000      * |       535 0.0290757   III     25000\n##     [7]      III 28001-30000      * |       537 0.0290713   III     29000\n##     [8]      III 30001-32000      * |       538 0.0266373   III     31000\n##     [9]      III 32001-34000      * |       539 0.0201137   III     33000\n##    [10]      III 32001-34000      * |       539 0.0201137   III     33000\n##    [11]      III 36001-38000      * |       541 0.0220603   III     37000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 11 ranges and 4 metadata columns:\n##         seqnames      ranges strand |    bin_id    weight   chr    center\n##            &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt;\n##     [1]       XV 16001-18000      * |      5032 0.0187250    XV     17000\n##     [2]       XV 16001-18000      * |      5032 0.0187250    XV     17000\n##     [3]       XV 20001-22000      * |      5034 0.0247973    XV     21000\n##     [4]       XV 14001-16000      * |      5031 0.0379727    XV     15000\n##     [5]       XV 10001-12000      * |      5029 0.0296913    XV     11000\n##     [6]       XV 32001-34000      * |      5040 0.0241580    XV     33000\n##     [7]       XV 16001-18000      * |      5032 0.0187250    XV     17000\n##     [8]       XV 38001-40000      * |      5043 0.0206849    XV     39000\n##     [9]       XV 22001-24000      * |      5035 0.0613856    XV     23000\n##    [10]       XV 30001-32000      * |      5039 0.0482465    XV     31000\n##    [11]       XV 10001-12000      * |      5029 0.0296913    XV     11000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n2.4.2 Interacting with HiCExperiment data\n\nAn HiCExperiment object allows parsing of a disk-stored contact matrix.\nAn HiCExperiment object operates by wrapping together (1) a ContactFile (i.e. a connection to a disk-stored data file) and (2) a GInteractions generated by parsing the data file.\n\nWe will use the yeast_hic HiCExperiment object to demonstrate how to parse information from a HiCExperiment object.\n\nyeast_hic &lt;- contacts_yeast()\n\n\nyeast_hic\n##  `HiCExperiment` object with 8,757,906 contacts over 763 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"whole genome\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 16000 \n##  interactions: 267709 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753 \n##  metadata(3): ID org date\n\n\n2.4.2.1 Interactions\nThe imported genomic interactions can be directly exposed using the interactions function and are returned as a GInteractions object.\n\ninteractions(yeast_hic)\n##  GInteractions object with 267709 interactions and 4 metadata columns:\n##             seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##                 &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##         [1]         I       1-16000 ---         I       1-16000 |         0\n##         [2]         I       1-16000 ---         I   16001-32000 |         0\n##         [3]         I       1-16000 ---         I   32001-48000 |         0\n##         [4]         I       1-16000 ---         I   48001-64000 |         0\n##         [5]         I       1-16000 ---         I   64001-80000 |         0\n##         ...       ...           ... ...       ...           ... .       ...\n##    [267705]       XVI 896001-912000 ---       XVI 912001-928000 |       759\n##    [267706]       XVI 896001-912000 ---       XVI 928001-944000 |       759\n##    [267707]       XVI 912001-928000 ---       XVI 912001-928000 |       760\n##    [267708]       XVI 912001-928000 ---       XVI 928001-944000 |       760\n##    [267709]       XVI 928001-944000 ---       XVI 928001-944000 |       761\n##               bin_id2     count  balanced\n##             &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;\n##         [1]         0      2836 1.0943959\n##         [2]         1      2212 0.9592069\n##         [3]         2      1183 0.4385242\n##         [4]         3       831 0.2231192\n##         [5]         4       310 0.0821255\n##         ...       ...       ...       ...\n##    [267705]       760      3565  1.236371\n##    [267706]       761      1359  0.385016\n##    [267707]       760      3534  2.103988\n##    [267708]       761      3055  1.485794\n##    [267709]       761      4308  1.711565\n##    -------\n##    regions: 763 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n\n\n\n\n\nNote\n\n\n\nBecause genomic interactions are actually stored as GInteractions, regions and anchors work on HiCExperiment objects just as they work with GInteractions!\n\n\n\nregions(yeast_hic)\n##  GRanges object with 763 ranges and 4 metadata columns:\n##                      seqnames        ranges strand |    bin_id     weight\n##                         &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt;  &lt;numeric&gt;\n##            I_1_16000        I       1-16000      * |         0  0.0196442\n##        I_16001_32000        I   16001-32000      * |         1  0.0220746\n##        I_32001_48000        I   32001-48000      * |         2  0.0188701\n##        I_48001_64000        I   48001-64000      * |         3  0.0136679\n##        I_64001_80000        I   64001-80000      * |         4  0.0134860\n##                  ...      ...           ...    ... .       ...        ...\n##    XVI_880001_896000      XVI 880001-896000      * |       758 0.00910873\n##    XVI_896001_912000      XVI 896001-912000      * |       759 0.01421350\n##    XVI_912001_928000      XVI 912001-928000      * |       760 0.02439992\n##    XVI_928001_944000      XVI 928001-944000      * |       761 0.01993237\n##    XVI_944001_948066      XVI 944001-948066      * |       762        NaN\n##                        chr    center\n##                      &lt;Rle&gt; &lt;integer&gt;\n##            I_1_16000     I      8000\n##        I_16001_32000     I     24000\n##        I_32001_48000     I     40000\n##        I_48001_64000     I     56000\n##        I_64001_80000     I     72000\n##                  ...   ...       ...\n##    XVI_880001_896000   XVI    888000\n##    XVI_896001_912000   XVI    904000\n##    XVI_912001_928000   XVI    920000\n##    XVI_928001_944000   XVI    936000\n##    XVI_944001_948066   XVI    946033\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\nanchors(yeast_hic)\n##  $first\n##  GRanges object with 267709 ranges and 4 metadata columns:\n##             seqnames        ranges strand |    bin_id    weight   chr\n##                &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##         [1]        I       1-16000      * |         0 0.0196442     I\n##         [2]        I       1-16000      * |         0 0.0196442     I\n##         [3]        I       1-16000      * |         0 0.0196442     I\n##         [4]        I       1-16000      * |         0 0.0196442     I\n##         [5]        I       1-16000      * |         0 0.0196442     I\n##         ...      ...           ...    ... .       ...       ...   ...\n##    [267705]      XVI 896001-912000      * |       759 0.0142135   XVI\n##    [267706]      XVI 896001-912000      * |       759 0.0142135   XVI\n##    [267707]      XVI 912001-928000      * |       760 0.0243999   XVI\n##    [267708]      XVI 912001-928000      * |       760 0.0243999   XVI\n##    [267709]      XVI 928001-944000      * |       761 0.0199324   XVI\n##                center\n##             &lt;integer&gt;\n##         [1]      8000\n##         [2]      8000\n##         [3]      8000\n##         [4]      8000\n##         [5]      8000\n##         ...       ...\n##    [267705]    904000\n##    [267706]    904000\n##    [267707]    920000\n##    [267708]    920000\n##    [267709]    936000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 267709 ranges and 4 metadata columns:\n##             seqnames        ranges strand |    bin_id    weight   chr\n##                &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##         [1]        I       1-16000      * |         0 0.0196442     I\n##         [2]        I   16001-32000      * |         1 0.0220746     I\n##         [3]        I   32001-48000      * |         2 0.0188701     I\n##         [4]        I   48001-64000      * |         3 0.0136679     I\n##         [5]        I   64001-80000      * |         4 0.0134860     I\n##         ...      ...           ...    ... .       ...       ...   ...\n##    [267705]      XVI 912001-928000      * |       760 0.0243999   XVI\n##    [267706]      XVI 928001-944000      * |       761 0.0199324   XVI\n##    [267707]      XVI 912001-928000      * |       760 0.0243999   XVI\n##    [267708]      XVI 928001-944000      * |       761 0.0199324   XVI\n##    [267709]      XVI 928001-944000      * |       761 0.0199324   XVI\n##                center\n##             &lt;integer&gt;\n##         [1]      8000\n##         [2]     24000\n##         [3]     40000\n##         [4]     56000\n##         [5]     72000\n##         ...       ...\n##    [267705]    920000\n##    [267706]    936000\n##    [267707]    920000\n##    [267708]    936000\n##    [267709]    936000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n2.4.2.2 Bins and seqinfo\nAdditional useful information can be recovered from a HiCExperiment object. This includes:\n\nThe seqinfo of the HiCExperiment:\n\n\nseqinfo(yeast_hic)\n##  Seqinfo object with 16 sequences from an unspecified genome:\n##    seqnames seqlengths isCircular genome\n##    I            230218       &lt;NA&gt;   &lt;NA&gt;\n##    II           813184       &lt;NA&gt;   &lt;NA&gt;\n##    III          316620       &lt;NA&gt;   &lt;NA&gt;\n##    IV          1531933       &lt;NA&gt;   &lt;NA&gt;\n##    V            576874       &lt;NA&gt;   &lt;NA&gt;\n##    ...             ...        ...    ...\n##    XII         1078177       &lt;NA&gt;   &lt;NA&gt;\n##    XIII         924431       &lt;NA&gt;   &lt;NA&gt;\n##    XIV          784333       &lt;NA&gt;   &lt;NA&gt;\n##    XV          1091291       &lt;NA&gt;   &lt;NA&gt;\n##    XVI          948066       &lt;NA&gt;   &lt;NA&gt;\n\nThis lists the different chromosomes available to parse along with their length.\n\nThe bins of the HiCExperiment:\n\n\nbins(yeast_hic)\n##  GRanges object with 763 ranges and 2 metadata columns:\n##                      seqnames        ranges strand |    bin_id     weight\n##                         &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt;  &lt;numeric&gt;\n##            I_1_16000        I       1-16000      * |         0  0.0196442\n##        I_16001_32000        I   16001-32000      * |         1  0.0220746\n##        I_32001_48000        I   32001-48000      * |         2  0.0188701\n##        I_48001_64000        I   48001-64000      * |         3  0.0136679\n##        I_64001_80000        I   64001-80000      * |         4  0.0134860\n##                  ...      ...           ...    ... .       ...        ...\n##    XVI_880001_896000      XVI 880001-896000      * |       758 0.00910873\n##    XVI_896001_912000      XVI 896001-912000      * |       759 0.01421350\n##    XVI_912001_928000      XVI 912001-928000      * |       760 0.02439992\n##    XVI_928001_944000      XVI 928001-944000      * |       761 0.01993237\n##    XVI_944001_948066      XVI 944001-948066      * |       762        NaN\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n\n\n\n\n\nDifference between bins and regions\n\n\n\nbins are not equivalent to regions of an HiCExperiment.\n\n\nbins refer to all the possible regions of a HiCExperiment. For instance, for a HiCExperiment with a total genome size of 1,000,000 and a resolution of 2000, bins will always return a GRanges object with 500 ranges.\n\nregions, on the opposite, refer to the union of anchors of all the interactions imported in a HiCExperiment object.\n\nThus, all the regions will necessarily be a subset of the HiCExperiment bins, or equal to bins if no focus has been specified when importing a ContactFile.\n\n\n\n2.4.2.3 Scores\nOf course, what the end-user would be looking for is the frequency for each genomic interaction. Such frequency scores are available using the scores function. scores returns a list with a number of different types of scores.\n\nhead(scores(yeast_hic))\n##  List of length 2\n##  names(2): count balanced\n\nhead(scores(yeast_hic, \"count\"))\n##  [1] 2836 2212 1183  831  310  159\n\nhead(scores(yeast_hic, \"balanced\"))\n##  [1] 1.09439586 0.95920688 0.43852417 0.22311917 0.08212549 0.03345221\n\n\n\n\n\n\n\nTip\n\n\n\nCalling interactions(hic) returns a GInteractions with scores already stored in extra columns. This short-hand allows one to dynamically check scores directly from the interactions output.\n\ninteractions(yeast_hic)\n##  GInteractions object with 267709 interactions and 4 metadata columns:\n##             seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##                 &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##         [1]         I       1-16000 ---         I       1-16000 |         0\n##         [2]         I       1-16000 ---         I   16001-32000 |         0\n##         [3]         I       1-16000 ---         I   32001-48000 |         0\n##         [4]         I       1-16000 ---         I   48001-64000 |         0\n##         [5]         I       1-16000 ---         I   64001-80000 |         0\n##         ...       ...           ... ...       ...           ... .       ...\n##    [267705]       XVI 896001-912000 ---       XVI 912001-928000 |       759\n##    [267706]       XVI 896001-912000 ---       XVI 928001-944000 |       759\n##    [267707]       XVI 912001-928000 ---       XVI 912001-928000 |       760\n##    [267708]       XVI 912001-928000 ---       XVI 928001-944000 |       760\n##    [267709]       XVI 928001-944000 ---       XVI 928001-944000 |       761\n##               bin_id2     count  balanced\n##             &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;\n##         [1]         0      2836 1.0943959\n##         [2]         1      2212 0.9592069\n##         [3]         2      1183 0.4385242\n##         [4]         3       831 0.2231192\n##         [5]         4       310 0.0821255\n##         ...       ...       ...       ...\n##    [267705]       760      3565  1.236371\n##    [267706]       761      1359  0.385016\n##    [267707]       760      3534  2.103988\n##    [267708]       761      3055  1.485794\n##    [267709]       761      4308  1.711565\n##    -------\n##    regions: 763 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\nhead(interactions(yeast_hic)$count)\n##  [1] 2836 2212 1183  831  310  159\n\n\n\n\n2.4.2.4 topologicalFeatures\nIn Hi-C studies, “topological features” refer to genomic structures identified (usually from a Hi-C map, but not necessarily). For instance, one may want to study known structural loops anchored at CTCF sites, or interactions around or over centromeres, or simply specific genomic “viewpoints”.\nHiCExperiment objects can store topologicalFeatures to facilitate this analysis. By default, four empty topologicalFeatures are stored in a list:\n\ncompartments\nborders\nloops\nviewpoints\n\nAdditional topologicalFeatures can be added to this list (read next chapter for more detail).\n\ntopologicalFeatures(yeast_hic)\n##  List of length 5\n##  names(5): compartments borders loops viewpoints centromeres\n\ntopologicalFeatures(yeast_hic, 'centromeres')\n##  GRanges object with 16 ranges and 0 metadata columns:\n##         seqnames        ranges strand\n##            &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt;\n##     [1]        I 151583-151641      +\n##     [2]       II 238361-238419      +\n##     [3]      III 114322-114380      +\n##     [4]       IV 449879-449937      +\n##     [5]        V 152522-152580      +\n##     ...      ...           ...    ...\n##    [12]      XII 151366-151424      +\n##    [13]     XIII 268222-268280      +\n##    [14]      XIV 628588-628646      +\n##    [15]       XV 326897-326955      +\n##    [16]      XVI 556255-556313      +\n##    -------\n##    seqinfo: 17 sequences (1 circular) from R64-1-1 genome\n\n\n2.4.2.5 pairsFile\nAs a contact matrix is typically obtained from binning a .pairs file, it is often the case that the matching .pairs file is available to then end-user. A PairsFile can thus be created and associated to the corresponding HiCExperiment object. This allows more accurate estimation of contact distribution, e.g. when calculating distance-dependent genomic interaction frequency.\n\npairsFile(yeast_hic) &lt;- pairsf\n\npairsFile(yeast_hic)\n##                                                   EH7703 \n##  \"/github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753\"\n\nreadLines(pairsFile(yeast_hic), 25)\n##   [1] \"## pairs format v1.0\"                                                             \n##   [2] \"#sorted: chr1-pos1-chr2-pos2\"                                                     \n##   [3] \"#columns: readID chr1 pos1 chr2 pos2 strand1 strand2 frag1 frag2\"                 \n##   [4] \"#chromsize: I 230218\"                                                             \n##   [5] \"#chromsize: II 813184\"                                                            \n##   [6] \"#chromsize: III 316620\"                                                           \n##   [7] \"#chromsize: IV 1531933\"                                                           \n##   [8] \"#chromsize: V 576874\"                                                             \n##   [9] \"#chromsize: VI 270161\"                                                            \n##  [10] \"#chromsize: VII 1090940\"                                                          \n##  [11] \"#chromsize: VIII 562643\"                                                          \n##  [12] \"#chromsize: IX 439888\"                                                            \n##  [13] \"#chromsize: X 745751\"                                                             \n##  [14] \"#chromsize: XI 666816\"                                                            \n##  [15] \"#chromsize: XII 1078177\"                                                          \n##  [16] \"#chromsize: XIII 924431\"                                                          \n##  [17] \"#chromsize: XIV 784333\"                                                           \n##  [18] \"#chromsize: XV 1091291\"                                                           \n##  [19] \"#chromsize: XVI 948066\"                                                           \n##  [20] \"#chromsize: Mito 85779\"                                                           \n##  [21] \"NS500150:527:HHGYNBGXF:3:21611:19085:3986\\tII\\t105\\tII\\t48548\\t+\\t-\\t1358\\t1681\"  \n##  [22] \"NS500150:527:HHGYNBGXF:4:13604:19734:2406\\tII\\t113\\tII\\t45003\\t-\\t+\\t1358\\t1658\"  \n##  [23] \"NS500150:527:HHGYNBGXF:2:11108:25178:11036\\tII\\t119\\tII\\t687251\\t-\\t+\\t1358\\t5550\"\n##  [24] \"NS500150:527:HHGYNBGXF:1:22301:8468:1586\\tII\\t160\\tII\\t26124\\t+\\t-\\t1358\\t1510\"   \n##  [25] \"NS500150:527:HHGYNBGXF:4:23606:24037:2076\\tII\\t169\\tII\\t39052\\t+\\t+\\t1358\\t1613\"\n\n\n\n\n\n\n\nImporting a PairsFile\n\n\n\nThe .pairs file linked to a HiCExperiment object can itself be imported in a GInteractions object:\n\nimport(pairsFile(yeast_hic), format = 'pairs')\n##  GInteractions object with 471364 interactions and 3 metadata columns:\n##             seqnames1   ranges1     seqnames2   ranges2 |     frag1     frag2\n##                 &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt; | &lt;numeric&gt; &lt;numeric&gt;\n##         [1]        II       105 ---        II     48548 |      1358      1681\n##         [2]        II       113 ---        II     45003 |      1358      1658\n##         [3]        II       119 ---        II    687251 |      1358      5550\n##         [4]        II       160 ---        II     26124 |      1358      1510\n##         [5]        II       169 ---        II     39052 |      1358      1613\n##         ...       ...       ... ...       ...       ... .       ...       ...\n##    [471360]        II    808605 ---        II    809683 |      6316      6320\n##    [471361]        II    808609 ---        II    809917 |      6316      6324\n##    [471362]        II    808617 ---        II    809506 |      6316      6319\n##    [471363]        II    809447 ---        II    809685 |      6319      6321\n##    [471364]        II    809472 ---        II    809675 |      6319      6320\n##              distance\n##             &lt;integer&gt;\n##         [1]     48443\n##         [2]     44890\n##         [3]    687132\n##         [4]     25964\n##         [5]     38883\n##         ...       ...\n##    [471360]      1078\n##    [471361]      1308\n##    [471362]       889\n##    [471363]       238\n##    [471364]       203\n##    -------\n##    regions: 549331 ranges and 0 metadata columns\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\nNote that these GInteractions are not binned, contrary to interactions extracted from a HiCExperiment. Anchors of the interactions listed in the GInteractions imported from a disk-stored .pairs file are all of width 1."
+    "text": "2.4 HiCExperiment class\nBased on the previous sections, we have different Bioconductor classes relevant for Hi-C:\n\n\nGInteractions which can be used to represent genomic interactions in R\n\nContactFiles which can be used to establish a connection with disk-stored Hi-C files\n\nHiCExperiment objects are created when parsing a ContactFile in R. The HiCExperiment class reads a ContactFile in memory and store genomic interactions as GInteractions. The HiCExperiment class is, quite obviously, defined in the HiCExperiment package.\n\n2.4.1 Creating a HiCExperiment object\n\n2.4.1.1 Importing a ContactFile\n\nIn practice, to create a HiCExperiment object from a ContactFile, one can use the import method.\n\n\n\n\n\n\nCaution\n\n\n\n\nCreating a HiCExperiment object means importing data from a Hi-C matrix (e.g.  from a ContactFile) in memory in R.\n\nCreating a HiCExperiment object from large disk-stored contact matrices can potentially take a long time.\n\n\n\n\ncf &lt;- CoolFile(coolf)\nhic &lt;- import(cf)\nhic\n##  `HiCExperiment` object with 8,757,906 contacts over 12,079 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"whole genome\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 2945692 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\nPrinting a HiCExperiment to the console will not reveal the actual data stored in the object (it would most likely crash your R session!). Instead, it gives a summary of the data stored in the object:\n\nThe fileName, i.e. the path to the disk-stored data file\nThe focus, i.e. the genomic location for which data has been imported (in the example above, \"whole genome\" implies that all the data has been imported in R)\n\nresolutions available in the disk-stored data file (this will be identical to availableResolutions(cf))\n\nactive resolution indicates at which resolution the data is currently imported\n\ninteractions refers to the actual GInteractions imported in R and “hidden” (for now!) in the HiCExperiment object\n\nscores refer to different interaction frequency estimates. These can be raw counts, balanced (if the contact matrix has been previously normalized), or whatever score the end-user want to attribute to each interaction (e.g. ratio of counts between two Hi-C maps, …)\n\ntopologicalFeatures is a list of GRanges or GInteractions objects to describe important topological features.\n\npairsFile is a pointer to an optional disk-stored .pairs file from which the contact matrix has been created. This is often useful to estimate some Hi-C metrics.\n\nmetadata is a list to further describe the experiment.\n\n\n\n\n\n\n\nHiCExperiment slots\n\n\n\nThese pieces of information are called slots. They can be directly accessed using getter functions, bearing the same name than the slot.\n\nfileName(hic)\n##  [1] \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\"\n\nfocus(hic)\n##  NULL\n\nresolutions(hic)\n##  [1]  1000  2000  4000  8000 16000\n\nresolution(hic)\n##  [1] 1000\n\ninteractions(hic)\n##  GInteractions object with 2945692 interactions and 4 metadata columns:\n##              seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##                  &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##          [1]         I        1-1000 ---         I        1-1000 |         0\n##          [2]         I        1-1000 ---         I     1001-2000 |         0\n##          [3]         I        1-1000 ---         I     2001-3000 |         0\n##          [4]         I        1-1000 ---         I     3001-4000 |         0\n##          [5]         I        1-1000 ---         I     4001-5000 |         0\n##          ...       ...           ... ...       ...           ... .       ...\n##    [2945688]       XVI 940001-941000 ---       XVI 942001-943000 |     12070\n##    [2945689]       XVI 940001-941000 ---       XVI 943001-944000 |     12070\n##    [2945690]       XVI 941001-942000 ---       XVI 941001-942000 |     12071\n##    [2945691]       XVI 941001-942000 ---       XVI 942001-943000 |     12071\n##    [2945692]       XVI 941001-942000 ---       XVI 943001-944000 |     12071\n##                bin_id2     count  balanced\n##              &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;\n##          [1]         0        15 0.0663491\n##          [2]         1        21 0.1273505\n##          [3]         2        21 0.0738691\n##          [4]         3        38 0.0827051\n##          [5]         4        17 0.0591984\n##          ...       ...       ...       ...\n##    [2945688]     12072        11 0.0575550\n##    [2945689]     12073         1       NaN\n##    [2945690]     12071        74 0.0504615\n##    [2945691]     12072        39 0.1624599\n##    [2945692]     12073         1       NaN\n##    -------\n##    regions: 12079 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\nscores(hic)\n##  List of length 2\n##  names(2): count balanced\n\ntopologicalFeatures(hic)\n##  List of length 4\n##  names(4): compartments borders loops viewpoints\n\npairsFile(hic)\n##  NULL\n\nmetadata(hic)\n##  list()\n\n\n\n\n\n\n\n\n\nNotes\n\n\n\nimport also works for other types of ContactFile (HicFile, HicproFile, PairsFile), e.g. \n\nFor HicFile and HicproFile, import seamlessly returns a HiCExperiment as well:\n\n\nhf &lt;- HicFile(hicf)\nhic &lt;- import(hf)\nhic\n##  `HiCExperiment` object with 13,681,280 contacts over 12,165 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a270f71fe_7836\" \n##  focus: \"whole genome\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 2965693 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nFor PairsFile, the returned object is a representation of Hi-C “pairs” in R, i.e. GInteractions\n\n\n\npf &lt;- PairsFile(pairsf)\npairs &lt;- import(pf)\npairs\n##  GInteractions object with 471364 interactions and 3 metadata columns:\n##             seqnames1   ranges1     seqnames2   ranges2 |     frag1     frag2\n##                 &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt; | &lt;numeric&gt; &lt;numeric&gt;\n##         [1]        II       105 ---        II     48548 |      1358      1681\n##         [2]        II       113 ---        II     45003 |      1358      1658\n##         [3]        II       119 ---        II    687251 |      1358      5550\n##         [4]        II       160 ---        II     26124 |      1358      1510\n##         [5]        II       169 ---        II     39052 |      1358      1613\n##         ...       ...       ... ...       ...       ... .       ...       ...\n##    [471360]        II    808605 ---        II    809683 |      6316      6320\n##    [471361]        II    808609 ---        II    809917 |      6316      6324\n##    [471362]        II    808617 ---        II    809506 |      6316      6319\n##    [471363]        II    809447 ---        II    809685 |      6319      6321\n##    [471364]        II    809472 ---        II    809675 |      6319      6320\n##              distance\n##             &lt;integer&gt;\n##         [1]     48443\n##         [2]     44890\n##         [3]    687132\n##         [4]     25964\n##         [5]     38883\n##         ...       ...\n##    [471360]      1078\n##    [471361]      1308\n##    [471362]       889\n##    [471363]       238\n##    [471364]       203\n##    -------\n##    regions: 549331 ranges and 0 metadata columns\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\n\n\n\n2.4.1.2 Customizing the import\n\nTo reduce the import to only parse the data that is relevant to the study, two arguments can be passed to import, along with a ContactFile.\n\n\n\n\n\n\nKey import arguments:\n\n\n\n\n\nfocus: This can be used to only parse data for a specific genomic location.\n\nresolution: This can be used to choose which resolution to parse the contact matrix at (this is ignored if the ContactFile is not multi-resolution, e.g. .cool or HiC-Pro generated matrices)\n\n\n\n\nImport interactions within a single chromosome:\n\n\nhic &lt;- import(cf, focus = 'II', resolution = 2000)\n\nregions(hic) # ---- `regions()` work on `HiCExperiment` the same way than on `GInteractions`\n##  GRanges object with 407 ranges and 4 metadata columns:\n##                     seqnames        ranges strand |    bin_id    weight   chr\n##                        &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##           II_1_2000       II        1-2000      * |       116       NaN    II\n##        II_2001_4000       II     2001-4000      * |       117       NaN    II\n##        II_4001_6000       II     4001-6000      * |       118       NaN    II\n##        II_6001_8000       II     6001-8000      * |       119       NaN    II\n##       II_8001_10000       II    8001-10000      * |       120 0.0461112    II\n##                 ...      ...           ...    ... .       ...       ...   ...\n##    II_804001_806000       II 804001-806000      * |       518 0.0493107    II\n##    II_806001_808000       II 806001-808000      * |       519 0.0611355    II\n##    II_808001_810000       II 808001-810000      * |       520       NaN    II\n##    II_810001_812000       II 810001-812000      * |       521       NaN    II\n##    II_812001_813184       II 812001-813184      * |       522       NaN    II\n##                        center\n##                     &lt;integer&gt;\n##           II_1_2000      1000\n##        II_2001_4000      3000\n##        II_4001_6000      5000\n##        II_6001_8000      7000\n##       II_8001_10000      9000\n##                 ...       ...\n##    II_804001_806000    805000\n##    II_806001_808000    807000\n##    II_808001_810000    809000\n##    II_810001_812000    811000\n##    II_812001_813184    812592\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\ntable(seqnames(regions(hic)))\n##  \n##     I   II  III   IV    V   VI  VII VIII   IX    X   XI  XII XIII  XIV   XV \n##     0  407    0    0    0    0    0    0    0    0    0    0    0    0    0 \n##   XVI \n##     0\n\nanchors(hic) # ---- `anchors()` work on `HiCExperiment` the same way than on `GInteractions`\n##  $first\n##  GRanges object with 34063 ranges and 4 metadata columns:\n##            seqnames        ranges strand |    bin_id    weight   chr\n##               &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##        [1]       II        1-2000      * |       116       NaN    II\n##        [2]       II        1-2000      * |       116       NaN    II\n##        [3]       II        1-2000      * |       116       NaN    II\n##        [4]       II        1-2000      * |       116       NaN    II\n##        [5]       II        1-2000      * |       116       NaN    II\n##        ...      ...           ...    ... .       ...       ...   ...\n##    [34059]       II 804001-806000      * |       518 0.0493107    II\n##    [34060]       II 806001-808000      * |       519 0.0611355    II\n##    [34061]       II 806001-808000      * |       519 0.0611355    II\n##    [34062]       II 806001-808000      * |       519 0.0611355    II\n##    [34063]       II 808001-810000      * |       520       NaN    II\n##               center\n##            &lt;integer&gt;\n##        [1]      1000\n##        [2]      1000\n##        [3]      1000\n##        [4]      1000\n##        [5]      1000\n##        ...       ...\n##    [34059]    805000\n##    [34060]    807000\n##    [34061]    807000\n##    [34062]    807000\n##    [34063]    809000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 34063 ranges and 4 metadata columns:\n##            seqnames        ranges strand |    bin_id    weight   chr\n##               &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##        [1]       II        1-2000      * |       116       NaN    II\n##        [2]       II     4001-6000      * |       118       NaN    II\n##        [3]       II     6001-8000      * |       119       NaN    II\n##        [4]       II    8001-10000      * |       120 0.0461112    II\n##        [5]       II   10001-12000      * |       121 0.0334807    II\n##        ...      ...           ...    ... .       ...       ...   ...\n##    [34059]       II 810001-812000      * |       521       NaN    II\n##    [34060]       II 806001-808000      * |       519 0.0611355    II\n##    [34061]       II 808001-810000      * |       520       NaN    II\n##    [34062]       II 810001-812000      * |       521       NaN    II\n##    [34063]       II 808001-810000      * |       520       NaN    II\n##               center\n##            &lt;integer&gt;\n##        [1]      1000\n##        [2]      5000\n##        [3]      7000\n##        [4]      9000\n##        [5]     11000\n##        ...       ...\n##    [34059]    811000\n##    [34060]    807000\n##    [34061]    809000\n##    [34062]    811000\n##    [34063]    809000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\nImport interactions within a segment of a chromosome:\n\n\nhic &lt;- import(cf, focus = 'II:40000-60000', resolution = 1000)\n\nregions(hic) \n##  GRanges object with 21 ranges and 4 metadata columns:\n##                   seqnames      ranges strand |    bin_id    weight   chr\n##                      &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##    II_39001_40000       II 39001-40000      * |       270 0.0220798    II\n##    II_40001_41000       II 40001-41000      * |       271 0.0246775    II\n##    II_41001_42000       II 41001-42000      * |       272 0.0269232    II\n##    II_42001_43000       II 42001-43000      * |       273 0.0341849    II\n##    II_43001_44000       II 43001-44000      * |       274 0.0265386    II\n##               ...      ...         ...    ... .       ...       ...   ...\n##    II_55001_56000       II 55001-56000      * |       286 0.0213532    II\n##    II_56001_57000       II 56001-57000      * |       287 0.0569839    II\n##    II_57001_58000       II 57001-58000      * |       288 0.0338612    II\n##    II_58001_59000       II 58001-59000      * |       289 0.0294531    II\n##    II_59001_60000       II 59001-60000      * |       290 0.0306662    II\n##                      center\n##                   &lt;integer&gt;\n##    II_39001_40000     39500\n##    II_40001_41000     40500\n##    II_41001_42000     41500\n##    II_42001_43000     42500\n##    II_43001_44000     43500\n##               ...       ...\n##    II_55001_56000     55500\n##    II_56001_57000     56500\n##    II_57001_58000     57500\n##    II_58001_59000     58500\n##    II_59001_60000     59500\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\nanchors(hic)\n##  $first\n##  GRanges object with 210 ranges and 4 metadata columns:\n##          seqnames      ranges strand |    bin_id    weight   chr    center\n##             &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt;\n##      [1]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [2]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [3]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [4]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [5]       II 40001-41000      * |       271 0.0246775    II     40500\n##      ...      ...         ...    ... .       ...       ...   ...       ...\n##    [206]       II 57001-58000      * |       288 0.0338612    II     57500\n##    [207]       II 57001-58000      * |       288 0.0338612    II     57500\n##    [208]       II 58001-59000      * |       289 0.0294531    II     58500\n##    [209]       II 58001-59000      * |       289 0.0294531    II     58500\n##    [210]       II 59001-60000      * |       290 0.0306662    II     59500\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 210 ranges and 4 metadata columns:\n##          seqnames      ranges strand |    bin_id    weight   chr    center\n##             &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt;\n##      [1]       II 40001-41000      * |       271 0.0246775    II     40500\n##      [2]       II 41001-42000      * |       272 0.0269232    II     41500\n##      [3]       II 42001-43000      * |       273 0.0341849    II     42500\n##      [4]       II 43001-44000      * |       274 0.0265386    II     43500\n##      [5]       II 44001-45000      * |       275 0.0488968    II     44500\n##      ...      ...         ...    ... .       ...       ...   ...       ...\n##    [206]       II 58001-59000      * |       289 0.0294531    II     58500\n##    [207]       II 59001-60000      * |       290 0.0306662    II     59500\n##    [208]       II 58001-59000      * |       289 0.0294531    II     58500\n##    [209]       II 59001-60000      * |       290 0.0306662    II     59500\n##    [210]       II 59001-60000      * |       290 0.0306662    II     59500\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\nImport interactions between two chromosomes:\n\n\nhic2 &lt;- import(cf, focus = 'II|XV', resolution = 4000)\n\nregions(hic2)\n##  GRanges object with 477 ranges and 4 metadata columns:\n##                       seqnames          ranges strand |    bin_id    weight\n##                          &lt;Rle&gt;       &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt;\n##             II_1_4000       II          1-4000      * |        58       NaN\n##          II_4001_8000       II       4001-8000      * |        59       NaN\n##         II_8001_12000       II      8001-12000      * |        60 0.0274474\n##        II_12001_16000       II     12001-16000      * |        61 0.0342116\n##        II_16001_20000       II     16001-20000      * |        62 0.0195128\n##                   ...      ...             ...    ... .       ...       ...\n##    XV_1072001_1076000       XV 1072001-1076000      * |      2783  0.041763\n##    XV_1076001_1080000       XV 1076001-1080000      * |      2784       NaN\n##    XV_1080001_1084000       XV 1080001-1084000      * |      2785       NaN\n##    XV_1084001_1088000       XV 1084001-1088000      * |      2786       NaN\n##    XV_1088001_1091291       XV 1088001-1091291      * |      2787       NaN\n##                         chr    center\n##                       &lt;Rle&gt; &lt;integer&gt;\n##             II_1_4000    II      2000\n##          II_4001_8000    II      6000\n##         II_8001_12000    II     10000\n##        II_12001_16000    II     14000\n##        II_16001_20000    II     18000\n##                   ...   ...       ...\n##    XV_1072001_1076000    XV   1074000\n##    XV_1076001_1080000    XV   1078000\n##    XV_1080001_1084000    XV   1082000\n##    XV_1084001_1088000    XV   1086000\n##    XV_1088001_1091291    XV   1089646\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\nanchors(hic2)\n##  $first\n##  GRanges object with 18032 ranges and 4 metadata columns:\n##            seqnames        ranges strand |    bin_id    weight   chr\n##               &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##        [1]       II        1-4000      * |        58       NaN    II\n##        [2]       II        1-4000      * |        58       NaN    II\n##        [3]       II        1-4000      * |        58       NaN    II\n##        [4]       II        1-4000      * |        58       NaN    II\n##        [5]       II        1-4000      * |        58       NaN    II\n##        ...      ...           ...    ... .       ...       ...   ...\n##    [18028]       II 808001-812000      * |       260       NaN    II\n##    [18029]       II 808001-812000      * |       260       NaN    II\n##    [18030]       II 808001-812000      * |       260       NaN    II\n##    [18031]       II 808001-812000      * |       260       NaN    II\n##    [18032]       II 808001-812000      * |       260       NaN    II\n##               center\n##            &lt;integer&gt;\n##        [1]      2000\n##        [2]      2000\n##        [3]      2000\n##        [4]      2000\n##        [5]      2000\n##        ...       ...\n##    [18028]    810000\n##    [18029]    810000\n##    [18030]    810000\n##    [18031]    810000\n##    [18032]    810000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 18032 ranges and 4 metadata columns:\n##            seqnames          ranges strand |    bin_id    weight   chr\n##               &lt;Rle&gt;       &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##        [1]       XV     48001-52000      * |      2527 0.0185354    XV\n##        [2]       XV   348001-352000      * |      2602 0.0233750    XV\n##        [3]       XV   468001-472000      * |      2632 0.0153615    XV\n##        [4]       XV   472001-476000      * |      2633 0.0189624    XV\n##        [5]       XV   584001-588000      * |      2661 0.0167715    XV\n##        ...      ...             ...    ... .       ...       ...   ...\n##    [18028]       XV   980001-984000      * |      2760 0.0187827    XV\n##    [18029]       XV   984001-988000      * |      2761 0.0250094    XV\n##    [18030]       XV   992001-996000      * |      2763 0.0185599    XV\n##    [18031]       XV 1004001-1008000      * |      2766 0.0196942    XV\n##    [18032]       XV 1064001-1068000      * |      2781 0.0208220    XV\n##               center\n##            &lt;integer&gt;\n##        [1]     50000\n##        [2]    350000\n##        [3]    470000\n##        [4]    474000\n##        [5]    586000\n##        ...       ...\n##    [18028]    982000\n##    [18029]    986000\n##    [18030]    994000\n##    [18031]   1006000\n##    [18032]   1066000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\nImport interactions between segments of two chromosomes:\n\n\nhic3 &lt;- import(cf, focus = 'III:10000-40000|XV:10000-40000', resolution = 2000)\n\nregions(hic3)\n##  GRanges object with 32 ranges and 4 metadata columns:\n##                    seqnames      ranges strand |    bin_id    weight   chr\n##                       &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##     III_8001_10000      III  8001-10000      * |       527       NaN   III\n##    III_10001_12000      III 10001-12000      * |       528       NaN   III\n##    III_12001_14000      III 12001-14000      * |       529       NaN   III\n##    III_14001_16000      III 14001-16000      * |       530 0.0356351   III\n##    III_16001_18000      III 16001-18000      * |       531 0.0230693   III\n##                ...      ...         ...    ... .       ...       ...   ...\n##     XV_30001_32000       XV 30001-32000      * |      5039 0.0482465    XV\n##     XV_32001_34000       XV 32001-34000      * |      5040 0.0241580    XV\n##     XV_34001_36000       XV 34001-36000      * |      5041 0.0273166    XV\n##     XV_36001_38000       XV 36001-38000      * |      5042 0.0542235    XV\n##     XV_38001_40000       XV 38001-40000      * |      5043 0.0206849    XV\n##                       center\n##                    &lt;integer&gt;\n##     III_8001_10000      9000\n##    III_10001_12000     11000\n##    III_12001_14000     13000\n##    III_14001_16000     15000\n##    III_16001_18000     17000\n##                ...       ...\n##     XV_30001_32000     31000\n##     XV_32001_34000     33000\n##     XV_34001_36000     35000\n##     XV_36001_38000     37000\n##     XV_38001_40000     39000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\nanchors(hic3)\n##  $first\n##  GRanges object with 11 ranges and 4 metadata columns:\n##         seqnames      ranges strand |    bin_id    weight   chr    center\n##            &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt;\n##     [1]      III 14001-16000      * |       530 0.0356351   III     15000\n##     [2]      III 16001-18000      * |       531 0.0230693   III     17000\n##     [3]      III 16001-18000      * |       531 0.0230693   III     17000\n##     [4]      III 20001-22000      * |       533 0.0343250   III     21000\n##     [5]      III 22001-24000      * |       534 0.0258604   III     23000\n##     [6]      III 24001-26000      * |       535 0.0290757   III     25000\n##     [7]      III 28001-30000      * |       537 0.0290713   III     29000\n##     [8]      III 30001-32000      * |       538 0.0266373   III     31000\n##     [9]      III 32001-34000      * |       539 0.0201137   III     33000\n##    [10]      III 32001-34000      * |       539 0.0201137   III     33000\n##    [11]      III 36001-38000      * |       541 0.0220603   III     37000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 11 ranges and 4 metadata columns:\n##         seqnames      ranges strand |    bin_id    weight   chr    center\n##            &lt;Rle&gt;   &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt;\n##     [1]       XV 16001-18000      * |      5032 0.0187250    XV     17000\n##     [2]       XV 16001-18000      * |      5032 0.0187250    XV     17000\n##     [3]       XV 20001-22000      * |      5034 0.0247973    XV     21000\n##     [4]       XV 14001-16000      * |      5031 0.0379727    XV     15000\n##     [5]       XV 10001-12000      * |      5029 0.0296913    XV     11000\n##     [6]       XV 32001-34000      * |      5040 0.0241580    XV     33000\n##     [7]       XV 16001-18000      * |      5032 0.0187250    XV     17000\n##     [8]       XV 38001-40000      * |      5043 0.0206849    XV     39000\n##     [9]       XV 22001-24000      * |      5035 0.0613856    XV     23000\n##    [10]       XV 30001-32000      * |      5039 0.0482465    XV     31000\n##    [11]       XV 10001-12000      * |      5029 0.0296913    XV     11000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n2.4.2 Interacting with HiCExperiment data\n\nAn HiCExperiment object allows parsing of a disk-stored contact matrix.\nAn HiCExperiment object operates by wrapping together (1) a ContactFile (i.e. a connection to a disk-stored data file) and (2) a GInteractions generated by parsing the data file.\n\nWe will use the yeast_hic HiCExperiment object to demonstrate how to parse information from a HiCExperiment object.\n\nyeast_hic &lt;- contacts_yeast()\n\n\nyeast_hic\n##  `HiCExperiment` object with 8,757,906 contacts over 763 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"whole genome\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 16000 \n##  interactions: 267709 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753 \n##  metadata(3): ID org date\n\n\n2.4.2.1 Interactions\nThe imported genomic interactions can be directly exposed using the interactions function and are returned as a GInteractions object.\n\ninteractions(yeast_hic)\n##  GInteractions object with 267709 interactions and 4 metadata columns:\n##             seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##                 &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##         [1]         I       1-16000 ---         I       1-16000 |         0\n##         [2]         I       1-16000 ---         I   16001-32000 |         0\n##         [3]         I       1-16000 ---         I   32001-48000 |         0\n##         [4]         I       1-16000 ---         I   48001-64000 |         0\n##         [5]         I       1-16000 ---         I   64001-80000 |         0\n##         ...       ...           ... ...       ...           ... .       ...\n##    [267705]       XVI 896001-912000 ---       XVI 912001-928000 |       759\n##    [267706]       XVI 896001-912000 ---       XVI 928001-944000 |       759\n##    [267707]       XVI 912001-928000 ---       XVI 912001-928000 |       760\n##    [267708]       XVI 912001-928000 ---       XVI 928001-944000 |       760\n##    [267709]       XVI 928001-944000 ---       XVI 928001-944000 |       761\n##               bin_id2     count  balanced\n##             &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;\n##         [1]         0      2836 1.0943959\n##         [2]         1      2212 0.9592069\n##         [3]         2      1183 0.4385242\n##         [4]         3       831 0.2231192\n##         [5]         4       310 0.0821255\n##         ...       ...       ...       ...\n##    [267705]       760      3565  1.236371\n##    [267706]       761      1359  0.385016\n##    [267707]       760      3534  2.103988\n##    [267708]       761      3055  1.485794\n##    [267709]       761      4308  1.711565\n##    -------\n##    regions: 763 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n\n\n\n\n\nNote\n\n\n\nBecause genomic interactions are actually stored as GInteractions, regions and anchors work on HiCExperiment objects just as they work with GInteractions!\n\n\n\nregions(yeast_hic)\n##  GRanges object with 763 ranges and 4 metadata columns:\n##                      seqnames        ranges strand |    bin_id     weight\n##                         &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt;  &lt;numeric&gt;\n##            I_1_16000        I       1-16000      * |         0  0.0196442\n##        I_16001_32000        I   16001-32000      * |         1  0.0220746\n##        I_32001_48000        I   32001-48000      * |         2  0.0188701\n##        I_48001_64000        I   48001-64000      * |         3  0.0136679\n##        I_64001_80000        I   64001-80000      * |         4  0.0134860\n##                  ...      ...           ...    ... .       ...        ...\n##    XVI_880001_896000      XVI 880001-896000      * |       758 0.00910873\n##    XVI_896001_912000      XVI 896001-912000      * |       759 0.01421350\n##    XVI_912001_928000      XVI 912001-928000      * |       760 0.02439992\n##    XVI_928001_944000      XVI 928001-944000      * |       761 0.01993237\n##    XVI_944001_948066      XVI 944001-948066      * |       762        NaN\n##                        chr    center\n##                      &lt;Rle&gt; &lt;integer&gt;\n##            I_1_16000     I      8000\n##        I_16001_32000     I     24000\n##        I_32001_48000     I     40000\n##        I_48001_64000     I     56000\n##        I_64001_80000     I     72000\n##                  ...   ...       ...\n##    XVI_880001_896000   XVI    888000\n##    XVI_896001_912000   XVI    904000\n##    XVI_912001_928000   XVI    920000\n##    XVI_928001_944000   XVI    936000\n##    XVI_944001_948066   XVI    946033\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\nanchors(yeast_hic)\n##  $first\n##  GRanges object with 267709 ranges and 4 metadata columns:\n##             seqnames        ranges strand |    bin_id    weight   chr\n##                &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##         [1]        I       1-16000      * |         0 0.0196442     I\n##         [2]        I       1-16000      * |         0 0.0196442     I\n##         [3]        I       1-16000      * |         0 0.0196442     I\n##         [4]        I       1-16000      * |         0 0.0196442     I\n##         [5]        I       1-16000      * |         0 0.0196442     I\n##         ...      ...           ...    ... .       ...       ...   ...\n##    [267705]      XVI 896001-912000      * |       759 0.0142135   XVI\n##    [267706]      XVI 896001-912000      * |       759 0.0142135   XVI\n##    [267707]      XVI 912001-928000      * |       760 0.0243999   XVI\n##    [267708]      XVI 912001-928000      * |       760 0.0243999   XVI\n##    [267709]      XVI 928001-944000      * |       761 0.0199324   XVI\n##                center\n##             &lt;integer&gt;\n##         [1]      8000\n##         [2]      8000\n##         [3]      8000\n##         [4]      8000\n##         [5]      8000\n##         ...       ...\n##    [267705]    904000\n##    [267706]    904000\n##    [267707]    920000\n##    [267708]    920000\n##    [267709]    936000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n##  \n##  $second\n##  GRanges object with 267709 ranges and 4 metadata columns:\n##             seqnames        ranges strand |    bin_id    weight   chr\n##                &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;Rle&gt;\n##         [1]        I       1-16000      * |         0 0.0196442     I\n##         [2]        I   16001-32000      * |         1 0.0220746     I\n##         [3]        I   32001-48000      * |         2 0.0188701     I\n##         [4]        I   48001-64000      * |         3 0.0136679     I\n##         [5]        I   64001-80000      * |         4 0.0134860     I\n##         ...      ...           ...    ... .       ...       ...   ...\n##    [267705]      XVI 912001-928000      * |       760 0.0243999   XVI\n##    [267706]      XVI 928001-944000      * |       761 0.0199324   XVI\n##    [267707]      XVI 912001-928000      * |       760 0.0243999   XVI\n##    [267708]      XVI 928001-944000      * |       761 0.0199324   XVI\n##    [267709]      XVI 928001-944000      * |       761 0.0199324   XVI\n##                center\n##             &lt;integer&gt;\n##         [1]      8000\n##         [2]     24000\n##         [3]     40000\n##         [4]     56000\n##         [5]     72000\n##         ...       ...\n##    [267705]    920000\n##    [267706]    936000\n##    [267707]    920000\n##    [267708]    936000\n##    [267709]    936000\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n2.4.2.2 Bins and seqinfo\nAdditional useful information can be recovered from a HiCExperiment object. This includes:\n\nThe seqinfo of the HiCExperiment:\n\n\nseqinfo(yeast_hic)\n##  Seqinfo object with 16 sequences from an unspecified genome:\n##    seqnames seqlengths isCircular genome\n##    I            230218       &lt;NA&gt;   &lt;NA&gt;\n##    II           813184       &lt;NA&gt;   &lt;NA&gt;\n##    III          316620       &lt;NA&gt;   &lt;NA&gt;\n##    IV          1531933       &lt;NA&gt;   &lt;NA&gt;\n##    V            576874       &lt;NA&gt;   &lt;NA&gt;\n##    ...             ...        ...    ...\n##    XII         1078177       &lt;NA&gt;   &lt;NA&gt;\n##    XIII         924431       &lt;NA&gt;   &lt;NA&gt;\n##    XIV          784333       &lt;NA&gt;   &lt;NA&gt;\n##    XV          1091291       &lt;NA&gt;   &lt;NA&gt;\n##    XVI          948066       &lt;NA&gt;   &lt;NA&gt;\n\nThis lists the different chromosomes available to parse along with their length.\n\nThe bins of the HiCExperiment:\n\n\nbins(yeast_hic)\n##  GRanges object with 763 ranges and 2 metadata columns:\n##                      seqnames        ranges strand |    bin_id     weight\n##                         &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt;  &lt;numeric&gt;\n##            I_1_16000        I       1-16000      * |         0  0.0196442\n##        I_16001_32000        I   16001-32000      * |         1  0.0220746\n##        I_32001_48000        I   32001-48000      * |         2  0.0188701\n##        I_48001_64000        I   48001-64000      * |         3  0.0136679\n##        I_64001_80000        I   64001-80000      * |         4  0.0134860\n##                  ...      ...           ...    ... .       ...        ...\n##    XVI_880001_896000      XVI 880001-896000      * |       758 0.00910873\n##    XVI_896001_912000      XVI 896001-912000      * |       759 0.01421350\n##    XVI_912001_928000      XVI 912001-928000      * |       760 0.02439992\n##    XVI_928001_944000      XVI 928001-944000      * |       761 0.01993237\n##    XVI_944001_948066      XVI 944001-948066      * |       762        NaN\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n\n\n\n\n\nDifference between bins and regions\n\n\n\nbins are not equivalent to regions of an HiCExperiment.\n\n\nbins refer to all the possible regions of a HiCExperiment. For instance, for a HiCExperiment with a total genome size of 1,000,000 and a resolution of 2000, bins will always return a GRanges object with 500 ranges.\n\nregions, on the opposite, refer to the union of anchors of all the interactions imported in a HiCExperiment object.\n\nThus, all the regions will necessarily be a subset of the HiCExperiment bins, or equal to bins if no focus has been specified when importing a ContactFile.\n\n\n\n2.4.2.3 Scores\nOf course, what the end-user would be looking for is the frequency for each genomic interaction. Such frequency scores are available using the scores function. scores returns a list with a number of different types of scores.\n\nhead(scores(yeast_hic))\n##  List of length 2\n##  names(2): count balanced\n\nhead(scores(yeast_hic, \"count\"))\n##  [1] 2836 2212 1183  831  310  159\n\nhead(scores(yeast_hic, \"balanced\"))\n##  [1] 1.09439586 0.95920688 0.43852417 0.22311917 0.08212549 0.03345221\n\n\n\n\n\n\n\nTip\n\n\n\nCalling interactions(hic) returns a GInteractions with scores already stored in extra columns. This short-hand allows one to dynamically check scores directly from the interactions output.\n\ninteractions(yeast_hic)\n##  GInteractions object with 267709 interactions and 4 metadata columns:\n##             seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##                 &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##         [1]         I       1-16000 ---         I       1-16000 |         0\n##         [2]         I       1-16000 ---         I   16001-32000 |         0\n##         [3]         I       1-16000 ---         I   32001-48000 |         0\n##         [4]         I       1-16000 ---         I   48001-64000 |         0\n##         [5]         I       1-16000 ---         I   64001-80000 |         0\n##         ...       ...           ... ...       ...           ... .       ...\n##    [267705]       XVI 896001-912000 ---       XVI 912001-928000 |       759\n##    [267706]       XVI 896001-912000 ---       XVI 928001-944000 |       759\n##    [267707]       XVI 912001-928000 ---       XVI 912001-928000 |       760\n##    [267708]       XVI 912001-928000 ---       XVI 928001-944000 |       760\n##    [267709]       XVI 928001-944000 ---       XVI 928001-944000 |       761\n##               bin_id2     count  balanced\n##             &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;\n##         [1]         0      2836 1.0943959\n##         [2]         1      2212 0.9592069\n##         [3]         2      1183 0.4385242\n##         [4]         3       831 0.2231192\n##         [5]         4       310 0.0821255\n##         ...       ...       ...       ...\n##    [267705]       760      3565  1.236371\n##    [267706]       761      1359  0.385016\n##    [267707]       760      3534  2.103988\n##    [267708]       761      3055  1.485794\n##    [267709]       761      4308  1.711565\n##    -------\n##    regions: 763 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\nhead(interactions(yeast_hic)$count)\n##  [1] 2836 2212 1183  831  310  159\n\n\n\n\n2.4.2.4 topologicalFeatures\nIn Hi-C studies, “topological features” refer to genomic structures identified (usually from a Hi-C map, but not necessarily). For instance, one may want to study known structural loops anchored at CTCF sites, or interactions around or over centromeres, or simply specific genomic “viewpoints”.\nHiCExperiment objects can store topologicalFeatures to facilitate this analysis. By default, four empty topologicalFeatures are stored in a list:\n\ncompartments\nborders\nloops\nviewpoints\n\nAdditional topologicalFeatures can be added to this list (read next chapter for more detail).\n\ntopologicalFeatures(yeast_hic)\n##  List of length 5\n##  names(5): compartments borders loops viewpoints centromeres\n\ntopologicalFeatures(yeast_hic, 'centromeres')\n##  GRanges object with 16 ranges and 0 metadata columns:\n##         seqnames        ranges strand\n##            &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt;\n##     [1]        I 151583-151641      +\n##     [2]       II 238361-238419      +\n##     [3]      III 114322-114380      +\n##     [4]       IV 449879-449937      +\n##     [5]        V 152522-152580      +\n##     ...      ...           ...    ...\n##    [12]      XII 151366-151424      +\n##    [13]     XIII 268222-268280      +\n##    [14]      XIV 628588-628646      +\n##    [15]       XV 326897-326955      +\n##    [16]      XVI 556255-556313      +\n##    -------\n##    seqinfo: 17 sequences (1 circular) from R64-1-1 genome\n\n\n2.4.2.5 pairsFile\nAs a contact matrix is typically obtained from binning a .pairs file, it is often the case that the matching .pairs file is available to then end-user. A PairsFile can thus be created and associated to the corresponding HiCExperiment object. This allows more accurate estimation of contact distribution, e.g. when calculating distance-dependent genomic interaction frequency.\n\npairsFile(yeast_hic) &lt;- pairsf\n\npairsFile(yeast_hic)\n##                                                  EH7703 \n##  \"/github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753\"\n\nreadLines(pairsFile(yeast_hic), 25)\n##   [1] \"## pairs format v1.0\"                                                             \n##   [2] \"#sorted: chr1-pos1-chr2-pos2\"                                                     \n##   [3] \"#columns: readID chr1 pos1 chr2 pos2 strand1 strand2 frag1 frag2\"                 \n##   [4] \"#chromsize: I 230218\"                                                             \n##   [5] \"#chromsize: II 813184\"                                                            \n##   [6] \"#chromsize: III 316620\"                                                           \n##   [7] \"#chromsize: IV 1531933\"                                                           \n##   [8] \"#chromsize: V 576874\"                                                             \n##   [9] \"#chromsize: VI 270161\"                                                            \n##  [10] \"#chromsize: VII 1090940\"                                                          \n##  [11] \"#chromsize: VIII 562643\"                                                          \n##  [12] \"#chromsize: IX 439888\"                                                            \n##  [13] \"#chromsize: X 745751\"                                                             \n##  [14] \"#chromsize: XI 666816\"                                                            \n##  [15] \"#chromsize: XII 1078177\"                                                          \n##  [16] \"#chromsize: XIII 924431\"                                                          \n##  [17] \"#chromsize: XIV 784333\"                                                           \n##  [18] \"#chromsize: XV 1091291\"                                                           \n##  [19] \"#chromsize: XVI 948066\"                                                           \n##  [20] \"#chromsize: Mito 85779\"                                                           \n##  [21] \"NS500150:527:HHGYNBGXF:3:21611:19085:3986\\tII\\t105\\tII\\t48548\\t+\\t-\\t1358\\t1681\"  \n##  [22] \"NS500150:527:HHGYNBGXF:4:13604:19734:2406\\tII\\t113\\tII\\t45003\\t-\\t+\\t1358\\t1658\"  \n##  [23] \"NS500150:527:HHGYNBGXF:2:11108:25178:11036\\tII\\t119\\tII\\t687251\\t-\\t+\\t1358\\t5550\"\n##  [24] \"NS500150:527:HHGYNBGXF:1:22301:8468:1586\\tII\\t160\\tII\\t26124\\t+\\t-\\t1358\\t1510\"   \n##  [25] \"NS500150:527:HHGYNBGXF:4:23606:24037:2076\\tII\\t169\\tII\\t39052\\t+\\t+\\t1358\\t1613\"\n\n\n\n\n\n\n\nImporting a PairsFile\n\n\n\nThe .pairs file linked to a HiCExperiment object can itself be imported in a GInteractions object:\n\nimport(pairsFile(yeast_hic), format = 'pairs')\n##  GInteractions object with 471364 interactions and 3 metadata columns:\n##             seqnames1   ranges1     seqnames2   ranges2 |     frag1     frag2\n##                 &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt; | &lt;numeric&gt; &lt;numeric&gt;\n##         [1]        II       105 ---        II     48548 |      1358      1681\n##         [2]        II       113 ---        II     45003 |      1358      1658\n##         [3]        II       119 ---        II    687251 |      1358      5550\n##         [4]        II       160 ---        II     26124 |      1358      1510\n##         [5]        II       169 ---        II     39052 |      1358      1613\n##         ...       ...       ... ...       ...       ... .       ...       ...\n##    [471360]        II    808605 ---        II    809683 |      6316      6320\n##    [471361]        II    808609 ---        II    809917 |      6316      6324\n##    [471362]        II    808617 ---        II    809506 |      6316      6319\n##    [471363]        II    809447 ---        II    809685 |      6319      6321\n##    [471364]        II    809472 ---        II    809675 |      6319      6320\n##              distance\n##             &lt;integer&gt;\n##         [1]     48443\n##         [2]     44890\n##         [3]    687132\n##         [4]     25964\n##         [5]     38883\n##         ...       ...\n##    [471360]      1078\n##    [471361]      1308\n##    [471362]       889\n##    [471363]       238\n##    [471364]       203\n##    -------\n##    regions: 549331 ranges and 0 metadata columns\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\nNote that these GInteractions are not binned, contrary to interactions extracted from a HiCExperiment. Anchors of the interactions listed in the GInteractions imported from a disk-stored .pairs file are all of width 1."
   },
   {
     "objectID": "data-representation.html#visual-summary-of-the-hicexperiment-data-structure",
@@ -123,14 +130,14 @@
     "href": "parsing.html#subsetting-a-contact-matrix",
     "title": "\n3  Manipulating Hi-C data in R\n",
     "section": "\n3.1 Subsetting a contact matrix",
-    "text": "3.1 Subsetting a contact matrix\nTwo entirely different approaches are possible to subset of a Hi-C contact matrix:\n\nSubsetting before importing: leveraging random access to a disk-stored contact matrix to only import interactions overlapping with a genomic locus of interest.\nSubsetting after importing: parsing the entire contact matrix in memory, and subsequently subset interactions overlapping with a genomic locus of interest.\n\n\n\n3.1.1 Subsetting before import: with focus\n\nSpecifying a focus when importing a dataset in R (i.e. \"Subset first, then parse\") is generally the recommended approach to import Hi-C data in R.\nThe focus argument can be set when importing a ContactFile in R, as follows:\n\nimport(cf, focus = \"...\")\n\nThis ensures that only the needed data is parsed in R, reducing memory load and accelerating the import. Thus, this should be the preferred way of parsing HiCExperiment data, as disk-stored contact matrices allow efficient random access to indexed data.\nfocus can be any of the following string types:\n\n#   \"II\"                                  --&gt; import contacts over an entire chromosome\n#   \"II:300001-800000\"                    --&gt; import on-diagonal contacts within a chromosome\n#   \"II:300001-400000|II:600001-700000\"   --&gt; import off-diagonal contacts within a chromosome\n#   \"II|III\"                              --&gt; import contacts between two chromosomes\n#   \"II:300001-800000|V:1-500000\"         --&gt; import contacts between segments of two chromosomes\n\n\n\n\n\n\n\nMore examples for import with focus argument 👇\n\n\n\n\n\n\nSubsetting to a specific on-diagonal genomic location using standard UCSC coordinates query:\n\n\nimport(cf, focus = 'II:300001-800000', resolution = 2000)\n##  `HiCExperiment` object with 301,018 contacts over 250 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300,001-800,000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 17974 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting to a specific off-diagonal genomic location using pairs of coordinates query:\n\n\nimport(cf, focus = 'II:300001-400000|II:600001-700000', resolution = 2000)\n##  `HiCExperiment` object with 402 contacts over 100 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300001-400000|II:600001-700000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 357 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those constrained within a single chromosome:\n\n\nimport(cf, focus = 'II', resolution = 2000)\n##  `HiCExperiment` object with 471,364 contacts over 407 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 34063 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those between two chromosomes:\n\n\nimport(cf, focus = 'II|III', resolution = 2000)\n##  `HiCExperiment` object with 9,092 contacts over 566 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II|III\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 7438 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those between parts of two chromosomes:\n\n\nimport(cf, focus = 'II:300001-800000|V:1-500000', resolution = 2000)\n##  `HiCExperiment` object with 7,147 contacts over 500 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300001-800000|V:1-500000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 6523 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n3.1.2 Subsetting after import\nIt may sometimes be desirable to import a full dataset from disk first, and only then perform in-memory subsetting of the HiCExperiment object (i.e. \"Parse first, then subset\"). This is for example necessary when the end user aims to investigate subsets of interactions across a large number of different areas of a contact matrix.\nSeveral strategies are possible to allow subsetting of imported data, either with subsetByOverlaps or [.\n\n3.1.2.1 subsetByOverlaps(&lt;HiCExperiment&gt;, &lt;GRanges&gt;)\n\nsubsetByOverlaps can take a HiCExperiment as a query and a GRanges as a query. In this case, the GRanges is used to extract a subset of a HiCExperiment constrained within a specific genomic location.\n\ntelomere &lt;- GRanges(\"II:700001-813184\")\nsubsetByOverlaps(hic, telomere) |&gt; interactions()\n##  GInteractions object with 1540 interactions and 4 metadata columns:\n##           seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##               &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##       [1]        II 700001-702000 ---        II 700001-702000 |       466\n##       [2]        II 700001-702000 ---        II 702001-704000 |       466\n##       [3]        II 700001-702000 ---        II 704001-706000 |       466\n##       [4]        II 700001-702000 ---        II 706001-708000 |       466\n##       [5]        II 700001-702000 ---        II 708001-710000 |       466\n##       ...       ...           ... ...       ...           ... .       ...\n##    [1536]        II 804001-806000 ---        II 810001-812000 |       518\n##    [1537]        II 806001-808000 ---        II 806001-808000 |       519\n##    [1538]        II 806001-808000 ---        II 808001-810000 |       519\n##    [1539]        II 806001-808000 ---        II 810001-812000 |       519\n##    [1540]        II 808001-810000 ---        II 808001-810000 |       520\n##             bin_id2     count  balanced\n##           &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;\n##       [1]       466        30 0.0283618\n##       [2]       467       145 0.0709380\n##       [3]       468       124 0.0704979\n##       [4]       469        59 0.0510221\n##       [5]       470        59 0.0384004\n##       ...       ...       ...       ...\n##    [1536]       521         1       NaN\n##    [1537]       519        15 0.0560633\n##    [1538]       520        25       NaN\n##    [1539]       521         1       NaN\n##    [1540]       520        10       NaN\n##    -------\n##    regions: 57 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n\n\n\n\n\ntype argument\n\n\n\nBy default, subsetByOverlaps(hic, telomere) will only recover interactions constrained within telomere, i.e. interactions for which both ends are in telomere.\nAlternatively, type = \"any\" can be specified to get all interactions with at least one of their anchors within telomere.\n\nsubsetByOverlaps(hic, telomere, type = \"any\") |&gt; interactions()\n##  GInteractions object with 6041 interactions and 4 metadata columns:\n##           seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##               &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##       [1]        II 300001-302000 ---        II 702001-704000 |       266\n##       [2]        II 300001-302000 ---        II 704001-706000 |       266\n##       [3]        II 300001-302000 ---        II 768001-770000 |       266\n##       [4]        II 300001-302000 ---        II 784001-786000 |       266\n##       [5]        II 302001-304000 ---        II 740001-742000 |       267\n##       ...       ...           ... ...       ...           ... .       ...\n##    [6037]        II 804001-806000 ---        II 810001-812000 |       518\n##    [6038]        II 806001-808000 ---        II 806001-808000 |       519\n##    [6039]        II 806001-808000 ---        II 808001-810000 |       519\n##    [6040]        II 806001-808000 ---        II 810001-812000 |       519\n##    [6041]        II 808001-810000 ---        II 808001-810000 |       520\n##             bin_id2     count    balanced\n##           &lt;numeric&gt; &lt;numeric&gt;   &lt;numeric&gt;\n##       [1]       467         1 0.000590999\n##       [2]       468         1 0.000686799\n##       [3]       500         1 0.000728215\n##       [4]       508         1 0.000923092\n##       [5]       486         1 0.000382222\n##       ...       ...       ...         ...\n##    [6037]       521         1         NaN\n##    [6038]       519        15   0.0560633\n##    [6039]       520        25         NaN\n##    [6040]       521         1         NaN\n##    [6041]       520        10         NaN\n##    -------\n##    regions: 257 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n\n\n3.1.2.2 &lt;HiCExperiment&gt;[\"...\"]\n\nThe square bracket operator [ allows for more advanced textual queries, similarly to focus arguments that can be used when importing contact matrices in memory.\nThis ensures that only the needed data is parsed in R, reducing memory load and accelerating the import. Thus, this should be the preferred way of parsing HiCExperiment data, as disk-stored contact matrices allow efficient random access to indexed data.\nThe following string types can be used to subset a HiCExperiment object with the [ notation:\n\n#   \"II\"                                  --&gt; import contacts over an entire chromosome\n#   \"II:300001-800000\"                    --&gt; import on-diagonal contacts within a chromosome\n#   \"II:300001-400000|II:600001-700000\"   --&gt; import off-diagonal contacts within a chromosome\n#   \"II|III\"                              --&gt; import contacts between two chromosomes\n#   \"II:300001-800000|V:1-500000\"         --&gt; import contacts between segments of two chromosomes\n#   c(\"II\", \"III\", \"IV\")                  --&gt; import contacts within and between several chromosomes\n\n\n\n\n\n\n\nMore examples for subsetting with [ 👇\n\n\n\n\n\n\nSubsetting to a specific on-diagonal genomic location using standard UCSC coordinates query:\n\n\nhic[\"II:800001-813184\"]\n##  `HiCExperiment` object with 1,040 contacts over 6 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:800,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 19 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting to a specific off-diagonal genomic location using pairs of coordinates query:\n\n\nhic[\"II:300001-320000|II:800001-813184\"]\n##  `HiCExperiment` object with 3 contacts over 6 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300001-320000|II:800001-813184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 3 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those constrained within a single chromosome:\n\n\nhic[\"II\"]\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those between two chromosomes:\n\n\nhic[\"II|IV\"]\n##  `HiCExperiment` object with 0 contacts over 0 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:1-813184|IV:1-1531933\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 0 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those between segments of two chromosomes:\n\n\nhic[\"II:300001-320000|IV:1-100000\"]\n##  `HiCExperiment` object with 0 contacts over 0 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300001-320000|IV:1-100000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 0 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those constrained within several chromosomes:\n\n\nhic[c('II', 'III', 'IV')]\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II, III, IV\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\n\n\n\nNote\n\n\n\n\nThis last example (subsetting for a vector of several chromosomes) is the only scenario for which [-based in-memory subsetting of pre-imported data is the only way to go, as such subsetting is not possible with focus from disk-stored data.\nAll the other [ subsetting scenarii illustrated above can be achieved more efficiently using the focus argument when importing data into a HiCExperiment object.\nHowever, keep in mind that subsetting preserves extra data, e.g. added scores, topologicalFeatures, metadata or pairsFile, whereas this information is lost using focus with import.\n\n\n\n\n3.1.3 Zooming on a HiCExperiment\n\n“Zooming” refers to dynamically changing the resolution of a HiCExperiment. By zooming a HiCExperiment, one can refine or coarsen the contact matrix. This operation takes aContactFile and focus from an existing HiCExperiment input and re-generates a new HiCExperiment with updated resolution, interactions and scores. Note that zoom will preserve existing metadata, topologicalFeatures and pairsFile information.\n\nhic\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\nzoom(hic, 4000)\n##  `HiCExperiment` object with 306,212 contacts over 129 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 4000 \n##  interactions: 6800 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\nzoom(hic, 1000)\n##  `HiCExperiment` object with 306,212 contacts over 514 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 44363 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\nNote\n\n\n\nThe sum of raw counts do not change after zooming, however the number of individual interactions and regions changes.\n\nlength(hic)\n##  [1] 18513\nlength(zoom(hic, 1000))\n##  [1] 44363\nlength(zoom(hic, 4000))\n##  [1] 6800\nsum(scores(hic, \"count\"))\n##  [1] 306212\nsum(scores(zoom(hic, 1000), \"count\"))\n##  [1] 306212\nsum(scores(zoom(hic, 4000), \"count\"))\n##  [1] 306212\n\n\n\n\n\n\n\n\n\nImportant\n\n\n\n\n\nzoom does not change the focus! It only affects the resolution (and consequently, the interactions).\n\nzoom will only work for multi-resolution contact matrices, e.g. .mcool or .hic."
+    "text": "3.1 Subsetting a contact matrix\nTwo entirely different approaches are possible to subset of a Hi-C contact matrix:\n\nSubsetting before importing: leveraging random access to a disk-stored contact matrix to only import interactions overlapping with a genomic locus of interest.\nSubsetting after importing: parsing the entire contact matrix in memory, and subsequently subset interactions overlapping with a genomic locus of interest.\n\n\n\n3.1.1 Subsetting before import: with focus\n\nSpecifying a focus when importing a dataset in R (i.e. \"Subset first, then parse\") is generally the recommended approach to import Hi-C data in R.\nThe focus argument can be set when importing a ContactFile in R, as follows:\n\nimport(cf, focus = \"...\")\n\nThis ensures that only the needed data is parsed in R, reducing memory load and accelerating the import. Thus, this should be the preferred way of parsing HiCExperiment data, as disk-stored contact matrices allow efficient random access to indexed data.\nfocus can be any of the following string types:\n\n#   \"II\"                                  --&gt; import contacts over an entire chromosome\n#   \"II:300001-800000\"                    --&gt; import on-diagonal contacts within a chromosome\n#   \"II:300001-400000|II:600001-700000\"   --&gt; import off-diagonal contacts within a chromosome\n#   \"II|III\"                              --&gt; import contacts between two chromosomes\n#   \"II:300001-800000|V:1-500000\"         --&gt; import contacts between segments of two chromosomes\n\n\n\n\n\n\n\nMore examples for import with focus argument 👇\n\n\n\n\n\n\nSubsetting to a specific on-diagonal genomic location using standard UCSC coordinates query:\n\n\nimport(cf, focus = 'II:300001-800000', resolution = 2000)\n##  `HiCExperiment` object with 301,018 contacts over 250 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300,001-800,000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 17974 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting to a specific off-diagonal genomic location using pairs of coordinates query:\n\n\nimport(cf, focus = 'II:300001-400000|II:600001-700000', resolution = 2000)\n##  `HiCExperiment` object with 402 contacts over 100 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300001-400000|II:600001-700000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 357 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those constrained within a single chromosome:\n\n\nimport(cf, focus = 'II', resolution = 2000)\n##  `HiCExperiment` object with 471,364 contacts over 407 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 34063 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those between two chromosomes:\n\n\nimport(cf, focus = 'II|III', resolution = 2000)\n##  `HiCExperiment` object with 9,092 contacts over 566 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II|III\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 7438 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those between parts of two chromosomes:\n\n\nimport(cf, focus = 'II:300001-800000|V:1-500000', resolution = 2000)\n##  `HiCExperiment` object with 7,147 contacts over 500 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300001-800000|V:1-500000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 6523 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n3.1.2 Subsetting after import\nIt may sometimes be desirable to import a full dataset from disk first, and only then perform in-memory subsetting of the HiCExperiment object (i.e. \"Parse first, then subset\"). This is for example necessary when the end user aims to investigate subsets of interactions across a large number of different areas of a contact matrix.\nSeveral strategies are possible to allow subsetting of imported data, either with subsetByOverlaps or [.\n\n3.1.2.1 subsetByOverlaps(&lt;HiCExperiment&gt;, &lt;GRanges&gt;)\n\nsubsetByOverlaps can take a HiCExperiment as a query and a GRanges as a query. In this case, the GRanges is used to extract a subset of a HiCExperiment constrained within a specific genomic location.\n\ntelomere &lt;- GRanges(\"II:700001-813184\")\nsubsetByOverlaps(hic, telomere) |&gt; interactions()\n##  GInteractions object with 1540 interactions and 4 metadata columns:\n##           seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##               &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##       [1]        II 700001-702000 ---        II 700001-702000 |       466\n##       [2]        II 700001-702000 ---        II 702001-704000 |       466\n##       [3]        II 700001-702000 ---        II 704001-706000 |       466\n##       [4]        II 700001-702000 ---        II 706001-708000 |       466\n##       [5]        II 700001-702000 ---        II 708001-710000 |       466\n##       ...       ...           ... ...       ...           ... .       ...\n##    [1536]        II 804001-806000 ---        II 810001-812000 |       518\n##    [1537]        II 806001-808000 ---        II 806001-808000 |       519\n##    [1538]        II 806001-808000 ---        II 808001-810000 |       519\n##    [1539]        II 806001-808000 ---        II 810001-812000 |       519\n##    [1540]        II 808001-810000 ---        II 808001-810000 |       520\n##             bin_id2     count  balanced\n##           &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;\n##       [1]       466        30 0.0283618\n##       [2]       467       145 0.0709380\n##       [3]       468       124 0.0704979\n##       [4]       469        59 0.0510221\n##       [5]       470        59 0.0384004\n##       ...       ...       ...       ...\n##    [1536]       521         1       NaN\n##    [1537]       519        15 0.0560633\n##    [1538]       520        25       NaN\n##    [1539]       521         1       NaN\n##    [1540]       520        10       NaN\n##    -------\n##    regions: 57 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n\n\n\n\n\ntype argument\n\n\n\nBy default, subsetByOverlaps(hic, telomere) will only recover interactions constrained within telomere, i.e. interactions for which both ends are in telomere.\nAlternatively, type = \"any\" can be specified to get all interactions with at least one of their anchors within telomere.\n\nsubsetByOverlaps(hic, telomere, type = \"any\") |&gt; interactions()\n##  GInteractions object with 6041 interactions and 4 metadata columns:\n##           seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##               &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##       [1]        II 300001-302000 ---        II 702001-704000 |       266\n##       [2]        II 300001-302000 ---        II 704001-706000 |       266\n##       [3]        II 300001-302000 ---        II 768001-770000 |       266\n##       [4]        II 300001-302000 ---        II 784001-786000 |       266\n##       [5]        II 302001-304000 ---        II 740001-742000 |       267\n##       ...       ...           ... ...       ...           ... .       ...\n##    [6037]        II 804001-806000 ---        II 810001-812000 |       518\n##    [6038]        II 806001-808000 ---        II 806001-808000 |       519\n##    [6039]        II 806001-808000 ---        II 808001-810000 |       519\n##    [6040]        II 806001-808000 ---        II 810001-812000 |       519\n##    [6041]        II 808001-810000 ---        II 808001-810000 |       520\n##             bin_id2     count    balanced\n##           &lt;numeric&gt; &lt;numeric&gt;   &lt;numeric&gt;\n##       [1]       467         1 0.000590999\n##       [2]       468         1 0.000686799\n##       [3]       500         1 0.000728215\n##       [4]       508         1 0.000923092\n##       [5]       486         1 0.000382222\n##       ...       ...       ...         ...\n##    [6037]       521         1         NaN\n##    [6038]       519        15   0.0560633\n##    [6039]       520        25         NaN\n##    [6040]       521         1         NaN\n##    [6041]       520        10         NaN\n##    -------\n##    regions: 257 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome\n\n\n\n\n3.1.2.2 &lt;HiCExperiment&gt;[\"...\"]\n\nThe square bracket operator [ allows for more advanced textual queries, similarly to focus arguments that can be used when importing contact matrices in memory.\nThis ensures that only the needed data is parsed in R, reducing memory load and accelerating the import. Thus, this should be the preferred way of parsing HiCExperiment data, as disk-stored contact matrices allow efficient random access to indexed data.\nThe following string types can be used to subset a HiCExperiment object with the [ notation:\n\n#   \"II\"                                  --&gt; import contacts over an entire chromosome\n#   \"II:300001-800000\"                    --&gt; import on-diagonal contacts within a chromosome\n#   \"II:300001-400000|II:600001-700000\"   --&gt; import off-diagonal contacts within a chromosome\n#   \"II|III\"                              --&gt; import contacts between two chromosomes\n#   \"II:300001-800000|V:1-500000\"         --&gt; import contacts between segments of two chromosomes\n#   c(\"II\", \"III\", \"IV\")                  --&gt; import contacts within and between several chromosomes\n\n\n\n\n\n\n\nMore examples for subsetting with [ 👇\n\n\n\n\n\n\nSubsetting to a specific on-diagonal genomic location using standard UCSC coordinates query:\n\n\nhic[\"II:800001-813184\"]\n##  `HiCExperiment` object with 1,040 contacts over 6 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:800,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 19 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting to a specific off-diagonal genomic location using pairs of coordinates query:\n\n\nhic[\"II:300001-320000|II:800001-813184\"]\n##  `HiCExperiment` object with 3 contacts over 6 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300001-320000|II:800001-813184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 3 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those constrained within a single chromosome:\n\n\nhic[\"II\"]\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those between two chromosomes:\n\n\nhic[\"II|IV\"]\n##  `HiCExperiment` object with 0 contacts over 0 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:1-813184|IV:1-1531933\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 0 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those between segments of two chromosomes:\n\n\nhic[\"II:300001-320000|IV:1-100000\"]\n##  `HiCExperiment` object with 0 contacts over 0 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300001-320000|IV:1-100000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 0 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\nSubsetting interactions to retain those constrained within several chromosomes:\n\n\nhic[c('II', 'III', 'IV')]\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II, III, IV\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\n\n\n\nNote\n\n\n\n\nThis last example (subsetting for a vector of several chromosomes) is the only scenario for which [-based in-memory subsetting of pre-imported data is the only way to go, as such subsetting is not possible with focus from disk-stored data.\nAll the other [ subsetting scenarii illustrated above can be achieved more efficiently using the focus argument when importing data into a HiCExperiment object.\nHowever, keep in mind that subsetting preserves extra data, e.g. added scores, topologicalFeatures, metadata or pairsFile, whereas this information is lost using focus with import.\n\n\n\n\n3.1.3 Zooming on a HiCExperiment\n\n“Zooming” refers to dynamically changing the resolution of a HiCExperiment. By zooming a HiCExperiment, one can refine or coarsen the contact matrix. This operation takes aContactFile and focus from an existing HiCExperiment input and re-generates a new HiCExperiment with updated resolution, interactions and scores. Note that zoom will preserve existing metadata, topologicalFeatures and pairsFile information.\n\nhic\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\nzoom(hic, 4000)\n##  `HiCExperiment` object with 306,212 contacts over 129 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 4000 \n##  interactions: 6800 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\nzoom(hic, 1000)\n##  `HiCExperiment` object with 306,212 contacts over 514 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 44363 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\nNote\n\n\n\nThe sum of raw counts do not change after zooming, however the number of individual interactions and regions changes.\n\nlength(hic)\n##  [1] 18513\nlength(zoom(hic, 1000))\n##  [1] 44363\nlength(zoom(hic, 4000))\n##  [1] 6800\nsum(scores(hic, \"count\"))\n##  [1] 306212\nsum(scores(zoom(hic, 1000), \"count\"))\n##  [1] 306212\nsum(scores(zoom(hic, 4000), \"count\"))\n##  [1] 306212\n\n\n\n\n\n\n\n\n\nImportant\n\n\n\n\n\nzoom does not change the focus! It only affects the resolution (and consequently, the interactions).\n\nzoom will only work for multi-resolution contact matrices, e.g. .mcool or .hic."
   },
   {
     "objectID": "parsing.html#updating-an-hicexperiment-object",
     "href": "parsing.html#updating-an-hicexperiment-object",
     "title": "\n3  Manipulating Hi-C data in R\n",
     "section": "\n3.2 Updating an HiCExperiment object",
-    "text": "3.2 Updating an HiCExperiment object\n\n\n\n\n\n\nTL;DR: Which HiCExperiment slots are mutable (✅) / immutable (⛔️)?\n\n\n\n\n\nfileName(hic): ⛔️ (obtained from disk-stored file)\n\nfocus(hic): 🤔 (see subsetting section)\n\nresolutions(hic): ⛔️ (obtained from disk-stored file)\n\nresolution(hic): 🤔 (see zooming section)\n\ninteractions(hic): ⛔️ (obtained from disk-stored file)\n\nscores(hic): ✅\n\ntopologicalFeatures(hic): ✅\n\npairsFile(hic): ✅\n\nmetadata(hic): ✅\n\n\n\n\n3.2.1 Immutable slots\nAn HiCExperiment object acts as an interface exposing disk-stored data. This implies that the fileName slot itself is immutable (i.e. cannot be changed). This should be obvious, as a HiCExperiment has to be associated with a disk-stored contact matrix to properly function (except in some advanced cases developed in next chapters).\nFor this reason, methods to manually modify interactions and resolutions slots are also not exposed in the HiCExperiment package.\nA corollary of this is that the associated regions and anchors of an HiCExperiment should not be modified by hand either, since they are directly linked to interactions.\n\n3.2.2 Mutable slots\nThat being said, HiCExperiment objects are flexible and can be partially modified in memory without having to change/overwrite the original, disk-stored contact matrix.\nSeveral slots can be modified in memory: slots, topologicalFeatures, pairsFile and metadata.\n\n3.2.2.1 scores\n\nWe have seen in the previous chapter that scores are stored in a list and are available using the scores function.\n\nscores(hic)\n##  List of length 2\n##  names(2): count balanced\n\nhead(scores(hic, \"count\"))\n##  [1]  7 92 75 61 38 43\n\nhead(scores(hic, \"balanced\"))\n##  [1] 0.009657438 0.076622340 0.054101992 0.042940512 0.040905212 0.029293930\n\nExtra scores can be added to this list, e.g. to describe the “expected” interaction frequency for each interaction stored in the HiCExperiment object). This can be achieved using the scores()&lt;- function.\n\nscores(hic, \"random\") &lt;- runif(length(hic))\n\nscores(hic)\n##  List of length 3\n##  names(3): count balanced random\n\nhead(scores(hic, \"random\"))\n##  [1] 0.080750138 0.834333037 0.600760886 0.157208442 0.007399441 0.466393497\n\n\n3.2.2.2 topologicalFeatures\n\nThe end-user can create additional topologicalFeatures or modify the existing ones using the topologicalFeatures()&lt;- function.\n\ntopologicalFeatures(hic, 'CTCF') &lt;- GRanges(c(\n    \"II:340-352\", \n    \"II:3520-3532\", \n    \"II:7980-7992\", \n    \"II:9240-9252\" \n))\ntopologicalFeatures(hic, 'CTCF')\n##  GRanges object with 4 ranges and 0 metadata columns:\n##        seqnames    ranges strand\n##           &lt;Rle&gt; &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]       II   340-352      *\n##    [2]       II 3520-3532      *\n##    [3]       II 7980-7992      *\n##    [4]       II 9240-9252      *\n##    -------\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\ntopologicalFeatures(hic, 'loops') &lt;- GInteractions(\n    topologicalFeatures(hic, 'CTCF')[rep(1:3, each = 3)],\n    topologicalFeatures(hic, 'CTCF')[rep(1:3, 3)]\n)\ntopologicalFeatures(hic, 'loops')\n##  GInteractions object with 9 interactions and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2   ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt;\n##    [1]        II   340-352 ---        II   340-352\n##    [2]        II   340-352 ---        II 3520-3532\n##    [3]        II   340-352 ---        II 7980-7992\n##    [4]        II 3520-3532 ---        II   340-352\n##    [5]        II 3520-3532 ---        II 3520-3532\n##    [6]        II 3520-3532 ---        II 7980-7992\n##    [7]        II 7980-7992 ---        II   340-352\n##    [8]        II 7980-7992 ---        II 3520-3532\n##    [9]        II 7980-7992 ---        II 7980-7992\n##    -------\n##    regions: 3 ranges and 0 metadata columns\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\nhic\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(3): count balanced random \n##  topologicalFeatures: compartments(0) borders(0) loops(9) viewpoints(0) CTCF(4) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\nNote\n\n\n\nAll these objects can be used in *Overlap methods, as they all extend the GRanges class of objects.\n\n# ---- This counts the number of times `CTCF` anchors are being used in the \n#      `loops` `GInteractions` object\ncountOverlaps(\n    query = topologicalFeatures(hic, 'CTCF'), \n    subject = topologicalFeatures(hic, 'loops')\n)\n##  [1] 5 5 5 0\n\n\n\n\n3.2.2.3 pairsFile\n\nIf pairsFile is not specified when importing the ContactFile into a HiCExperiment object, one can add it later.\n\npairsf &lt;- HiContactsData('yeast_wt', 'pairs.gz')\n\n\npairsFile(hic) &lt;- pairsf\nhic\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(3): count balanced random \n##  topologicalFeatures: compartments(0) borders(0) loops(9) viewpoints(0) CTCF(4) \n##  pairsFile: /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753 \n##  metadata(0):\n\n\n3.2.2.4 metadata\n\nMetadata associated with a HiCExperiment can be updated at any point.\n\nmetadata(hic) &lt;- list(\n    info = \"HiCExperiment created from an example .mcool file from `HiContactsData`\", \n    date = date()\n)\nmetadata(hic)\n##  $info\n##  [1] \"HiCExperiment created from an example .mcool file from `HiContactsData`\"\n##  \n##  $date\n##  [1] \"Mon Sep 25 17:02:32 2023\""
+    "text": "3.2 Updating an HiCExperiment object\n\n\n\n\n\n\nTL;DR: Which HiCExperiment slots are mutable (✅) / immutable (⛔️)?\n\n\n\n\n\nfileName(hic): ⛔️ (obtained from disk-stored file)\n\nfocus(hic): 🤔 (see subsetting section)\n\nresolutions(hic): ⛔️ (obtained from disk-stored file)\n\nresolution(hic): 🤔 (see zooming section)\n\ninteractions(hic): ⛔️ (obtained from disk-stored file)\n\nscores(hic): ✅\n\ntopologicalFeatures(hic): ✅\n\npairsFile(hic): ✅\n\nmetadata(hic): ✅\n\n\n\n\n3.2.1 Immutable slots\nAn HiCExperiment object acts as an interface exposing disk-stored data. This implies that the fileName slot itself is immutable (i.e. cannot be changed). This should be obvious, as a HiCExperiment has to be associated with a disk-stored contact matrix to properly function (except in some advanced cases developed in next chapters).\nFor this reason, methods to manually modify interactions and resolutions slots are also not exposed in the HiCExperiment package.\nA corollary of this is that the associated regions and anchors of an HiCExperiment should not be modified by hand either, since they are directly linked to interactions.\n\n3.2.2 Mutable slots\nThat being said, HiCExperiment objects are flexible and can be partially modified in memory without having to change/overwrite the original, disk-stored contact matrix.\nSeveral slots can be modified in memory: slots, topologicalFeatures, pairsFile and metadata.\n\n3.2.2.1 scores\n\nWe have seen in the previous chapter that scores are stored in a list and are available using the scores function.\n\nscores(hic)\n##  List of length 2\n##  names(2): count balanced\n\nhead(scores(hic, \"count\"))\n##  [1]  7 92 75 61 38 43\n\nhead(scores(hic, \"balanced\"))\n##  [1] 0.009657438 0.076622340 0.054101992 0.042940512 0.040905212 0.029293930\n\nExtra scores can be added to this list, e.g. to describe the “expected” interaction frequency for each interaction stored in the HiCExperiment object). This can be achieved using the scores()&lt;- function.\n\nscores(hic, \"random\") &lt;- runif(length(hic))\n\nscores(hic)\n##  List of length 3\n##  names(3): count balanced random\n\nhead(scores(hic, \"random\"))\n##  [1] 0.080750138 0.834333037 0.600760886 0.157208442 0.007399441 0.466393497\n\n\n3.2.2.2 topologicalFeatures\n\nThe end-user can create additional topologicalFeatures or modify the existing ones using the topologicalFeatures()&lt;- function.\n\ntopologicalFeatures(hic, 'CTCF') &lt;- GRanges(c(\n    \"II:340-352\", \n    \"II:3520-3532\", \n    \"II:7980-7992\", \n    \"II:9240-9252\" \n))\ntopologicalFeatures(hic, 'CTCF')\n##  GRanges object with 4 ranges and 0 metadata columns:\n##        seqnames    ranges strand\n##           &lt;Rle&gt; &lt;IRanges&gt;  &lt;Rle&gt;\n##    [1]       II   340-352      *\n##    [2]       II 3520-3532      *\n##    [3]       II 7980-7992      *\n##    [4]       II 9240-9252      *\n##    -------\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\ntopologicalFeatures(hic, 'loops') &lt;- GInteractions(\n    topologicalFeatures(hic, 'CTCF')[rep(1:3, each = 3)],\n    topologicalFeatures(hic, 'CTCF')[rep(1:3, 3)]\n)\ntopologicalFeatures(hic, 'loops')\n##  GInteractions object with 9 interactions and 0 metadata columns:\n##        seqnames1   ranges1     seqnames2   ranges2\n##            &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt;\n##    [1]        II   340-352 ---        II   340-352\n##    [2]        II   340-352 ---        II 3520-3532\n##    [3]        II   340-352 ---        II 7980-7992\n##    [4]        II 3520-3532 ---        II   340-352\n##    [5]        II 3520-3532 ---        II 3520-3532\n##    [6]        II 3520-3532 ---        II 7980-7992\n##    [7]        II 7980-7992 ---        II   340-352\n##    [8]        II 7980-7992 ---        II 3520-3532\n##    [9]        II 7980-7992 ---        II 7980-7992\n##    -------\n##    regions: 3 ranges and 0 metadata columns\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\nhic\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(3): count balanced random \n##  topologicalFeatures: compartments(0) borders(0) loops(9) viewpoints(0) CTCF(4) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\nNote\n\n\n\nAll these objects can be used in *Overlap methods, as they all extend the GRanges class of objects.\n\n# ---- This counts the number of times `CTCF` anchors are being used in the \n#      `loops` `GInteractions` object\ncountOverlaps(\n    query = topologicalFeatures(hic, 'CTCF'), \n    subject = topologicalFeatures(hic, 'loops')\n)\n##  [1] 5 5 5 0\n\n\n\n\n3.2.2.3 pairsFile\n\nIf pairsFile is not specified when importing the ContactFile into a HiCExperiment object, one can add it later.\n\npairsf &lt;- HiContactsData('yeast_wt', 'pairs.gz')\n\n\npairsFile(hic) &lt;- pairsf\nhic\n##  `HiCExperiment` object with 306,212 contacts over 257 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:300,001-813,184\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 18513 \n##  scores(3): count balanced random \n##  topologicalFeatures: compartments(0) borders(0) loops(9) viewpoints(0) CTCF(4) \n##  pairsFile: /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753 \n##  metadata(0):\n\n\n3.2.2.4 metadata\n\nMetadata associated with a HiCExperiment can be updated at any point.\n\nmetadata(hic) &lt;- list(\n    info = \"HiCExperiment created from an example .mcool file from `HiContactsData`\", \n    date = date()\n)\nmetadata(hic)\n##  $info\n##  [1] \"HiCExperiment created from an example .mcool file from `HiContactsData`\"\n##  \n##  $date\n##  [1] \"Thu Oct 19 10:04:41 2023\""
   },
   {
     "objectID": "parsing.html#coercing-hicexperiment-objects",
@@ -165,7 +172,7 @@
     "href": "visualization.html#advanced-visualization",
     "title": "\n4  Hi-C data visualization\n",
     "section": "\n4.3 Advanced visualization",
-    "text": "4.3 Advanced visualization\n\n4.3.1 Overlaying topological features\nTopological features (e.g. chromatin loops, domain borders, A/B compartments, e.g. …) are often displayed over a Hi-C heatmap.\nTo illustrate how to do this, let’s import pre-computed chromatin loops in R. These loops have been identified using chromosight (Matthey-Doret et al. (2020)) on the contact matrix which we imported interactions from.\n\nlibrary(rtracklayer)\nlibrary(InteractionSet)\nloops &lt;- system.file('extdata', 'S288C-loops.bedpe', package = 'HiCExperiment') |&gt; \n    import() |&gt; \n    makeGInteractionsFromGRangesPairs()\nloops\n##  GInteractions object with 162 interactions and 0 metadata columns:\n##          seqnames1       ranges1     seqnames2       ranges2\n##              &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt;\n##      [1]         I     3001-4000 ---         I   29001-30000\n##      [2]         I   29001-30000 ---         I   50001-51000\n##      [3]         I   95001-96000 ---         I 128001-129000\n##      [4]         I 133001-134000 ---         I 157001-158000\n##      [5]        II     8001-9000 ---        II   46001-47000\n##      ...       ...           ... ...       ...           ...\n##    [158]       XVI 773001-774000 ---       XVI 803001-804000\n##    [159]       XVI 834001-835000 ---       XVI 859001-860000\n##    [160]       XVI 860001-861000 ---       XVI 884001-885000\n##    [161]       XVI 901001-902000 ---       XVI 940001-941000\n##    [162]       XVI 917001-918000 ---       XVI 939001-940000\n##    -------\n##    regions: 316 ranges and 0 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome; no seqlengths\n\nSimilarly, borders have also been mapped with chromosight. We can also import them in R.\n\nborders &lt;- system.file('extdata', 'S288C-borders.bed', package = 'HiCExperiment') |&gt; \n    import()\nborders\n##  GRanges object with 814 ranges and 0 metadata columns:\n##          seqnames        ranges strand\n##             &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt;\n##      [1]        I   73001-74000      *\n##      [2]        I 108001-109000      *\n##      [3]        I 181001-182000      *\n##      [4]       II   90001-91000      *\n##      [5]       II 119001-120000      *\n##      ...      ...           ...    ...\n##    [810]      XVI 777001-778000      *\n##    [811]      XVI 796001-797000      *\n##    [812]      XVI 811001-812000      *\n##    [813]      XVI 890001-891000      *\n##    [814]      XVI 933001-934000      *\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome; no seqlengths\n\nChromatin loops are stored in GInteractions while borders are GRanges. The former will be displayed as off-diagonal circles and the later as on-diagonal diamonds on the Hi-C heatmap.\n\nplotMatrix(hic, loops = loops, borders = borders)\n\n\n\n\n\n\n\n\n4.3.2 Aggregated Hi-C maps\nFinally, Hi-C map “snippets” (i.e. extracts) are often aggregated together to show an average signal. This analysis is sometimes referred to as APA (Aggregated Plot Analysis).\nAggregated Hi-C maps can be computed over a collection of targets using the aggregate function. These targets can be GRanges (to extract on-diagonal snippets) or GInteractions (to extract off-diagonal snippets). The flankingBins specifies how many matrix bins should be extracted on each side of the targets of interest.\nHere, we compute the aggregated Hi-C snippets of ± 15kb around each chromatin loop listed in loops.\n\nhic &lt;- zoom(hic, 1000)\naggr_loops &lt;- aggregate(hic, targets = loops, flankingBins = 15)\n##  Going through preflight checklist...\n##  Parsing the entire contact matrice as a sparse matrix...\n##  Modeling distance decay...\n##  Filtering for contacts within provided targets...\naggr_loops\n##  `AggrHiCExperiment` object over 148 targets \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: 148 targets \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 961 \n##  scores(4): count balanced expected detrended \n##  slices(4): count balanced expected detrended \n##  topologicalFeatures: targets(148) compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\naggregate generates a AggrHiCExperiment object, a flavor of HiCExperiment class of objects.\n\n\nAggrHiCExperiment objects have an extra slices slot. This stores a list of arrays, one per scores. Each array is of 3 dimensions, x and y representing the heatmap axes, and z representing the index of the target.\n\nAggrHiCExperiment objects also have a mandatory topologicalFeatures element named targets, storing the genomic loci provided in aggregate.\n\n\nslices(aggr_loops)\n##  List of length 4\n##  names(4): count balanced expected detrended\ndim(slices(aggr_loops, 'count'))\n##  [1]  31  31 148\ntopologicalFeatures(aggr_loops, 'targets')\n##  Pairs object with 148 pairs and 0 metadata columns:\n##                      first            second\n##                  &lt;GRanges&gt;         &lt;GRanges&gt;\n##      [1]     I:14501-44500     I:35501-65500\n##      [2]    I:80501-110500   I:113501-143500\n##      [3]   I:118501-148500   I:142501-172500\n##      [4]    II:33501-63500    II:63501-93500\n##      [5]  II:134501-164500  II:159501-189500\n##      ...               ...               ...\n##    [144] XVI:586501-616500 XVI:606501-636500\n##    [145] XVI:733501-763500 XVI:754501-784500\n##    [146] XVI:758501-788500 XVI:788501-818500\n##    [147] XVI:819501-849500 XVI:844501-874500\n##    [148] XVI:845501-875500 XVI:869501-899500\n\nThe resulting AggrHiCExperiment can be plotted using the same plotMatrix function with the arguments described above.\n\nplotMatrix(\n    aggr_loops, \n    use.scores = 'detrended', \n    scale = 'linear', \n    limits = c(-1, 1), \n    cmap = bgrColors()\n)"
+    "text": "4.3 Advanced visualization\n\n4.3.1 Overlaying topological features\nTopological features (e.g. chromatin loops, domain borders, A/B compartments, e.g. …) are often displayed over a Hi-C heatmap.\nTo illustrate how to do this, let’s import pre-computed chromatin loops in R. These loops have been identified using chromosight (Matthey-Doret et al. (2020)) on the contact matrix which we imported interactions from.\n\nlibrary(rtracklayer)\nlibrary(InteractionSet)\nloops &lt;- system.file('extdata', 'S288C-loops.bedpe', package = 'HiCExperiment') |&gt; \n    import() |&gt; \n    makeGInteractionsFromGRangesPairs()\nloops\n##  GInteractions object with 162 interactions and 0 metadata columns:\n##          seqnames1       ranges1     seqnames2       ranges2\n##              &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt;\n##      [1]         I     3001-4000 ---         I   29001-30000\n##      [2]         I   29001-30000 ---         I   50001-51000\n##      [3]         I   95001-96000 ---         I 128001-129000\n##      [4]         I 133001-134000 ---         I 157001-158000\n##      [5]        II     8001-9000 ---        II   46001-47000\n##      ...       ...           ... ...       ...           ...\n##    [158]       XVI 773001-774000 ---       XVI 803001-804000\n##    [159]       XVI 834001-835000 ---       XVI 859001-860000\n##    [160]       XVI 860001-861000 ---       XVI 884001-885000\n##    [161]       XVI 901001-902000 ---       XVI 940001-941000\n##    [162]       XVI 917001-918000 ---       XVI 939001-940000\n##    -------\n##    regions: 316 ranges and 0 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome; no seqlengths\n\nSimilarly, borders have also been mapped with chromosight. We can also import them in R.\n\nborders &lt;- system.file('extdata', 'S288C-borders.bed', package = 'HiCExperiment') |&gt; \n    import()\nborders\n##  GRanges object with 814 ranges and 0 metadata columns:\n##          seqnames        ranges strand\n##             &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt;\n##      [1]        I   73001-74000      *\n##      [2]        I 108001-109000      *\n##      [3]        I 181001-182000      *\n##      [4]       II   90001-91000      *\n##      [5]       II 119001-120000      *\n##      ...      ...           ...    ...\n##    [810]      XVI 777001-778000      *\n##    [811]      XVI 796001-797000      *\n##    [812]      XVI 811001-812000      *\n##    [813]      XVI 890001-891000      *\n##    [814]      XVI 933001-934000      *\n##    -------\n##    seqinfo: 16 sequences from an unspecified genome; no seqlengths\n\nChromatin loops are stored in GInteractions while borders are GRanges. The former will be displayed as off-diagonal circles and the later as on-diagonal diamonds on the Hi-C heatmap.\n\nplotMatrix(hic, loops = loops, borders = borders)\n\n\n\n\n\n\n\n\n4.3.2 Aggregated Hi-C maps\nFinally, Hi-C map “snippets” (i.e. extracts) are often aggregated together to show an average signal. This analysis is sometimes referred to as APA (Aggregated Plot Analysis).\nAggregated Hi-C maps can be computed over a collection of targets using the aggregate function. These targets can be GRanges (to extract on-diagonal snippets) or GInteractions (to extract off-diagonal snippets). The flankingBins specifies how many matrix bins should be extracted on each side of the targets of interest.\nHere, we compute the aggregated Hi-C snippets of ± 15kb around each chromatin loop listed in loops.\n\nhic &lt;- zoom(hic, 1000)\naggr_loops &lt;- aggregate(hic, targets = loops, flankingBins = 15)\n##  Going through preflight checklist...\n##  Parsing the entire contact matrice as a sparse matrix...\n##  Modeling distance decay...\n##  Filtering for contacts within provided targets...\naggr_loops\n##  `AggrHiCExperiment` object over 148 targets \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: 148 targets \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 961 \n##  scores(4): count balanced expected detrended \n##  slices(4): count balanced expected detrended \n##  topologicalFeatures: targets(148) compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\naggregate generates a AggrHiCExperiment object, a flavor of HiCExperiment class of objects.\n\n\nAggrHiCExperiment objects have an extra slices slot. This stores a list of arrays, one per scores. Each array is of 3 dimensions, x and y representing the heatmap axes, and z representing the index of the target.\n\nAggrHiCExperiment objects also have a mandatory topologicalFeatures element named targets, storing the genomic loci provided in aggregate.\n\n\nslices(aggr_loops)\n##  List of length 4\n##  names(4): count balanced expected detrended\ndim(slices(aggr_loops, 'count'))\n##  [1]  31  31 148\ntopologicalFeatures(aggr_loops, 'targets')\n##  Pairs object with 148 pairs and 0 metadata columns:\n##                      first            second\n##                  &lt;GRanges&gt;         &lt;GRanges&gt;\n##      [1]     I:14501-44500     I:35501-65500\n##      [2]    I:80501-110500   I:113501-143500\n##      [3]   I:118501-148500   I:142501-172500\n##      [4]    II:33501-63500    II:63501-93500\n##      [5]  II:134501-164500  II:159501-189500\n##      ...               ...               ...\n##    [144] XVI:586501-616500 XVI:606501-636500\n##    [145] XVI:733501-763500 XVI:754501-784500\n##    [146] XVI:758501-788500 XVI:788501-818500\n##    [147] XVI:819501-849500 XVI:844501-874500\n##    [148] XVI:845501-875500 XVI:869501-899500\n\nThe resulting AggrHiCExperiment can be plotted using the same plotMatrix function with the arguments described above.\n\nplotMatrix(\n    aggr_loops, \n    use.scores = 'detrended', \n    scale = 'linear', \n    limits = c(-1, 1), \n    cmap = bgrColors()\n)"
   },
   {
     "objectID": "matrix-centric.html",
@@ -179,7 +186,7 @@
     "href": "matrix-centric.html#operations-in-an-individual-matrix",
     "title": "\n5  Matrix-centric analysis\n",
     "section": "\n5.1 Operations in an individual matrix",
-    "text": "5.1 Operations in an individual matrix\n\n5.1.1 Balancing a raw interaction count map\nHi-C sequencing coverage is systematically affected by multiple confounding factors, e.g.  density of restriction sites, GC%, genome mappability, etc.. Overall, it generally ends up not homogenous throughout the entire genome and this leads to artifacts in un-normalized count matrices.\nTo correct for sequencing coverage heterogeneity of raw count maps, Hi-C data can be normalized using matrix balancing approaches (Cournac et al. (2012), Imakaev et al. (2012)). This is generally done directly on the disk-stored matrices using out-of-memory strategies (e.g. with cooler balance &lt;.cool&gt;). However, if contact matrix files are imported into a HiCExperiment object but no balanced scores are available, in-memory balancing can be performed using the normalize function.\n\nnormalized_hic &lt;- normalize(hic)\nnormalized_hic\n##  `HiCExperiment` object with 471,364 contacts over 407 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 34063 \n##  scores(3): count balanced ICE \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\nNote\n\n\n\nThe only change done to the HiCExperiment object by the normalize function is the addition of a single extra ICE in scores list. The interactions themselves are unmodified.\n\n\nIt is possible to plot the different scores of the resulting object to visualize the newly computed scores. In this example, ICE scores should be nearly identical to balanced scores, which were originally imported from the disk-stored contact matrix.\n\n\npatchwork::wrap_plots(\n    plotMatrix(normalized_hic, use.scores = 'count', caption = FALSE),\n    plotMatrix(normalized_hic, use.scores = 'balanced', caption = FALSE),\n    plotMatrix(normalized_hic, use.scores = 'ICE', caption = FALSE), \n    nrow = 1\n)\n\n\n\n\n\n\n\n\n\n5.1.2 Computing observed/expected (O/E) map\nThe most prominent feature of a balanced Hi-C matrix is the strong main diagonal. This main diagonal is observed because interactions between immediate adjacent genomic loci are more prone to happen than interactions spanning longer genomic distances. This “expected” behavior is due to the polymer nature of the chromosomes being studied, and can be locally estimated using the distance-dependent interaction frequency (a.k.a. the “distance law”, or P(s)). It can be used to compute an expected matrix on interactions.\nWhen it is desirable to “mask” this polymer behavior to emphasize topological structures formed by chromosomes, one can divide a given balanced matrix by its expected matrix, i.e. calculate the observed/expected (O/E) map. This is sometimes called “detrending”, as it effectively removes the average polymer behavior from the balanced matrix.\nThe detrend function performs this operation on a given HiCExperiment object.\n\ndetrended_hic &lt;- detrend(hic)\ndetrended_hic\n##  `HiCExperiment` object with 471,364 contacts over 407 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 34063 \n##  scores(4): count balanced expected detrended \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\nNote\n\n\n\nThe only change done to the HiCExperiment object by the detrend function is the addition of two extra scores:\n\nexpected\ndetrended\n\nThe interactions themselves are unmodified.\n\n\nTopological features will be visually more prominent in the O/E detrended Hi-C map.\n\n\npatchwork::wrap_plots(\n    plotMatrix(detrended_hic, use.scores = 'balanced', scale = 'log10', limits = c(-3.5, -1.2), caption = FALSE),\n    plotMatrix(detrended_hic, use.scores = 'expected', scale = 'log10', limits = c(-3.5, -1.2), caption = FALSE),\n    plotMatrix(detrended_hic, use.scores = 'detrended', scale = 'linear', limits = c(-1, 1), cmap = bwrColors(), caption = FALSE), \n    nrow = 1\n)\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nScale for detrended scores\n\n\n\n\n\nexpected scores are in linear scale and ± in the same amplitude than balanced scores;\n\ndetrended scores are in log2 scale, in general approximately centered around 0. When plotting detrended scores, scale = linear should be set to prevent the default log10 scaling.\n\n\n\n\n5.1.3 Computing autocorrelated map\nCorrelation matrices are often calculated from balanced Hi-C matrices. For instance, in genomes composed of eu- and heterochromatin, a correlation matrix can be used to reveal a checkerboard pattern emphasizing the segregation of chromatin into two A/B compartments (Lieberman-Aiden et al. (2009)).\nThe autocorrelate function is used to compute a correlation matrix of a HiCExperiment object. For each pair of interacting loci, the autocorrelated score represents the correlation between their respective interaction profiles with the rest of the genome.\n\nautocorr_hic &lt;- autocorrelate(hic)\n##  \nautocorr_hic\n##  `HiCExperiment` object with 471,364 contacts over 407 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 34063 \n##  scores(5): count balanced expected detrended autocorrelated \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\nSince these metrics represent correlation scores, they range between -1 and 1. Two loci with an autocorrelated score close to -1 have anti-correlated interaction profiles, while two loci with a autocorrelated score close to 1 are likely to interact with shared targets.\n\nsummary(scores(autocorr_hic, 'autocorrelated'))\n##     Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's \n##  -0.4156  0.0025  0.0504  0.0645  0.1036  1.0000     564\n\nCorrelated and anti-correlated loci will be visually represented in the autocorrelated Hi-C map in red and blue pixels, respectively.\n\n\n\n\n\n\nNote\n\n\n\nHere we have illustrated how to compute an autocorrelation matrix from a HiCExperiment object using the example yeast Hi-C experiment. Bear in mind that this is unusual and not very useful, as yeast chromatin is not segregated in two compartments but rather follows a Rabl conformation (Duan et al. (2010)). An example of autocorrelation map from a vertebrate Hi-C experiment (for which chromatin is segregated in A/B compartments) is shown in Chapter 10.\n\n\n\nplotMatrix(\n    autocorr_hic, \n    use.scores = 'autocorrelated', \n    scale = 'linear', \n    limits = c(-0.4, 0.4), \n    cmap = bgrColors()\n)\n\n\n\n\n\n\n\n\n\n\n\n\n\nScale for autocorrelated scores\n\n\n\n\n\nautocorrelated scores are in linear scale, in general approximately centered around 0. When plotting autocorrelated scores, scale = linear should be set to prevent the default log10 scaling.\n\nlimits should be manually set to c(-x, x) (0 &lt; x &lt;= 1) to ensure that the color range is effectively centered on 0.\n\n\n\n\n5.1.4 Despeckling (smoothing out) a contact map\nShallow-sequenced Hi-C libraries or matrices binned with an overly small bin size sometimes produce “grainy” Hi-C maps with noisy backgrounds. A grainy map may also be obtained when dividing two matrices, e.g. when computing the O/E ratio with detrend. This is particularly true for sparser long-range interactions. To overcome such limitations, HiCExperiment objects can be “despeckled” to smooth out focal speckles.\n\nhic2 &lt;- detrend(hic['II:400000-700000'])\nhic2 &lt;- despeckle(hic2, use.scores = 'detrended', focal.size = 2)\nhic2\n##  `HiCExperiment` object with 168,785 contacts over 150 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II:400,000-700,000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 11325 \n##  scores(5): count balanced expected detrended detrended.despeckled \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\nThe added &lt;use.scores&gt;.despeckled scores correspond to scores averaged using a window, whose width is provided with the focal.size argument. This results in a smoother Hi-C heatmap, effectively removing the “speckles” observed at longer range.\n\n\nlibrary(InteractionSet)\nloops &lt;- system.file('extdata', 'S288C-loops.bedpe', package = 'HiCExperiment') |&gt; \n    import() |&gt; \n    makeGInteractionsFromGRangesPairs()\nborders &lt;- system.file('extdata', 'S288C-borders.bed', package = 'HiCExperiment') |&gt; \n    import()\npatchwork::wrap_plots(\n    plotMatrix(hic2, caption = FALSE),\n    plotMatrix(hic2, use.scores = 'detrended', scale = 'linear', limits = c(-1, 1), caption = FALSE),\n    plotMatrix(\n        hic2, \n        use.scores = 'detrended.despeckled', \n        scale = 'linear', \n        limits = c(-1, 1), \n        caption = FALSE, \n        loops = loops, \n        borders = borders\n    ),\n    nrow = 1\n)\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nScale for despeckled scores\n\n\n\ndespeckled scores are in the same scale than the scores they were computed from."
+    "text": "5.1 Operations in an individual matrix\n\n5.1.1 Balancing a raw interaction count map\nHi-C sequencing coverage is systematically affected by multiple confounding factors, e.g.  density of restriction sites, GC%, genome mappability, etc.. Overall, it generally ends up not homogenous throughout the entire genome and this leads to artifacts in un-normalized count matrices.\nTo correct for sequencing coverage heterogeneity of raw count maps, Hi-C data can be normalized using matrix balancing approaches (Cournac et al. (2012), Imakaev et al. (2012)). This is generally done directly on the disk-stored matrices using out-of-memory strategies (e.g. with cooler balance &lt;.cool&gt;). However, if contact matrix files are imported into a HiCExperiment object but no balanced scores are available, in-memory balancing can be performed using the normalize function.\n\nnormalized_hic &lt;- normalize(hic)\nnormalized_hic\n##  `HiCExperiment` object with 471,364 contacts over 407 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 34063 \n##  scores(3): count balanced ICE \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\nNote\n\n\n\nThe only change done to the HiCExperiment object by the normalize function is the addition of a single extra ICE in scores list. The interactions themselves are unmodified.\n\n\nIt is possible to plot the different scores of the resulting object to visualize the newly computed scores. In this example, ICE scores should be nearly identical to balanced scores, which were originally imported from the disk-stored contact matrix.\n\n\npatchwork::wrap_plots(\n    plotMatrix(normalized_hic, use.scores = 'count', caption = FALSE),\n    plotMatrix(normalized_hic, use.scores = 'balanced', caption = FALSE),\n    plotMatrix(normalized_hic, use.scores = 'ICE', caption = FALSE), \n    nrow = 1\n)\n\n\n\n\n\n\n\n\n\n5.1.2 Computing observed/expected (O/E) map\nThe most prominent feature of a balanced Hi-C matrix is the strong main diagonal. This main diagonal is observed because interactions between immediate adjacent genomic loci are more prone to happen than interactions spanning longer genomic distances. This “expected” behavior is due to the polymer nature of the chromosomes being studied, and can be locally estimated using the distance-dependent interaction frequency (a.k.a. the “distance law”, or P(s)). It can be used to compute an expected matrix on interactions.\nWhen it is desirable to “mask” this polymer behavior to emphasize topological structures formed by chromosomes, one can divide a given balanced matrix by its expected matrix, i.e. calculate the observed/expected (O/E) map. This is sometimes called “detrending”, as it effectively removes the average polymer behavior from the balanced matrix.\nThe detrend function performs this operation on a given HiCExperiment object.\n\ndetrended_hic &lt;- detrend(hic)\ndetrended_hic\n##  `HiCExperiment` object with 471,364 contacts over 407 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 34063 \n##  scores(4): count balanced expected detrended \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\n\n\n\n\n\n\nNote\n\n\n\nThe only change done to the HiCExperiment object by the detrend function is the addition of two extra scores:\n\nexpected\ndetrended\n\nThe interactions themselves are unmodified.\n\n\nTopological features will be visually more prominent in the O/E detrended Hi-C map.\n\n\npatchwork::wrap_plots(\n    plotMatrix(detrended_hic, use.scores = 'balanced', scale = 'log10', limits = c(-3.5, -1.2), caption = FALSE),\n    plotMatrix(detrended_hic, use.scores = 'expected', scale = 'log10', limits = c(-3.5, -1.2), caption = FALSE),\n    plotMatrix(detrended_hic, use.scores = 'detrended', scale = 'linear', limits = c(-1, 1), cmap = bwrColors(), caption = FALSE), \n    nrow = 1\n)\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nScale for detrended scores\n\n\n\n\n\nexpected scores are in linear scale and ± in the same amplitude than balanced scores;\n\ndetrended scores are in log2 scale, in general approximately centered around 0. When plotting detrended scores, scale = linear should be set to prevent the default log10 scaling.\n\n\n\n\n5.1.3 Computing autocorrelated map\nCorrelation matrices are often calculated from balanced Hi-C matrices. For instance, in genomes composed of eu- and heterochromatin, a correlation matrix can be used to reveal a checkerboard pattern emphasizing the segregation of chromatin into two A/B compartments (Lieberman-Aiden et al. (2009)).\nThe autocorrelate function is used to compute a correlation matrix of a HiCExperiment object. For each pair of interacting loci, the autocorrelated score represents the correlation between their respective interaction profiles with the rest of the genome.\n\nautocorr_hic &lt;- autocorrelate(hic)\n##  \nautocorr_hic\n##  `HiCExperiment` object with 471,364 contacts over 407 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 34063 \n##  scores(5): count balanced expected detrended autocorrelated \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\nSince these metrics represent correlation scores, they range between -1 and 1. Two loci with an autocorrelated score close to -1 have anti-correlated interaction profiles, while two loci with a autocorrelated score close to 1 are likely to interact with shared targets.\n\nsummary(scores(autocorr_hic, 'autocorrelated'))\n##     Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's \n##  -0.4156  0.0025  0.0504  0.0645  0.1036  1.0000     564\n\nCorrelated and anti-correlated loci will be visually represented in the autocorrelated Hi-C map in red and blue pixels, respectively.\n\n\n\n\n\n\nNote\n\n\n\nHere we have illustrated how to compute an autocorrelation matrix from a HiCExperiment object using the example yeast Hi-C experiment. Bear in mind that this is unusual and not very useful, as yeast chromatin is not segregated in two compartments but rather follows a Rabl conformation (Duan et al. (2010)). An example of autocorrelation map from a vertebrate Hi-C experiment (for which chromatin is segregated in A/B compartments) is shown in Chapter 10.\n\n\n\nplotMatrix(\n    autocorr_hic, \n    use.scores = 'autocorrelated', \n    scale = 'linear', \n    limits = c(-0.4, 0.4), \n    cmap = bgrColors()\n)\n\n\n\n\n\n\n\n\n\n\n\n\n\nScale for autocorrelated scores\n\n\n\n\n\nautocorrelated scores are in linear scale, in general approximately centered around 0. When plotting autocorrelated scores, scale = linear should be set to prevent the default log10 scaling.\n\nlimits should be manually set to c(-x, x) (0 &lt; x &lt;= 1) to ensure that the color range is effectively centered on 0.\n\n\n\n\n5.1.4 Despeckling (smoothing out) a contact map\nShallow-sequenced Hi-C libraries or matrices binned with an overly small bin size sometimes produce “grainy” Hi-C maps with noisy backgrounds. A grainy map may also be obtained when dividing two matrices, e.g. when computing the O/E ratio with detrend. This is particularly true for sparser long-range interactions. To overcome such limitations, HiCExperiment objects can be “despeckled” to smooth out focal speckles.\n\nhic2 &lt;- detrend(hic['II:400000-700000'])\nhic2 &lt;- despeckle(hic2, use.scores = 'detrended', focal.size = 2)\nhic2\n##  `HiCExperiment` object with 168,785 contacts over 150 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II:400,000-700,000\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 2000 \n##  interactions: 11325 \n##  scores(5): count balanced expected detrended detrended.despeckled \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) centromeres(16) \n##  pairsFile: N/A \n##  metadata(0):\n\nThe added &lt;use.scores&gt;.despeckled scores correspond to scores averaged using a window, whose width is provided with the focal.size argument. This results in a smoother Hi-C heatmap, effectively removing the “speckles” observed at longer range.\n\n\nlibrary(InteractionSet)\nloops &lt;- system.file('extdata', 'S288C-loops.bedpe', package = 'HiCExperiment') |&gt; \n    import() |&gt; \n    makeGInteractionsFromGRangesPairs()\nborders &lt;- system.file('extdata', 'S288C-borders.bed', package = 'HiCExperiment') |&gt; \n    import()\npatchwork::wrap_plots(\n    plotMatrix(hic2, caption = FALSE),\n    plotMatrix(hic2, use.scores = 'detrended', scale = 'linear', limits = c(-1, 1), caption = FALSE),\n    plotMatrix(\n        hic2, \n        use.scores = 'detrended.despeckled', \n        scale = 'linear', \n        limits = c(-1, 1), \n        caption = FALSE, \n        loops = loops, \n        borders = borders\n    ),\n    nrow = 1\n)\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nScale for despeckled scores\n\n\n\ndespeckled scores are in the same scale than the scores they were computed from."
   },
   {
     "objectID": "matrix-centric.html#operations-between-multiple-matrices",
@@ -200,7 +207,7 @@
     "href": "interactions-centric.html#distance-laws",
     "title": "\n6  Interactions-centric analysis\n",
     "section": "\n6.1 Distance law(s)",
-    "text": "6.1 Distance law(s)\n\n6.1.1 P(s) from a single .pairs file\nDistance laws are generally computed directly from .pairs files. This is because the .pairs files are at 1-bp resolution whereas the contact matrices (for example from .cool files) are binned at a minimum resolution.\nAn example .pairs file can be fetched from the ExperimentHub database using the HiContactsData package.\n\nlibrary(HiCExperiment)\nlibrary(HiContactsData)\npairsf &lt;- HiContactsData('yeast_wt', 'pairs.gz')\npf &lt;- PairsFile(pairsf)\n\n\npf\n##  PairsFile object\n##  resource: /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753\n\n\n\n\n\n\n\nReminder!\n\n\n\nPairsFile connections can be imported directly into a GInteractions object with import():\n\nimport(pf)\n##  GInteractions object with 471364 interactions and 3 metadata columns:\n##             seqnames1   ranges1     seqnames2   ranges2 |     frag1     frag2\n##                 &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt; | &lt;numeric&gt; &lt;numeric&gt;\n##         [1]        II       105 ---        II     48548 |      1358      1681\n##         [2]        II       113 ---        II     45003 |      1358      1658\n##         [3]        II       119 ---        II    687251 |      1358      5550\n##         [4]        II       160 ---        II     26124 |      1358      1510\n##         [5]        II       169 ---        II     39052 |      1358      1613\n##         ...       ...       ... ...       ...       ... .       ...       ...\n##    [471360]        II    808605 ---        II    809683 |      6316      6320\n##    [471361]        II    808609 ---        II    809917 |      6316      6324\n##    [471362]        II    808617 ---        II    809506 |      6316      6319\n##    [471363]        II    809447 ---        II    809685 |      6319      6321\n##    [471364]        II    809472 ---        II    809675 |      6319      6320\n##              distance\n##             &lt;integer&gt;\n##         [1]     48443\n##         [2]     44890\n##         [3]    687132\n##         [4]     25964\n##         [5]     38883\n##         ...       ...\n##    [471360]      1078\n##    [471361]      1308\n##    [471362]       889\n##    [471363]       238\n##    [471364]       203\n##    -------\n##    regions: 549331 ranges and 0 metadata columns\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\n\n\nWe can compute a P(s) per chromosome from this .pairs file using the distanceLaw function.\n\nlibrary(HiContacts)\nps &lt;- distanceLaw(pf, by_chr = TRUE) \n##  Importing pairs file /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753 in memory. This may take a while...\nps\n##  # A tibble: 115 × 6\n##    chr   binned_distance          p     norm_p norm_p_unity slope\n##    &lt;chr&gt;           &lt;dbl&gt;      &lt;dbl&gt;      &lt;dbl&gt;        &lt;dbl&gt; &lt;dbl&gt;\n##  1 II                 14 0.00000212 0.00000106         2.27  0   \n##  2 II                 16 0.0000170  0.0000170         36.4   1.56\n##  3 II                 17 0.0000361  0.0000180         38.6   1.55\n##  4 II                 19 0.0000424  0.0000212         45.5   1.55\n##  5 II                 21 0.0000467  0.0000233         50.0   1.54\n##  6 II                 23 0.0000870  0.0000290         62.1   1.53\n##  # ℹ 109 more rows\n\n\n\n\n\n\n\nNote\n\n\n\nBecause this is a toy dataset, contacts are only provided for the chromosome II.\n\ntable(ps$chr)\n##  \n##   II \n##  115\n\n\n\nThe plotPs() and plotPsSlope() functions are convenient ggplot2-based functions with pre-configured settings optimized for P(s) visualization.\n\nlibrary(ggplot2)\nplotPs(ps, aes(x = binned_distance, y = norm_p, color = chr))\n##  Warning: Removed 67 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\nplotPsSlope(ps, aes(x = binned_distance, y = slope, color = chr))\n##  Warning: Removed 67 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\n\n\n6.1.2 P(s) for multiple .pairs files\nLet’s first import a second example dataset. We’ll import pairs identified in a eco1 yeast mutant.\n\neco1_pairsf &lt;- HiContactsData('yeast_eco1', 'pairs.gz')\neco1_pf &lt;- PairsFile(eco1_pairsf)\n\n\neco1_ps &lt;- distanceLaw(eco1_pf, by_chr = TRUE) \n##  Importing pairs file /github/home/.cache/R/ExperimentHub/21b95aa8e2b4_7755 in memory. This may take a while...\neco1_ps\n##  # A tibble: 115 × 6\n##    chr   binned_distance          p     norm_p norm_p_unity slope\n##    &lt;chr&gt;           &lt;dbl&gt;      &lt;dbl&gt;      &lt;dbl&gt;        &lt;dbl&gt; &lt;dbl&gt;\n##  1 II                 14 0.00000201 0.00000100        0.660  0   \n##  2 II                 16 0.0000221  0.0000221        14.5    1.46\n##  3 II                 17 0.0000492  0.0000246        16.2    1.46\n##  4 II                 19 0.0000412  0.0000206        13.5    1.45\n##  5 II                 21 0.0000653  0.0000326        21.5    1.45\n##  6 II                 23 0.0000803  0.0000268        17.6    1.44\n##  # ℹ 109 more rows\n\nA little data wrangling can help plotting the distance laws for 2 different samples in the same plot.\n\nlibrary(dplyr)\nmerged_ps &lt;- rbind(\n    ps |&gt; mutate(sample = 'WT'), \n    eco1_ps |&gt; mutate(sample = 'eco1')\n)\nplotPs(merged_ps, aes(x = binned_distance, y = norm_p, color = sample, linetype = chr)) + \n    scale_color_manual(values = c('#c6c6c6', '#ca0000'))\n##  Warning: Removed 134 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\nplotPsSlope(merged_ps, aes(x = binned_distance, y = slope, color = sample, linetype = chr)) + \n    scale_color_manual(values = c('#c6c6c6', '#ca0000'))\n##  Warning: Removed 135 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\n\n\n6.1.3 P(s) from HiCExperiment objects\nAlternatively, distance laws can be computed from binned matrices directly by providing HiCExperiment objects. For deeply sequenced datasets, this can be significantly faster than when using original .pairs files, but the smoothness of the resulting curves will be greatly impacted, notably at short distances.\n\nps_from_hic &lt;- distanceLaw(hic, by_chr = TRUE) \n##  pairsFile not specified. The P(s) curve will be an approximation.\nplotPs(ps_from_hic, aes(x = binned_distance, y = norm_p))\n##  Warning: Removed 9 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\nplotPsSlope(ps_from_hic, aes(x = binned_distance, y = slope))\n##  Warning: Removed 8 rows containing missing values (`geom_line()`)."
+    "text": "6.1 Distance law(s)\n\n6.1.1 P(s) from a single .pairs file\nDistance laws are generally computed directly from .pairs files. This is because the .pairs files are at 1-bp resolution whereas the contact matrices (for example from .cool files) are binned at a minimum resolution.\nAn example .pairs file can be fetched from the ExperimentHub database using the HiContactsData package.\n\nlibrary(HiCExperiment)\nlibrary(HiContactsData)\npairsf &lt;- HiContactsData('yeast_wt', 'pairs.gz')\npf &lt;- PairsFile(pairsf)\n\n\npf\n##  PairsFile object\n##  resource: /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753\n\n\n\n\n\n\n\nReminder!\n\n\n\nPairsFile connections can be imported directly into a GInteractions object with import():\n\nimport(pf)\n##  GInteractions object with 471364 interactions and 3 metadata columns:\n##             seqnames1   ranges1     seqnames2   ranges2 |     frag1     frag2\n##                 &lt;Rle&gt; &lt;IRanges&gt;         &lt;Rle&gt; &lt;IRanges&gt; | &lt;numeric&gt; &lt;numeric&gt;\n##         [1]        II       105 ---        II     48548 |      1358      1681\n##         [2]        II       113 ---        II     45003 |      1358      1658\n##         [3]        II       119 ---        II    687251 |      1358      5550\n##         [4]        II       160 ---        II     26124 |      1358      1510\n##         [5]        II       169 ---        II     39052 |      1358      1613\n##         ...       ...       ... ...       ...       ... .       ...       ...\n##    [471360]        II    808605 ---        II    809683 |      6316      6320\n##    [471361]        II    808609 ---        II    809917 |      6316      6324\n##    [471362]        II    808617 ---        II    809506 |      6316      6319\n##    [471363]        II    809447 ---        II    809685 |      6319      6321\n##    [471364]        II    809472 ---        II    809675 |      6319      6320\n##              distance\n##             &lt;integer&gt;\n##         [1]     48443\n##         [2]     44890\n##         [3]    687132\n##         [4]     25964\n##         [5]     38883\n##         ...       ...\n##    [471360]      1078\n##    [471361]      1308\n##    [471362]       889\n##    [471363]       238\n##    [471364]       203\n##    -------\n##    regions: 549331 ranges and 0 metadata columns\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\n\n\nWe can compute a P(s) per chromosome from this .pairs file using the distanceLaw function.\n\nlibrary(HiContacts)\nps &lt;- distanceLaw(pf, by_chr = TRUE) \n##  Importing pairs file /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753 in memory. This may take a while...\nps\n##  # A tibble: 115 × 6\n##    chr   binned_distance          p     norm_p norm_p_unity slope\n##    &lt;chr&gt;           &lt;dbl&gt;      &lt;dbl&gt;      &lt;dbl&gt;        &lt;dbl&gt; &lt;dbl&gt;\n##  1 II                 14 0.00000212 0.00000106         2.27  0   \n##  2 II                 16 0.0000170  0.0000170         36.4   1.56\n##  3 II                 17 0.0000361  0.0000180         38.6   1.55\n##  4 II                 19 0.0000424  0.0000212         45.5   1.55\n##  5 II                 21 0.0000467  0.0000233         50.0   1.54\n##  6 II                 23 0.0000870  0.0000290         62.1   1.53\n##  # ℹ 109 more rows\n\n\n\n\n\n\n\nNote\n\n\n\nBecause this is a toy dataset, contacts are only provided for the chromosome II.\n\ntable(ps$chr)\n##  \n##   II \n##  115\n\n\n\nThe plotPs() and plotPsSlope() functions are convenient ggplot2-based functions with pre-configured settings optimized for P(s) visualization.\n\nlibrary(ggplot2)\nplotPs(ps, aes(x = binned_distance, y = norm_p, color = chr))\n##  Warning: Removed 67 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\nplotPsSlope(ps, aes(x = binned_distance, y = slope, color = chr))\n##  Warning: Removed 67 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\n\n\n6.1.2 P(s) for multiple .pairs files\nLet’s first import a second example dataset. We’ll import pairs identified in a eco1 yeast mutant.\n\neco1_pairsf &lt;- HiContactsData('yeast_eco1', 'pairs.gz')\neco1_pf &lt;- PairsFile(eco1_pairsf)\n\n\neco1_ps &lt;- distanceLaw(eco1_pf, by_chr = TRUE) \n##  Importing pairs file /github/home/.cache/R/ExperimentHub/21fb251da216_7755 in memory. This may take a while...\neco1_ps\n##  # A tibble: 115 × 6\n##    chr   binned_distance          p     norm_p norm_p_unity slope\n##    &lt;chr&gt;           &lt;dbl&gt;      &lt;dbl&gt;      &lt;dbl&gt;        &lt;dbl&gt; &lt;dbl&gt;\n##  1 II                 14 0.00000201 0.00000100        0.660  0   \n##  2 II                 16 0.0000221  0.0000221        14.5    1.46\n##  3 II                 17 0.0000492  0.0000246        16.2    1.46\n##  4 II                 19 0.0000412  0.0000206        13.5    1.45\n##  5 II                 21 0.0000653  0.0000326        21.5    1.45\n##  6 II                 23 0.0000803  0.0000268        17.6    1.44\n##  # ℹ 109 more rows\n\nA little data wrangling can help plotting the distance laws for 2 different samples in the same plot.\n\nlibrary(dplyr)\nmerged_ps &lt;- rbind(\n    ps |&gt; mutate(sample = 'WT'), \n    eco1_ps |&gt; mutate(sample = 'eco1')\n)\nplotPs(merged_ps, aes(x = binned_distance, y = norm_p, color = sample, linetype = chr)) + \n    scale_color_manual(values = c('#c6c6c6', '#ca0000'))\n##  Warning: Removed 134 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\nplotPsSlope(merged_ps, aes(x = binned_distance, y = slope, color = sample, linetype = chr)) + \n    scale_color_manual(values = c('#c6c6c6', '#ca0000'))\n##  Warning: Removed 135 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\n\n\n6.1.3 P(s) from HiCExperiment objects\nAlternatively, distance laws can be computed from binned matrices directly by providing HiCExperiment objects. For deeply sequenced datasets, this can be significantly faster than when using original .pairs files, but the smoothness of the resulting curves will be greatly impacted, notably at short distances.\n\nps_from_hic &lt;- distanceLaw(hic, by_chr = TRUE) \n##  pairsFile not specified. The P(s) curve will be an approximation.\nplotPs(ps_from_hic, aes(x = binned_distance, y = norm_p))\n##  Warning: Removed 9 rows containing missing values (`geom_line()`).\n\n\n\n\n\n\nplotPsSlope(ps_from_hic, aes(x = binned_distance, y = slope))\n##  Warning: Removed 8 rows containing missing values (`geom_line()`)."
   },
   {
     "objectID": "interactions-centric.html#cistrans-ratios",
@@ -221,7 +228,7 @@
     "href": "interactions-centric.html#scalograms",
     "title": "\n6  Interactions-centric analysis\n",
     "section": "\n6.4 Scalograms",
-    "text": "6.4 Scalograms\nScalograms were introduced in Lioy et al. (2018) to investigate distance-dependent contact frequencies for individual genomic bins along chromosomes.\nTo generate a scalogram, one needs to provide a HiCExperiment object with a valid associated pairsFile.\n\npairsFile(hic) &lt;- pairsf\nscalo &lt;- scalogram(hic) \n##  Importing pairs file /github/home/.cache/R/ExperimentHub/1a594e4de0cf_7753 in memory. This may take a while...\nplotScalogram(scalo |&gt; filter(chr == 'II'), ylim = c(1e3, 1e5))\n\n\n\n\n\n\n\nSeveral scalograms can be plotted together to compare distance-dependent contact frequencies along a given chromosome in different samples.\n\n\neco1_hic &lt;- import(\n    CoolFile(HiContactsData('yeast_eco1', 'mcool')), \n    focus = 'II', \n    resolution = 2000\n)\n##  see ?HiContactsData and browseVignettes('HiContactsData') for documentation\n##  loading from cache\neco1_pairsf &lt;- HiContactsData('yeast_eco1', 'pairs.gz')\n##  see ?HiContactsData and browseVignettes('HiContactsData') for documentation\n##  loading from cache\npairsFile(eco1_hic) &lt;- eco1_pairsf\neco1_scalo &lt;- scalogram(eco1_hic) \n##  Importing pairs file /github/home/.cache/R/ExperimentHub/21b95aa8e2b4_7755 in memory. This may take a while...\nmerged_scalo &lt;- rbind(\n    scalo |&gt; mutate(sample = 'WT'), \n    eco1_scalo |&gt; mutate(sample = 'eco1')\n)\nplotScalogram(merged_scalo |&gt; filter(chr == 'II'), ylim = c(1e3, 1e5)) + \n    facet_grid(~sample)\n\n\n\n\n\n\n\n\nThis example points out the overall longer interactions within the long arm of the chromosome II in an eco1 mutant."
+    "text": "6.4 Scalograms\nScalograms were introduced in Lioy et al. (2018) to investigate distance-dependent contact frequencies for individual genomic bins along chromosomes.\nTo generate a scalogram, one needs to provide a HiCExperiment object with a valid associated pairsFile.\n\npairsFile(hic) &lt;- pairsf\nscalo &lt;- scalogram(hic) \n##  Importing pairs file /github/home/.cache/R/ExperimentHub/1a9a1c034d7_7753 in memory. This may take a while...\nplotScalogram(scalo |&gt; filter(chr == 'II'), ylim = c(1e3, 1e5))\n\n\n\n\n\n\n\nSeveral scalograms can be plotted together to compare distance-dependent contact frequencies along a given chromosome in different samples.\n\n\neco1_hic &lt;- import(\n    CoolFile(HiContactsData('yeast_eco1', 'mcool')), \n    focus = 'II', \n    resolution = 2000\n)\n##  see ?HiContactsData and browseVignettes('HiContactsData') for documentation\n##  loading from cache\neco1_pairsf &lt;- HiContactsData('yeast_eco1', 'pairs.gz')\n##  see ?HiContactsData and browseVignettes('HiContactsData') for documentation\n##  loading from cache\npairsFile(eco1_hic) &lt;- eco1_pairsf\neco1_scalo &lt;- scalogram(eco1_hic) \n##  Importing pairs file /github/home/.cache/R/ExperimentHub/21fb251da216_7755 in memory. This may take a while...\nmerged_scalo &lt;- rbind(\n    scalo |&gt; mutate(sample = 'WT'), \n    eco1_scalo |&gt; mutate(sample = 'eco1')\n)\nplotScalogram(merged_scalo |&gt; filter(chr == 'II'), ylim = c(1e3, 1e5)) + \n    facet_grid(~sample)\n\n\n\n\n\n\n\n\nThis example points out the overall longer interactions within the long arm of the chromosome II in an eco1 mutant."
   },
   {
     "objectID": "topological-features.html",
@@ -242,14 +249,14 @@
     "href": "topological-features.html#topological-domains",
     "title": "\n7  Finding topological features in Hi-C\n",
     "section": "\n7.2 Topological domains",
-    "text": "7.2 Topological domains\nTopological domains (a.k.a. Topologically Associating Domains, TADs, isolated neighborhoods, contact domains, …) refer to local chromosomal segments (e.b. roughly ≤ 1Mb in mammal genomes) which preferentially self-interact, in a constrained manner. They are demarcated by domain boundaries.\n\n\n\n\nThey are generally conserved across cell types and species (Schmitt et al. (2016)), typically correlate with units of DNA replication (Pope et al. (2014)), and could play a role during development (Stadhouders, Filion, and Graf (2019)).\n\n7.2.1 Computing diamond insulation score\nSeveral approaches exist to annotate topological domains (Sefer (2022)). Several packages in R implement some of these functionalities, e.g. spectralTAD or TADcompare.\nHiContacts offers a simple getDiamondInsulation function which computes the diamond insulation score (Crane et al. (2015)). This score quantifies average interaction frequency in an insulation window (of a certain window_size) sliding along contact matrices at a chosen resolution.\n\n# - Compute insulation score\nbpparam &lt;- SerialParam(progressbar = FALSE)\nhic &lt;- zoom(microC, 5000) |&gt; \n    refocus('chr17:60000001-83257441') |&gt;\n    getDiamondInsulation(window_size = 100000, BPPARAM = bpparam) |&gt; \n    getBorders()\n##  Going through preflight checklist...\n##  Scan each window and compute diamond insulation score...\n##  Annotating diamond score prominence for each window...\n\nhic\n##  `HiCExperiment` object with 2,156,222 contacts over 4,652 regions \n##  -------\n##  fileName: \"/usr/local/lib/R/site-library/OHCA/extdata/chr17.mcool\" \n##  focus: \"chr17:60,000,001-83,257,441\" \n##  resolutions(3): 5000 100000 250000\n##  active resolution: 5000 \n##  interactions: 2156044 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(21) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(1): insulation\n\n\n\n\n\n\n\nNote\n\n\n\nThe getDiamondInsulation function can be parallelized over multiple threads by specifying the Bioconductor generic BPPARAM argument.\n\n\n\n\n\n\n\n\nNote\n\n\n\ngetDiamondInsulation() is an endomorphism: it returns the original object, enriched with two new pieces of information:\n\nA borders topologicalFeatures:\n\n\ntopologicalFeatures(hic, \"borders\")\n##  GRanges object with 21 ranges and 1 metadata column:\n##           seqnames            ranges strand |     score\n##              &lt;Rle&gt;         &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt;\n##    strong    chr17 60105001-60110000      * |  0.574760\n##      weak    chr17 60210001-60215000      * |  0.414425\n##      weak    chr17 61415001-61420000      * |  0.346668\n##    strong    chr17 61500001-61505000      * |  0.544336\n##      weak    chr17 62930001-62935000      * |  0.399794\n##       ...      ...               ...    ... .       ...\n##      weak    chr17 78395001-78400000      * |  0.235613\n##      weak    chr17 79065001-79070000      * |  0.236535\n##      weak    chr17 80155001-80160000      * |  0.284855\n##      weak    chr17 81735001-81740000      * |  0.497478\n##    strong    chr17 81840001-81845000      * |  1.395949\n##    -------\n##    seqinfo: 1 sequence from an unspecified genome\n\n\nThe calculated insulation scores stored in metadata:\n\n\nmetadata(hic)$insulation\n##  GRanges object with 4611 ranges and 8 metadata columns:\n##                            seqnames            ranges strand |    bin_id\n##                               &lt;Rle&gt;         &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt;\n##    chr17_60100001_60105000    chr17 60100001-60105000      * |     12020\n##    chr17_60105001_60110000    chr17 60105001-60110000      * |     12021\n##    chr17_60110001_60115000    chr17 60110001-60115000      * |     12022\n##    chr17_60115001_60120000    chr17 60115001-60120000      * |     12023\n##    chr17_60120001_60125000    chr17 60120001-60125000      * |     12024\n##                        ...      ...               ...    ... .       ...\n##    chr17_83130001_83135000    chr17 83130001-83135000      * |     16626\n##    chr17_83135001_83140000    chr17 83135001-83140000      * |     16627\n##    chr17_83140001_83145000    chr17 83140001-83145000      * |     16628\n##    chr17_83145001_83150000    chr17 83145001-83150000      * |     16629\n##    chr17_83150001_83155000    chr17 83150001-83155000      * |     16630\n##                               weight   chr    center     score insulation\n##                            &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt; &lt;numeric&gt;  &lt;numeric&gt;\n##    chr17_60100001_60105000 0.0406489 chr17  60102500  0.188061  -0.750142\n##    chr17_60105001_60110000 0.0255539 chr17  60107500  0.180860  -0.806466\n##    chr17_60110001_60115000       NaN chr17  60112500  0.196579  -0.686232\n##    chr17_60115001_60120000       NaN chr17  60117500  0.216039  -0.550046\n##    chr17_60120001_60125000       NaN chr17  60122500  0.230035  -0.459489\n##                        ...       ...   ...       ...       ...        ...\n##    chr17_83130001_83135000 0.0314684 chr17  83132500  0.262191  -0.270723\n##    chr17_83135001_83140000 0.0307197 chr17  83137500  0.240779  -0.393632\n##    chr17_83140001_83145000 0.0322810 chr17  83142500  0.219113  -0.529664\n##    chr17_83145001_83150000 0.0280840 chr17  83147500  0.199645  -0.663900\n##    chr17_83150001_83155000 0.0272775 chr17  83152500  0.180434  -0.809873\n##                                  min prominence\n##                            &lt;logical&gt;  &lt;numeric&gt;\n##    chr17_60100001_60105000     FALSE         NA\n##    chr17_60105001_60110000      TRUE    0.57476\n##    chr17_60110001_60115000     FALSE         NA\n##    chr17_60115001_60120000     FALSE         NA\n##    chr17_60120001_60125000     FALSE         NA\n##                        ...       ...        ...\n##    chr17_83130001_83135000     FALSE         NA\n##    chr17_83135001_83140000     FALSE         NA\n##    chr17_83140001_83145000     FALSE         NA\n##    chr17_83145001_83150000     FALSE         NA\n##    chr17_83150001_83155000     FALSE         NA\n##    -------\n##    seqinfo: 1 sequence from an unspecified genome\n\n\n\n\n7.2.2 Exporting insulation scores tracks\nTo save the diamond insulation scores (as a bigwig file) and the borders (as a bed file), the export function can be used:\n\ncoverage(metadata(hic)$insulation, weight = 'insulation') |&gt; export('microC_insulation.bw')\ntopologicalFeatures(hic, \"borders\") |&gt; export('microC_borders.bed')\n\n\n7.2.3 Visualizing chromatin domains\nInsulation tracks should be visualized in a dedicated genome browser.\nThat being said, it is possible to visualize a genome track in R besides the matching Hi-C contact matrix.\n\nhic &lt;- zoom(hic, 100000)\np1 &lt;- plotMatrix(\n    hic, \n    use.scores = 'balanced', \n    limits = c(-3.5, -1),\n    borders = topologicalFeatures(hic, \"borders\"),\n    caption = FALSE\n)\ninsulation &lt;- coverage(metadata(hic)$insulation, weight = 'insulation')[[1]]\ninsulation_df &lt;- tibble(pos = cumsum(runLength(insulation)), insulation = runValue(insulation))\np2 &lt;- ggplot(insulation_df, aes(x = pos, y = insulation)) + \n    geom_area() + \n    theme_void() + \n    coord_cartesian(expand = FALSE) + \n    labs(x = \"Genomic position\", y = \"Diamond insulation score\")\nwrap_plots(p1, p2, ncol = 1, heights = c(10, 1))\n\n\n\n\n\n\n\nLocal minima in the diamond insulation score displayed below the Hi-C contact matrix are identified using the getBorders() function, which automatically estimates a minimum threshold. These local minima correspond to borders and are visually depicted on the Hi-C map by blue diamonds."
+    "text": "7.2 Topological domains\nTopological domains (a.k.a. Topologically Associating Domains, TADs, isolated neighborhoods, contact domains, …) refer to local chromosomal segments (e.b. roughly ≤ 1Mb in mammal genomes) which preferentially self-interact, in a constrained manner. They are demarcated by domain boundaries.\n\n\n\n\nThey are generally conserved across cell types and species (Schmitt et al. (2016)), typically correlate with units of DNA replication (Pope et al. (2014)), and could play a role during development (Stadhouders et al. (2019)).\n\n7.2.1 Computing diamond insulation score\nSeveral approaches exist to annotate topological domains (Sefer (2022)). Several packages in R implement some of these functionalities, e.g. spectralTAD or TADcompare.\nHiContacts offers a simple getDiamondInsulation function which computes the diamond insulation score (Crane et al. (2015)). This score quantifies average interaction frequency in an insulation window (of a certain window_size) sliding along contact matrices at a chosen resolution.\n\n# - Compute insulation score\nbpparam &lt;- SerialParam(progressbar = FALSE)\nhic &lt;- zoom(microC, 5000) |&gt; \n    refocus('chr17:60000001-83257441') |&gt;\n    getDiamondInsulation(window_size = 100000, BPPARAM = bpparam) |&gt; \n    getBorders()\n##  Going through preflight checklist...\n##  Scan each window and compute diamond insulation score...\n##  Annotating diamond score prominence for each window...\n\nhic\n##  `HiCExperiment` object with 2,156,222 contacts over 4,652 regions \n##  -------\n##  fileName: \"/usr/local/lib/R/site-library/OHCA/extdata/chr17.mcool\" \n##  focus: \"chr17:60,000,001-83,257,441\" \n##  resolutions(3): 5000 100000 250000\n##  active resolution: 5000 \n##  interactions: 2156044 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(21) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(1): insulation\n\n\n\n\n\n\n\nNote\n\n\n\nThe getDiamondInsulation function can be parallelized over multiple threads by specifying the Bioconductor generic BPPARAM argument.\n\n\n\n\n\n\n\n\nNote\n\n\n\ngetDiamondInsulation() is an endomorphism: it returns the original object, enriched with two new pieces of information:\n\nA borders topologicalFeatures:\n\n\ntopologicalFeatures(hic, \"borders\")\n##  GRanges object with 21 ranges and 1 metadata column:\n##           seqnames            ranges strand |     score\n##              &lt;Rle&gt;         &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt;\n##    strong    chr17 60105001-60110000      * |  0.574760\n##      weak    chr17 60210001-60215000      * |  0.414425\n##      weak    chr17 61415001-61420000      * |  0.346668\n##    strong    chr17 61500001-61505000      * |  0.544336\n##      weak    chr17 62930001-62935000      * |  0.399794\n##       ...      ...               ...    ... .       ...\n##      weak    chr17 78395001-78400000      * |  0.235613\n##      weak    chr17 79065001-79070000      * |  0.236535\n##      weak    chr17 80155001-80160000      * |  0.284855\n##      weak    chr17 81735001-81740000      * |  0.497478\n##    strong    chr17 81840001-81845000      * |  1.395949\n##    -------\n##    seqinfo: 1 sequence from an unspecified genome\n\n\nThe calculated insulation scores stored in metadata:\n\n\nmetadata(hic)$insulation\n##  GRanges object with 4611 ranges and 8 metadata columns:\n##                            seqnames            ranges strand |    bin_id\n##                               &lt;Rle&gt;         &lt;IRanges&gt;  &lt;Rle&gt; | &lt;numeric&gt;\n##    chr17_60100001_60105000    chr17 60100001-60105000      * |     12020\n##    chr17_60105001_60110000    chr17 60105001-60110000      * |     12021\n##    chr17_60110001_60115000    chr17 60110001-60115000      * |     12022\n##    chr17_60115001_60120000    chr17 60115001-60120000      * |     12023\n##    chr17_60120001_60125000    chr17 60120001-60125000      * |     12024\n##                        ...      ...               ...    ... .       ...\n##    chr17_83130001_83135000    chr17 83130001-83135000      * |     16626\n##    chr17_83135001_83140000    chr17 83135001-83140000      * |     16627\n##    chr17_83140001_83145000    chr17 83140001-83145000      * |     16628\n##    chr17_83145001_83150000    chr17 83145001-83150000      * |     16629\n##    chr17_83150001_83155000    chr17 83150001-83155000      * |     16630\n##                               weight   chr    center     score insulation\n##                            &lt;numeric&gt; &lt;Rle&gt; &lt;integer&gt; &lt;numeric&gt;  &lt;numeric&gt;\n##    chr17_60100001_60105000 0.0406489 chr17  60102500  0.188061  -0.750142\n##    chr17_60105001_60110000 0.0255539 chr17  60107500  0.180860  -0.806466\n##    chr17_60110001_60115000       NaN chr17  60112500  0.196579  -0.686232\n##    chr17_60115001_60120000       NaN chr17  60117500  0.216039  -0.550046\n##    chr17_60120001_60125000       NaN chr17  60122500  0.230035  -0.459489\n##                        ...       ...   ...       ...       ...        ...\n##    chr17_83130001_83135000 0.0314684 chr17  83132500  0.262191  -0.270723\n##    chr17_83135001_83140000 0.0307197 chr17  83137500  0.240779  -0.393632\n##    chr17_83140001_83145000 0.0322810 chr17  83142500  0.219113  -0.529664\n##    chr17_83145001_83150000 0.0280840 chr17  83147500  0.199645  -0.663900\n##    chr17_83150001_83155000 0.0272775 chr17  83152500  0.180434  -0.809873\n##                                  min prominence\n##                            &lt;logical&gt;  &lt;numeric&gt;\n##    chr17_60100001_60105000     FALSE         NA\n##    chr17_60105001_60110000      TRUE    0.57476\n##    chr17_60110001_60115000     FALSE         NA\n##    chr17_60115001_60120000     FALSE         NA\n##    chr17_60120001_60125000     FALSE         NA\n##                        ...       ...        ...\n##    chr17_83130001_83135000     FALSE         NA\n##    chr17_83135001_83140000     FALSE         NA\n##    chr17_83140001_83145000     FALSE         NA\n##    chr17_83145001_83150000     FALSE         NA\n##    chr17_83150001_83155000     FALSE         NA\n##    -------\n##    seqinfo: 1 sequence from an unspecified genome\n\n\n\n\n7.2.2 Exporting insulation scores tracks\nTo save the diamond insulation scores (as a bigwig file) and the borders (as a bed file), the export function can be used:\n\ncoverage(metadata(hic)$insulation, weight = 'insulation') |&gt; export('microC_insulation.bw')\ntopologicalFeatures(hic, \"borders\") |&gt; export('microC_borders.bed')\n\n\n7.2.3 Visualizing chromatin domains\nInsulation tracks should be visualized in a dedicated genome browser.\nThat being said, it is possible to visualize a genome track in R besides the matching Hi-C contact matrix.\n\nhic &lt;- zoom(hic, 100000)\np1 &lt;- plotMatrix(\n    hic, \n    use.scores = 'balanced', \n    limits = c(-3.5, -1),\n    borders = topologicalFeatures(hic, \"borders\"),\n    caption = FALSE\n)\ninsulation &lt;- coverage(metadata(hic)$insulation, weight = 'insulation')[[1]]\ninsulation_df &lt;- tibble(pos = cumsum(runLength(insulation)), insulation = runValue(insulation))\np2 &lt;- ggplot(insulation_df, aes(x = pos, y = insulation)) + \n    geom_area() + \n    theme_void() + \n    coord_cartesian(expand = FALSE) + \n    labs(x = \"Genomic position\", y = \"Diamond insulation score\")\nwrap_plots(p1, p2, ncol = 1, heights = c(10, 1))\n\n\n\n\n\n\n\nLocal minima in the diamond insulation score displayed below the Hi-C contact matrix are identified using the getBorders() function, which automatically estimates a minimum threshold. These local minima correspond to borders and are visually depicted on the Hi-C map by blue diamonds."
   },
   {
     "objectID": "topological-features.html#chromatin-loops",
     "href": "topological-features.html#chromatin-loops",
     "title": "\n7  Finding topological features in Hi-C\n",
     "section": "\n7.3 Chromatin loops",
-    "text": "7.3 Chromatin loops\n\n7.3.1 chromosight\n\nChromatin loops, dots, or contacts, refer to a strong increase of interaction frequency between a pair of two genomic loci. They correspond to focal “dots” on a Hi-C map. Relying on computer vision algorithms, chromosight uses this property to annotate chromatin loops in a Hi-C map (Matthey-Doret et al. (2020)). chromosight is a standalone python package and is made available in R through the HiCool-managed conda environment with the getLoops() function.\n\n7.3.1.1 Identifying loops\n\nhic &lt;- HiCool::getLoops(microC, resolution = 5000)\n\nhic\n## `HiCExperiment` object with 917,156 contacts over 100 regions\n## -------\n## fileName: \"/home/rsg/.cache/R/fourDNData/4d434d8538a0_4DNFI9FVHJZQ.mcool\"\n## focus: \"chr17:63,000,001-63,500,000\"\n## resolutions(13): 1000 2000 ... 5000000 10000000\n## active resolution: 5000\n## interactions: 5047\n## scores(2): count balanced\n## topologicalFeatures: compartments(0) borders(0) loops(66411) viewpoints(0)\n## pairsFile: N/A\n## metadata(1): chromosight_args\n\n\n\n\n\n\n\nNote\n\n\n\ngetLoops() is an endomorphism: it returns the original object, enriched with two new pieces of information:\n\nA loops topologicalFeatures:\n\n\ntopologicalFeatures(hic, \"loops\")\n## GInteractions object with 66411 interactions and 5 metadata columns:\n##           seqnames1           ranges1     seqnames2           ranges2 |   bin_id1   bin_id2     score      ## pvalue    qvalue\n##               &lt;Rle&gt;         &lt;IRanges&gt;         &lt;Rle&gt;         &lt;IRanges&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;   ## &lt;numeric&gt; &lt;numeric&gt;\n##       [1]      chr1     775001-780000 ---      chr1     850001-855000 |       155       170  0.334586 2.## 15995e-05 2.162e-05\n##       [2]      chr1     775001-780000 ---      chr1     865001-870000 |       155       173  0.403336 1.## 62900e-07 1.669e-07\n##       [3]      chr1     865001-870000 ---      chr1     890001-895000 |       173       178  0.337344 1.## 91400e-07 1.957e-07\n##       [4]      chr1     910001-915000 ---      chr1     955001-960000 |       182       191  0.639725 0.## 00000e+00 0.000e+00\n##       [5]      chr1     910001-915000 ---      chr1   1055001-1060000 |       182       211  0.521699 0.## 00000e+00 0.000e+00\n##       ...       ...               ... ...       ...               ... .       ...       ...       ...       ##   ...       ...\n##   [66407]      chrY 19570001-19575000 ---      chrY 19720001-19725000 |    610133    610163  0.315529    3.## 30e-08  3.55e-08\n##   [66408]      chrY 19705001-19710000 ---      chrY 19730001-19735000 |    610160    610165  0.708753    0.## 00e+00  0.00e+00\n##   [66409]      chrY 19765001-19770000 ---      chrY 19800001-19805000 |    610172    610179  0.373635    1.## 10e-09  1.40e-09\n##   [66410]      chrY 20555001-20560000 ---      chrY 20645001-20650000 |    610330    610348  0.603308    0.## 00e+00  0.00e+00\n##   [66411]      chrY 21015001-21020000 ---      chrY 21055001-21060000 |    610422    610430  0.394614    9.## 12e-08  9.45e-08\n##   -------\n##   regions: 84171 ranges and 0 metadata columns\n##   seqinfo: 24 sequences from an unspecified genome; no seqlengths\n\n\nThe arguments used by chromosight, stored in metadata:\n\n\nmetadata(hic)$chromosight_args\n## $`--pattern`\n## [1] \"loops\"\n## \n## $`--dump`\n## [1] \"/data/.cache/R//RtmpSaRwiZ\"\n## \n## $`--inter`\n## [1] FALSE\n## \n## $`--iterations`\n## [1] \"auto\"\n## \n## $`--kernel-config`\n## NULL\n## \n## $`--perc-zero`\n## [1] \"auto\"\n## \n## $`--perc-undetected`\n## [1] \"auto\"\n## \n## $`--tsvd`\n## [1] FALSE\n## \n## $`--win-fmt`\n## [1] \"json\"\n## \n## $`--win-size`\n## [1] \"auto\"\n## \n## $`--no-plotting`\n## [1] TRUE\n## \n## $`--smooth-trend`\n## [1] FALSE\n## \n## $`--norm`\n## [1] \"auto\"\n## \n## $`&lt;contact_map&gt;`\n## [1] \"/home/rsg/.cache/R/fourDNData/4d434d8538a0_4DNFI9FVHJZQ.mcool::/resolutions/5000\"\n## \n## $`--max-dist`\n## [1] \"auto\"\n## \n## $`--min-dist`\n## [1] \"auto\"\n## \n## $`--min-separation`\n## [1] \"auto\"\n## \n## $`--n-mads`\n## [1] 5\n## \n## $`&lt;prefix&gt;`\n## [1] \"chromosight/chromo\"\n## \n## $`--pearson`\n## [1] \"auto\"\n## \n## $`--subsample`\n## [1] \"no\"\n## \n## $`--threads`\n## [1] 1\n\n\n\n\n7.3.1.2 Exporting chromatin loops\n\nloops &lt;- topologicalFeatures(hic, \"loops\")\nloops &lt;- loops[loops$score &gt;= 0.4 & loops$qvalue &lt;= 1e-6]\nGenomicInteractions::export.bedpe(loops, 'loops.bedpe')\n\n\n7.3.1.3 Visualizing chromatin loops\n\n\n\n\n\n\nChromosight users\n\n\n\nIf you are using chromosight directly from the terminal (i.e. outside R), you can import the annotated loops in R as follows:\n\ndf &lt;- readr::read_tsv(\"...\")\nloops &lt;- InteractionSet::GInteractions(\n    anchor1 = GenomicRanges::GRanges(\n        df$chrom1, IRanges::IRanges(df$start1+1, df$end1)\n    ),\n    anchor2 = GenomicRanges::GRanges(\n        df$chrom2, IRanges::IRanges(df$start2+1, df$end2)\n    ),\n    bin_id1 = df$bin1, \n    bin_id2 = df$bin2, \n    score = df$score, \n    pvalue = df$pvalue, \n    qvalue = df$qvalue\n)\n\n\n\n\nplotMatrix(\n    refocus(hic, 'chr17:62500001-63500000') |&gt; zoom(5000), \n    loops = loops,\n    limits = c(-4, -1.2),\n    caption = FALSE\n)\n\n\n\n7.3.2 Other R packages\nA number of other R packages have been developed to identify focal chromatin loops, notably fitHiC (Ay, Bailey, and Noble (2014)), GOTHiC (Mifsud et al. (2017)) or idr2d (Krismer, Guo, and Gifford (2020)). Each fits a slightly different purpose, and we encourage the end user to read companion publications."
+    "text": "7.3 Chromatin loops\n\n7.3.1 chromosight\n\nChromatin loops, dots, or contacts, refer to a strong increase of interaction frequency between a pair of two genomic loci. They correspond to focal “dots” on a Hi-C map. Relying on computer vision algorithms, chromosight uses this property to annotate chromatin loops in a Hi-C map (Matthey-Doret et al. (2020)). chromosight is a standalone python package and is made available in R through the HiCool-managed conda environment with the getLoops() function.\n\n7.3.1.1 Identifying loops\n\nhic &lt;- HiCool::getLoops(microC, resolution = 5000)\n\nhic\n## `HiCExperiment` object with 917,156 contacts over 100 regions\n## -------\n## fileName: \"/home/rsg/.cache/R/fourDNData/4d434d8538a0_4DNFI9FVHJZQ.mcool\"\n## focus: \"chr17:63,000,001-63,500,000\"\n## resolutions(13): 1000 2000 ... 5000000 10000000\n## active resolution: 5000\n## interactions: 5047\n## scores(2): count balanced\n## topologicalFeatures: compartments(0) borders(0) loops(66411) viewpoints(0)\n## pairsFile: N/A\n## metadata(1): chromosight_args\n\n\n\n\n\n\n\nNote\n\n\n\ngetLoops() is an endomorphism: it returns the original object, enriched with two new pieces of information:\n\nA loops topologicalFeatures:\n\n\ntopologicalFeatures(hic, \"loops\")\n## GInteractions object with 66411 interactions and 5 metadata columns:\n##           seqnames1           ranges1     seqnames2           ranges2 |   bin_id1   bin_id2     score      ## pvalue    qvalue\n##               &lt;Rle&gt;         &lt;IRanges&gt;         &lt;Rle&gt;         &lt;IRanges&gt; | &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;   ## &lt;numeric&gt; &lt;numeric&gt;\n##       [1]      chr1     775001-780000 ---      chr1     850001-855000 |       155       170  0.334586 2.## 15995e-05 2.162e-05\n##       [2]      chr1     775001-780000 ---      chr1     865001-870000 |       155       173  0.403336 1.## 62900e-07 1.669e-07\n##       [3]      chr1     865001-870000 ---      chr1     890001-895000 |       173       178  0.337344 1.## 91400e-07 1.957e-07\n##       [4]      chr1     910001-915000 ---      chr1     955001-960000 |       182       191  0.639725 0.## 00000e+00 0.000e+00\n##       [5]      chr1     910001-915000 ---      chr1   1055001-1060000 |       182       211  0.521699 0.## 00000e+00 0.000e+00\n##       ...       ...               ... ...       ...               ... .       ...       ...       ...       ##   ...       ...\n##   [66407]      chrY 19570001-19575000 ---      chrY 19720001-19725000 |    610133    610163  0.315529    3.## 30e-08  3.55e-08\n##   [66408]      chrY 19705001-19710000 ---      chrY 19730001-19735000 |    610160    610165  0.708753    0.## 00e+00  0.00e+00\n##   [66409]      chrY 19765001-19770000 ---      chrY 19800001-19805000 |    610172    610179  0.373635    1.## 10e-09  1.40e-09\n##   [66410]      chrY 20555001-20560000 ---      chrY 20645001-20650000 |    610330    610348  0.603308    0.## 00e+00  0.00e+00\n##   [66411]      chrY 21015001-21020000 ---      chrY 21055001-21060000 |    610422    610430  0.394614    9.## 12e-08  9.45e-08\n##   -------\n##   regions: 84171 ranges and 0 metadata columns\n##   seqinfo: 24 sequences from an unspecified genome; no seqlengths\n\n\nThe arguments used by chromosight, stored in metadata:\n\n\nmetadata(hic)$chromosight_args\n## $`--pattern`\n## [1] \"loops\"\n## \n## $`--dump`\n## [1] \"/data/.cache/R//RtmpSaRwiZ\"\n## \n## $`--inter`\n## [1] FALSE\n## \n## $`--iterations`\n## [1] \"auto\"\n## \n## $`--kernel-config`\n## NULL\n## \n## $`--perc-zero`\n## [1] \"auto\"\n## \n## $`--perc-undetected`\n## [1] \"auto\"\n## \n## $`--tsvd`\n## [1] FALSE\n## \n## $`--win-fmt`\n## [1] \"json\"\n## \n## $`--win-size`\n## [1] \"auto\"\n## \n## $`--no-plotting`\n## [1] TRUE\n## \n## $`--smooth-trend`\n## [1] FALSE\n## \n## $`--norm`\n## [1] \"auto\"\n## \n## $`&lt;contact_map&gt;`\n## [1] \"/home/rsg/.cache/R/fourDNData/4d434d8538a0_4DNFI9FVHJZQ.mcool::/resolutions/5000\"\n## \n## $`--max-dist`\n## [1] \"auto\"\n## \n## $`--min-dist`\n## [1] \"auto\"\n## \n## $`--min-separation`\n## [1] \"auto\"\n## \n## $`--n-mads`\n## [1] 5\n## \n## $`&lt;prefix&gt;`\n## [1] \"chromosight/chromo\"\n## \n## $`--pearson`\n## [1] \"auto\"\n## \n## $`--subsample`\n## [1] \"no\"\n## \n## $`--threads`\n## [1] 1\n\n\n\n\n7.3.1.2 Exporting chromatin loops\n\nloops &lt;- topologicalFeatures(hic, \"loops\")\nloops &lt;- loops[loops$score &gt;= 0.4 & loops$qvalue &lt;= 1e-6]\nGenomicInteractions::export.bedpe(loops, 'loops.bedpe')\n\n\n7.3.1.3 Visualizing chromatin loops\n\n\n\n\n\n\nChromosight users\n\n\n\nIf you are using chromosight directly from the terminal (i.e. outside R), you can import the annotated loops in R as follows:\n\ndf &lt;- readr::read_tsv(\"...\")\nloops &lt;- InteractionSet::GInteractions(\n    anchor1 = GenomicRanges::GRanges(\n        df$chrom1, IRanges::IRanges(df$start1+1, df$end1)\n    ),\n    anchor2 = GenomicRanges::GRanges(\n        df$chrom2, IRanges::IRanges(df$start2+1, df$end2)\n    ),\n    bin_id1 = df$bin1, \n    bin_id2 = df$bin2, \n    score = df$score, \n    pvalue = df$pvalue, \n    qvalue = df$qvalue\n)\n\n\n\n\nplotMatrix(\n    refocus(hic, 'chr17:62500001-63500000') |&gt; zoom(5000), \n    loops = loops,\n    limits = c(-4, -1.2),\n    caption = FALSE\n)\n\n\n\n7.3.2 Other R packages\nA number of other R packages have been developed to identify focal chromatin loops, notably fitHiC (Ay et al. (2014)), GOTHiC (Mifsud et al. (2017)) or idr2d (Krismer et al. (2020)). Each fits a slightly different purpose, and we encourage the end user to read companion publications."
   },
   {
     "objectID": "disseminating.html",
@@ -272,47 +279,54 @@
     "section": "\n8.2 DNA Zoo",
     "text": "8.2 DNA Zoo\nThe DNA Zoo Consortium is a collaborative group whose aim is to correct and refine genome assemblies across the tree of life using Hi-C approaches. As of 2023, they have performed Hi-C across more than 300 animal, plant and fungi species.\nDNAZooData is a package giving programmatic access to these uniformly processed Hi-C contact files, as well as the refined genome assemblies.\nThe DNAZooData() function provides a gateway to DNA Zoo-hosted Hi-C files, fetching and caching relevant contact matrices in .hic format It returns a HicFile object, which can then be imported in memory using import().\n\nlibrary(DNAZooData)\nhead(DNAZooData())\n##                  species                              readme                                                         readme_link original_assembly     new_assembly                                                             new_assembly_link new_assembly_link_status                                                                 hic_link\n##         Acinonyx_jubatus        Acinonyx_jubatus/README.json        https://dnazoo.s3.wasabisys.com/Acinonyx_jubatus/README.json           aciJub1      aciJub1_HiC         https://dnazoo.s3.wasabisys.com/Acinonyx_jubatus/aciJub1_HiC.fasta.gz                      200    https://dnazoo.s3.wasabisys.com/Acinonyx_jubatus/aciJub1.rawchrom.hic\n##       Acropora_millepora      Acropora_millepora/README.json      https://dnazoo.s3.wasabisys.com/Acropora_millepora/README.json       amil_sf_1.1  amil_sf_1.1_HiC   https://dnazoo.s3.wasabisys.com/Acropora_millepora/amil_sf_1.1_HiC.fasta.gz                      200   https://dnazoo.s3.wasabisys.com/Acropora_millepora/amil_sf_1.1_HiC.hic\n##      Addax_nasomaculatus     Addax_nasomaculatus/README.json     https://dnazoo.s3.wasabisys.com/Addax_nasomaculatus/README.json      ASM1959352v1 ASM1959352v1_HiC https://dnazoo.s3.wasabisys.com/Addax_nasomaculatus/ASM1959352v1_HiC.fasta.gz                      200 https://dnazoo.s3.wasabisys.com/Addax_nasomaculatus/ASM1959352v1_HiC.hic\n##            Aedes_aegypti           Aedes_aegypti/README.json           https://dnazoo.s3.wasabisys.com/Aedes_aegypti/README.json        AGWG.draft         AaegL5.0               https://dnazoo.s3.wasabisys.com/Aedes_aegypti/AaegL5.0.fasta.gz                      404                                                                     &lt;NA&gt;\n##    Aedes_aegypti__AaegL4   Aedes_aegypti__AaegL4/README.json   https://dnazoo.s3.wasabisys.com/Aedes_aegypti__AaegL4/README.json            AaegL3           AaegL4         https://dnazoo.s3.wasabisys.com/Aedes_aegypti__AaegL4/AaegL4.fasta.gz                      200         https://dnazoo.s3.wasabisys.com/Aedes_aegypti__AaegL4/AaegL4.hic\n##  Aedes_aegypti__AaegL5.0 Aedes_aegypti__AaegL5.0/README.json https://dnazoo.s3.wasabisys.com/Aedes_aegypti__AaegL5.0/README.json        AGWG.draft         AaegL5.0     https://dnazoo.s3.wasabisys.com/Aedes_aegypti__AaegL5.0/AaegL5.0.fasta.gz                      200     https://dnazoo.s3.wasabisys.com/Aedes_aegypti__AaegL5.0/AaegL5.0.hic\n\nFor example, we can directly fetch a Hi-C dataset generated from a tardigrade sample by specifying the right species argument.\n\nhicfile &lt;- DNAZooData(species = 'Hypsibius_dujardini')\n##  Fetching Hi-C data from DNAZoo\n##  |===================================|  100%\nhicfile\n##  HicFile object\n##  .hic file: /home/rsg/.cache/R/DNAZooData/400d7e2b0145_nHd_3.1_HiC.hic\n##  resolution: 5000\n##  pairs file:\n##  metadata(6): organism draftAssembly ... credits assemblyURL\n\nHere again, the resulting HicFile is populated with metadata parsed from the DNA Zoo data portal.\n\nmetadata(hicfile)$organism\n##  $vernacular\n##  [1] \"Tardigrade\"\n##  \n##  $binomial\n##  [1] \"Hypsibius dujardini\"\n##  \n##  $funFact\n##  [1] \"&lt;i&gt;Hypsibius dujardini&lt;/i&gt; is a species of tardigrade, a tiny microscopic organism. They are also commonly called water bears. This species is found world-wide!\"\n##  \n##  $extraInfo\n##  [1] \"on BioKIDS website\"\n##  \n##  $extraInfoLink\n##  [1] \"http://www.biokids.umich.edu/critters/Hypsibius_dujardini/\"\n##  \n##  $image\n##  [1] \"https://static.wixstatic.com/media/2b9330_82db39c219f24b20a75cb38943aad1fb~mv2.jpg\"\n##  \n##  $imageCredit\n##  [1] \"By Willow Gabriel, Goldstein Lab - https://www.flickr.com/photos/waterbears/1614095719/ Template:Uploader Transferred from en.wikipedia to Commons., CC BY-SA 2.5, https://commons.wikimedia.org/w/index.php?curi\n##  d=2261992\"\n##  \n##  $isChromognomes\n##  [1] \"FALSE\"\n##  \n##  $taxonomy\n##  [1] \"Species:202423-914154-914155-914158-155166-155362-710171-710179-710192-155390-155420\"\n\nHiCFile metadata also points to a URL to directly fetch the genome assembly corrected by the DNA Zoo consortium.\n\nmetadata(hicfile)$assemblyURL\n##  [1] \"https://dnazoo.s3.wasabisys.com/Hypsibius_dujardini/nHd_3.1_HiC.fasta.gz\""
   },
+  {
+    "objectID": "interoperability.html",
+    "href": "interoperability.html",
+    "title": "\n9  Interoperability: using HiCExperiment with other R packages\n",
+    "section": "",
+    "text": "References"
+  },
   {
     "objectID": "interoperability.html#hicrep",
     "href": "interoperability.html#hicrep",
     "title": "\n9  Interoperability: using HiCExperiment with other R packages\n",
     "section": "\n9.1 HiCrep",
-    "text": "9.1 HiCrep\nhicrep is a popular package to compute stratum-adjusted correlations between Hi-C datasets (Yang et al. (2017)). “Stratum” refers to the distance from the main diagonal: with increase distance from the main diagonal, interactions of the DNA polymer are bound to decrease. hicrep computes a “per-stratum” correlation score and computes a weighted average correlation for entire chromosomes.\n\n\n\n\n\n\nInstalling hicrep\n\n\n\nhicrep package has been available from Bioconductor for many years but has been withdrawn from their repositories at some point. You can always install hicrep directly from its GitHub repository as follows:\n\nremotes::install_github('TaoYang-dev/hicrep')\n\n\n\nIn order to use hicrep, we first need to create two HiCExperiment objects.\n\nlibrary(InteractionSet)\nlibrary(HiCExperiment)\nlibrary(HiContactsData)\n\n# ---- This downloads example `.mcool` and `.pairs` files and caches them locally \ncoolf_wt &lt;- HiContactsData('yeast_wt', 'mcool')\ncoolf_eco1 &lt;- HiContactsData('yeast_eco1', 'mcool')\n\n\nhic_wt &lt;- import(coolf_wt, format = 'cool')\nhic_eco1 &lt;- import(coolf_eco1, format = 'cool')\n\nWe can now run the main get.scc function from hicrep. The documentation for this function is available from the console by typing ?hicrep::get.scc. More information is also available from the GitHub page. It informs the end user that the input for this function should be two intra-chromosomal Hi-C raw count matrices in square (optionally sparse) format.\n\nhic_wt\n##  `HiCExperiment` object with 8,757,906 contacts over 12,079 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"whole genome\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 2945692 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\nas.matrix(hic_wt[\"IV\"], use.scores = 'count')[1:10, 1:10]\n##  10 x 10 sparse Matrix of class \"dgTMatrix\"\n##                           \n##   [1,] . 1 . . 1 . . . . .\n##   [2,] 1 . . . . . . . . .\n##   [3,] . . . . . . . . . .\n##   [4,] . . . . . . . . . .\n##   [5,] 1 . . . . . . . 1 .\n##   [6,] . . . . . . . . . .\n##   [7,] . . . . . . . . . .\n##   [8,] . . . . . . . . 1 .\n##   [9,] . . . . 1 . . 1 . .\n##  [10,] . . . . . . . . . .\n\nlibrary(hicrep)\nscc &lt;- get.scc(\n    as.matrix(hic_wt[\"IV\"], use.scores = 'count'), \n    as.matrix(hic_eco1[\"IV\"], use.scores = 'count'), \n    resol = 1000, h = 25, lbr = 5000, ubr = 50000\n)\nscc\n##  $corr\n##   [1] 0.9412784 0.9410680 0.9408082 0.9404796 0.9404544 0.9402584 0.9400710\n##   [8] 0.9398965 0.9397935 0.9397027 0.9396112 0.9393001 0.9393180 0.9390608\n##  [15] 0.9391645 0.9394670 0.9395147 0.9396798 0.9397547 0.9398291 0.9401371\n##  [22] 0.9402369 0.9402251 0.9404188 0.9404327 0.9403101 0.9402634 0.9401683\n##  [29] 0.9401746 0.9394978 0.9391277 0.9381969 0.9371561 0.9357012 0.9342620\n##  [36] 0.9324366 0.9302835 0.9277556 0.9247008 0.9208466 0.9166648 0.9120206\n##  [43] 0.9060828 0.9002430 0.8931754 0.8847777\n##  \n##  $wei\n##   [1] 123.2500 123.1667 123.0833 123.0000 122.9167 122.8333 122.7500 122.6667\n##   [9] 122.5833 122.5000 122.4167 122.3333 122.2500 122.1667 122.0833 122.0000\n##  [17] 121.9167 121.8333 121.7500 121.6667 121.5833 121.5000 121.4167 121.3333\n##  [25] 121.2500 121.1667 121.0833 121.0000 120.9167 120.8333 120.7500 120.6667\n##  [33] 120.5833 120.5000 120.4167 120.3333 120.2500 120.1667 120.0833 120.0000\n##  [41] 119.9167 119.8333 119.7500 119.6667 119.5833 119.5000\n##  \n##  $scc\n##            [,1]\n##  [1,] 0.9334303\n##  \n##  $std\n##  [1] 0.001994845\n\nscc$scc\n##            [,1]\n##  [1,] 0.9334303"
+    "text": "9.1 HiCrep\nhicrep is a popular package to compute stratum-adjusted correlations between Hi-C datasets (Yang et al. (2017)). “Stratum” refers to the distance from the main diagonal: with increase distance from the main diagonal, interactions of the DNA polymer are bound to decrease. hicrep computes a “per-stratum” correlation score and computes a weighted average correlation for entire chromosomes.\n\n\n\n\n\n\nInstalling hicrep\n\n\n\nhicrep package has been available from Bioconductor for many years but has been withdrawn from their repositories at some point. You can always install hicrep directly from its GitHub repository as follows:\n\nremotes::install_github('TaoYang-dev/hicrep')\n\n\n\nIn order to use hicrep, we first need to create two HiCExperiment objects.\n\nlibrary(InteractionSet)\nlibrary(HiCExperiment)\nlibrary(HiContactsData)\n\n# ---- This downloads example `.mcool` and `.pairs` files and caches them locally \ncoolf_wt &lt;- HiContactsData('yeast_wt', 'mcool')\ncoolf_eco1 &lt;- HiContactsData('yeast_eco1', 'mcool')\n\n\nhic_wt &lt;- import(coolf_wt, format = 'cool')\nhic_eco1 &lt;- import(coolf_eco1, format = 'cool')\n\nWe can now run the main get.scc function from hicrep. The documentation for this function is available from the console by typing ?hicrep::get.scc. More information is also available from the GitHub page. It informs the end user that the input for this function should be two intra-chromosomal Hi-C raw count matrices in square (optionally sparse) format.\n\nhic_wt\n##  `HiCExperiment` object with 8,757,906 contacts over 12,079 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"whole genome\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 2945692 \n##  scores(2): count balanced \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) \n##  pairsFile: N/A \n##  metadata(0):\n\nas.matrix(hic_wt[\"IV\"], use.scores = 'count')[1:10, 1:10]\n##  10 x 10 sparse Matrix of class \"dgTMatrix\"\n##                           \n##   [1,] . 1 . . 1 . . . . .\n##   [2,] 1 . . . . . . . . .\n##   [3,] . . . . . . . . . .\n##   [4,] . . . . . . . . . .\n##   [5,] 1 . . . . . . . 1 .\n##   [6,] . . . . . . . . . .\n##   [7,] . . . . . . . . . .\n##   [8,] . . . . . . . . 1 .\n##   [9,] . . . . 1 . . 1 . .\n##  [10,] . . . . . . . . . .\n\nlibrary(hicrep)\nscc &lt;- get.scc(\n    as.matrix(hic_wt[\"IV\"], use.scores = 'count'), \n    as.matrix(hic_eco1[\"IV\"], use.scores = 'count'), \n    resol = 1000, h = 25, lbr = 5000, ubr = 50000\n)\nscc\n##  $corr\n##   [1] 0.9412784 0.9410680 0.9408082 0.9404796 0.9404544 0.9402584 0.9400710\n##   [8] 0.9398965 0.9397935 0.9397027 0.9396112 0.9393001 0.9393180 0.9390608\n##  [15] 0.9391645 0.9394670 0.9395147 0.9396798 0.9397547 0.9398291 0.9401371\n##  [22] 0.9402369 0.9402251 0.9404188 0.9404327 0.9403101 0.9402634 0.9401683\n##  [29] 0.9401746 0.9394978 0.9391277 0.9381969 0.9371561 0.9357012 0.9342620\n##  [36] 0.9324366 0.9302835 0.9277556 0.9247008 0.9208466 0.9166648 0.9120206\n##  [43] 0.9060828 0.9002430 0.8931754 0.8847777\n##  \n##  $wei\n##   [1] 123.2500 123.1667 123.0833 123.0000 122.9167 122.8333 122.7500 122.6667\n##   [9] 122.5833 122.5000 122.4167 122.3333 122.2500 122.1667 122.0833 122.0000\n##  [17] 121.9167 121.8333 121.7500 121.6667 121.5833 121.5000 121.4167 121.3333\n##  [25] 121.2500 121.1667 121.0833 121.0000 120.9167 120.8333 120.7500 120.6667\n##  [33] 120.5833 120.5000 120.4167 120.3333 120.2500 120.1667 120.0833 120.0000\n##  [41] 119.9167 119.8333 119.7500 119.6667 119.5833 119.5000\n##  \n##  $scc\n##            [,1]\n##  [1,] 0.9334303\n##  \n##  $std\n##  [1] 0.001994845\n\nscc$scc\n##            [,1]\n##  [1,] 0.9334303"
   },
   {
     "objectID": "interoperability.html#multihiccompare",
     "href": "interoperability.html#multihiccompare",
     "title": "\n9  Interoperability: using HiCExperiment with other R packages\n",
     "section": "\n9.2 multiHiCcompare",
-    "text": "9.2 multiHiCcompare\nThe multiHiCcompare package provides functions for joint normalization and difference detection in multiple Hi-C datasets (Stansfield, Cresswell, and Dozmorov (2019)). According to its excerpt, to perform differential interaction analysis, it requires a list of raw counts for different samples/replicates, stored in data frames with four columns (chr, start1, start2, count).\nManipulate a HiCExperiment object to coerce it into such structure is straightforward.\n\nlibrary(dplyr)\nlibrary(tidyr)\nlibrary(purrr)\nhics &lt;- list(\n    \"wt\" = import(coolf_wt, format = 'cool'),\n    \"eco1\" = import(coolf_eco1, format = 'cool')\n)\nhics_list &lt;- map(hics, ~ .x['XI'] |&gt; \n    as.data.frame() |&gt;\n    mutate(chr = 1) |&gt; \n    relocate(chr) |&gt;\n    select(chr, start1, start2, count)\n)\nhead(hics_list[[1]])\n##    chr start1 start2 count\n##  1   1      1      1     2\n##  2   1      1   1001     3\n##  3   1      1   2001     3\n##  4   1      1   3001    13\n##  5   1      1   4001     9\n##  6   1      1   5001    13\n\nOnce this list is generated, the classical multiHiCcompare workflow can be applied: first run make_hicexp(), followed by cyclic_loess(), then hic_exactTest() and finally results():\n\nDI &lt;- hics_list |&gt; \n    make_hicexp(\n        data_list = hics_list, \n        groups = factor(c(1, 2))\n    ) |&gt; \n    cyclic_loess() |&gt; \n    hic_exactTest() |&gt; \n    results()\nDI\n##         chr region1 region2 D      logFC    logCPM    p.value     p.adj\n##      1:   1       1    1001 1  0.4279414  6.382927 0.78960192 1.0000000\n##      2:   1       1    3001 3  1.0325237  8.339327 0.06035705 0.9501367\n##      3:   1       1    4001 4  0.6862141  7.597689 0.34723639 1.0000000\n##      4:   1       1    5001 5  0.5124878  7.960339 0.43133791 1.0000000\n##      5:   1       1    6001 6 -0.3568672  8.563374 0.52289982 1.0000000\n##     ---                                                                \n##  22637:   1  663001  666001 3 -1.1680738  7.158551 0.17500113 1.0000000\n##  22638:   1  664001  664001 0  1.4530501  8.536212 0.16535151 1.0000000\n##  22639:   1  664001  665001 1 -0.1014769  8.166275 1.00000000 1.0000000\n##  22640:   1  665001  665001 0 -0.3110054 10.013750 0.60075706 1.0000000\n##  22641:   1  665001  666001 1 -0.4989794  7.750157 0.41481212 1.0000000"
+    "text": "9.2 multiHiCcompare\nThe multiHiCcompare package provides functions for joint normalization and difference detection in multiple Hi-C datasets (Stansfield et al. (2019)). According to its excerpt, to perform differential interaction analysis, it requires a list of raw counts for different samples/replicates, stored in data frames with four columns (chr, start1, start2, count).\nManipulate a HiCExperiment object to coerce it into such structure is straightforward.\n\nlibrary(dplyr)\nlibrary(tidyr)\nlibrary(purrr)\nhics &lt;- list(\n    \"wt\" = import(coolf_wt, format = 'cool'),\n    \"eco1\" = import(coolf_eco1, format = 'cool')\n)\nhics_list &lt;- map(hics, ~ .x['XI'] |&gt; \n    as.data.frame() |&gt;\n    mutate(chr = 1) |&gt; \n    relocate(chr) |&gt;\n    select(chr, start1, start2, count)\n)\nhead(hics_list[[1]])\n##    chr start1 start2 count\n##  1   1      1      1     2\n##  2   1      1   1001     3\n##  3   1      1   2001     3\n##  4   1      1   3001    13\n##  5   1      1   4001     9\n##  6   1      1   5001    13\n\nOnce this list is generated, the classical multiHiCcompare workflow can be applied: first run make_hicexp(), followed by cyclic_loess(), then hic_exactTest() and finally results():\n\nDI &lt;- hics_list |&gt; \n    make_hicexp(\n        data_list = hics_list, \n        groups = factor(c(1, 2))\n    ) |&gt; \n    cyclic_loess() |&gt; \n    hic_exactTest() |&gt; \n    results()\nDI\n##         chr region1 region2 D      logFC    logCPM    p.value     p.adj\n##      1:   1       1    1001 1  0.4279414  6.382927 0.78960192 1.0000000\n##      2:   1       1    3001 3  1.0325237  8.339327 0.06035705 0.9501367\n##      3:   1       1    4001 4  0.6862141  7.597689 0.34723639 1.0000000\n##      4:   1       1    5001 5  0.5124878  7.960339 0.43133791 1.0000000\n##      5:   1       1    6001 6 -0.3568672  8.563374 0.52289982 1.0000000\n##     ---                                                                \n##  22637:   1  663001  666001 3 -1.1680738  7.158551 0.17500113 1.0000000\n##  22638:   1  664001  664001 0  1.4530501  8.536212 0.16535151 1.0000000\n##  22639:   1  664001  665001 1 -0.1014769  8.166275 1.00000000 1.0000000\n##  22640:   1  665001  665001 0 -0.3110054 10.013750 0.60075706 1.0000000\n##  22641:   1  665001  666001 1 -0.4989794  7.750157 0.41481212 1.0000000"
   },
   {
     "objectID": "interoperability.html#topdom",
     "href": "interoperability.html#topdom",
     "title": "\n9  Interoperability: using HiCExperiment with other R packages\n",
     "section": "\n9.3 TopDom",
-    "text": "9.3 TopDom\nThe TopDom method is widely used to annotate topological domains in genomes from Hi-C data (Shin et al. (2016)). The TopDom package was created to implement this method in R (Bengtsson et al. (2020)).\nUnfortunately, the format of the input to TopDom is rather tricky (see ?TopDom::readHiC). The following chunk of code shows how to coerce a HiCExperiment object into a TopDom-compatible object.\n\nlibrary(TopDom)\nhic &lt;- import(coolf_wt, format = 'cool')\nHiCExperiment2TopDom &lt;- function(hic, chr) {\n    data &lt;- list()\n    cm &lt;- as(hic[chr], 'ContactMatrix')\n    data$counts &lt;- as.matrix(cm) |&gt; base::as.matrix()\n    data$counts[is.na(data$counts)] &lt;- 0\n    data$bins &lt;- regions(cm) |&gt; \n        as.data.frame() |&gt; \n        select(seqnames, start, end) |&gt;\n        mutate(seqnames = as.character(seqnames)) |&gt;\n        mutate(id = 1:n(), start = start - 1) |&gt; \n        relocate(id) |&gt; \n        dplyr::rename(chr = seqnames, from.coord = start, to.coord = end)\n    class(data) &lt;- 'TopDomData'\n    return(data)\n}\nhic_topdom &lt;- HiCExperiment2TopDom(hic, \"II\")\nhic_topdom\n##  TopDomData:\n##  bins:\n##  'data.frame':   813 obs. of  4 variables:\n##   $ id        : int  1 2 3 4 5 6 7 8 9 10 ...\n##   $ chr       : chr  \"II\" \"II\" \"II\" \"II\" ...\n##   $ from.coord: num  0 1000 2000 3000 4000 5000 6000 7000 8000 9000 ...\n##   $ to.coord  : int  1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 ...\n##  counts:\n##   num [1:813, 1:813] 0 0 0 0 0 0 0 0 0 0 ...\n\nNow that we have coerced a HiCExperiment object into a TopDom-compatible object, we can use the main TopDom function to annotate topological domains.\n\ndomains &lt;- TopDom::TopDom(hic_topdom, window.size = 5)\ndomains\n##  TopDom:\n##  Parameters:\n##  - window.size: 5\n##  - statFilter: TRUE\n##  binSignal:\n##  'data.frame':   813 obs. of  7 variables:\n##   $ id        : int  1 2 3 4 5 6 7 8 9 10 ...\n##   $ chr       : chr  \"II\" \"II\" \"II\" \"II\" ...\n##   $ from.coord: num  0 1000 2000 3000 4000 5000 6000 7000 8000 9000 ...\n##   $ to.coord  : int  1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 ...\n##   $ local.ext : num  -0.5 -0.5 -0.5 -0.5 -0.5 -0.5 -0.5 -0.5 0 0 ...\n##   $ mean.cf   : num  0 0 0 0 0 ...\n##   $ pvalue    : num  1 1 1 1 1 ...\n##  domain:\n##  'data.frame':   61 obs. of  7 variables:\n##   $ chr       : chr  \"II\" \"II\" \"II\" \"II\" ...\n##   $ from.id   : int  1 9 31 36 47 61 76 82 91 102 ...\n##   $ from.coord: num  0 8000 30000 35000 46000 60000 75000 81000 90000 101000 ...\n##   $ to.id     : int  8 30 35 46 60 75 81 90 101 136 ...\n##   $ to.coord  : num  8000 30000 35000 46000 60000 75000 81000 90000 101000 136000 ...\n##   $ tag       : chr  \"gap\" \"domain\" \"gap\" \"domain\" ...\n##   $ size      : num  8000 22000 5000 11000 14000 15000 6000 9000 11000 35000 ...\n##  bed:\n##  'data.frame':   61 obs. of  4 variables:\n##   $ chrom     : chr  \"II\" \"II\" \"II\" \"II\" ...\n##   $ chromStart: num  0 8000 30000 35000 46000 60000 75000 81000 90000 101000 ...\n##   $ chromEnd  : num  8000 30000 35000 46000 60000 75000 81000 90000 101000 136000 ...\n##   $ name      : chr  \"gap\" \"domain\" \"gap\" \"domain\" ...\n\nThe resulting domains object can be used to extract annotated domains, store them in topologicalFeatures of the original HiCExperiment, and optionally write a bed file to export them in text.\n\ntopologicalFeatures(hic, 'domain') &lt;- domains$bed |&gt; \n    mutate(chromStart = chromStart + 1) |&gt; \n    filter(name == 'domain') |&gt; \n    makeGRangesFromDataFrame()\ntopologicalFeatures(hic, 'domain')\n##  GRanges object with 52 ranges and 0 metadata columns:\n##         seqnames        ranges strand\n##            &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt;\n##     [1]       II    8001-30000      *\n##     [2]       II   35001-46000      *\n##     [3]       II   46001-60000      *\n##     [4]       II   60001-75000      *\n##     [5]       II   75001-81000      *\n##     ...      ...           ...    ...\n##    [48]       II 664001-681000      *\n##    [49]       II 681001-707000      *\n##    [50]       II 707001-714000      *\n##    [51]       II 714001-761000      *\n##    [52]       II 761001-806000      *\n##    -------\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\nrtracklayer::export(topologicalFeatures(hic, 'domain'), 'hic_domains.bed')"
+    "text": "9.3 TopDom\nThe TopDom method is widely used to annotate topological domains in genomes from Hi-C data ((Shin_2016?)). The TopDom package was created to implement this method in R (Bengtsson et al. (2020)).\nUnfortunately, the format of the input to TopDom is rather tricky (see ?TopDom::readHiC). The following chunk of code shows how to coerce a HiCExperiment object into a TopDom-compatible object.\n\nlibrary(TopDom)\nhic &lt;- import(coolf_wt, format = 'cool')\nHiCExperiment2TopDom &lt;- function(hic, chr) {\n    data &lt;- list()\n    cm &lt;- as(hic[chr], 'ContactMatrix')\n    data$counts &lt;- as.matrix(cm) |&gt; base::as.matrix()\n    data$counts[is.na(data$counts)] &lt;- 0\n    data$bins &lt;- regions(cm) |&gt; \n        as.data.frame() |&gt; \n        select(seqnames, start, end) |&gt;\n        mutate(seqnames = as.character(seqnames)) |&gt;\n        mutate(id = 1:n(), start = start - 1) |&gt; \n        relocate(id) |&gt; \n        dplyr::rename(chr = seqnames, from.coord = start, to.coord = end)\n    class(data) &lt;- 'TopDomData'\n    return(data)\n}\nhic_topdom &lt;- HiCExperiment2TopDom(hic, \"II\")\nhic_topdom\n##  TopDomData:\n##  bins:\n##  'data.frame':   813 obs. of  4 variables:\n##   $ id        : int  1 2 3 4 5 6 7 8 9 10 ...\n##   $ chr       : chr  \"II\" \"II\" \"II\" \"II\" ...\n##   $ from.coord: num  0 1000 2000 3000 4000 5000 6000 7000 8000 9000 ...\n##   $ to.coord  : int  1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 ...\n##  counts:\n##   num [1:813, 1:813] 0 0 0 0 0 0 0 0 0 0 ...\n\nNow that we have coerced a HiCExperiment object into a TopDom-compatible object, we can use the main TopDom function to annotate topological domains.\n\ndomains &lt;- TopDom::TopDom(hic_topdom, window.size = 5)\ndomains\n##  TopDom:\n##  Parameters:\n##  - window.size: 5\n##  - statFilter: TRUE\n##  binSignal:\n##  'data.frame':   813 obs. of  7 variables:\n##   $ id        : int  1 2 3 4 5 6 7 8 9 10 ...\n##   $ chr       : chr  \"II\" \"II\" \"II\" \"II\" ...\n##   $ from.coord: num  0 1000 2000 3000 4000 5000 6000 7000 8000 9000 ...\n##   $ to.coord  : int  1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 ...\n##   $ local.ext : num  -0.5 -0.5 -0.5 -0.5 -0.5 -0.5 -0.5 -0.5 0 0 ...\n##   $ mean.cf   : num  0 0 0 0 0 ...\n##   $ pvalue    : num  1 1 1 1 1 ...\n##  domain:\n##  'data.frame':   61 obs. of  7 variables:\n##   $ chr       : chr  \"II\" \"II\" \"II\" \"II\" ...\n##   $ from.id   : int  1 9 31 36 47 61 76 82 91 102 ...\n##   $ from.coord: num  0 8000 30000 35000 46000 60000 75000 81000 90000 101000 ...\n##   $ to.id     : int  8 30 35 46 60 75 81 90 101 136 ...\n##   $ to.coord  : num  8000 30000 35000 46000 60000 75000 81000 90000 101000 136000 ...\n##   $ tag       : chr  \"gap\" \"domain\" \"gap\" \"domain\" ...\n##   $ size      : num  8000 22000 5000 11000 14000 15000 6000 9000 11000 35000 ...\n##  bed:\n##  'data.frame':   61 obs. of  4 variables:\n##   $ chrom     : chr  \"II\" \"II\" \"II\" \"II\" ...\n##   $ chromStart: num  0 8000 30000 35000 46000 60000 75000 81000 90000 101000 ...\n##   $ chromEnd  : num  8000 30000 35000 46000 60000 75000 81000 90000 101000 136000 ...\n##   $ name      : chr  \"gap\" \"domain\" \"gap\" \"domain\" ...\n\nThe resulting domains object can be used to extract annotated domains, store them in topologicalFeatures of the original HiCExperiment, and optionally write a bed file to export them in text.\n\ntopologicalFeatures(hic, 'domain') &lt;- domains$bed |&gt; \n    mutate(chromStart = chromStart + 1) |&gt; \n    filter(name == 'domain') |&gt; \n    makeGRangesFromDataFrame()\ntopologicalFeatures(hic, 'domain')\n##  GRanges object with 52 ranges and 0 metadata columns:\n##         seqnames        ranges strand\n##            &lt;Rle&gt;     &lt;IRanges&gt;  &lt;Rle&gt;\n##     [1]       II    8001-30000      *\n##     [2]       II   35001-46000      *\n##     [3]       II   46001-60000      *\n##     [4]       II   60001-75000      *\n##     [5]       II   75001-81000      *\n##     ...      ...           ...    ...\n##    [48]       II 664001-681000      *\n##    [49]       II 681001-707000      *\n##    [50]       II 707001-714000      *\n##    [51]       II 714001-761000      *\n##    [52]       II 761001-806000      *\n##    -------\n##    seqinfo: 1 sequence from an unspecified genome; no seqlengths\n\nrtracklayer::export(topologicalFeatures(hic, 'domain'), 'hic_domains.bed')"
   },
   {
     "objectID": "interoperability.html#gothic",
     "href": "interoperability.html#gothic",
     "title": "\n9  Interoperability: using HiCExperiment with other R packages\n",
     "section": "\n9.4 GOTHiC",
-    "text": "9.4 GOTHiC\nGOTHiC relies on a cumulative binomial test to detect interactions between distal genomic loci that have significantly more reads than expected by chance in Hi-C experiments (Mifsud et al. (2017)).\n\n\n\n\n\n\nUsing the GOTHiC function\n\n\n\nUnfortunately, the main GOTHiC function require two .bam files as input. These files are often deleted due to their larger size, while the filtered pairs file itself is retained.\nMoreover, the internal nuts and bolts of the main GOTHiC function perform several operations that are not required in modern workflows:\n\n\nFiltering pairs from same restriction fragment; this step is now usually taken care of automatically, e.g. with HiCool Hi-C processing package.\n\nFiltering short-range pairs; the GOTHiC package hard-codes a 10kb lower threshold for minimum pair distance. More advanced optimized filtering approaches have been implemented since then, to circumvent the need for such hard-coded threshold.\n\nBinning pairs; this step is also already taken care of, when working with Hi-C matrices in modern formats, e.g. with .(m)cool files.\n\n\n\nBased on these facts, we can simplify the binomial test function provided by GOTHiC so that it can directly used binned interactions imported as a HiCExperiment object in R.\n\nShow the code for GOTHiC_binomial functionGOTHiC_binomial &lt;- function(x) {\n\n    if (length(trans(x)) != 0) stop(\"Only `cis` interactions can be used here.\")\n    ints &lt;- interactions(x) |&gt;\n        as.data.frame() |&gt; \n        select(seqnames1, start1, seqnames2, start2, count) |&gt;\n        dplyr::rename(chr1 = seqnames1, locus1 = start1, chr2 = seqnames2, locus2 = start2, frequencies = count) |&gt;\n        mutate(locus1 = locus1 - 1, locus2 = locus2 - 1) |&gt;\n        mutate(int1 = paste0(chr1, '_', locus1), int2 = paste0(chr2, '_', locus2))\n    \n    numberOfReadPairs &lt;- sum(ints$frequencies)\n    all_bins &lt;- unique(c(unique(ints$int1), unique(ints$int2)))\n    all_bins &lt;- sort(all_bins)\n    upperhalfBinNumber &lt;- (length(all_bins)^2 - length(all_bins))/2\n\n    cov &lt;- ints |&gt; \n        group_by(int1) |&gt; \n        tally(frequencies) |&gt; \n        full_join(ints |&gt; \n            group_by(int2) |&gt; \n            tally(frequencies), \n            by = c('int1' = 'int2')\n        ) |&gt; \n        rowwise() |&gt; \n        mutate(coverage = sum(n.x, n.y, na.rm = TRUE)) |&gt; \n        ungroup() |&gt;\n        mutate(relative_coverage = coverage/sum(coverage))\n    \n    results &lt;- mutate(ints,\n        cov1 = left_join(ints, select(cov, int1, relative_coverage), by = c('int1' = 'int1'))$relative_coverage, \n        cov2 = left_join(ints, select(cov, int1, relative_coverage), by = c('int2' = 'int1'))$relative_coverage,\n        probability = cov1 * cov2 * 2 * 1/(1 - sum(cov$relative_coverage^2)),\n        predicted = probability * numberOfReadPairs\n    ) |&gt; \n    rowwise() |&gt;\n    mutate(\n        pvalue = binom.test(\n            frequencies, \n            numberOfReadPairs, \n            probability,\n            alternative = \"greater\"\n        )$p.value\n    ) |&gt; \n    ungroup() |&gt; \n    mutate(\n        logFoldChange = log2(frequencies / predicted), \n        qvalue = stats::p.adjust(pvalue, method = \"BH\", n = upperhalfBinNumber)\n    )\n\n    scores(x, \"probability\") &lt;- results$probability\n    scores(x, \"predicted\") &lt;- results$predicted\n    scores(x, \"pvalue\") &lt;- results$pvalue\n    scores(x, \"qvalue\") &lt;- results$qvalue\n    scores(x, \"logFoldChange\") &lt;- results$logFoldChange\n\n    return(x)\n\n} \n\n\n\nres &lt;- GOTHiC_binomial(hic[\"II\"])\nres\n##  `HiCExperiment` object with 471,364 contacts over 802 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a594277bd62_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 74360 \n##  scores(7): count balanced probability predicted pvalue qvalue logFoldChange \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) domain(52) \n##  pairsFile: N/A \n##  metadata(0):\n\ninteractions(res)\n##  GInteractions object with 74360 interactions and 9 metadata columns:\n##            seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##                &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##        [1]        II        1-1000 ---        II     1001-2000 |       231\n##        [2]        II        1-1000 ---        II     5001-6000 |       231\n##        [3]        II        1-1000 ---        II     6001-7000 |       231\n##        [4]        II        1-1000 ---        II     8001-9000 |       231\n##        [5]        II        1-1000 ---        II    9001-10000 |       231\n##        ...       ...           ... ...       ...           ... .       ...\n##    [74356]        II 807001-808000 ---        II 809001-810000 |      1038\n##    [74357]        II 807001-808000 ---        II 810001-811000 |      1038\n##    [74358]        II 808001-809000 ---        II 808001-809000 |      1039\n##    [74359]        II 808001-809000 ---        II 809001-810000 |      1039\n##    [74360]        II 809001-810000 ---        II 809001-810000 |      1040\n##              bin_id2     count  balanced probability  predicted      pvalue\n##            &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;   &lt;numeric&gt;  &lt;numeric&gt;   &lt;numeric&gt;\n##        [1]       232         1       NaN 7.83580e-09 0.00369352 3.68670e-03\n##        [2]       236         2       NaN 2.81318e-08 0.01326033 8.71446e-05\n##        [3]       237         1       NaN 2.02960e-08 0.00956681 9.52120e-03\n##        [4]       239         2       NaN 6.73108e-08 0.03172791 4.92808e-04\n##        [5]       240         3       NaN 7.37336e-08 0.03475538 6.81713e-06\n##        ...       ...       ...       ...         ...        ...         ...\n##    [74356]      1040         8 0.0472023 3.85638e-07  0.1817758 2.51560e-11\n##    [74357]      1041         1       NaN 5.03006e-08  0.0237099 2.34310e-02\n##    [74358]      1039         1       NaN 8.74604e-08  0.0412257 4.03875e-02\n##    [74359]      1040         7       NaN 1.02111e-07  0.0481315 1.13834e-13\n##    [74360]      1040         2 0.0411355 1.19216e-07  0.0561941 1.52097e-03\n##                 qvalue logFoldChange\n##              &lt;numeric&gt;     &lt;numeric&gt;\n##        [1] 0.063385760       8.08079\n##        [2] 0.001926954       7.23674\n##        [3] 0.150288341       6.70775\n##        [4] 0.009806734       5.97810\n##        [5] 0.000173165       6.43158\n##        ...         ...           ...\n##    [74356] 1.07966e-09       5.45977\n##    [74357] 3.38098e-01       5.39837\n##    [74358] 5.49519e-01       4.60031\n##    [74359] 5.77259e-12       7.18423\n##    [74360] 2.79707e-02       5.15344\n##    -------\n##    regions: 802 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome"
+    "text": "9.4 GOTHiC\nGOTHiC relies on a cumulative binomial test to detect interactions between distal genomic loci that have significantly more reads than expected by chance in Hi-C experiments (Mifsud et al. (2017)).\n\n\n\n\n\n\nUsing the GOTHiC function\n\n\n\nUnfortunately, the main GOTHiC function require two .bam files as input. These files are often deleted due to their larger size, while the filtered pairs file itself is retained.\nMoreover, the internal nuts and bolts of the main GOTHiC function perform several operations that are not required in modern workflows:\n\n\nFiltering pairs from same restriction fragment; this step is now usually taken care of automatically, e.g. with HiCool Hi-C processing package.\n\nFiltering short-range pairs; the GOTHiC package hard-codes a 10kb lower threshold for minimum pair distance. More advanced optimized filtering approaches have been implemented since then, to circumvent the need for such hard-coded threshold.\n\nBinning pairs; this step is also already taken care of, when working with Hi-C matrices in modern formats, e.g. with .(m)cool files.\n\n\n\nBased on these facts, we can simplify the binomial test function provided by GOTHiC so that it can directly used binned interactions imported as a HiCExperiment object in R.\n\nShow the code for GOTHiC_binomial functionGOTHiC_binomial &lt;- function(x) {\n\n    if (length(trans(x)) != 0) stop(\"Only `cis` interactions can be used here.\")\n    ints &lt;- interactions(x) |&gt;\n        as.data.frame() |&gt; \n        select(seqnames1, start1, seqnames2, start2, count) |&gt;\n        dplyr::rename(chr1 = seqnames1, locus1 = start1, chr2 = seqnames2, locus2 = start2, frequencies = count) |&gt;\n        mutate(locus1 = locus1 - 1, locus2 = locus2 - 1) |&gt;\n        mutate(int1 = paste0(chr1, '_', locus1), int2 = paste0(chr2, '_', locus2))\n    \n    numberOfReadPairs &lt;- sum(ints$frequencies)\n    all_bins &lt;- unique(c(unique(ints$int1), unique(ints$int2)))\n    all_bins &lt;- sort(all_bins)\n    upperhalfBinNumber &lt;- (length(all_bins)^2 - length(all_bins))/2\n\n    cov &lt;- ints |&gt; \n        group_by(int1) |&gt; \n        tally(frequencies) |&gt; \n        full_join(ints |&gt; \n            group_by(int2) |&gt; \n            tally(frequencies), \n            by = c('int1' = 'int2')\n        ) |&gt; \n        rowwise() |&gt; \n        mutate(coverage = sum(n.x, n.y, na.rm = TRUE)) |&gt; \n        ungroup() |&gt;\n        mutate(relative_coverage = coverage/sum(coverage))\n    \n    results &lt;- mutate(ints,\n        cov1 = left_join(ints, select(cov, int1, relative_coverage), by = c('int1' = 'int1'))$relative_coverage, \n        cov2 = left_join(ints, select(cov, int1, relative_coverage), by = c('int2' = 'int1'))$relative_coverage,\n        probability = cov1 * cov2 * 2 * 1/(1 - sum(cov$relative_coverage^2)),\n        predicted = probability * numberOfReadPairs\n    ) |&gt; \n    rowwise() |&gt;\n    mutate(\n        pvalue = binom.test(\n            frequencies, \n            numberOfReadPairs, \n            probability,\n            alternative = \"greater\"\n        )$p.value\n    ) |&gt; \n    ungroup() |&gt; \n    mutate(\n        logFoldChange = log2(frequencies / predicted), \n        qvalue = stats::p.adjust(pvalue, method = \"BH\", n = upperhalfBinNumber)\n    )\n\n    scores(x, \"probability\") &lt;- results$probability\n    scores(x, \"predicted\") &lt;- results$predicted\n    scores(x, \"pvalue\") &lt;- results$pvalue\n    scores(x, \"qvalue\") &lt;- results$qvalue\n    scores(x, \"logFoldChange\") &lt;- results$logFoldChange\n\n    return(x)\n\n} \n\n\n\nres &lt;- GOTHiC_binomial(hic[\"II\"])\nres\n##  `HiCExperiment` object with 471,364 contacts over 802 regions \n##  -------\n##  fileName: \"/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752\" \n##  focus: \"II\" \n##  resolutions(5): 1000 2000 4000 8000 16000\n##  active resolution: 1000 \n##  interactions: 74360 \n##  scores(7): count balanced probability predicted pvalue qvalue logFoldChange \n##  topologicalFeatures: compartments(0) borders(0) loops(0) viewpoints(0) domain(52) \n##  pairsFile: N/A \n##  metadata(0):\n\ninteractions(res)\n##  GInteractions object with 74360 interactions and 9 metadata columns:\n##            seqnames1       ranges1     seqnames2       ranges2 |   bin_id1\n##                &lt;Rle&gt;     &lt;IRanges&gt;         &lt;Rle&gt;     &lt;IRanges&gt; | &lt;numeric&gt;\n##        [1]        II        1-1000 ---        II     1001-2000 |       231\n##        [2]        II        1-1000 ---        II     5001-6000 |       231\n##        [3]        II        1-1000 ---        II     6001-7000 |       231\n##        [4]        II        1-1000 ---        II     8001-9000 |       231\n##        [5]        II        1-1000 ---        II    9001-10000 |       231\n##        ...       ...           ... ...       ...           ... .       ...\n##    [74356]        II 807001-808000 ---        II 809001-810000 |      1038\n##    [74357]        II 807001-808000 ---        II 810001-811000 |      1038\n##    [74358]        II 808001-809000 ---        II 808001-809000 |      1039\n##    [74359]        II 808001-809000 ---        II 809001-810000 |      1039\n##    [74360]        II 809001-810000 ---        II 809001-810000 |      1040\n##              bin_id2     count  balanced probability  predicted      pvalue\n##            &lt;numeric&gt; &lt;numeric&gt; &lt;numeric&gt;   &lt;numeric&gt;  &lt;numeric&gt;   &lt;numeric&gt;\n##        [1]       232         1       NaN 7.83580e-09 0.00369352 3.68670e-03\n##        [2]       236         2       NaN 2.81318e-08 0.01326033 8.71446e-05\n##        [3]       237         1       NaN 2.02960e-08 0.00956681 9.52120e-03\n##        [4]       239         2       NaN 6.73108e-08 0.03172791 4.92808e-04\n##        [5]       240         3       NaN 7.37336e-08 0.03475538 6.81713e-06\n##        ...       ...       ...       ...         ...        ...         ...\n##    [74356]      1040         8 0.0472023 3.85638e-07  0.1817758 2.51560e-11\n##    [74357]      1041         1       NaN 5.03006e-08  0.0237099 2.34310e-02\n##    [74358]      1039         1       NaN 8.74604e-08  0.0412257 4.03875e-02\n##    [74359]      1040         7       NaN 1.02111e-07  0.0481315 1.13834e-13\n##    [74360]      1040         2 0.0411355 1.19216e-07  0.0561941 1.52097e-03\n##                 qvalue logFoldChange\n##              &lt;numeric&gt;     &lt;numeric&gt;\n##        [1] 0.063385760       8.08079\n##        [2] 0.001926954       7.23674\n##        [3] 0.150288341       6.70775\n##        [4] 0.009806734       5.97810\n##        [5] 0.000173165       6.43158\n##        ...         ...           ...\n##    [74356] 1.07966e-09       5.45977\n##    [74357] 3.38098e-01       5.39837\n##    [74358] 5.49519e-01       4.60031\n##    [74359] 5.77259e-12       7.18423\n##    [74360] 2.79707e-02       5.15344\n##    -------\n##    regions: 802 ranges and 4 metadata columns\n##    seqinfo: 16 sequences from an unspecified genome"
   },
   {
     "objectID": "interoperability.html#references",
     "href": "interoperability.html#references",
     "title": "\n9  Interoperability: using HiCExperiment with other R packages\n",
     "section": "References",
-    "text": "References\n\n\nAbdennur, Nezar, and Leonid A. Mirny. 2020. “Cooler: scalable storage for Hi-C data and other\ngenomically labeled arrays.” Bioinformatics 36\n(1): 311–16. https://doi.org/10.1093/bioinformatics/btz540.\n\n\nAy, Ferhat, Timothy L. Bailey, and William Stafford Noble. 2014.\n“Statistical confidence estimation for Hi-C\ndata reveals regulatory chromatin contacts.” Genome\nRes. 24 (6): 999–1011. https://doi.org/10.1101/gr.160374.113.\n\n\nBengtsson, Henrik, Hanjun Shin, Harris Lazaris, Gangqing Hu, and\nXianghong Zhou. 2020. R Package TopDom: An Efficient and\nDeterministic Method for Identifying Topological Domains in\nGenomes. https://github.com/HenrikBengtsson/TopDom.\n\n\nCournac, Axel, Hervé Marie-Nelly, Martial Marbouty, Romain Koszul, and\nJulien Mozziconacci. 2012. “Normalization of\na chromosomal contact map.” BMC Genomics 13 (1):\n1–13. https://doi.org/10.1186/1471-2164-13-436.\n\n\nCrane, Emily, Qian Bian, Rachel Patton McCord, Bryan R. Lajoie, Bayly S.\nWheeler, Edward J. Ralston, Satoru Uzawa, Job Dekker, and Barbara J.\nMeyer. 2015. “Condensin-driven remodelling of\nX chromosome topology during dosage compensation.”\nNature 523 (July): 240–44. https://doi.org/10.1038/nature14450.\n\n\nDekker, Job, Karsten Rippe, Martijn Dekker, and Nancy Kleckner. 2002.\n“Capturing Chromosome Conformation.”\nScience 295 (5558): 1306–11. https://doi.org/10.1126/science.1067799.\n\n\nDeshpande, Aditya S., Netha Ulahannan, Matthew Pendleton, Xiaoguang Dai,\nLynn Ly, Julie M. Behr, Stefan Schwenk, et al. 2022. “Identifying synergistic high-order 3D chromatin\nconformations from genome-scale nanopore concatemer\nsequencing.” Nat. Biotechnol. 40 (October):\n1488–99. https://doi.org/10.1038/s41587-022-01289-z.\n\n\nDuan, Zhijun, Mirela Andronescu, Kevin Schutz, Sean McIlwain, Yoo Jung\nKim, Choli Lee, Jay Shendure, Stanley Fields, C. Anthony Blau, and\nWilliam S. Noble. 2010. “A three-dimensional\nmodel of the yeast genome.” Nature 465 (May):\n363–67. https://doi.org/10.1038/nature08973.\n\n\nDurand, Neva C., Muhammad S. Shamim, Ido Machol, Suhas S. P. Rao, Miriam\nH. Huntley, Eric S. Lander, and Erez Lieberman Aiden. 2016. “Juicer provides a one-click system for analyzing\nloop-resolution Hi-C experiments.” Cell Systems 3\n(1): 95. https://doi.org/10.1016/j.cels.2016.07.002.\n\n\nGibcus, Johan H., Kumiko Samejima, Anton Goloborodko, Itaru Samejima,\nNatalia Naumova, Johannes Nuebler, Masato T. Kanemaki, et al. 2018.\n“A pathway for mitotic chromosome\nformation.” Science 359 (6376): eaao6135. https://doi.org/10.1126/science.aao6135.\n\n\nHuber, Wolfgang, Vincent J. Carey, Robert Gentleman, Simon Anders, Marc\nCarlson, Benilton S. Carvalho, Hector Corrada Bravo, et al. 2015.\n“Orchestrating high-throughput genomic\nanalysis with Bioconductor.” Nat. Methods 12 (2):\n115–21. https://doi.org/10.1038/nmeth.3252.\n\n\nImakaev, Maxim, Geoffrey Fudenberg, Rachel Patton McCord, Natalia\nNaumova, Anton Goloborodko, Bryan R. Lajoie, Job Dekker, and Leonid A.\nMirny. 2012. “Iterative correction of Hi-C\ndata reveals hallmarks of chromosome organization.”\nNat. Methods 9 (October): 999–1003. https://doi.org/10.1038/nmeth.2148.\n\n\nJ. O., Davies, Oudelaar A. M., Higgs D. R., and Hughes J. R. 2017.\n“How best to identify chromosomal\ninteractions: a comparison of approaches.” Nat.\nMethods 14 (2): 125–34. https://doi.org/10.1038/nmeth.4146.\n\n\nKrietenstein, Nils, Sameer Abraham, Sergey V. Venev, Nezar Abdennur,\nJohan Gibcus, Tsung-Han S. Hsieh, Krishna Mohan Parsi, et al. 2020.\n“Ultrastructural Details of Mammalian\nChromosome Architecture.” Mol. Cell 78 (3):\n554–565.e7. https://doi.org/10.1016/j.molcel.2020.03.003.\n\n\nKrismer, Konstantin, Yuchun Guo, and David K. Gifford. 2020.\n“IDR2D identifies reproducible genomic\ninteractions.” Nucleic Acids Res. 48 (6): e31. https://doi.org/10.1093/nar/gkaa030.\n\n\nLieberman-Aiden, Erez, Nynke L. van Berkum, Louise Williams, Maxim\nImakaev, Tobias Ragoczy, Agnes Telling, Ido Amit, et al. 2009.\n“Comprehensive mapping of long range\ninteractions reveals folding principles of the human\ngenome.” Science (New York, N.Y.) 326 (5950):\n289. https://doi.org/10.1126/science.1181369.\n\n\nLioy, Virginia S., Axel Cournac, Martial Marbouty, Stéphane Duigou,\nJulien Mozziconacci, Olivier Espéli, Frédéric Boccard, and Romain\nKoszul. 2018. “Multiscale Structuring of the\nE. coli Chromosome by Nucleoid-Associated and Condensin\nProteins.” Cell 172 (4): 771–78318. https://doi.org/10.1016/j.cell.2017.12.027.\n\n\nLun, Aaron T. L., Malcolm Perry, and Elizabeth Ing-Simmons. 2016.\n“Infrastructure for genomic interactions:\nBioconductor classes for Hi-C, ChIA-PET and related\nexperiments.” F1000Research 5 (May): 950. https://doi.org/10.12688/f1000research.8759.2.\n\n\nMatthey-Doret, Cyril, Lyam Baudry, Axel Breuer, Rémi Montagne, Nadège\nGuiglielmoni, Vittore Scolari, Etienne Jean, et al. 2020. “Computer vision for pattern detection in chromosome\ncontact maps.” Nat. Commun. 11 (5795): 1–11. https://doi.org/10.1038/s41467-020-19562-7.\n\n\nMifsud, Borbala, Inigo Martincorena, Elodie Darbo, Robert Sugar, Stefan\nSchoenfelder, Peter Fraser, and Nicholas M. Luscombe. 2017. “GOTHiC, a probabilistic model to resolve complex biases\nand to identify real interactions in Hi-C data.” PLoS\nOne 12 (4): e0174744. https://doi.org/10.1371/journal.pone.0174744.\n\n\nOpen2C, Nezar Abdennur, Geoffrey Fudenberg, Ilya M. Flyamer, Aleksandra\nA. Galitsyna, Anton Goloborodko, Maxim Imakaev, and Sergey V. Venev.\n2023. “Pairtools: from sequencing data to\nchromosome contacts.” bioRxiv, February,\n2023.02.13.528389. https://doi.org/10.1101/2023.02.13.528389.\n\n\nPope, Benjamin D., Tyrone Ryba, Vishnu Dileep, Feng Yue, Weisheng Wu,\nOlgert Denas, Daniel L. Vera, et al. 2014. “Topologically associating domains are stable units of\nreplication-timing regulation.” Nature 515\n(7527): 402–5. https://doi.org/10.1038/nature13986.\n\n\nSchmitt, Anthony D., Ming Hu, Inkyung Jung, Zheng Xu, Yunjiang Qiu,\nCatherine L. Tan, Yun Li, et al. 2016. “A\nCompendium of Chromatin Contact Maps Reveals Spatially Active Regions in\nthe Human Genome.” Cell Rep. 17 (8): 2042–59. https://doi.org/10.1016/j.celrep.2016.10.061.\n\n\nSefer, Emre. 2022. “A comparison of\ntopologically associating domain callers over mammals at high\nresolution.” BMC Bioinf. 23 (1): 1–39. https://doi.org/10.1186/s12859-022-04674-2.\n\n\nServant, Nicolas, Nelle Varoquaux, Bryan R. Lajoie, Eric Viara,\nChong-Jian Chen, Jean-Philippe Vert, Edith Heard, Job Dekker, and\nEmmanuel Barillot. 2015. “HiC-Pro: an\noptimized and flexible pipeline for Hi-C data processing.”\nGenome Biol. 16 (1): 1–11. https://doi.org/10.1186/s13059-015-0831-x.\n\n\nShin, Hanjun, Yi Shi, Chao Dai, Harianto Tjong, Ke Gong, Frank Alber,\nand Xianghong Jasmine Zhou. 2016. “TopDom: An Efficient and\nDeterministic Method for Identifying Topological Domains in\nGenomes.” Nucleic Acids Research 44 (7): e70. https://doi.org/10.1093/nar/gkv1505.\n\n\nStadhouders, Ralph, Guillaume J. Filion, and Thomas Graf. 2019.\n“Transcription factors and 3D genome\nconformation in cell-fate decisions.” Nature 569\n(7756): 345–54. https://doi.org/10.1038/s41586-019-1182-7.\n\n\nStansfield, John C., Kellen G. Cresswell, and Mikhail G. Dozmorov. 2019.\n“multiHiCcompare: joint normalization and\ncomparative analysis of complex Hi-C experiments.”\nBioinformatics 35 (17): 2916–23. https://doi.org/10.1093/bioinformatics/btz048.\n\n\nTavares-Cadete, Filipe, Davood Norouzi, Bastiaan Dekker, Yu Liu, and Job\nDekker. 2020. “Multi-contact 3C reveals that\nthe human genome during interphase is largely not\nentangled.” Nat. Struct. Mol. Biol. 27\n(December): 1105–14. https://doi.org/10.1038/s41594-020-0506-5.\n\n\nYang, Tao, Feipeng Zhang, Galip Gürkan Yardımcı, Fan Song, Ross C.\nHardison, William Stafford Noble, Feng Yue, and Qunhua Li. 2017.\n“HiCRep: assessing the reproducibility of\nHi-C data using a stratum-adjusted correlation\ncoefficient.” Genome Res. 27 (11): 1939–49. https://doi.org/10.1101/gr.220640.117."
+    "text": "References"
   },
   {
     "objectID": "interoperability.html#session-info",
     "href": "interoperability.html#session-info",
     "title": "\n9  Interoperability: using HiCExperiment with other R packages\n",
     "section": "Session info",
-    "text": "Session info\n\n##  ─ Session info ────────────────────────────────────────────────────────────\n##   setting  value\n##   version  R version 4.3.1 (2023-06-16)\n##   os       Ubuntu 22.04.3 LTS\n##   system   x86_64, linux-gnu\n##   ui       X11\n##   language (EN)\n##   collate  en_US.UTF-8\n##   ctype    en_US.UTF-8\n##   tz       Etc/UTC\n##   date     2023-09-25\n##   pandoc   3.1.1 @ /usr/local/bin/ (via rmarkdown)\n##  \n##  ─ Packages ────────────────────────────────────────────────────────────────\n##   package                * version   date (UTC) lib source\n##   abind                    1.4-5     2016-07-21 [1] CRAN (R 4.3.1)\n##   aggregation              1.0.1     2018-01-25 [1] CRAN (R 4.3.1)\n##   AnnotationDbi            1.63.2    2023-07-02 [1] Bioconductor\n##   AnnotationHub          * 3.9.2     2023-08-24 [1] Bioconductor\n##   Biobase                * 2.61.0    2023-04-25 [1] Bioconductor\n##   BiocFileCache          * 2.9.1     2023-07-12 [1] Bioconductor\n##   BiocGenerics           * 0.47.0    2023-04-25 [1] Bioconductor\n##   BiocIO                   1.11.0    2023-04-25 [1] Bioconductor\n##   BiocManager              1.30.22   2023-08-08 [1] CRAN (R 4.3.1)\n##   BiocParallel             1.35.4    2023-08-17 [1] Bioconductor\n##   BiocVersion              3.18.0    2023-04-25 [1] Bioconductor\n##   Biostrings               2.69.2    2023-07-02 [1] Bioconductor\n##   bit                      4.0.5     2022-11-15 [1] CRAN (R 4.3.1)\n##   bit64                    4.0.5     2020-08-30 [1] CRAN (R 4.3.1)\n##   bitops                   1.0-7     2021-04-24 [1] CRAN (R 4.3.1)\n##   blob                     1.2.4     2023-03-17 [1] CRAN (R 4.3.1)\n##   cachem                   1.0.8     2023-05-01 [1] CRAN (R 4.3.1)\n##   calibrate                1.7.7     2020-06-19 [1] CRAN (R 4.3.1)\n##   cli                      3.6.1     2023-03-23 [1] CRAN (R 4.3.1)\n##   codetools                0.2-19    2023-02-01 [2] CRAN (R 4.3.1)\n##   colorspace               2.1-0     2023-01-23 [1] CRAN (R 4.3.1)\n##   crayon                   1.5.2     2022-09-29 [1] CRAN (R 4.3.1)\n##   curl                     5.0.2     2023-08-14 [1] CRAN (R 4.3.1)\n##   data.table               1.14.8    2023-02-17 [1] CRAN (R 4.3.1)\n##   DBI                      1.1.3     2022-06-18 [1] CRAN (R 4.3.1)\n##   dbplyr                 * 2.3.3     2023-07-07 [1] CRAN (R 4.3.1)\n##   DelayedArray             0.27.10   2023-07-28 [1] Bioconductor\n##   digest                   0.6.33    2023-07-07 [1] CRAN (R 4.3.1)\n##   dplyr                  * 1.1.3     2023-09-03 [1] CRAN (R 4.3.1)\n##   edgeR                    3.43.8    2023-08-02 [1] Bioconductor\n##   ellipsis                 0.3.2     2021-04-29 [1] CRAN (R 4.3.1)\n##   evaluate                 0.21      2023-05-05 [1] CRAN (R 4.3.1)\n##   ExperimentHub          * 2.9.1     2023-07-12 [1] Bioconductor\n##   fansi                    1.0.4     2023-01-22 [1] CRAN (R 4.3.1)\n##   fastmap                  1.1.1     2023-02-24 [1] CRAN (R 4.3.1)\n##   filelock                 1.0.2     2018-10-05 [1] CRAN (R 4.3.1)\n##   generics                 0.1.3     2022-07-05 [1] CRAN (R 4.3.1)\n##   GenomeInfoDb           * 1.37.4    2023-09-07 [1] Bioconductor\n##   GenomeInfoDbData         1.2.10    2023-09-25 [1] Bioconductor\n##   GenomicAlignments        1.37.0    2023-04-25 [1] Bioconductor\n##   GenomicRanges          * 1.53.1    2023-05-04 [1] Bioconductor\n##   ggplot2                * 3.4.3     2023-08-14 [1] CRAN (R 4.3.1)\n##   glue                     1.6.2     2022-02-24 [1] CRAN (R 4.3.1)\n##   gridExtra                2.3       2017-09-09 [1] CRAN (R 4.3.1)\n##   gtable                   0.3.4     2023-08-21 [1] CRAN (R 4.3.1)\n##   gtools                   3.9.4     2022-11-27 [1] CRAN (R 4.3.1)\n##   HiCcompare               1.23.1    2023-06-04 [1] Bioconductor\n##   HiCExperiment          * 1.1.2     2023-09-04 [1] Bioconductor\n##   HiContactsData         * 1.3.0     2023-04-27 [1] Bioconductor\n##   hicrep                 * 1.12.2    2023-09-25 [1] Github (TaoYang-dev/hicrep@e485dfa)\n##   htmltools                0.5.6     2023-08-10 [1] CRAN (R 4.3.1)\n##   htmlwidgets              1.6.2     2023-03-17 [1] CRAN (R 4.3.1)\n##   httpuv                   1.6.11    2023-05-11 [1] CRAN (R 4.3.1)\n##   httr                     1.4.7     2023-08-15 [1] CRAN (R 4.3.1)\n##   InteractionSet         * 1.29.1    2023-06-14 [1] Bioconductor\n##   interactiveDisplayBase   1.39.0    2023-04-25 [1] Bioconductor\n##   IRanges                * 2.35.2    2023-06-22 [1] Bioconductor\n##   jsonlite                 1.8.7     2023-06-29 [1] CRAN (R 4.3.1)\n##   KEGGREST                 1.41.0    2023-04-25 [1] Bioconductor\n##   KernSmooth               2.23-22   2023-07-10 [1] CRAN (R 4.3.1)\n##   knitr                    1.44      2023-09-11 [1] CRAN (R 4.3.1)\n##   later                    1.3.1     2023-05-02 [1] CRAN (R 4.3.1)\n##   lattice                  0.21-8    2023-04-05 [2] CRAN (R 4.3.1)\n##   lifecycle                1.0.3     2022-10-07 [1] CRAN (R 4.3.1)\n##   limma                    3.57.7    2023-08-02 [1] Bioconductor\n##   locfit                   1.5-9.8   2023-06-11 [1] CRAN (R 4.3.1)\n##   magrittr                 2.0.3     2022-03-30 [1] CRAN (R 4.3.1)\n##   MASS                     7.3-60    2023-05-04 [2] CRAN (R 4.3.1)\n##   Matrix                   1.6-1.1   2023-09-18 [1] CRAN (R 4.3.1)\n##   MatrixGenerics         * 1.13.1    2023-07-25 [1] Bioconductor\n##   matrixStats            * 1.0.0     2023-06-02 [1] CRAN (R 4.3.1)\n##   memoise                  2.0.1     2021-11-26 [1] CRAN (R 4.3.1)\n##   mgcv                     1.9-0     2023-07-11 [1] CRAN (R 4.3.1)\n##   mime                     0.12      2021-09-28 [1] CRAN (R 4.3.1)\n##   multiHiCcompare        * 1.19.1    2023-07-02 [1] Bioconductor\n##   munsell                  0.5.0     2018-06-12 [1] CRAN (R 4.3.1)\n##   nlme                     3.1-163   2023-08-09 [1] CRAN (R 4.3.1)\n##   pbapply                  1.7-2     2023-06-27 [1] CRAN (R 4.3.1)\n##   pheatmap                 1.0.12    2019-01-04 [1] CRAN (R 4.3.1)\n##   pillar                   1.9.0     2023-03-22 [1] CRAN (R 4.3.1)\n##   pkgconfig                2.0.3     2019-09-22 [1] CRAN (R 4.3.1)\n##   plyr                     1.8.8     2022-11-11 [1] CRAN (R 4.3.1)\n##   png                      0.1-8     2022-11-29 [1] CRAN (R 4.3.1)\n##   promises                 1.2.1     2023-08-10 [1] CRAN (R 4.3.1)\n##   purrr                  * 1.0.2     2023-08-10 [1] CRAN (R 4.3.1)\n##   qqman                    0.1.9     2023-08-23 [1] CRAN (R 4.3.1)\n##   R6                       2.5.1     2021-08-19 [1] CRAN (R 4.3.1)\n##   rappdirs                 0.3.3     2021-01-31 [1] CRAN (R 4.3.1)\n##   RColorBrewer             1.1-3     2022-04-03 [1] CRAN (R 4.3.1)\n##   Rcpp                     1.0.11    2023-07-06 [1] CRAN (R 4.3.1)\n##   RCurl                    1.98-1.12 2023-03-27 [1] CRAN (R 4.3.1)\n##   reshape2                 1.4.4     2020-04-09 [1] CRAN (R 4.3.1)\n##   restfulr                 0.0.15    2022-06-16 [1] CRAN (R 4.3.1)\n##   rhdf5                    2.45.1    2023-07-10 [1] Bioconductor\n##   rhdf5filters             1.13.5    2023-07-19 [1] Bioconductor\n##   Rhdf5lib                 1.23.2    2023-09-10 [1] Bioconductor\n##   rjson                    0.2.21    2022-01-09 [1] CRAN (R 4.3.1)\n##   rlang                    1.1.1     2023-04-28 [1] CRAN (R 4.3.1)\n##   rmarkdown                2.25      2023-09-18 [1] CRAN (R 4.3.1)\n##   Rsamtools                2.17.0    2023-04-25 [1] Bioconductor\n##   RSQLite                  2.3.1     2023-04-03 [1] CRAN (R 4.3.1)\n##   rstudioapi               0.15.0    2023-07-07 [1] CRAN (R 4.3.1)\n##   rtracklayer              1.61.1    2023-08-15 [1] Bioconductor\n##   S4Arrays                 1.1.6     2023-08-30 [1] Bioconductor\n##   S4Vectors              * 0.39.2    2023-09-22 [1] Bioconductor\n##   scales                   1.2.1     2022-08-20 [1] CRAN (R 4.3.1)\n##   sessioninfo              1.2.2     2021-12-06 [1] CRAN (R 4.3.1)\n##   shiny                    1.7.5     2023-08-12 [1] CRAN (R 4.3.1)\n##   SparseArray              1.1.12    2023-08-31 [1] Bioconductor\n##   statmod                  1.5.0     2023-01-06 [1] CRAN (R 4.3.1)\n##   strawr                   0.0.91    2023-03-29 [1] CRAN (R 4.3.1)\n##   stringi                  1.7.12    2023-01-11 [1] CRAN (R 4.3.1)\n##   stringr                  1.5.0     2022-12-02 [1] CRAN (R 4.3.1)\n##   SummarizedExperiment   * 1.31.1    2023-05-01 [1] Bioconductor\n##   tibble                   3.2.1     2023-03-20 [1] CRAN (R 4.3.1)\n##   tidyr                  * 1.3.0     2023-01-24 [1] CRAN (R 4.3.1)\n##   tidyselect               1.2.0     2022-10-10 [1] CRAN (R 4.3.1)\n##   TopDom                 * 0.10.1    2021-05-06 [1] CRAN (R 4.3.1)\n##   tzdb                     0.4.0     2023-05-12 [1] CRAN (R 4.3.1)\n##   utf8                     1.2.3     2023-01-31 [1] CRAN (R 4.3.1)\n##   vctrs                    0.6.3     2023-06-14 [1] CRAN (R 4.3.1)\n##   vroom                    1.6.3     2023-04-28 [1] CRAN (R 4.3.1)\n##   withr                    2.5.0     2022-03-03 [1] CRAN (R 4.3.1)\n##   xfun                     0.40      2023-08-09 [1] CRAN (R 4.3.1)\n##   XML                      3.99-0.14 2023-03-19 [1] CRAN (R 4.3.1)\n##   xtable                   1.8-4     2019-04-21 [1] CRAN (R 4.3.1)\n##   XVector                  0.41.1    2023-05-03 [1] Bioconductor\n##   yaml                     2.3.7     2023-01-23 [1] CRAN (R 4.3.1)\n##   zlibbioc                 1.47.0    2023-04-25 [1] Bioconductor\n##  \n##   [1] /usr/local/lib/R/site-library\n##   [2] /usr/local/lib/R/library\n##  \n##  ───────────────────────────────────────────────────────────────────────────"
+    "text": "Session info\n\n##  ─ Session info ────────────────────────────────────────────────────────────\n##   setting  value\n##   version  R version 4.3.1 (2023-06-16)\n##   os       Ubuntu 22.04.3 LTS\n##   system   x86_64, linux-gnu\n##   ui       X11\n##   language (EN)\n##   collate  en_US.UTF-8\n##   ctype    en_US.UTF-8\n##   tz       Etc/UTC\n##   date     2023-10-19\n##   pandoc   3.1.1 @ /usr/local/bin/ (via rmarkdown)\n##  \n##  ─ Packages ────────────────────────────────────────────────────────────────\n##   package                * version   date (UTC) lib source\n##   abind                    1.4-5     2016-07-21 [1] CRAN (R 4.3.1)\n##   aggregation              1.0.1     2018-01-25 [1] CRAN (R 4.3.1)\n##   AnnotationDbi            1.63.2    2023-07-02 [1] Bioconductor\n##   AnnotationHub          * 3.9.2     2023-08-24 [1] Bioconductor\n##   Biobase                * 2.61.0    2023-04-25 [1] Bioconductor\n##   BiocFileCache          * 2.9.1     2023-07-12 [1] Bioconductor\n##   BiocGenerics           * 0.47.0    2023-04-25 [1] Bioconductor\n##   BiocIO                   1.11.0    2023-04-25 [1] Bioconductor\n##   BiocManager              1.30.22   2023-08-08 [1] CRAN (R 4.3.1)\n##   BiocParallel             1.35.4    2023-08-17 [1] Bioconductor\n##   BiocVersion              3.18.0    2023-04-25 [1] Bioconductor\n##   Biostrings               2.69.2    2023-07-02 [1] Bioconductor\n##   bit                      4.0.5     2022-11-15 [1] CRAN (R 4.3.1)\n##   bit64                    4.0.5     2020-08-30 [1] CRAN (R 4.3.1)\n##   bitops                   1.0-7     2021-04-24 [1] CRAN (R 4.3.1)\n##   blob                     1.2.4     2023-03-17 [1] CRAN (R 4.3.1)\n##   cachem                   1.0.8     2023-05-01 [1] CRAN (R 4.3.1)\n##   calibrate                1.7.7     2020-06-19 [1] CRAN (R 4.3.1)\n##   cli                      3.6.1     2023-03-23 [1] CRAN (R 4.3.1)\n##   codetools                0.2-19    2023-02-01 [2] CRAN (R 4.3.1)\n##   colorspace               2.1-0     2023-01-23 [1] CRAN (R 4.3.1)\n##   crayon                   1.5.2     2022-09-29 [1] CRAN (R 4.3.1)\n##   curl                     5.1.0     2023-10-02 [1] CRAN (R 4.3.1)\n##   data.table               1.14.8    2023-02-17 [1] CRAN (R 4.3.1)\n##   DBI                      1.1.3     2022-06-18 [1] CRAN (R 4.3.1)\n##   dbplyr                 * 2.3.4     2023-09-26 [1] CRAN (R 4.3.1)\n##   DelayedArray             0.27.10   2023-07-28 [1] Bioconductor\n##   digest                   0.6.33    2023-07-07 [1] CRAN (R 4.3.1)\n##   dplyr                  * 1.1.3     2023-09-03 [1] CRAN (R 4.3.1)\n##   edgeR                    3.99.3    2023-10-16 [1] Bioconductor\n##   ellipsis                 0.3.2     2021-04-29 [1] CRAN (R 4.3.1)\n##   evaluate                 0.22      2023-09-29 [1] CRAN (R 4.3.1)\n##   ExperimentHub          * 2.9.1     2023-07-12 [1] Bioconductor\n##   fansi                    1.0.5     2023-10-08 [1] CRAN (R 4.3.1)\n##   fastmap                  1.1.1     2023-02-24 [1] CRAN (R 4.3.1)\n##   filelock                 1.0.2     2018-10-05 [1] CRAN (R 4.3.1)\n##   generics                 0.1.3     2022-07-05 [1] CRAN (R 4.3.1)\n##   GenomeInfoDb           * 1.37.6    2023-10-02 [1] Bioconductor\n##   GenomeInfoDbData         1.2.11    2023-10-19 [1] Bioconductor\n##   GenomicAlignments        1.37.0    2023-04-25 [1] Bioconductor\n##   GenomicRanges          * 1.53.2    2023-10-08 [1] Bioconductor\n##   ggplot2                * 3.4.4     2023-10-12 [1] CRAN (R 4.3.1)\n##   glue                     1.6.2     2022-02-24 [1] CRAN (R 4.3.1)\n##   gridExtra                2.3       2017-09-09 [1] CRAN (R 4.3.1)\n##   gtable                   0.3.4     2023-08-21 [1] CRAN (R 4.3.1)\n##   gtools                   3.9.4     2022-11-27 [1] CRAN (R 4.3.1)\n##   HiCcompare               1.23.1    2023-06-04 [1] Bioconductor\n##   HiCExperiment          * 1.1.2     2023-09-04 [1] Bioconductor\n##   HiContactsData         * 1.3.0     2023-04-27 [1] Bioconductor\n##   hicrep                 * 1.12.2    2023-10-19 [1] Github (TaoYang-dev/hicrep@e485dfa)\n##   htmltools                0.5.6.1   2023-10-06 [1] CRAN (R 4.3.1)\n##   htmlwidgets              1.6.2     2023-03-17 [1] CRAN (R 4.3.1)\n##   httpuv                   1.6.11    2023-05-11 [1] CRAN (R 4.3.1)\n##   httr                     1.4.7     2023-08-15 [1] CRAN (R 4.3.1)\n##   InteractionSet         * 1.29.1    2023-06-14 [1] Bioconductor\n##   interactiveDisplayBase   1.39.0    2023-04-25 [1] Bioconductor\n##   IRanges                * 2.35.3    2023-10-12 [1] Bioconductor\n##   jsonlite                 1.8.7     2023-06-29 [1] CRAN (R 4.3.1)\n##   KEGGREST                 1.41.4    2023-09-25 [1] Bioconductor\n##   KernSmooth               2.23-22   2023-07-10 [1] CRAN (R 4.3.1)\n##   knitr                    1.44      2023-09-11 [1] CRAN (R 4.3.1)\n##   later                    1.3.1     2023-05-02 [1] CRAN (R 4.3.1)\n##   lattice                  0.21-9    2023-10-01 [1] CRAN (R 4.3.1)\n##   lifecycle                1.0.3     2022-10-07 [1] CRAN (R 4.3.1)\n##   limma                    3.57.10   2023-10-11 [1] Bioconductor\n##   locfit                   1.5-9.8   2023-06-11 [1] CRAN (R 4.3.1)\n##   magrittr                 2.0.3     2022-03-30 [1] CRAN (R 4.3.1)\n##   MASS                     7.3-60    2023-05-04 [2] CRAN (R 4.3.1)\n##   Matrix                   1.6-1.1   2023-09-18 [1] CRAN (R 4.3.1)\n##   MatrixGenerics         * 1.13.1    2023-07-25 [1] Bioconductor\n##   matrixStats            * 1.0.0     2023-06-02 [1] CRAN (R 4.3.1)\n##   memoise                  2.0.1     2021-11-26 [1] CRAN (R 4.3.1)\n##   mgcv                     1.9-0     2023-07-11 [1] CRAN (R 4.3.1)\n##   mime                     0.12      2021-09-28 [1] CRAN (R 4.3.1)\n##   multiHiCcompare        * 1.19.1    2023-07-02 [1] Bioconductor\n##   munsell                  0.5.0     2018-06-12 [1] CRAN (R 4.3.1)\n##   nlme                     3.1-163   2023-08-09 [1] CRAN (R 4.3.1)\n##   pbapply                  1.7-2     2023-06-27 [1] CRAN (R 4.3.1)\n##   pheatmap                 1.0.12    2019-01-04 [1] CRAN (R 4.3.1)\n##   pillar                   1.9.0     2023-03-22 [1] CRAN (R 4.3.1)\n##   pkgconfig                2.0.3     2019-09-22 [1] CRAN (R 4.3.1)\n##   plyr                     1.8.9     2023-10-02 [1] CRAN (R 4.3.1)\n##   png                      0.1-8     2022-11-29 [1] CRAN (R 4.3.1)\n##   promises                 1.2.1     2023-08-10 [1] CRAN (R 4.3.1)\n##   purrr                  * 1.0.2     2023-08-10 [1] CRAN (R 4.3.1)\n##   qqman                    0.1.9     2023-08-23 [1] CRAN (R 4.3.1)\n##   R6                       2.5.1     2021-08-19 [1] CRAN (R 4.3.1)\n##   rappdirs                 0.3.3     2021-01-31 [1] CRAN (R 4.3.1)\n##   RColorBrewer             1.1-3     2022-04-03 [1] CRAN (R 4.3.1)\n##   Rcpp                     1.0.11    2023-07-06 [1] CRAN (R 4.3.1)\n##   RCurl                    1.98-1.12 2023-03-27 [1] CRAN (R 4.3.1)\n##   reshape2                 1.4.4     2020-04-09 [1] CRAN (R 4.3.1)\n##   restfulr                 0.0.15    2022-06-16 [1] CRAN (R 4.3.1)\n##   rhdf5                    2.45.1    2023-07-10 [1] Bioconductor\n##   rhdf5filters             1.13.5    2023-07-19 [1] Bioconductor\n##   Rhdf5lib                 1.23.2    2023-09-10 [1] Bioconductor\n##   rjson                    0.2.21    2022-01-09 [1] CRAN (R 4.3.1)\n##   rlang                    1.1.1     2023-04-28 [1] CRAN (R 4.3.1)\n##   rmarkdown                2.25      2023-09-18 [1] CRAN (R 4.3.1)\n##   Rsamtools                2.17.0    2023-04-25 [1] Bioconductor\n##   RSQLite                  2.3.1     2023-04-03 [1] CRAN (R 4.3.1)\n##   rstudioapi               0.15.0    2023-07-07 [1] CRAN (R 4.3.1)\n##   rtracklayer              1.61.1    2023-08-15 [1] Bioconductor\n##   S4Arrays                 1.1.6     2023-08-30 [1] Bioconductor\n##   S4Vectors              * 0.39.3    2023-10-11 [1] Bioconductor\n##   scales                   1.2.1     2022-08-20 [1] CRAN (R 4.3.1)\n##   sessioninfo              1.2.2     2021-12-06 [1] CRAN (R 4.3.1)\n##   shiny                    1.7.5.1   2023-10-14 [1] CRAN (R 4.3.1)\n##   SparseArray              1.1.12    2023-08-31 [1] Bioconductor\n##   statmod                  1.5.0     2023-01-06 [1] CRAN (R 4.3.1)\n##   strawr                   0.0.91    2023-03-29 [1] CRAN (R 4.3.1)\n##   stringi                  1.7.12    2023-01-11 [1] CRAN (R 4.3.1)\n##   stringr                  1.5.0     2022-12-02 [1] CRAN (R 4.3.1)\n##   SummarizedExperiment   * 1.31.1    2023-05-01 [1] Bioconductor\n##   tibble                   3.2.1     2023-03-20 [1] CRAN (R 4.3.1)\n##   tidyr                  * 1.3.0     2023-01-24 [1] CRAN (R 4.3.1)\n##   tidyselect               1.2.0     2022-10-10 [1] CRAN (R 4.3.1)\n##   TopDom                 * 0.10.1    2021-05-06 [1] CRAN (R 4.3.1)\n##   tzdb                     0.4.0     2023-05-12 [1] CRAN (R 4.3.1)\n##   utf8                     1.2.3     2023-01-31 [1] CRAN (R 4.3.1)\n##   vctrs                    0.6.4     2023-10-12 [1] CRAN (R 4.3.1)\n##   vroom                    1.6.4     2023-10-02 [1] CRAN (R 4.3.1)\n##   withr                    2.5.1     2023-09-26 [1] CRAN (R 4.3.1)\n##   xfun                     0.40      2023-08-09 [1] CRAN (R 4.3.1)\n##   XML                      3.99-0.14 2023-03-19 [1] CRAN (R 4.3.1)\n##   xtable                   1.8-4     2019-04-21 [1] CRAN (R 4.3.1)\n##   XVector                  0.41.1    2023-05-03 [1] Bioconductor\n##   yaml                     2.3.7     2023-01-23 [1] CRAN (R 4.3.1)\n##   zlibbioc                 1.47.0    2023-04-25 [1] Bioconductor\n##  \n##   [1] /usr/local/lib/R/site-library\n##   [2] /usr/local/lib/R/library\n##  \n##  ───────────────────────────────────────────────────────────────────────────"
   },
   {
     "objectID": "workflow-yeast.html#recovering-data-from-sra",
diff --git a/sitemap.xml b/sitemap.xml
index a1008c3..36e09c8 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -2,58 +2,58 @@
 <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
   <url>
     <loc>https://js2264.github.io/OHCA/index.html</loc>
-    <lastmod>2023-09-25T17:13:44.372Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.176Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/preamble.html</loc>
-    <lastmod>2023-09-25T17:13:44.388Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.184Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/principles.html</loc>
-    <lastmod>2023-09-25T17:13:44.408Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.224Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/data-representation.html</loc>
-    <lastmod>2023-09-25T17:13:44.548Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.396Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/parsing.html</loc>
-    <lastmod>2023-09-25T17:13:44.596Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.460Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/visualization.html</loc>
-    <lastmod>2023-09-25T17:13:44.624Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.488Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/matrix-centric.html</loc>
-    <lastmod>2023-09-25T17:13:44.648Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.564Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/interactions-centric.html</loc>
-    <lastmod>2023-09-25T17:13:44.716Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.624Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/topological-features.html</loc>
-    <lastmod>2023-09-25T17:13:44.776Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.676Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/disseminating.html</loc>
-    <lastmod>2023-09-25T17:13:44.792Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.700Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/interoperability.html</loc>
-    <lastmod>2023-09-25T17:13:44.836Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.760Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/workflow-yeast.html</loc>
-    <lastmod>2023-09-25T17:13:44.896Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.848Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/workflow-chicken.html</loc>
-    <lastmod>2023-09-25T17:13:44.924Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.884Z</lastmod>
   </url>
   <url>
     <loc>https://js2264.github.io/OHCA/workflow-centros.html</loc>
-    <lastmod>2023-09-25T17:13:44.952Z</lastmod>
+    <lastmod>2023-10-19T10:18:04.924Z</lastmod>
   </url>
 </urlset>
diff --git a/topological-features.html b/topological-features.html
index f967bf0..5132e5d 100644
--- a/topological-features.html
+++ b/topological-features.html
@@ -336,7 +336,7 @@ <h1 class="title">
 <p>Chromosome compartments refer to the segregation of the chromatin into active euchromatin (A compartments) and regulated heterochromatin (B compartment).</p>
 <section id="importing-hi-c-data" class="level3" data-number="7.1.1"><h3 data-number="7.1.1" class="anchored" data-anchor-id="importing-hi-c-data">
 <span class="header-section-number">7.1.1</span> Importing Hi-C data</h3>
-<p>To investigate chromosome compartments, we will fetch a contact matrix generated from a micro-C experiment (from <span class="citation" data-cites="Krietenstein2020May">Krietenstein et al. (<a href="interoperability.html#ref-Krietenstein2020May" role="doc-biblioref">2020</a>)</span>). A subset of the genome-wide dataset is provided in the <code>OHCA</code> package. It contains intra-chromosomal interactions within <code>chr17</code>, binned at <code>5000</code>, <code>100000</code> and <code>250000</code> bp.</p>
+<p>To investigate chromosome compartments, we will fetch a contact matrix generated from a micro-C experiment (from <span class="citation" data-cites="Krietenstein_2020">Krietenstein et al. (<a href="interoperability.html#ref-Krietenstein_2020" role="doc-biblioref">2020</a>)</span>). A subset of the genome-wide dataset is provided in the <code>OHCA</code> package. It contains intra-chromosomal interactions within <code>chr17</code>, binned at <code>5000</code>, <code>100000</code> and <code>250000</code> bp.</p>
 <div class="cell" data-layout-align="center" data-hash="topological-features_cache/html/unnamed-chunk-2_c717a9def34d68713aebd3dc844cb19b">
 <div class="sourceCode" id="cb1"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html">library</a></span><span class="op">(</span><span class="va"><a href="https://github.com/js2264/HiCExperiment">HiCExperiment</a></span><span class="op">)</span></span>
 <span><span class="kw"><a href="https://rdrr.io/r/base/library.html">library</a></span><span class="op">(</span><span class="va"><a href="https://github.com/js2264/OHCA">OHCA</a></span><span class="op">)</span></span>
@@ -558,11 +558,11 @@ <h1 class="title">
 <figure class="figure"><p><img src="images/20230403090000.png" class="img-fluid figure-img"></p>
 </figure>
 </div>
-<p>They are generally conserved across cell types and species (<span class="citation" data-cites="Schmitt2016Nov">Schmitt et al. (<a href="interoperability.html#ref-Schmitt2016Nov" role="doc-biblioref">2016</a>)</span>), typically correlate with units of DNA replication (<span class="citation" data-cites="Pope2014Nov">Pope et al. (<a href="interoperability.html#ref-Pope2014Nov" role="doc-biblioref">2014</a>)</span>), and could play a role during development (<span class="citation" data-cites="Stadhouders2019May">Stadhouders, Filion, and Graf (<a href="interoperability.html#ref-Stadhouders2019May" role="doc-biblioref">2019</a>)</span>).</p>
+<p>They are generally conserved across cell types and species (<span class="citation" data-cites="Schmitt_2016">Schmitt et al. (<a href="interoperability.html#ref-Schmitt_2016" role="doc-biblioref">2016</a>)</span>), typically correlate with units of DNA replication (<span class="citation" data-cites="Pope_2014">Pope et al. (<a href="interoperability.html#ref-Pope_2014" role="doc-biblioref">2014</a>)</span>), and could play a role during development (<span class="citation" data-cites="Stadhouders_2019">Stadhouders et al. (<a href="interoperability.html#ref-Stadhouders_2019" role="doc-biblioref">2019</a>)</span>).</p>
 <section id="computing-diamond-insulation-score" class="level3" data-number="7.2.1"><h3 data-number="7.2.1" class="anchored" data-anchor-id="computing-diamond-insulation-score">
 <span class="header-section-number">7.2.1</span> Computing diamond insulation score</h3>
-<p>Several approaches exist to annotate topological domains (<span class="citation" data-cites="Sefer2022Dec">Sefer (<a href="interoperability.html#ref-Sefer2022Dec" role="doc-biblioref">2022</a>)</span>). Several packages in R implement some of these functionalities, e.g.&nbsp;<code>spectralTAD</code> or <code>TADcompare</code>.</p>
-<p><code>HiContacts</code> offers a simple <code>getDiamondInsulation</code> function which computes the diamond insulation score (<span class="citation" data-cites="Crane2015Jul">Crane et al. (<a href="interoperability.html#ref-Crane2015Jul" role="doc-biblioref">2015</a>)</span>). This score quantifies average interaction frequency in an insulation window (of a certain <code>window_size</code>) sliding along contact matrices at a chosen <code>resolution</code>.</p>
+<p>Several approaches exist to annotate topological domains (<span class="citation" data-cites="Sefer_2022">Sefer (<a href="interoperability.html#ref-Sefer_2022" role="doc-biblioref">2022</a>)</span>). Several packages in R implement some of these functionalities, e.g.&nbsp;<code>spectralTAD</code> or <code>TADcompare</code>.</p>
+<p><code>HiContacts</code> offers a simple <code>getDiamondInsulation</code> function which computes the diamond insulation score (<span class="citation" data-cites="Crane_2015">Crane et al. (<a href="interoperability.html#ref-Crane_2015" role="doc-biblioref">2015</a>)</span>). This score quantifies average interaction frequency in an insulation window (of a certain <code>window_size</code>) sliding along contact matrices at a chosen <code>resolution</code>.</p>
 <div class="cell" data-layout-align="center" data-hash="topological-features_cache/html/unnamed-chunk-9_6009129c9a0127aa688befd84e1c2839">
 <div class="sourceCode" id="cb8"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="co"># - Compute insulation score</span></span>
 <span><span class="va">bpparam</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/BiocParallel/man/SerialParam-class.html">SerialParam</a></span><span class="op">(</span>progressbar <span class="op">=</span> <span class="cn">FALSE</span><span class="op">)</span></span>
@@ -724,7 +724,7 @@ <h1 class="title">
 <section id="chromosight" class="level3" data-number="7.3.1"><h3 data-number="7.3.1" class="anchored" data-anchor-id="chromosight">
 <span class="header-section-number">7.3.1</span> <code>chromosight</code>
 </h3>
-<p>Chromatin loops, dots, or contacts, refer to a strong increase of interaction frequency between a pair of two genomic loci. They correspond to focal “dots” on a Hi-C map. Relying on computer vision algorithms, <code>chromosight</code> uses this property to annotate chromatin loops in a Hi-C map (<span class="citation" data-cites="MattheyDoret2020Nov">Matthey-Doret et al. (<a href="interoperability.html#ref-MattheyDoret2020Nov" role="doc-biblioref">2020</a>)</span>). <code>chromosight</code> is a standalone <code>python</code> package and is made available in R through the <code>HiCool</code>-managed conda environment with the <code><a href="https://rdrr.io/pkg/HiCool/man/getLoops.html">getLoops()</a></code> function.</p>
+<p>Chromatin loops, dots, or contacts, refer to a strong increase of interaction frequency between a pair of two genomic loci. They correspond to focal “dots” on a Hi-C map. Relying on computer vision algorithms, <code>chromosight</code> uses this property to annotate chromatin loops in a Hi-C map (<span class="citation" data-cites="Matthey_Doret_2020">Matthey-Doret et al. (<a href="interoperability.html#ref-Matthey_Doret_2020" role="doc-biblioref">2020</a>)</span>). <code>chromosight</code> is a standalone <code>python</code> package and is made available in R through the <code>HiCool</code>-managed conda environment with the <code><a href="https://rdrr.io/pkg/HiCool/man/getLoops.html">getLoops()</a></code> function.</p>
 <section id="identifying-loops" class="level4" data-number="7.3.1.1"><h4 data-number="7.3.1.1" class="anchored" data-anchor-id="identifying-loops">
 <span class="header-section-number">7.3.1.1</span> Identifying loops</h4>
 <div class="cell" data-layout-align="center" data-hash="topological-features_cache/html/unnamed-chunk-15_ff63f7ae1d1ec0dcd5cb4c8ce3732019">
@@ -899,40 +899,40 @@ <h1 class="title">
 <p><img src="images/20230403134800.png" class="img-fluid"></p>
 </section></section><section id="other-r-packages" class="level3" data-number="7.3.2"><h3 data-number="7.3.2" class="anchored" data-anchor-id="other-r-packages">
 <span class="header-section-number">7.3.2</span> Other R packages</h3>
-<p>A number of other R packages have been developed to identify focal chromatin loops, notably <code>fitHiC</code> (<span class="citation" data-cites="Ay2014Feb">Ay, Bailey, and Noble (<a href="interoperability.html#ref-Ay2014Feb" role="doc-biblioref">2014</a>)</span>), <code>GOTHiC</code> (<span class="citation" data-cites="Mifsud2017Apr">Mifsud et al. (<a href="interoperability.html#ref-Mifsud2017Apr" role="doc-biblioref">2017</a>)</span>) or <code>idr2d</code> (<span class="citation" data-cites="Krismer2020Apr">Krismer, Guo, and Gifford (<a href="interoperability.html#ref-Krismer2020Apr" role="doc-biblioref">2020</a>)</span>). Each fits a slightly different purpose, and we encourage the end user to read companion publications.</p>
+<p>A number of other R packages have been developed to identify focal chromatin loops, notably <code>fitHiC</code> (<span class="citation" data-cites="Ay_2014">Ay et al. (<a href="interoperability.html#ref-Ay_2014" role="doc-biblioref">2014</a>)</span>), <code>GOTHiC</code> (<span class="citation" data-cites="Mifsud_2017">Mifsud et al. (<a href="interoperability.html#ref-Mifsud_2017" role="doc-biblioref">2017</a>)</span>) or <code>idr2d</code> (<span class="citation" data-cites="Krismer_2020">Krismer et al. (<a href="interoperability.html#ref-Krismer_2020" role="doc-biblioref">2020</a>)</span>). Each fits a slightly different purpose, and we encourage the end user to read companion publications.</p>
 
 
 </section></section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-Ay2014Feb" class="csl-entry" role="listitem">
-Ay, Ferhat, Timothy L. Bailey, and William Stafford Noble. 2014. <span>“<span class="nocase">Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts</span>.”</span> <em>Genome Res.</em> 24 (6): 999–1011. <a href="https://doi.org/10.1101/gr.160374.113">https://doi.org/10.1101/gr.160374.113</a>.
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Ay_2014" class="csl-entry" role="listitem">
+Ay, F., Bailey, T. L., &amp; Noble, W. S. (2014). Statistical confidence estimation for hi-c data reveals regulatory chromatin contacts. <em>Genome Research</em>, <em>24</em>(6), 999–1011. <a href="https://doi.org/10.1101/gr.160374.113">https://doi.org/10.1101/gr.160374.113</a>
 </div>
-<div id="ref-Crane2015Jul" class="csl-entry" role="listitem">
-Crane, Emily, Qian Bian, Rachel Patton McCord, Bryan R. Lajoie, Bayly S. Wheeler, Edward J. Ralston, Satoru Uzawa, Job Dekker, and Barbara J. Meyer. 2015. <span>“<span class="nocase">Condensin-driven remodelling of X chromosome topology during dosage compensation</span>.”</span> <em>Nature</em> 523 (July): 240–44. <a href="https://doi.org/10.1038/nature14450">https://doi.org/10.1038/nature14450</a>.
+<div id="ref-Crane_2015" class="csl-entry" role="listitem">
+Crane, E., Bian, Q., McCord, R. P., Lajoie, B. R., Wheeler, B. S., Ralston, E. J., Uzawa, S., Dekker, J., &amp; Meyer, B. J. (2015). Condensin-driven remodelling of x chromosome topology during dosage compensation. <em>Nature</em>, <em>523</em>(7559), 240–244. <a href="https://doi.org/10.1038/nature14450">https://doi.org/10.1038/nature14450</a>
 </div>
-<div id="ref-Krietenstein2020May" class="csl-entry" role="listitem">
-Krietenstein, Nils, Sameer Abraham, Sergey V. Venev, Nezar Abdennur, Johan Gibcus, Tsung-Han S. Hsieh, Krishna Mohan Parsi, et al. 2020. <span>“<span class="nocase">Ultrastructural Details of Mammalian Chromosome Architecture</span>.”</span> <em>Mol. Cell</em> 78 (3): 554–565.e7. <a href="https://doi.org/10.1016/j.molcel.2020.03.003">https://doi.org/10.1016/j.molcel.2020.03.003</a>.
+<div id="ref-Krietenstein_2020" class="csl-entry" role="listitem">
+Krietenstein, N., Abraham, S., Venev, S. V., Abdennur, N., Gibcus, J., Hsieh, T.-H. S., Parsi, K. M., Yang, L., Maehr, R., Mirny, L. A., Dekker, J., &amp; Rando, O. J. (2020). Ultrastructural details of mammalian chromosome architecture. <em>Molecular Cell</em>, <em>78</em>(3), 554–565.e7. <a href="https://doi.org/10.1016/j.molcel.2020.03.003">https://doi.org/10.1016/j.molcel.2020.03.003</a>
 </div>
-<div id="ref-Krismer2020Apr" class="csl-entry" role="listitem">
-Krismer, Konstantin, Yuchun Guo, and David K. Gifford. 2020. <span>“<span class="nocase">IDR2D identifies reproducible genomic interactions</span>.”</span> <em>Nucleic Acids Res.</em> 48 (6): e31. <a href="https://doi.org/10.1093/nar/gkaa030">https://doi.org/10.1093/nar/gkaa030</a>.
+<div id="ref-Krismer_2020" class="csl-entry" role="listitem">
+Krismer, K., Guo, Y., &amp; Gifford, D. K. (2020). <span>IDR</span>2D identifies reproducible genomic interactions. <em>Nucleic Acids Research</em>, <em>48</em>(6), e31–e31. <a href="https://doi.org/10.1093/nar/gkaa030">https://doi.org/10.1093/nar/gkaa030</a>
 </div>
-<div id="ref-MattheyDoret2020Nov" class="csl-entry" role="listitem">
-Matthey-Doret, Cyril, Lyam Baudry, Axel Breuer, Rémi Montagne, Nadège Guiglielmoni, Vittore Scolari, Etienne Jean, et al. 2020. <span>“<span class="nocase">Computer vision for pattern detection in chromosome contact maps</span>.”</span> <em>Nat. Commun.</em> 11 (5795): 1–11. <a href="https://doi.org/10.1038/s41467-020-19562-7">https://doi.org/10.1038/s41467-020-19562-7</a>.
+<div id="ref-Matthey_Doret_2020" class="csl-entry" role="listitem">
+Matthey-Doret, C., Baudry, L., Breuer, A., Montagne, R., Guiglielmoni, N., Scolari, V., Jean, E., Campeas, A., Chanut, P. H., Oriol, E., Méot, A., Politis, L., Vigouroux, A., Moreau, P., Koszul, R., &amp; Cournac, A. (2020). Computer vision for pattern detection in chromosome contact maps. <em>Nature Communications</em>, <em>11</em>(1). <a href="https://doi.org/10.1038/s41467-020-19562-7">https://doi.org/10.1038/s41467-020-19562-7</a>
 </div>
-<div id="ref-Mifsud2017Apr" class="csl-entry" role="listitem">
-Mifsud, Borbala, Inigo Martincorena, Elodie Darbo, Robert Sugar, Stefan Schoenfelder, Peter Fraser, and Nicholas M. Luscombe. 2017. <span>“<span class="nocase">GOTHiC, a probabilistic model to resolve complex biases and to identify real interactions in Hi-C data</span>.”</span> <em>PLoS One</em> 12 (4): e0174744. <a href="https://doi.org/10.1371/journal.pone.0174744">https://doi.org/10.1371/journal.pone.0174744</a>.
+<div id="ref-Mifsud_2017" class="csl-entry" role="listitem">
+Mifsud, B., Martincorena, I., Darbo, E., Sugar, R., Schoenfelder, S., Fraser, P., &amp; Luscombe, N. M. (2017). <span>GOTHiC</span>, a probabilistic model to resolve complex biases and to identify real interactions in hi-c data. <em><span>PLOS</span> <span>ONE</span></em>, <em>12</em>(4), e0174744. <a href="https://doi.org/10.1371/journal.pone.0174744">https://doi.org/10.1371/journal.pone.0174744</a>
 </div>
-<div id="ref-Pope2014Nov" class="csl-entry" role="listitem">
-Pope, Benjamin D., Tyrone Ryba, Vishnu Dileep, Feng Yue, Weisheng Wu, Olgert Denas, Daniel L. Vera, et al. 2014. <span>“<span class="nocase">Topologically associating domains are stable units of replication-timing regulation</span>.”</span> <em>Nature</em> 515 (7527): 402–5. <a href="https://doi.org/10.1038/nature13986">https://doi.org/10.1038/nature13986</a>.
+<div id="ref-Pope_2014" class="csl-entry" role="listitem">
+Pope, B. D., Ryba, T., Dileep, V., Yue, F., Wu, W., Denas, O., Vera, D. L., Wang, Y., Hansen, R. S., Canfield, T. K., Thurman, R. E., Cheng, Y., Gülsoy, G., Dennis, J. H., Snyder, M. P., Stamatoyannopoulos, J. A., Taylor, J., Hardison, R. C., Kahveci, T., … Gilbert, D. M. (2014). Topologically associating domains are stable units of replication-timing regulation. <em>Nature</em>, <em>515</em>(7527), 402–405. <a href="https://doi.org/10.1038/nature13986">https://doi.org/10.1038/nature13986</a>
 </div>
-<div id="ref-Schmitt2016Nov" class="csl-entry" role="listitem">
-Schmitt, Anthony D., Ming Hu, Inkyung Jung, Zheng Xu, Yunjiang Qiu, Catherine L. Tan, Yun Li, et al. 2016. <span>“<span class="nocase">A Compendium of Chromatin Contact Maps Reveals Spatially Active Regions in the Human Genome</span>.”</span> <em>Cell Rep.</em> 17 (8): 2042–59. <a href="https://doi.org/10.1016/j.celrep.2016.10.061">https://doi.org/10.1016/j.celrep.2016.10.061</a>.
+<div id="ref-Schmitt_2016" class="csl-entry" role="listitem">
+Schmitt, A. D., Hu, M., Jung, I., Xu, Z., Qiu, Y., Tan, C. L., Li, Y., Lin, S., Lin, Y., Barr, C. L., &amp; Ren, B. (2016). A compendium of chromatin contact maps reveals spatially active regions in the human genome. <em>Cell Reports</em>, <em>17</em>(8), 2042–2059. <a href="https://doi.org/10.1016/j.celrep.2016.10.061">https://doi.org/10.1016/j.celrep.2016.10.061</a>
 </div>
-<div id="ref-Sefer2022Dec" class="csl-entry" role="listitem">
-Sefer, Emre. 2022. <span>“<span class="nocase">A comparison of topologically associating domain callers over mammals at high resolution</span>.”</span> <em>BMC Bioinf.</em> 23 (1): 1–39. <a href="https://doi.org/10.1186/s12859-022-04674-2">https://doi.org/10.1186/s12859-022-04674-2</a>.
+<div id="ref-Sefer_2022" class="csl-entry" role="listitem">
+Sefer, E. (2022). A comparison of topologically associating domain callers over mammals at high resolution. <em><span>BMC</span> Bioinformatics</em>, <em>23</em>(1). <a href="https://doi.org/10.1186/s12859-022-04674-2">https://doi.org/10.1186/s12859-022-04674-2</a>
 </div>
-<div id="ref-Stadhouders2019May" class="csl-entry" role="listitem">
-Stadhouders, Ralph, Guillaume J. Filion, and Thomas Graf. 2019. <span>“<span class="nocase">Transcription factors and 3D genome conformation in cell-fate decisions</span>.”</span> <em>Nature</em> 569 (7756): 345–54. <a href="https://doi.org/10.1038/s41586-019-1182-7">https://doi.org/10.1038/s41586-019-1182-7</a>.
+<div id="ref-Stadhouders_2019" class="csl-entry" role="listitem">
+Stadhouders, R., Filion, G. J., &amp; Graf, T. (2019). Transcription factors and 3D genome conformation in cell-fate decisions. <em>Nature</em>, <em>569</em>(7756), 345–354. <a href="https://doi.org/10.1038/s41586-019-1182-7">https://doi.org/10.1038/s41586-019-1182-7</a>
 </div>
 </div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">
diff --git a/visualization.html b/visualization.html
index 2438015..e478ff6 100644
--- a/visualization.html
+++ b/visualization.html
@@ -356,7 +356,7 @@ <h1 class="title">
 <div class="sourceCode" id="cb2"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="va">hic</span></span>
 <span><span class="co">##  `HiCExperiment` object with 303,545 contacts over 289 regions </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: "V" </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 2000 </span></span>
@@ -524,7 +524,7 @@ <h1 class="title">
 <section id="overlaying-topological-features" class="level3" data-number="4.3.1"><h3 data-number="4.3.1" class="anchored" data-anchor-id="overlaying-topological-features">
 <span class="header-section-number">4.3.1</span> Overlaying topological features</h3>
 <p>Topological features (e.g.&nbsp;chromatin loops, domain borders, A/B compartments, e.g.&nbsp;…) are often displayed over a Hi-C heatmap.</p>
-<p>To illustrate how to do this, let’s import pre-computed chromatin loops in <code>R</code>. These loops have been identified using <code>chromosight</code> (<span class="citation" data-cites="MattheyDoret2020Nov">Matthey-Doret et al. (<a href="interoperability.html#ref-MattheyDoret2020Nov" role="doc-biblioref">2020</a>)</span>) on the contact matrix which we imported interactions from.</p>
+<p>To illustrate how to do this, let’s import pre-computed chromatin loops in <code>R</code>. These loops have been identified using <code>chromosight</code> (<span class="citation" data-cites="Matthey_Doret_2020">Matthey-Doret et al. (<a href="interoperability.html#ref-Matthey_Doret_2020" role="doc-biblioref">2020</a>)</span>) on the contact matrix which we imported interactions from.</p>
 <div class="cell" data-layout-align="center" data-hash="visualization_cache/html/unnamed-chunk-12_626707573521c07887c7beaa97e3d25e">
 <div class="sourceCode" id="cb12"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html">library</a></span><span class="op">(</span><span class="va">rtracklayer</span><span class="op">)</span></span>
 <span><span class="kw"><a href="https://rdrr.io/r/base/library.html">library</a></span><span class="op">(</span><span class="va">InteractionSet</span><span class="op">)</span></span>
@@ -597,7 +597,7 @@ <h1 class="title">
 <span><span class="va">aggr_loops</span></span>
 <span><span class="co">##  `AggrHiCExperiment` object over 148 targets </span></span>
 <span><span class="co">##  -------</span></span>
-<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a594277bd62_7752" </span></span>
+<span><span class="co">##  fileName: "/github/home/.cache/R/ExperimentHub/1a9a4dc30249_7752" </span></span>
 <span><span class="co">##  focus: 148 targets </span></span>
 <span><span class="co">##  resolutions(5): 1000 2000 4000 8000 16000</span></span>
 <span><span class="co">##  active resolution: 1000 </span></span>
@@ -656,9 +656,9 @@ <h1 class="title">
 
 
 </section></section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-MattheyDoret2020Nov" class="csl-entry" role="listitem">
-Matthey-Doret, Cyril, Lyam Baudry, Axel Breuer, Rémi Montagne, Nadège Guiglielmoni, Vittore Scolari, Etienne Jean, et al. 2020. <span>“<span class="nocase">Computer vision for pattern detection in chromosome contact maps</span>.”</span> <em>Nat. Commun.</em> 11 (5795): 1–11. <a href="https://doi.org/10.1038/s41467-020-19562-7">https://doi.org/10.1038/s41467-020-19562-7</a>.
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Matthey_Doret_2020" class="csl-entry" role="listitem">
+Matthey-Doret, C., Baudry, L., Breuer, A., Montagne, R., Guiglielmoni, N., Scolari, V., Jean, E., Campeas, A., Chanut, P. H., Oriol, E., Méot, A., Politis, L., Vigouroux, A., Moreau, P., Koszul, R., &amp; Cournac, A. (2020). Computer vision for pattern detection in chromosome contact maps. <em>Nature Communications</em>, <em>11</em>(1). <a href="https://doi.org/10.1038/s41467-020-19562-7">https://doi.org/10.1038/s41467-020-19562-7</a>
 </div>
 </div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">
diff --git a/visualization_files/figure-html/unnamed-chunk-10-1.png b/visualization_files/figure-html/unnamed-chunk-10-1.png
index 472ce9d..ced2797 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-10-1.png and b/visualization_files/figure-html/unnamed-chunk-10-1.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-11-1.png b/visualization_files/figure-html/unnamed-chunk-11-1.png
index ea99475..bb1c991 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-11-1.png and b/visualization_files/figure-html/unnamed-chunk-11-1.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-14-1.png b/visualization_files/figure-html/unnamed-chunk-14-1.png
index 6038be3..dbf28cf 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-14-1.png and b/visualization_files/figure-html/unnamed-chunk-14-1.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-17-1.png b/visualization_files/figure-html/unnamed-chunk-17-1.png
index 062e49a..8013c44 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-17-1.png and b/visualization_files/figure-html/unnamed-chunk-17-1.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-4-1.png b/visualization_files/figure-html/unnamed-chunk-4-1.png
index 057913c..f58ad23 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-4-1.png and b/visualization_files/figure-html/unnamed-chunk-4-1.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-5-1.png b/visualization_files/figure-html/unnamed-chunk-5-1.png
index 2be357b..554b584 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-5-1.png and b/visualization_files/figure-html/unnamed-chunk-5-1.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-7-1.png b/visualization_files/figure-html/unnamed-chunk-7-1.png
index 508b204..e5003a1 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-7-1.png and b/visualization_files/figure-html/unnamed-chunk-7-1.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-8-1.png b/visualization_files/figure-html/unnamed-chunk-8-1.png
index 56b433f..9b46ae4 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-8-1.png and b/visualization_files/figure-html/unnamed-chunk-8-1.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-8-2.png b/visualization_files/figure-html/unnamed-chunk-8-2.png
index fad029e..a4c13a3 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-8-2.png and b/visualization_files/figure-html/unnamed-chunk-8-2.png differ
diff --git a/visualization_files/figure-html/unnamed-chunk-9-1.png b/visualization_files/figure-html/unnamed-chunk-9-1.png
index 8be3a26..103a57b 100644
Binary files a/visualization_files/figure-html/unnamed-chunk-9-1.png and b/visualization_files/figure-html/unnamed-chunk-9-1.png differ
diff --git a/workflow-chicken.html b/workflow-chicken.html
index 1dff2b0..55cb0be 100644
--- a/workflow-chicken.html
+++ b/workflow-chicken.html
@@ -321,7 +321,7 @@ <h1 class="title">
 </div>
 </div>
 <div class="callout-body-container callout-body">
-<p>We leverage five chicken datasets in this notebook, published in <span class="citation" data-cites="Gibcus2018Feb">Gibcus et al. (<a href="interoperability.html#ref-Gibcus2018Feb" role="doc-biblioref">2018</a>)</span>. They are all available from the 4DN data portal using the <code>fourDNData</code> package.</p>
+<p>We leverage five chicken datasets in this notebook, published in <span class="citation" data-cites="Gibcus_2018">Gibcus et al. (<a href="interoperability.html#ref-Gibcus_2018" role="doc-biblioref">2018</a>)</span>. They are all available from the 4DN data portal using the <code>fourDNData</code> package.</p>
 <ul>
 <li>
 <code>4DNES9LEZXN7</code>: chicken cell culture blocked in G2</li>
@@ -338,7 +338,7 @@ <h1 class="title">
 </div>
 <section id="importing-data" class="level2" data-number="11.1"><h2 data-number="11.1" class="anchored" data-anchor-id="importing-data">
 <span class="header-section-number">11.1</span> Importing data</h2>
-<p>The 4DN consortium provides access to the datasets published in <span class="citation" data-cites="Gibcus2018Feb">Gibcus et al. (<a href="interoperability.html#ref-Gibcus2018Feb" role="doc-biblioref">2018</a>)</span>. in <code>R</code>, they can be obtained thanks to the <code>fourDNData</code> gateway package.</p>
+<p>The 4DN consortium provides access to the datasets published in <span class="citation" data-cites="Gibcus_2018">Gibcus et al. (<a href="interoperability.html#ref-Gibcus_2018" role="doc-biblioref">2018</a>)</span>. in <code>R</code>, they can be obtained thanks to the <code>fourDNData</code> gateway package.</p>
 <div class="callout callout-style-default callout-warning callout-titled">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
@@ -554,9 +554,9 @@ <h1 class="title">
 
 
 </section><section id="bibliography" class="level1 unnumbered"><h1 class="unnumbered">References</h1>
-<div id="refs" class="references csl-bib-body hanging-indent" role="list" style="display: none">
-<div id="ref-Gibcus2018Feb" class="csl-entry" role="listitem">
-Gibcus, Johan H., Kumiko Samejima, Anton Goloborodko, Itaru Samejima, Natalia Naumova, Johannes Nuebler, Masato T. Kanemaki, et al. 2018. <span>“<span class="nocase">A pathway for mitotic chromosome formation</span>.”</span> <em>Science</em> 359 (6376): eaao6135. <a href="https://doi.org/10.1126/science.aao6135">https://doi.org/10.1126/science.aao6135</a>.
+<div id="refs" class="references csl-bib-body hanging-indent" data-line-spacing="2" role="list" style="display: none">
+<div id="ref-Gibcus_2018" class="csl-entry" role="listitem">
+Gibcus, J. H., Samejima, K., Goloborodko, A., Samejima, I., Naumova, N., Nuebler, J., Kanemaki, M. T., Xie, L., Paulson, J. R., Earnshaw, W. C., Mirny, L. A., &amp; Dekker, J. (2018). A pathway for mitotic chromosome formation. <em>Science</em>, <em>359</em>(6376). <a href="https://doi.org/10.1126/science.aao6135">https://doi.org/10.1126/science.aao6135</a>
 </div>
 </div>
 </section></main><!-- /main --><script id="quarto-html-after-body" type="application/javascript">