Skip to content
This repository has been archived by the owner on Nov 26, 2024. It is now read-only.

A simplified Pretext workflow specs #3

Open
nekrut opened this issue Oct 24, 2024 · 7 comments
Open

A simplified Pretext workflow specs #3

nekrut opened this issue Oct 24, 2024 · 7 comments

Comments

@nekrut
Copy link
Collaborator

nekrut commented Oct 24, 2024

A simplified workflow should take the following inputs:

  • hap1
  • hap2
  • hap1 name
  • hap2 name
  • HiFi reads
  • HiC F reads
  • HiC R reads
  • Telomere (units)

It generates a PretextMap snapshot and a dataset suitable for running Pretext View

@nekrut nekrut converted this from a draft issue Oct 24, 2024
@fubar2
Copy link
Member

fubar2 commented Oct 27, 2024

@nekrut @Delphine-L

I've been using this subworkflow derived from Delphine's version at https://vgp.usegalaxy.org/u/fubar/w/ebphib-pretext-hic-2 . It probably works fine as a workflow but I cannot get the parent EBPhib-big workflow to finish on vgp because of the samtools view OOM problem reported as a bug on 24 and 20th October. Team meeting time so probably lost somewhere.

If samtools view can have some more ram and I can get the whole WF to finish, could show you the end result JBrowse2 with the cools as HiC tracks.

The subWF creates pretextmap for viewing and cool tracks for JBrowse2 for each haplotype separately and both together.
Cannot get the coverage bigwig to show in Pretextview after adding it to the pretextmap so not sure telomeres will work either. They're available in JBrowse with the new HiC tracks anyway.

Image

Also creates a paf containing all the contact pairs from the final merged bam, for the new and amazing interactive HiC Jupyter notebook viewer.

Top row is H1-H1 (call it cis for the haplotypes) then H2-H2 pairs and bottom row is H1-H2 trans pairs.

When first opened they show all the data - about 14 million pairs.
Clicking anywhere on a plot shows the coordinates in the row above.
The notebook normally produces larger images but I squeezed them down to fit into this family photo.
Here's mUroPar1 - the parka squirrel arima HiC data.

Image

@fubar2 fubar2 moved this to In Progress in TreeValGal / EGAPx tasks Oct 27, 2024
@fubar2
Copy link
Member

fubar2 commented Oct 27, 2024

T2T HG002 is running in https://usegalaxy.org/u/fubar/h/ebphib-pretext-noextra-hg002
It is very slow right now. Only a couple of trivial steps completed in the last 3 hours.
Is vgp.usegalaxy.org ok?

@fubar2
Copy link
Member

fubar2 commented Oct 27, 2024

@Smeds
Copy link

Smeds commented Oct 28, 2024

I have an updated version of the pretext workflow Delphine worked on (galaxyproject/iwc#584)

@mvdbeek
Copy link
Member

mvdbeek commented Oct 28, 2024

Can I move this issue over to the IWC ? I think we should mark this repo as deprecated

@fubar2
Copy link
Member

fubar2 commented Oct 29, 2024

@mvdbeek: which one do you want deprecated?

@mvdbeek
Copy link
Member

mvdbeek commented Oct 29, 2024

This repository. This discussion should happen on the IWC.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
Status: In Progress
Development

No branches or pull requests

4 participants