Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benb/use metadata as source of family table load #936

Merged
merged 2 commits into from
Oct 28, 2024

Conversation

bpblanken
Copy link
Collaborator

No description provided.

@bpblanken bpblanken requested a review from a team as a code owner October 28, 2024 14:29
@bpblanken bpblanken changed the base branch from main to dev October 28, 2024 14:29

# Fetch family guids from pedigree
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The specific failure was a family existing in the pedigree, failing a check and thus being excluded from the rest of the pipeline, but being present on the project table from a different callset.

family_guids_in_project_table & families_guids_in_pedigree
def run(self):
ht = hl.read_matrix_table(
remapped_and_subsetted_callset_path(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just sanity checking that this works for AnVIL loading where we don't have remaps (or usually subsets)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah the same file gets generated, we just run a "subset" that has all families in it.

@bpblanken bpblanken merged commit ed4364f into dev Oct 28, 2024
3 checks passed
bpblanken added a commit that referenced this pull request Nov 5, 2024
* add task to write relatedness check to tsv (#930)

* add task to write relatedness check to tsv

* fix requirements

* relatedness_check_table_path

* add relatedness check file path to metadata.json

* Benb/use metadata as source of family table load (#936)

* use run metadata as source of family table load

* ruff

* Support gcs dirs in rsync (#932)

* Support gcs dirs in rsync

* ws

* Gencode refactor to remove gcs (#934)

* Gencode refactor to remove gcs

* Fix

* additional semi join (#947)

* metadata parameters refactor (#946)

* metadata parameters refactor

* fix missing param

* tweak

* missed one

* last one

* fix test

* last few bugfixes

* fix

* bump

* missed one

* change parameter type due to confusing bug

* push

* enum

* Parse clinvar version from header (#949)

* Parse clinvar version from header

* responses activate

* fix test

* Dependency reordering so that `ValidateCallsetTask` runs before updating the reference data. (#950)

* Parse clinvar version from header

* Dependency reordering for reference data updates and validation

* ruff

* missed one

* Revert relatedness changes

* push

* Fix import issue

* Fix sample type

* ruff

* Fix import mocking

* imports

* responses activate

* fix test

* Tweaks

* comment

* Benb/check parsed clinvar version in complete (#951)

* Parse clinvar version from header

* First pass

* Bump hail tables to https

* correct dataset/dataset types

* Fix clinvar mito

* Fix combined

* Dependency reordering for reference data updates and validation

* ruff

* missed one

* Revert relatedness changes

* push

* Fix import issue

* Fix sample type

* ruff

* Fix import mocking

* imports

* Missed one

* First mocking pass

* Finish mocks in reference data

* responses activate

* ruff

* commas

* fix test

* Update compare_globals.py

* import

---------

Co-authored-by: Julia Klugherz <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants