Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat!: add more support for queries in gnomad_vcf_to_protein #529

Merged
merged 15 commits into from
Feb 7, 2024

Conversation

korikuzma
Copy link
Member

@korikuzma korikuzma commented Jan 11, 2024

close #362

@wesleygoar still in draft, but your feedback would be nice

@korikuzma korikuzma added enhancement New feature or request priority:medium Medium priority labels Jan 11, 2024
@korikuzma korikuzma requested a review from wesleygoar January 11, 2024 15:00
@korikuzma korikuzma self-assigned this Jan 11, 2024
@wesleygoar
Copy link

Screenshot 2024-01-11 at 12 48 10 PM
Example from vcf in varhouse

@wesleygoar
Copy link

image (12)
annotated by varhouse as...

Copy link

@wesleygoar wesleygoar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This all seems correct to me.

@wesleygoar
Copy link

@korikuzma we should consider asking @ahwagner to review as well to confirm.

@korikuzma
Copy link
Member Author

Since this endpoint needs gene-normalizer data, I'm wondering if we should use ProteinSequenceConsequence. Right now, I have the catvar pydantic models in metakb (it made development easier). Maybe we should move them here or in a different place?

"""Define response for gnomad vcf to protein service"""

gene_context: Optional[core_models.Gene] = None
vrs_ref_allele_seq: Optional[StrictStr] = None
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I could have just added this to variation extensions field, but thought it'd be easier for varcat if we had a field to retrieve

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am always pro making varcat better (easier to develop). @ahwagner any objections?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, this is good info to include. I would like there to be a gene_context field on the VRS 2.x SequenceReference class but it doesn't exist yet, so we can leave that as a downstream issue.

@korikuzma korikuzma requested a review from wesleygoar February 1, 2024 16:53
@wesleygoar
Copy link

Since this endpoint needs gene-normalizer data, I'm wondering if we should use ProteinSequenceConsequence. Right now, I have the catvar pydantic models in metakb (it made development easier). Maybe we should move them here or in a different place?

Hmm. I will default to where @ahwagner thinks is best for them since he runs the whole show.

wesleygoar
wesleygoar previously approved these changes Feb 1, 2024
Base automatically changed from issue-408 to staging February 5, 2024 12:41
@korikuzma korikuzma dismissed wesleygoar’s stale review February 5, 2024 12:41

The base branch was changed.

korikuzma added a commit that referenced this pull request Feb 5, 2024
* Updates cool-seq-tool + ga4gh.vrs versions. This mainly had to do with fixing inter-residue positions. cool-seq-tool previous versions returned 0-based coordinates. 
* Temporarily removes gnomad_vcf_to_protein, which will be added back in #529
@korikuzma korikuzma changed the base branch from staging to main February 5, 2024 13:11
@korikuzma
Copy link
Member Author

@ahwagner Do you have time to review this?

ahwagner
ahwagner previously approved these changes Feb 7, 2024
Copy link
Member

@ahwagner ahwagner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall outstanding work. 👍
I noticed a few minor docstring clarifications and thread comments.

I spot tested a few of the provided tests, they check out.

"""Define response for gnomad vcf to protein service"""

gene_context: Optional[core_models.Gene] = None
vrs_ref_allele_seq: Optional[StrictStr] = None
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, this is good info to include. I would like there to be a gene_context field on the VRS 2.x SequenceReference class but it doesn't exist yet, so we can leave that as a downstream issue.

variation/gnomad_vcf_to_protein_variation.py Outdated Show resolved Hide resolved
tests/test_gnomad_vcf_to_protein.py Outdated Show resolved Hide resolved
tests/test_gnomad_vcf_to_protein.py Outdated Show resolved Hide resolved
tests/test_gnomad_vcf_to_protein.py Outdated Show resolved Hide resolved
tests/test_gnomad_vcf_to_protein.py Outdated Show resolved Hide resolved
tests/test_gnomad_vcf_to_protein.py Outdated Show resolved Hide resolved
tests/test_gnomad_vcf_to_protein.py Show resolved Hide resolved
@wesleygoar
Copy link

@korikuzma I am pretty sure I have reviewed this at least once if not more. Is there any reason for me to review again?

@korikuzma
Copy link
Member Author

@korikuzma I am pretty sure I have reviewed this at least once if not more. Is there any reason for me to review again?

@wesleygoar no, when changes get made the reviews are dismissed. I only re-request reviews when I need things to be reviewed again.

@korikuzma korikuzma marked this pull request as ready for review February 7, 2024 20:53
@korikuzma korikuzma merged commit cdde139 into main Feb 7, 2024
8 of 10 checks passed
@korikuzma korikuzma deleted the issue-362-2-alpha branch February 7, 2024 20:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request priority:medium Medium priority
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add more support for deletions/insertions gnomad_vcf_to_protein
3 participants