Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create germlineResource for GRCh37 #25

Closed
maxulysse opened this issue Aug 16, 2019 · 5 comments
Closed

Create germlineResource for GRCh37 #25

maxulysse opened this issue Aug 16, 2019 · 5 comments
Assignees
Labels
enhancement New feature or request

Comments

@maxulysse
Copy link
Member

We don't have any germlineResource for GRCh37
Can you please create one?
cf #24

@maxulysse maxulysse added the enhancement New feature or request label Aug 16, 2019
@maxulysse maxulysse mentioned this issue Aug 16, 2019
8 tasks
@maxulysse maxulysse mentioned this issue Aug 26, 2019
5 tasks
@szilvajuhos
Copy link
Contributor

Now we have germlineResource VCFs for iGenomes on munin at

  • GRCh37 (Homo_sapiens/GATK/GRCh37/Annotation/GermlineResource):
    • gnomAD.r2.1.1.GRCh37.PASS.AC.AF.only.vcf.gz (and the .tbi)
    • also a dummy_PON.gnomAD.GRCh37.WGS.AF.GT.01.vcf.gz (and the .tbi)
  • GRCh38 (Homo_sapiens/GATK/GRCh38/Annotation/GermlineResource)
    • gnomAD.r2.1.1.GRCh38.PASS.AC.AF.only.vcf.gz (.tbi)
    • dummy_PON.gnomAD.GRCh38.WGS.AF.GT.01.vcf.gz (.tbi)

The gnomAD file for Mutect2 --germline-resource have been generated like getting rid of calls from gnomAD missing the PASS field, also writing out only the AC and AF infos to make file smaller.

The dummy PON is a subset of this subset above: only calls with AF > 0.1 were included. This file should not be used for real filtering.

@maxulysse pls review, update iGenomes and close once ready.

@nandan75
Copy link

Hi ,

I have been trying to find downloadable links for the file (to be used with sarek)

GRCh38 (Homo_sapiens/GATK/GRCh38/Annotation/GermlineResource)
gnomAD.r2.1.1.GRCh38.PASS.AC.AF.only.vcf.gz (.tbi)

Can someone point me to the latest working link? I have tried searching a lot on goiogle but failed!

Appreciate it.

regards,

Nandan

@szilvajuhos
Copy link
Contributor

Hi, AFAIR I have made those files for tests, should reside somewhere at BTB's storage. @maxulysse maybe can dig it up, but would not be that difficult to re-create them using awk (printing out ounly entries with the PASS flag, and only the AC and AF filelds.)

@maxulysse
Copy link
Member Author

@nandan75 AWS igenomes ressources are findable here: https://ewels.github.io/AWS-iGenomes/
The following command should work to download each files:

aws s3 --no-sign-request --region eu-west-1 cp s3://ngi-igenomes/igenomes/Homo_sapiens/GATK/GRCh38/Annotation/GermlineResource/gnomAD.r2.1.1.GRCh38.PASS.AC.AF.only.vcf.gz ./references/Homo_sapiens/GATK/GRCh38/Annotation/GermlineResource/.
aws s3 --no-sign-request --region eu-west-1 cp s3://ngi-igenomes/igenomes/Homo_sapiens/GATK/GRCh38/Annotation/GermlineResource/gnomAD.r2.1.1.GRCh38.PASS.AC.AF.only.vcf.gz.tbi ./references/Homo_sapiens/GATK/GRCh38/Annotation/GermlineResource/.

@nandan75
Copy link

nandan75 commented Jan 12, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants