-
Notifications
You must be signed in to change notification settings - Fork 3
Home
Christian Parobek edited this page Feb 4, 2015
·
37 revisions
I will document my analysis through this wiki. The associated scripts (and results?) will be kept in this repo.
###Getting Additional Populations:
- getting otherPops
###Steps to good SNP-call data:
-
variantCalling - documented as part of the gatk_pipeline repo and the
HaplotypeCaller
updates are documented here. - weakestLinks - remove the lowest-coverage samples prior to downstream analysis. Gives us a shot at getting full haplotypes for all SNPs.
- variantFiltering - modeled loosely after Manske et al.
###Population Differentiation:
- PopGenome - the "Swiss Army Knife" package for population genetic analyses.
-
vcf2structure - VCF file format must be converted to
STRUCTURE
format for bothSTRUCTURE
andadegenet
analysis. structure
-
adegenet
- this is for PCA inR
- recombination
- Derrick's dupFinder
- Structural Variant Analysis
###Selective Sweeps:
- hapFLK - Andrew found this and recommended. Looks like it is sensitive and specific for sweeps even in complicated demographic backgrounds, because it makes extended haplotypes, then does Fst on those, somehow.