Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WC_FST_FiniteSample_*ploids_2AllelesB_No*Corr*() for multiple loci #6

Open
l-yampolsky opened this issue Jul 26, 2016 · 2 comments
Open

Comments

@l-yampolsky
Copy link

Dear Mike and Katie,

I am trying to run OutFLANK for a large number of loci (obviously). So far I have only used it with my externally calculated values of $FST, $T1, $T2, $FSTNoCorr, $T1NoCorr and $T2NoCorr, but now I feel it would be convenient to use built-in functions to calculate these. I understand that the input for these functions is an array with data about a single locus, with a row for each population and a column for each genotype. I am trying to do it for 90K SNP loci; clearly I don't want to create 90K input files. I have no doubt that you also have a script, unix or R, to run these functions for multiple loci, preferably from a single file where populations and loci are identified by two index columns - yet I can't find it anywhere on your github site. Any suggestions?

Thanks!

Lev

@DrK-Lo
Copy link
Collaborator

DrK-Lo commented Jul 26, 2016

Hi Lev,
You can use the MakeDiploidFSTMat() function if you have diploid data. See the user manual for the format of the data. If you have haploid data, you will have to use a different function for the Fst calculation. (as described in the manual). We don't have a wrapper for this, but you can look at the code inside the MakeDiploidFSTMat() function to see how we implemented it. Let us know if this helps!

@l-yampolsky
Copy link
Author

l-yampolsky commented Jul 27, 2016

Thanks! No, I cannot use MakeDiploidFSTMat() for the data in question, because it's poolSeq data. So, it's 1) haploid and 2) no individuals, just allele frequencies.
May be the easiest way would be to know exactly which versiond of the Fst formula are used to calculate numerator and denominator with and without correction. I would then continue to feed externally calculated Fst's into OutFLANK, but will be certain that I am doing the right thing. I've been doing this in the past, but I am worrying I don't have a benchmark to make sure I am not misusing OutFLANK.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants