Skip to content

CBC-UCONN/RADseq_tutorial_grayling

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RADseq tutorial

This tutorial demonstrates a few ways to get and filter genotypes using restriction-site associated DNA. This readme is currently skeletal, but the code is more or less complete and demonstrates how to:

  1. Get genotypes using stacks in de novo mode, step by step.
  2. Get genotypes using stacks in reference mapping mode.
  3. Get genotypes using the variant caller Freebayes with reference-mapped sequence.
  4. Filter and compare variant call sets.
  5. Do some basic QC-oriented analysis in R.

Each script should be run from the /scripts directory and will create its own output directories as necessary.

The dataset is from a fish, the Arctic grayling, and includes several hundred individuals distributed across three watersheds on Alaska's North slope. It has some fun issues. A bunch of samples are actually from another species. Graylings are salmonids and so have an old-ish genome duplication with residual tetrasomy which leads to some issues with paralogs.

Unfortunately, the data are unpublished at the moment, and so are only available to users of UConn's Xanadu cluster. We will update the code to retrieve the data from NCBI once they are deposited there.

The scripts are written to use SLURM, as configured for UConn's Xanadu cluster. If you are using our cluster, you should be able to clone this repository and simply run the scripts sequentially to complete the analysis. The raw data are housed in a directory accessible to all users. If you are a workshop participant from outside the University, you can also clone and run the scripts sequentially, but you will need to update the partition to "mcbstudent" instead of "general".

If you are not using Xanadu, and you are at all comfortable with bash coding or SLURM you should be able to adapt these scripts to your home server, your own laptop, or for your own dataset. If you're uncomfortable with those things, picking apart and figuring out code like this is a great way to learn!

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 59.9%
  • R 40.1%