Skip to content

Commit

Permalink
Add info from the figshare tarball to LICENSE.md
Browse files Browse the repository at this point in the history
  • Loading branch information
tbooth committed Sep 26, 2023
1 parent 3e7637c commit 7744946
Showing 1 changed file with 57 additions and 2 deletions.
59 changes: 57 additions & 2 deletions LICENSE.md
Original file line number Diff line number Diff line change
Expand Up @@ -49,8 +49,7 @@ Notices:

Except where otherwise noted, the example programs and other software
provided by Software Carpentry and Data Carpentry are made available under the
[OSI][osi]-approved
[MIT license][mit-license].
[OSI][osi]-approved [MIT license][mit-license].

Permission is hereby granted, free of charge, to any person obtaining
a copy of this software and associated documentation files (the
Expand All @@ -71,6 +70,62 @@ LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

## Sample Dataset

This information specifically applies to the contents of the
`data-for-snakemake-novice-bioinformatics.tar.xz` file which you will need to download as part of
the standard setup for this course. This info is also included within the tar file
itself.

### Short read data

These originate from a dataset lodged in [ArrayExpress at EBI][aa]:

[ https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-MTAB-4044
](https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-MTAB-4044)

As per the [BioStudies terms of use][bs] "New datasets in BioStudies are released into the public
domain under the terms of a Creative Commons Zero (CC0) waiver". This means you may legally use
the files for any purpose without attribution. However, if you do re-use the dataset it is basic
academic courtesy to state the accession *E-MTAB-4044*, and cite the paper:

> Lahtvee PJ, Kumar R, Hallström BM, Nielsen J.
> Adaptation to different types of stress converge on mitochondrial metabolism.
> Mol Biol Cell. 2016;27(15):2505-2514. doi:10.1091/mbc.E16-03-0187
The subset of the dataset included here was selected by the authors of the Snakemake lesson.

### Yeast transcriptome

This was downloaded from the [Ensembl yeast database][eyd], specifically this location:

[ https://ftp.ensembl.org/pub/release-100/fasta/saccharomyces_cerevisiae/cdna/
](https://ftp.ensembl.org/pub/release-100/fasta/saccharomyces_cerevisiae/cdna/)

The description provided by Ensembl states that:

> The data on this site is a direct import of the Saccharomyces Genome Database (SGD) dataset for
> the Saccharomyces cerevisiae S288C genome. The assembly provided is R64-1-1, supplemented with
> additional cross-references and annotations. The protein-coding and non-coding gene model
> annotation was imported from SGD in April 2018. This gene set is based on Liachko et al. 2013
The [SGD website][sgd] provides full information on appropriate citation of their genome builds,
but the appropriate papers to cite regarding this reference sequence are:

> Engel SR, Dietrich FS, Fisk DG, Binkley G, Balakrishnan R, Costanzo MC, Dwight SS, Hitz BC,
> Karra K, Nash RS, Weng S, Wong ED, Lloyd P, Skrzypek MS, Miyasato SR, Simison M, Cherry JM. (2013)
> The Reference Genome Sequence of Saccharomyces cerevisiae: Then and Now. G3 (Bethesda).
> 2013 Dec 27. pii: g3.113.008995v1. doi: 10.1534/g3.113.008995. [PMID: 24374639]
> Liachko I, Youngblood RA, Keich U, Dunham MJ.
> High-resolution mapping, characterization, and optimization of autonomously replicating sequences
> in yeast. Genome Res. 2013;23(4):698-704. doi:10.1101/gr.144659.112
[aa]: https://www.ebi.ac.uk/biostudies/arrayexpress
[bs]: https://www.ebi.ac.uk/biostudies/help
[eyd]: https://www.ensembl.org/Saccharomyces_cerevisiae/Info/Index
[sgd]: https://sites.google.com/view/yeastgenome-help/about/how-to-cite-sgd

## Trademark

"Software Carpentry" and "Data Carpentry" and their respective logos
Expand Down

0 comments on commit 7744946

Please sign in to comment.