Skip to content

Latest commit

 

History

History
146 lines (124 loc) · 7.46 KB

File metadata and controls

146 lines (124 loc) · 7.46 KB

Schedule

The course schedule consists of 1 afternoon (Tuesday) and 3 "full" days (Wednesday - Friday) in the first two weeks, with 1.5 days in the last week.

The "full days" run from 09:00 - 17:00, with a break from 12:00 - 13:30.

Conceptually, mornings are predominantly related to introductions, presentations and discussion of the previous days, and are afternoons reserved for independent work on the examples and tasks.

Introduction, File Formats & Genome Browsers (Michael Baudis)

2018-09-18 (Tue), 13-17
  • general introduction into the topic (slides)
  • schedule adjustment
  • guidance about course room and computer use (Tina Siegenthaler)
  • reading:
    • 1000 Genomes paper
    • The sequence of sequencers paper
  • tasks:
    • Genome Storage Space & Cost, e.g. required for 1000 Genomes
      • WES & WGS
      • Different file formats
        • SAM
        • BAM
        • VCF
        • FASTA
      • Associated costs
      • Cost factors
      • Raw Storage costs
2018-09-19 (Wed), 09-17
2018-09-20 (Thu), 09-17
2018-09-21 (Fri), 09-17

Tools & Programmatic Solutions (Izaskun Mallona)

2018-09-25 (Tue), 13-17
  • How are UCSC Genome Browser data stored? Why?
  • Genomics data management: automation
    • Computer basics: plain text files, Unix terminal
    • Reproducibility
    • Systems set up (data download and software installs)
2018-09-26 (Wed), 09-17
  • Unix for bioinformatics
    • Chapter 1: What is UNIX
    • Chapter 2: The UNIX filesystem
    • Chapter 3: UNIX shell - first steps
    • Chapter 4: UNIX shell - filesystem commands
    • Chapter 5: UNIX shell - working with files
2018-09-27 (Thu), 09-17
  • Overview of the standard genomics data formats (I)
    • FASTA
    • FASTQ
    • SAM
    • BED
  • Basic file processing for bioinformatics
    • awk, cut
2018-09-28 (Fri), 09-17
  • Overview of the standard genomics data formats (II)
    • GFF/GTF
    • BEDgraphs
    • Wiggle files
    • VCFs
  • Indexed genomic data formats
  • Exercises

Genome Variants to Modified Proteins (Elif Ozkirimli Olmez)

2018-10-02 (Tue), 13-17
2018-10-03 (Wed), 09-17
2018-10-04 (Thu), 09-17
2018-10-05 (Fri), 09-17
  • Morning: Presentations on your protein
    • Biological relevance of your protein
    • Experimental details/methods
    • 2 key findings
    • Position of mutations on protein structure (structure figure)
    • Discussion
  • Afternoon: BLAST task

Review, feedback & test (Michael Baudis)

  • 2018-10-09 (Tue), 13-17 (slides)
    • Ontologies for metadata annotations (very brief introduction)
    • Privacy, security, society - implications of availability & possible re-identification of genome data
      • long range familial identification
      • principles of Beacon-style re-identification attack
      • "ease" of field sequencing (MinIon etc.)
  • 2018-10-10 (Wed), 09-14:30
    • preparation/recap time in the morning
    • Written exam (13:00 - 14:30)
      • multiple choice and free questions