A collection of educational content and software for learning the beauty of bioinformatics.
I started a new journey at Freenome as a Bioinformatics Research Engineer.
In September 2019, I started working as a software engineer at DNAnexus. During my first week I joined an internal group called Science Frontiers, which is the equivalent of 20% Projects at Google. Within a few weeks, I began collaborating with a bioinformatician/microbiome specialist on one of his projects involving the clustering of microbial data extracted from the Human Microbiome Project. Humbled by the vastness of this field of research, the dedication of our researchers, and the incredible applications of bioinformatics in general, a deep curiosity formed within me to understand the fundamentals of this science. I began by the picking the brains of the many scientists at the company and reading laymen books such as The Gene, whilst making progress in both my work at the company as well as the Science Frontiers project. Then COVID-19 hit. With essentially no social life, I thought this would be as good a time as any to give myself an unofficial degree in bioinformatics, perhaps something that would quickly amount to the equivalent of an undergraduate degree at a notable university.
- De Bruijn graphs
- Cyclopeptide sequencing
- Smith-Waterman, Needleman-Wunch algorithm, and Hirschberg algorithms
- Multiple sequence alignment
- PAM / BLOSUM scoring matrices
- Random Breakage, Fragile Breakage, and Whole Genome Duplication models
- Synteny block graph / 2-break distance/sorting
- Additive phylogeny
- Unweighted Pair Group Method with Arithmetic Mean (UPGMA)
- Neighbor-Joining
- Maximum Parsimony
- Clustering gene expression matrices: k-means, soft k-means, and hierarchical clustering
- DNAnexus (thanks for the employment)
- Galaxy
- Glow
- BoaG
- EMBL-EBI
- Consensus
- MEME
- Samtools
- BLAST
- EMBOSS Water/Needle
- Clustal Omega
- Samtools
- FastQC
- Scythe
- Sickle
- MultiQC
- Quast
- MEGA
- FASTA/FASTQ
- VCF
- SAM
- GenBank
- UCSD Bioinformatics Algorithms Specialization (Coursera)
- John Hopkins Genomic Data Science (Coursera)
- Bioinformatics: Tools for Genome Analysis (John Hopkins AS.410.635.82)
- Bioinformatics Algorithms
- Bioinformatics with Python Cookbook
- Molecular Biology: Principles of Genome Function
- Molecular Population Genetics
- An Introduction to Population Genetics
- The Gene by Siddhartha Mukherjee
- Stepik
- Rosalind
- Biostars
- 1964: Molecules as Documents of Evolutionary History
- 1984: Lengths of chromosomal segments conserved since divergence of man and mouse (Nadeau et. al.)
- 1987: The neighbor-joining method
- 1990: Basic local alignment search tool (Altschul et.al.)
- 2003: Human Genome Project
- 2008: 1000 Genomes Project
- Where are the fragile regions located? What causes fragility?
- DNAnexus
- Illumina
- Pacific Biosciences
- GRAIL
- Freenome
...