Skip to content

dcdanko/MD2

Repository files navigation

The Microbe Directory v2.0

The ultimate microbe database

The Microbe Directory (TMD) is a collective research effort to profile and annotate more than 68,000 microbial species that include Bacteria, Archaea, Virus, Fungi, and Algae.

TMD aims to:

  • Provide a curated list of microbes from three domains: Archaea, Bacteria, Eukarya and Virus.
  • Compile microbial data from different databases and studies into a single one.
  • Give a phenotypic and ecologic description of microbial species parameters.
  • Annonante the microbiome where taxa have been identified.
  • Make microbial data handy to everyone!


Studies

The Microbe Directory (TMD) is a collective research effort to profile and annotate more than 30,000 microbial species that include Bacteria, Archaea, Virus, Fungi, and Algae.

TMD aims to:

  • Provide a curated list of microbes from four domains: Virus, Archaea, Bacteria and Eukarya
  • Compile microbial data from different databases and studies into a single one
  • Give a phenotypic and ecologic description of microbial species parameters.
  • Provide a community portal to add data and annotate new microbes.
  • Make a machine and human readable database.

The Database

Different features are important for different types of Microbe. It doesn't make much sense to talk about the Gram-Stain of a Virus or the Capsid symmetry of a Bacteria. To make data as relevant as possible we have split the data in The Microbe Directory into three domains.

Virus

  1. Genetic material: Virus have either RNA or DNA as their genetic material
  2. Strand: The nucleic acid may be single (ss) or double stranded (ds).
  3. Capsid symmetry: The way in which the capsid units are arranged.
  • Helical
  • Icosahedral
  • Complex
  1. Envelop: The outer layer of a virus that protects the nucleic acid. Virus without envelop are called naked.
  2. Is it a pathogen? If yes, which is its host.
    • Human
    • Animal
    • Plant
    • Bacteria
    • Fungi

Bacteria and Archaea Only

  1. Gram stain: Used to distinguish and classify bacterial species into two large groups: Gram-positive and Gram-negative.
  2. Antimicrobial resistance (AMR): Antimicrobial resistance occurs naturally over time, usually through genetic changes. However, the misuse and overuse of antimicrobials is accelerating this process.
  3. Type of metabolisms: the nutrition mode of microbes according to the sources of energy and carbon needed for living, growth and reproduction. All sorts of combinations may exist in nature.
    • Primary source of energy:
      • Phototrophs: Light is absorbed in photo receptors and transformed into chemical energy
      • Chemotrophs: Bond energy is released from a chemical compound.
    • Primary sources of reducing equivalents:
      • Organotrophs: Organic compounds are used as electron donor.
      • Lithotrophs: Inorganic compounds are used as electron donor.
    • Primary sources of carbon
      • Heterotrophs: Organic compounds are metabolized to get carbon for growth and development.
      • Autotrophs: Carbon dioxide (CO2) is used as source of carbon.

Bacteria, Archaea and Eukarya

  1. Biofilm forming: Biofilms are multicellular communities held together by a self-produced extracellular matrix. Biofilms impact humans in many ways as they can form in natural, medical, and industrial settings.
  2. Spore forming: Also referred to as endospores, are the dormant form of vegetative microbes and are highly resistant to physical and chemical influences.
  3. Microbiome: Host or environment where microbes are usually found.
    • Host: Microbes might be commensal or pathogenic to their host. Commensal microbes are found to be crucial to the survival of their hosts.
    • Soil: Microbes are essential for soils. They are main drivers of nutrient cycles in soils, decompose organic matter, promote plant growth and control pests and diseases.
      • Tundra
      • Grassland
      • Croplands
      • Forest
        • Tropical
        • Temperate
        • Boreal
    • Extreme: Microbes that live in habitats considered hard to survive in due to its extreme conditions such as temperature, accessibility to different energy sources or under high pressure.
      • Desert
      • Polar
      • Deep ocean
      • Space
    • Water: Water can support the growth of many types of microorganisms. Microbes are main drivers of biogeochemical processes and nutrient cycling.
      • Ocean
      • Fresh
      • Mangrove
      • Sediments
  4. Is it a pathogen? if Yes, which is its host:
  5. Extremophile: a microbe that thrives in physically or geochemically extreme conditions that are detrimental to most life on Earth. Microbes that can only live under optimal conditions are called Mesophiles.
  6. If extremophile, which type.
    • Acidophile: Microbes that live in acidic systems with pH -0.06 to 4.0.
    • Alkaliphile: Microbes capable of survival in alkaline environments with pH 8.5–11
    • Halophile: Microbes that thrive in high salt concentrations.
    • Metallotolerant: Microbes that survive in environments with a high concentration of dissolved heavy metals in solution
    • Barophile: Also called piezophile, are microbes which thrive at high pressures such as deep seas.
    • Psychrophile: Also called cryophiles, are microbes capable of growth in low temperatures, ranging from −20°C to 10°C.
    • Radioresistant: Microbes capable of withstand high levels of ionizing radiation.
    • Thermophile: Microbes that live at high temperatures between 41°C and 122°C.
    • Xerophile: Microbes that grow and reproduce in conditions with a low availability of water.
    • Hypolith: Organisms that live underneath rocks in cold deserts.
    • Oligotroph: Microbes capable of growth in nutritionally limited environments.

Data Sources

The Microbe Directory collates data from a number of other databases. Some databases directly provide information about microbes. These databases include annotations for a number of different types of microbial traits.

Studies

The Microbe Directory also includes collated results from a number of projects on microbial communities. These studies are condensed into summary results describing the settings where a microbe may be found. <<<<<<< HEAD


Installation and Use

The Microbe Directory may be accessed as a set of csv files. We also provide an API to provide programmatic access to The Microbe Directory. This API includes several statistical functions meant to compare microbial communities based on their annotated traits.

Installation

From PyPi

pip install microbe_directory

From source

git clone https://github.com/dcdanko/MD2	
cd MD2	
python setup.py install	
=======

* [MetaSUB](http://metasub.org): Molecular profile of cities around the globe to improve their design, functionality, and impact on health. 
* [Earth Microbiome Project](http://www.earthmicrobiome.org): Characterization of microbial communities around the globe.
* [TARA Oceans](http://ocean-microbiome.embl.de/companion.html):  Metagenomic study of oceans samples in epipelagic and mesopelagic waters across the globe.
* [Soil bacterial and fungal communities across a pH gradient in an arable soil](https://qiita.ucsd.edu/study/description/94): Soils collected across a long-term liming experiment (pH 4.0-8.3).
* [The ecology of the phyllosphere](https://qiita.ucsd.edu/study/description/396): Bacterial communities from leaves of 56 tree species in Boulder, Colorado, USA.
* [Characterization of Airborne Microbial Communities at a High-Elevation Site and Their Potential To Act as Atmospheric Ice Nuclei](http://dx.doi.org/10.1128/AEM.00447-09): Atmospheric microbial abundance, community composition, and ice nucleation at a high-elevation site in northwestern Colorado.
* [Microbial community composition in a lowland tropical rain forest- Costa Rica](https://doi.org/10.1016/j.soilbio.2010.08.011): Plot-scale manipulations of organic matter inputs to soils correlate with shifts in microbial community composition in a lowland tropical rain forest.
* [Microbial communities on money](https://qiita.ucsd.edu/study/description/375)

---

## Installation	and Use 

**The Microbe Directory** may be accessed as a set of csv files. We also provide an API to provide programmatic access to **The Microbe Directory**. This API includes several statistical functions meant to compare microbial communities based on their annotated traits.

### Installation

From PyPi

pip install microbe_directory


From source	

git clone https://github.com/dcdanko/MD2 cd MD2 python setup.py install


### Building TMD Tables from Source Databases

TMD uses `make` to build tables, see the Makefile for details.

make clean # delete the current tables make test # run unit tests make all # make all tables make bact # make bateria/archaea table make euks # make eukaryotic table make virus # make viral table

28730154519877cf233172208ca4d76b2c71057c

The following outputs the taxonomy of all available Bacteria and Viruses from the NCBI dmp files. The table consists of the scientific name and classification from phylum-species level along with the unique taxonomic id.

<<<<<<< HEAD
### Building TMD Tables from Source Databases

TMD uses `make` to build tables, see the Makefile for details.

make clean # delete the current tables make test # run unit tests make all # make all tables make bact # make bateria/archaea table make euks # make eukaryotic table make virus # make viral table

The following outputs the taxonomy of all available Bacteria and Viruses from the NCBI dmp files. The table consists of the scientific name and classification from phylum-species level along with the unique taxonomic id.

## License and Use

=======
## License and Use

>>>>>>> 28730154519877cf233172208ca4d76b2c71057c
All original material in TMD is provided under the MIT License. Some of the source databases may have restrictions on commercial use.