microorganisms.Rd
A data set containing the microbial taxonomy of six kingdoms from the Catalogue of Life. MO codes can be looked up using as.mo
.
microorganisms
A data.frame
with 56,672 observations and 14 variables:
mo
ID of microorganism as used by this package
col_id
Catalogue of Life ID
fullname
Full name, like "Echerichia coli"
kingdom
Taxonomic kingdom of the microorganism
phylum
Taxonomic phylum of the microorganism
class
Taxonomic class of the microorganism
order
Taxonomic order of the microorganism
family
Taxonomic family of the microorganism
genus
Taxonomic genus of the microorganism
species
Taxonomic species of the microorganism
subspecies
Taxonomic subspecies of the microorganism
rank
Taxonomic rank of the microorganism, like "species"
or "genus"
ref
Author(s) and year of concerning scientific publication
species_id
ID of the species as used by the Catalogue of Life
Catalogue of Life: Annual Checklist (public online database), www.catalogueoflife.org.
Manually added were:
9 species of Streptococcus (beta haemolytic groups A, B, C, D, F, G, H, K and unspecified)
2 species of Staphylococcus (coagulase-negative [CoNS] and coagulase-positive [CoPS])
2 other undefined (unknown Gram negatives and unknown Gram positives)
This package contains the complete taxonomic tree of almost all microorganisms from the authoritative and comprehensive Catalogue of Life (http://www.catalogueoflife.org). This data is updated annually - check the included version with catalogue_of_life_version
.
Included are:
All ~55,000 (sub)species from the kingdoms of Archaea, Bacteria, Protozoa and Viruses
All ~3,000 (sub)species from these orders of the kingdom of Fungi: Eurotiales, Onygenales, Pneumocystales, Saccharomycetales and Schizosaccharomycetales. The kingdom of Fungi is a very large taxon with almost 300,000 different (sub)species, of which most are not microbial (but rather macroscopic, like mushrooms). Because of this, not all fungi fit the scope of this package and including everything would tremendously slow down our algorithms too. By only including the aforementioned taxonomic orders, the most relevant (sub)species are covered (like all species of Aspergillus, Candida, Pneumocystis, Saccharomyces and Trichophyton).
All ~15,000 previously accepted names of included (sub)species that have been taxonomically renamed
The complete taxonomic tree of all included (sub)species: from kingdom to subspecies
The responsible author(s) and year of scientific publication
The Catalogue of Life (http://www.catalogueoflife.org) is the most comprehensive and authoritative global index of species currently available. It holds essential information on the names, relationships and distributions of over 1.6 million species. The Catalogue of Life is used to support the major biodiversity and conservation information services such as the Global Biodiversity Information Facility (GBIF), Encyclopedia of Life (EoL) and the International Union for Conservation of Nature Red List. It is recognised by the Convention on Biological Diversity as a significant component of the Global Taxonomy Initiative and a contribution to Target 1 of the Global Strategy for Plant Conservation.
The syntax used to transform the original data to a cleansed R format, can be found here: https://gitlab.com/msberends/AMR/blob/master/reproduction_of_microorganisms.R.
On our website https://msberends.gitlab.io/AMR you can find a comprehensive tutorial about how to conduct AMR analysis, the complete documentation of all functions (which reads a lot easier than here in R) and an example analysis using WHONET data.