AMR/R/data.R

# ==================================================================== #
# TITLE                                                                #
# Antimicrobial Resistance (AMR) Data Analysis for R                   #
#                                                                      #
# SOURCE                                                               #
# https://github.com/msberends/AMR                                     #
#                                                                      #
# LICENCE                                                              #
# (c) 2018-2021 Berends MS, Luz CF et al.                              #
# Developed at the University of Groningen, the Netherlands, in        #
# collaboration with non-profit organisations Certe Medical            #
# Diagnostics & Advice, and University Medical Center Groningen.       # 
#                                                                      #
# This R package is free software; you can freely use and distribute   #
# it for both personal and commercial purposes under the terms of the  #
# GNU General Public License version 2.0 (GNU GPL-2), as published by  #
# the Free Software Foundation.                                        #
# We created this package for both routine data analysis and academic  #
# research and it was publicly released in the hope that it will be    #
# useful, but it comes WITHOUT ANY WARRANTY OR LIABILITY.              #
#                                                                      #
# Visit our website for the full manual and a complete tutorial about  #
# how to conduct AMR data analysis: https://msberends.github.io/AMR/   #
# ==================================================================== #

#' Data Sets with `r format(nrow(antibiotics) + nrow(antivirals), big.mark = ",")` Antimicrobial Drugs
#'
#' Two data sets containing all antibiotics/antimycotics and antivirals. Use [as.ab()] or one of the [`ab_*`][ab_property()] functions to retrieve values from the [antibiotics] data set. Three identifiers are included in this data set: an antibiotic ID (`ab`, primarily used in this package) as defined by WHONET/EARS-Net, an ATC code (`atc`) as defined by the WHO, and a Compound ID (`cid`) as found in PubChem. Other properties in this data set are derived from one or more of these codes. Note that some drugs have multiple ATC codes.
#' @format
#' ## For the [antibiotics] data set: a [data.frame] with `r nrow(antibiotics)` observations and `r ncol(antibiotics)` variables:
#' - `ab`\cr Antibiotic ID as used in this package (such as `AMC`), using the official EARS-Net (European Antimicrobial Resistance Surveillance Network) codes where available
#' - `cid`\cr Compound ID as found in PubChem
#' - `name`\cr Official name as used by WHONET/EARS-Net or the WHO
#' - `group`\cr A short and concise group name, based on WHONET and WHOCC definitions
#' - `atc`\cr ATC codes (Anatomical Therapeutic Chemical) as defined by the WHOCC, like `J01CR02`
#' - `atc_group1`\cr Official pharmacological subgroup (3rd level ATC code) as defined by the WHOCC, like `"Macrolides, lincosamides and streptogramins"`
#' - `atc_group2`\cr Official chemical subgroup (4th level ATC code) as defined by the WHOCC, like `"Macrolides"`
#' - `abbr`\cr List of abbreviations as used in many countries, also for antibiotic susceptibility testing (AST)
#' - `synonyms`\cr Synonyms (often trade names) of a drug, as found in PubChem based on their compound ID
#' - `oral_ddd`\cr Defined Daily Dose (DDD), oral treatment, currently available for `r sum(!is.na(antibiotics$oral_ddd))` drugs
#' - `oral_units`\cr Units of `oral_ddd`
#' - `iv_ddd`\cr Defined Daily Dose (DDD), parenteral (intravenous) treatment, currently available for `r sum(!is.na(antibiotics$iv_ddd))` drugs
#' - `iv_units`\cr Units of `iv_ddd`
#' - `loinc`\cr All LOINC codes (Logical Observation Identifiers Names and Codes) associated with the name of the antimicrobial agent. Use [ab_loinc()] to retrieve them quickly, see [ab_property()].
#' 
#' ## For the [antivirals] data set: a [data.frame] with `r nrow(antivirals)` observations and `r ncol(antivirals)` variables:
#' - `atc`\cr ATC codes (Anatomical Therapeutic Chemical) as defined by the WHOCC
#' - `cid`\cr Compound ID as found in PubChem
#' - `name`\cr Official name as used by WHONET/EARS-Net or the WHO
#' - `atc_group`\cr Official pharmacological subgroup (3rd level ATC code) as defined by the WHOCC
#' - `synonyms`\cr Synonyms (often trade names) of a drug, as found in PubChem based on their compound ID
#' - `oral_ddd`\cr Defined Daily Dose (DDD), oral treatment
#' - `oral_units`\cr Units of `oral_ddd`
#' - `iv_ddd`\cr Defined Daily Dose (DDD), parenteral treatment
#' - `iv_units`\cr Units of `iv_ddd`
#' @details Properties that are based on an ATC code are only available when an ATC is available. These properties are: `atc_group1`, `atc_group2`, `oral_ddd`, `oral_units`, `iv_ddd` and `iv_units`.
#'
#' Synonyms (i.e. trade names) were derived from the Compound ID (`cid`) and consequently only available where a CID is available.
#' 
#' ## Direct download
#' These data sets are available as 'flat files' for use even without \R - you can find the files here:
#' 
#' * <https://github.com/msberends/AMR/raw/main/data-raw/antibiotics.txt>
#' * <https://github.com/msberends/AMR/raw/main/data-raw/antivirals.txt>
#' 
#' Files in \R format (with preserved data structure) can be found here:
#' 
#' * <https://github.com/msberends/AMR/raw/main/data/antibiotics.rda>
#' * <https://github.com/msberends/AMR/raw/main/data/antivirals.rda>
#' @source World Health Organization (WHO) Collaborating Centre for Drug Statistics Methodology (WHOCC): <https://www.whocc.no/atc_ddd_index/>
#'
#' WHONET 2019 software: <http://www.whonet.org/software.html>
#'
#' European Commission Public Health PHARMACEUTICALS - COMMUNITY REGISTER: <https://ec.europa.eu/health/documents/community-register/html/reg_hum_atc.htm>
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection WHOCC WHOCC
#' @inheritSection AMR Read more on Our Website!
#' @seealso [microorganisms], [intrinsic_resistant]
"antibiotics"

#' @rdname antibiotics
"antivirals"

#' Data Set with `r format(nrow(microorganisms), big.mark = ",")` Microorganisms
#'
#' A data set containing the full microbial taxonomy (**last updated: `r CATALOGUE_OF_LIFE$yearmonth_LPSN`**) of `r nr2char(length(unique(microorganisms$kingdom[!microorganisms$kingdom %like% "unknown"])))` kingdoms from the Catalogue of Life (CoL) and the List of Prokaryotic names with Standing in Nomenclature (LPSN). MO codes can be looked up using [as.mo()].
#' @inheritSection catalogue_of_life Catalogue of Life
#' @format A [data.frame] with `r format(nrow(microorganisms), big.mark = ",")` observations and `r ncol(microorganisms)` variables:
#' - `mo`\cr ID of microorganism as used by this package
#' - `fullname`\cr Full name, like `"Escherichia coli"`
#' - `kingdom`, `phylum`, `class`, `order`, `family`, `genus`, `species`, `subspecies`\cr Taxonomic rank of the microorganism
#' - `rank`\cr Text of the taxonomic rank of the microorganism, like `"species"` or `"genus"`
#' - `ref`\cr Author(s) and year of concerning scientific publication
#' - `species_id`\cr ID of the species as used by the Catalogue of Life
#' - `source`\cr Either `r vector_or(microorganisms$source)` (see *Source*)
#' - `prevalence`\cr Prevalence of the microorganism, see [as.mo()]
#' - `snomed`\cr Systematized Nomenclature of Medicine (SNOMED) code of the microorganism, according to the `r SNOMED_VERSION$current_source` (see *Source*). Use [mo_snomed()] to retrieve it quickly, see [mo_property()].
#' @details 
#' Please note that entries are only based on the Catalogue of Life and the LPSN (see below). Since these sources incorporate entries based on (recent) publications in the International Journal of Systematic and Evolutionary Microbiology (IJSEM), it can happen that the year of publication is sometimes later than one might expect.
#' 
#' For example, *Staphylococcus pettenkoferi* was described for the first time in Diagnostic Microbiology and Infectious Disease in 2002 (\doi{10.1016/s0732-8893(02)00399-1}), but it was not before 2007 that a publication in IJSEM followed (\doi{10.1099/ijs.0.64381-0}). Consequently, the `AMR` package returns 2007 for `mo_year("S. pettenkoferi")`.
#' 
#' ## Manual additions
#' For convenience, some entries were added manually:
#' 
#' - 11 entries of *Streptococcus* (beta-haemolytic: groups A, B, C, D, F, G, H, K and unspecified; other: viridans, milleri)
#' - 2 entries of *Staphylococcus* (coagulase-negative (CoNS) and coagulase-positive (CoPS))
#' - 3 entries of *Trichomonas* (*T. vaginalis*, and its family and genus)
#' - 1 entry of *Candida* (*C.  krusei*), that is not (yet) in the Catalogue of Life
#' - 1 entry of *Blastocystis* (*B.  hominis*), although it officially does not exist (Noel *et al.* 2005, PMID 15634993)
#' - 1 entry of *Moraxella* (*M. catarrhalis*), which was formally named *Branhamella catarrhalis* (Catlin, 1970) though this change was never accepted within the field of clinical microbiology
#' - 5 other 'undefined' entries (unknown, unknown Gram negatives, unknown Gram positives, unknown yeast and unknown fungus)
#' - 6 families under the Enterobacterales order, according to Adeolu *et al.* (2016, PMID 27620848), that are not (yet) in the Catalogue of Life
#' 
#' ## Direct download
#' This data set is available as 'flat file' for use even without \R - you can find the file here:
#' 
#' * <https://github.com/msberends/AMR/raw/main/data-raw/microorganisms.txt>
#' 
#' The file in \R format (with preserved data structure) can be found here:
#' 
#' * <https://github.com/msberends/AMR/raw/main/data/microorganisms.rda>
#' @section About the Records from LPSN (see *Source*):
#' The List of Prokaryotic names with Standing in Nomenclature (LPSN) provides comprehensive information on the nomenclature of prokaryotes. LPSN is a free to use service founded by Jean P. Euzeby in 1997 and later on maintained by Aidan C. Parte.
#' 
#' As of February 2020, the regularly augmented LPSN database at DSMZ is the basis of the new LPSN service. The new database was implemented for the Type-Strain Genome Server and augmented in 2018 to store all kinds of nomenclatural information. Data from the previous version of LPSN and from the Prokaryotic Nomenclature Up-to-date (PNU) service were imported into the new system. PNU had been established in 1993 as a service of the Leibniz Institute DSMZ, and was curated by Norbert Weiss, Manfred Kracht and Dorothea Gleim.
#' @source 
#' `r gsub("{year}", CATALOGUE_OF_LIFE$year, CATALOGUE_OF_LIFE$version, fixed = TRUE)` as currently implemented in this `AMR` package:
#' 
#' * Annual Checklist (public online taxonomic database), <http://www.catalogueoflife.org>
#' 
#' List of Prokaryotic names with Standing in Nomenclature (`r CATALOGUE_OF_LIFE$yearmonth_LPSN`) as currently implemented in this `AMR` package:
#' 
#' * Parte, A.C., Sarda Carbasse, J., Meier-Kolthoff, J.P., Reimer, L.C. and Goker, M. (2020). List of Prokaryotic names with Standing in Nomenclature (LPSN) moves to the DSMZ. International Journal of Systematic and Evolutionary Microbiology, 70, 5607-5612; \doi{10.1099/ijsem.0.004332}
#' * Parte, A.C. (2018). LPSN — List of Prokaryotic names with Standing in Nomenclature (bacterio.net), 20 years on. International Journal of Systematic and Evolutionary Microbiology, 68, 1825-1829; \doi{10.1099/ijsem.0.002786}
#' * Parte, A.C. (2014). LPSN — List of Prokaryotic names with Standing in Nomenclature. Nucleic Acids Research, 42, Issue D1, D613–D616; \doi{10.1093/nar/gkt1111}
#' * Euzeby, J.P. (1997). List of Bacterial Names with Standing in Nomenclature: a Folder Available on the Internet. International Journal of Systematic Bacteriology, 47, 590-592; \doi{10.1099/00207713-47-2-590}
#' 
#' `r SNOMED_VERSION$current_source` as currently implemented in this `AMR` package:
#' 
#' * Retrieved from the `r SNOMED_VERSION$title`, OID `r SNOMED_VERSION$current_oid`, version `r SNOMED_VERSION$current_version`; url: <`r SNOMED_VERSION$url`>
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection AMR Read more on Our Website!
#' @seealso [as.mo()], [mo_property()], [microorganisms.codes], [intrinsic_resistant]
"microorganisms"

#' Data Set with Previously Accepted Taxonomic Names
#'
#' A data set containing old (previously valid or accepted) taxonomic names according to the Catalogue of Life. This data set is used internally by [as.mo()].
#' @inheritSection catalogue_of_life Catalogue of Life
#' @format A [data.frame] with `r format(nrow(microorganisms.old), big.mark = ",")` observations and `r ncol(microorganisms.old)` variables:
#' - `fullname`\cr Old full taxonomic name of the microorganism
#' - `fullname_new`\cr New full taxonomic name of the microorganism
#' - `ref`\cr Author(s) and year of concerning scientific publication
#' - `prevalence`\cr Prevalence of the microorganism, see [as.mo()]
#' @source Catalogue of Life: Annual Checklist (public online taxonomic database), <http://www.catalogueoflife.org> (check included annual version with [catalogue_of_life_version()]).
#' 
#' Parte, A.C. (2018). LPSN — List of Prokaryotic names with Standing in Nomenclature (bacterio.net), 20 years on. International Journal of Systematic and Evolutionary Microbiology, 68, 1825-1829; \doi{10.1099/ijsem.0.002786}
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection AMR Read more on Our Website!
#' @seealso [as.mo()] [mo_property()] [microorganisms]
"microorganisms.old"

#' Data Set with `r format(nrow(microorganisms.codes), big.mark = ",")` Common Microorganism Codes
#'
#' A data set containing commonly used codes for microorganisms, from laboratory systems and WHONET. Define your own with [set_mo_source()]. They will all be searched when using [as.mo()] and consequently all the [`mo_*`][mo_property()] functions.
#' @format A [data.frame] with `r format(nrow(microorganisms.codes), big.mark = ",")` observations and `r ncol(microorganisms.codes)` variables:
#' - `code`\cr Commonly used code of a microorganism
#' - `mo`\cr ID of the microorganism in the [microorganisms] data set
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection catalogue_of_life Catalogue of Life
#' @inheritSection AMR Read more on Our Website!
#' @seealso [as.mo()] [microorganisms]
"microorganisms.codes"

#' Data Set with `r format(nrow(example_isolates), big.mark = ",")` Example Isolates
#'
#' A data set containing `r format(nrow(example_isolates), big.mark = ",")` microbial isolates with their full antibiograms. The data set reflects reality and can be used to practice AMR data analysis. For examples, please read [the tutorial on our website](https://msberends.github.io/AMR/articles/AMR.html).
#' @format A [data.frame] with `r format(nrow(example_isolates), big.mark = ",")` observations and `r ncol(example_isolates)` variables:
#' - `date`\cr date of receipt at the laboratory
#' - `hospital_id`\cr ID of the hospital, from A to D
#' - `ward_icu`\cr [logical] to determine if ward is an intensive care unit
#' - `ward_clinical`\cr [logical] to determine if ward is a regular clinical ward
#' - `ward_outpatient`\cr [logical] to determine if ward is an outpatient clinic
#' - `age`\cr age of the patient
#' - `gender`\cr gender of the patient
#' - `patient_id`\cr ID of the patient
#' - `mo`\cr ID of microorganism created with [as.mo()], see also [microorganisms]
#' - `PEN:RIF`\cr `r sum(vapply(FUN.VALUE = logical(1), example_isolates, is.rsi))` different antibiotics with class [`rsi`] (see [as.rsi()]); these column names occur in the [antibiotics] data set and can be translated with [ab_name()]
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection AMR Read more on Our Website!
"example_isolates"

#' Data Set with Unclean Data
#'
#' A data set containing `r format(nrow(example_isolates_unclean), big.mark = ",")` microbial isolates that are not cleaned up and consequently not ready for AMR data analysis. This data set can be used for practice.
#' @format A [data.frame] with `r format(nrow(example_isolates_unclean), big.mark = ",")` observations and `r ncol(example_isolates_unclean)` variables:
#' - `patient_id`\cr ID of the patient
#' - `date`\cr date of receipt at the laboratory
#' - `hospital`\cr ID of the hospital, from A to C
#' - `bacteria`\cr info about microorganism that can be transformed with [as.mo()], see also [microorganisms]
#' - `AMX:GEN`\cr 4 different antibiotics that have to be transformed with [as.rsi()]
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection AMR Read more on Our Website!
"example_isolates_unclean"

#' Data Set with `r format(nrow(WHONET), big.mark = ",")` Isolates - WHONET Example
#'
#' This example data set has the exact same structure as an export file from WHONET. Such files can be used with this package, as this example data set shows. The antibiotic results are from our [example_isolates] data set. All patient names are created using online surname generators and are only in place for practice purposes.
#' @format A [data.frame] with `r format(nrow(WHONET), big.mark = ",")` observations and `r ncol(WHONET)` variables:
#' - `Identification number`\cr ID of the sample
#' - `Specimen number`\cr ID of the specimen
#' - `Organism`\cr Name of the microorganism. Before analysis, you should transform this to a valid microbial class, using [as.mo()].
#' - `Country`\cr Country of origin
#' - `Laboratory`\cr Name of laboratory
#' - `Last name`\cr Fictitious last name of patient
#' - `First name`\cr Fictitious initial of patient
#' - `Sex`\cr Fictitious gender of patient
#' - `Age`\cr Fictitious age of patient
#' - `Age category`\cr Age group, can also be looked up using [age_groups()]
#' - `Date of admission`\cr [Date] of hospital admission
#' - `Specimen date`\cr [Date] when specimen was received at laboratory
#' - `Specimen type`\cr Specimen type or group
#' - `Specimen type (Numeric)`\cr Translation of `"Specimen type"`
#' - `Reason`\cr Reason of request with Differential Diagnosis
#' - `Isolate number`\cr ID of isolate
#' - `Organism type`\cr Type of microorganism, can also be looked up using [mo_type()]
#' - `Serotype`\cr Serotype of microorganism
#' - `Beta-lactamase`\cr Microorganism produces beta-lactamase?
#' - `ESBL`\cr Microorganism produces extended spectrum beta-lactamase?
#' - `Carbapenemase`\cr Microorganism produces carbapenemase?
#' - `MRSA screening test`\cr Microorganism is possible MRSA?
#' - `Inducible clindamycin resistance`\cr Clindamycin can be induced?
#' - `Comment`\cr Other comments
#' - `Date of data entry`\cr [Date] this data was entered in WHONET
#' - `AMP_ND10:CIP_EE`\cr `r sum(vapply(FUN.VALUE = logical(1), WHONET, is.rsi))` different antibiotics. You can lookup the abbreviations in the [antibiotics] data set, or use e.g. [`ab_name("AMP")`][ab_name()] to get the official name immediately. Before analysis, you should transform this to a valid antibiotic class, using [as.rsi()].
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection AMR Read more on Our Website!
"WHONET"

#' Data Set for R/SI Interpretation
#'
#' Data set containing reference data to interpret MIC and disk diffusion to R/SI values, according to international guidelines. Currently implemented guidelines are EUCAST (`r min(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "EUCAST")$guideline)))`-`r max(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "EUCAST")$guideline)))`) and CLSI (`r min(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "CLSI")$guideline)))`-`r max(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "CLSI")$guideline)))`). Use [as.rsi()] to transform MICs or disks measurements to R/SI values.
#' @format A [data.frame] with `r format(nrow(rsi_translation), big.mark = ",")` observations and `r ncol(rsi_translation)` variables:
#' - `guideline`\cr Name of the guideline
#' - `method`\cr Either `r vector_or(rsi_translation$method)`
#' - `site`\cr Body site, e.g. "Oral" or "Respiratory"
#' - `mo`\cr Microbial ID, see [as.mo()]
#' - `rank_index`\cr Taxonomic rank index of `mo` from 1 (subspecies/infraspecies) to 5 (unknown microorganism)
#' - `ab`\cr Antibiotic ID, see [as.ab()]
#' - `ref_tbl`\cr Info about where the guideline rule can be found
#' - `disk_dose`\cr Dose of the used disk diffusion method
#' - `breakpoint_S`\cr Lowest MIC value or highest number of millimetres that leads to "S"
#' - `breakpoint_R`\cr Highest MIC value or lowest number of millimetres that leads to "R"
#' - `uti`\cr A [logical] value (`TRUE`/`FALSE`) to indicate whether the rule applies to a urinary tract infection (UTI)
#' @details The repository of this `AMR` package contains a file comprising this exact data set: <https://github.com/msberends/AMR/blob/main/data-raw/rsi_translation.txt>. This file **allows for machine reading EUCAST and CLSI guidelines**, which is almost impossible with the Excel and PDF files distributed by EUCAST and CLSI. The file is updated automatically and the `mo` and `ab` columns have been transformed to contain the full official names instead of codes.
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection AMR Read more on Our Website!
#' @seealso [intrinsic_resistant]
"rsi_translation"

#' Data Set with Bacterial Intrinsic Resistance
#'
#' Data set containing defined intrinsic resistance by EUCAST of all bug-drug combinations.
#' @format A [data.frame] with `r format(nrow(intrinsic_resistant), big.mark = ",")` observations and `r ncol(intrinsic_resistant)` variables:
#' - `mo`\cr Microorganism ID
#' - `ab`\cr Antibiotic ID
#' @details The repository of this `AMR` package contains a file comprising this data set with full taxonomic and antibiotic names: <https://github.com/msberends/AMR/blob/main/data-raw/intrinsic_resistant.txt>. This file **allows for machine reading EUCAST guidelines about intrinsic resistance**, which is almost impossible with the Excel and PDF files distributed by EUCAST. The file is updated automatically.
#' 
#' This data set is based on `r format_eucast_version_nr(3.3)`.
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection AMR Read more on Our Website!
#' @examples
#' subset(intrinsic_resistant,
#'        antibiotic == "Vancomycin" & microorganism %like% "Enterococcus")$microorganism
#' #> [1] "Enterococcus casseliflavus" "Enterococcus gallinarum"
#' 
#' \donttest{
#' if (require("dplyr")) {
#'   intrinsic_resistant %>%
#'     filter(antibiotic == "Vancomycin" & microorganism %like% "Enterococcus") %>% 
#'     pull(microorganism)
#'   #> [1] "Enterococcus casseliflavus" "Enterococcus gallinarum"
#' }
#' }
"intrinsic_resistant"

#' Data Set with Treatment Dosages as Defined by EUCAST
#'
#' EUCAST breakpoints used in this package are based on the dosages in this data set. They can be retrieved with [eucast_dosage()].
#' @format A [data.frame] with `r format(nrow(dosage), big.mark = ",")` observations and `r ncol(dosage)` variables:
#' - `ab`\cr Antibiotic ID as used in this package (such as `AMC`), using the official EARS-Net (European Antimicrobial Resistance Surveillance Network) codes where available
#' - `name`\cr Official name of the antimicrobial agent as used by WHONET/EARS-Net or the WHO
#' - `type`\cr Type of the dosage, either `r vector_or(dosage$type)`
#' - `dose`\cr Dose, such as "2 g" or "25 mg/kg"
#' - `dose_times`\cr Number of times a dose must be administered
#' - `administration`\cr Route of administration, either `r vector_or(dosage$administration)`
#' - `notes`\cr Additional dosage notes
#' - `original_txt`\cr Original text in the PDF file of EUCAST
#' - `eucast_version`\cr Version number of the EUCAST Clinical Breakpoints guideline to which these dosages apply
#' @details `r format_eucast_version_nr(11.0)` are based on the dosages in this data set.
#' @inheritSection AMR Reference Data Publicly Available
#' @inheritSection AMR Read more on Our Website!
"dosage"
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
+								# ==================================================================== #
 								# TITLE                                                                #
-												(v1.5.0.9014) only_rsi_columns, is.rsi.eligible improvement

											
										
										
											2021-02-02 23:57:35 +01:00
+								# Antimicrobial Resistance (AMR) Data Analysis for R                   #
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
+								#                                                                      #
-												big website update, licence txt update

											
										
										
											2019-01-02 23:24:07 +01:00
+								# SOURCE                                                               #
-												(v1.2.0.9026) move to github

											
										
										
											2020-07-08 14:48:06 +02:00
+								# https://github.com/msberends/AMR                                     #
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
+								#                                                                      #
 								# LICENCE                                                              #
-												(v1.4.0.9047) unit tests

											
										
										
											2020-12-27 00:30:28 +01:00
+								# (c) 2018-2021 Berends MS, Luz CF et al.                              #
-												(v1.4.0) matching score update

											
										
										
											2020-10-08 11:16:03 +02:00
+								# Developed at the University of Groningen, the Netherlands, in        #
 								# collaboration with non-profit organisations Certe Medical            #
 								# Diagnostics & Advice, and University Medical Center Groningen.       #
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
+								#                                                                      #
-												big website update, licence txt update

											
										
										
											2019-01-02 23:24:07 +01:00
+								# This R package is free software; you can freely use and distribute   #
 								# it for both personal and commercial purposes under the terms of the  #
 								# GNU General Public License version 2.0 (GNU GPL-2), as published by  #
 								# the Free Software Foundation.                                        #
-												(v0.9.0.9008) Happy new year! Add lifecycles

											
										
										
											2020-01-05 17:22:09 +01:00
+								# We created this package for both routine data analysis and academic  #
 								# research and it was publicly released in the hope that it will be    #
 								# useful, but it comes WITHOUT ANY WARRANTY OR LIABILITY.              #
-												(v1.4.0) matching score update

											
										
										
											2020-10-08 11:16:03 +02:00
+								#                                                                      #
 								# Visit our website for the full manual and a complete tutorial about  #
-												(v1.5.0.9014) only_rsi_columns, is.rsi.eligible improvement

											
										
										
											2021-02-02 23:57:35 +01:00
+								# how to conduct AMR data analysis: https://msberends.github.io/AMR/   #
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
+								# ==================================================================== #
-												(v1.7.1.9026) updated DDDs

											
										
										
											2021-08-19 23:43:02 +02:00
+								#' Data Sets with `r format(nrow(antibiotics) + nrow(antivirals), big.mark = ",")` Antimicrobial Drugs
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
+								#'
-												(v1.7.1.9023) Removed filter_ functions, new set_ab_names(), ATC code update, ab selector update, fixes #46 and fixed #47

											
										
										
											2021-08-16 21:54:34 +02:00
+								#' Two data sets containing all antibiotics/antimycotics and antivirals. Use [as.ab()] or one of the [`ab_*`][ab_property()] functions to retrieve values from the [antibiotics] data set. Three identifiers are included in this data set: an antibiotic ID (`ab`, primarily used in this package) as defined by WHONET/EARS-Net, an ATC code (`atc`) as defined by the WHO, and a Compound ID (`cid`) as found in PubChem. Other properties in this data set are derived from one or more of these codes. Note that some drugs have multiple ATC codes.
-												(v0.8.0.9034) add cid to antivirals

											
										
										
											2019-11-23 12:39:57 +01:00
+								#' @format
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' ## For the [antibiotics] data set: a [data.frame] with `r nrow(antibiotics)` observations and `r ncol(antibiotics)` variables:
-												(v1.4.0.9041) updates based on review

											
										
										
											2020-12-17 16:22:25 +01:00
+								#' - `ab`\cr Antibiotic ID as used in this package (such as `AMC`), using the official EARS-Net (European Antimicrobial Resistance Surveillance Network) codes where available
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `cid`\cr Compound ID as found in PubChem
 								#' - `name`\cr Official name as used by WHONET/EARS-Net or the WHO
 								#' - `group`\cr A short and concise group name, based on WHONET and WHOCC definitions
-												(v1.7.1.9023) Removed filter_ functions, new set_ab_names(), ATC code update, ab selector update, fixes #46 and fixed #47

											
										
										
											2021-08-16 21:54:34 +02:00
+								#' - `atc`\cr ATC codes (Anatomical Therapeutic Chemical) as defined by the WHOCC, like `J01CR02`
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `atc_group1`\cr Official pharmacological subgroup (3rd level ATC code) as defined by the WHOCC, like `"Macrolides, lincosamides and streptogramins"`
 								#' - `atc_group2`\cr Official chemical subgroup (4th level ATC code) as defined by the WHOCC, like `"Macrolides"`
 								#' - `abbr`\cr List of abbreviations as used in many countries, also for antibiotic susceptibility testing (AST)
 								#' - `synonyms`\cr Synonyms (often trade names) of a drug, as found in PubChem based on their compound ID
-												(v1.7.1.9026) updated DDDs

											
										
										
											2021-08-19 23:43:02 +02:00
+								#' - `oral_ddd`\cr Defined Daily Dose (DDD), oral treatment, currently available for `r sum(!is.na(antibiotics$oral_ddd))` drugs
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `oral_units`\cr Units of `oral_ddd`
-												(v1.7.1.9026) updated DDDs

											
										
										
											2021-08-19 23:43:02 +02:00
+								#' - `iv_ddd`\cr Defined Daily Dose (DDD), parenteral (intravenous) treatment, currently available for `r sum(!is.na(antibiotics$iv_ddd))` drugs
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `iv_units`\cr Units of `iv_ddd`
-												(v0.9.0.9013) Support for LOINC codes

											
										
										
											2020-01-26 20:38:54 +01:00
+								#' - `loinc`\cr All LOINC codes (Logical Observation Identifiers Names and Codes) associated with the name of the antimicrobial agent. Use [ab_loinc()] to retrieve them quickly, see [ab_property()].
-												(v0.8.0.9034) add cid to antivirals

											
										
										
											2019-11-23 12:39:57 +01:00
+								#'
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' ## For the [antivirals] data set: a [data.frame] with `r nrow(antivirals)` observations and `r ncol(antivirals)` variables:
-												(v1.7.1.9023) Removed filter_ functions, new set_ab_names(), ATC code update, ab selector update, fixes #46 and fixed #47

											
										
										
											2021-08-16 21:54:34 +02:00
+								#' - `atc`\cr ATC codes (Anatomical Therapeutic Chemical) as defined by the WHOCC
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `cid`\cr Compound ID as found in PubChem
 								#' - `name`\cr Official name as used by WHONET/EARS-Net or the WHO
 								#' - `atc_group`\cr Official pharmacological subgroup (3rd level ATC code) as defined by the WHOCC
 								#' - `synonyms`\cr Synonyms (often trade names) of a drug, as found in PubChem based on their compound ID
 								#' - `oral_ddd`\cr Defined Daily Dose (DDD), oral treatment
 								#' - `oral_units`\cr Units of `oral_ddd`
 								#' - `iv_ddd`\cr Defined Daily Dose (DDD), parenteral treatment
 								#' - `iv_units`\cr Units of `iv_ddd`
 								#' @details Properties that are based on an ATC code are only available when an ATC is available. These properties are: `atc_group1`, `atc_group2`, `oral_ddd`, `oral_units`, `iv_ddd` and `iv_units`.
-												(v0.8.0.9034) add cid to antivirals

											
										
										
											2019-11-23 12:39:57 +01:00
+								#'
-												(v1.7.1.9026) updated DDDs

											
										
										
											2021-08-19 23:43:02 +02:00
+								#' Synonyms (i.e. trade names) were derived from the Compound ID (`cid`) and consequently only available where a CID is available.
-												(v0.9.0.9019) website update

											
										
										
											2020-02-01 15:09:36 +01:00
+								#'
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' ## Direct download
-												(v1.4.0.9042) auto dark theme website

											
										
										
											2020-12-21 22:46:29 +01:00
+								#' These data sets are available as 'flat files' for use even without \R - you can find the files here:
-												(v0.9.0.9019) website update

											
										
										
											2020-02-01 15:09:36 +01:00
+								#'
-												(v1.7.1.9051) updated taxonomy, updated git branch name

											
										
										
											2021-10-06 13:23:57 +02:00
+								#' * <https://github.com/msberends/AMR/raw/main/data-raw/antibiotics.txt>
 								#' * <https://github.com/msberends/AMR/raw/main/data-raw/antivirals.txt>
-												(v1.0.1.9008) RIVM abbreviations for drugs

											
										
										
											2020-04-14 20:38:09 +02:00
+								#'
-												(v1.4.0.9042) auto dark theme website

											
										
										
											2020-12-21 22:46:29 +01:00
+								#' Files in \R format (with preserved data structure) can be found here:
-												(v1.0.1.9008) RIVM abbreviations for drugs

											
										
										
											2020-04-14 20:38:09 +02:00
+								#'
-												(v1.7.1.9051) updated taxonomy, updated git branch name

											
										
										
											2021-10-06 13:23:57 +02:00
+								#' * <https://github.com/msberends/AMR/raw/main/data/antibiotics.rda>
 								#' * <https://github.com/msberends/AMR/raw/main/data/antivirals.rda>
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' @source World Health Organization (WHO) Collaborating Centre for Drug Statistics Methodology (WHOCC): <https://www.whocc.no/atc_ddd_index/>
-												(v0.8.0.9034) add cid to antivirals

											
										
										
											2019-11-23 12:39:57 +01:00
+								#'
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' WHONET 2019 software: <http://www.whonet.org/software.html>
-												(v0.8.0.9034) add cid to antivirals

											
										
										
											2019-11-23 12:39:57 +01:00
+								#'
-												v1.7.0

											
										
										
											2021-05-24 15:29:17 +02:00
+								#' European Commission Public Health PHARMACEUTICALS - COMMUNITY REGISTER: <https://ec.europa.eu/health/documents/community-register/html/reg_hum_atc.htm>
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
-												(v0.8.0.9033) antivirals data set, cleanup

											
										
										
											2019-11-18 12:10:47 +01:00
+								#' @inheritSection WHOCC WHOCC
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Read more on Our Website!
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#' @seealso [microorganisms], [intrinsic_resistant]
-												(v0.8.0.9034) add cid to antivirals

											
										
										
											2019-11-23 12:39:57 +01:00
+								"antibiotics"
 								#' @rdname antibiotics
-												(v0.8.0.9033) antivirals data set, cleanup

											
										
										
											2019-11-18 12:10:47 +01:00
+								"antivirals"
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set with `r format(nrow(microorganisms), big.mark = ",")` Microorganisms
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
+								#'
-												(v1.7.1.9051) updated taxonomy, updated git branch name

											
										
										
											2021-10-06 13:23:57 +02:00
+								#' A data set containing the full microbial taxonomy (**last updated: `r CATALOGUE_OF_LIFE$yearmonth_LPSN`**) of `r nr2char(length(unique(microorganisms$kingdom[!microorganisms$kingdom %like% "unknown"])))` kingdoms from the Catalogue of Life (CoL) and the List of Prokaryotic names with Standing in Nomenclature (LPSN). MO codes can be looked up using [as.mo()].
-												Catalogue of life

											
										
										
											2019-02-20 00:04:48 +01:00
+								#' @inheritSection catalogue_of_life Catalogue of Life
-												(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix

											
										
										
											2020-09-18 16:05:53 +02:00
+								#' @format A [data.frame] with `r format(nrow(microorganisms), big.mark = ",")` observations and `r ncol(microorganisms)` variables:
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `mo`\cr ID of microorganism as used by this package
 								#' - `fullname`\cr Full name, like `"Escherichia coli"`
 								#' - `kingdom`, `phylum`, `class`, `order`, `family`, `genus`, `species`, `subspecies`\cr Taxonomic rank of the microorganism
 								#' - `rank`\cr Text of the taxonomic rank of the microorganism, like `"species"` or `"genus"`
 								#' - `ref`\cr Author(s) and year of concerning scientific publication
 								#' - `species_id`\cr ID of the species as used by the Catalogue of Life
-												(v1.5.0.9028) Updated taxonomy until March 2021

											
										
										
											2021-03-04 23:28:32 +01:00
+								#' - `source`\cr Either `r vector_or(microorganisms$source)` (see *Source*)
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `prevalence`\cr Prevalence of the microorganism, see [as.mo()]
-												(v1.5.0.9041) SNOMED update

											
										
										
											2021-03-11 21:42:30 +01:00
+								#' - `snomed`\cr Systematized Nomenclature of Medicine (SNOMED) code of the microorganism, according to the `r SNOMED_VERSION$current_source` (see *Source*). Use [mo_snomed()] to retrieve it quickly, see [mo_property()].
-												(v.1.5.0.9000) implementation of EUCAST rules v11 (2021)

											
										
										
											2021-01-12 22:08:04 +01:00
+								#' @details
 								#' Please note that entries are only based on the Catalogue of Life and the LPSN (see below). Since these sources incorporate entries based on (recent) publications in the International Journal of Systematic and Evolutionary Microbiology (IJSEM), it can happen that the year of publication is sometimes later than one might expect.
 								#'
-												(v1.6.0.9007) documentation custom eucast rules, progress bar as.mo

											
										
										
											2021-04-20 10:46:17 +02:00
+								#' For example, *Staphylococcus pettenkoferi* was described for the first time in Diagnostic Microbiology and Infectious Disease in 2002 (\doi{10.1016/s0732-8893(02)00399-1}), but it was not before 2007 that a publication in IJSEM followed (\doi{10.1099/ijs.0.64381-0}). Consequently, the `AMR` package returns 2007 for `mo_year("S. pettenkoferi")`.
-												(v.1.5.0.9000) implementation of EUCAST rules v11 (2021)

											
										
										
											2021-01-12 22:08:04 +01:00
+								#'
-												(v1.5.0.9028) Updated taxonomy until March 2021

											
										
										
											2021-03-04 23:28:32 +01:00
+								#' ## Manual additions
-												(v.1.5.0.9000) implementation of EUCAST rules v11 (2021)

											
										
										
											2021-01-12 22:08:04 +01:00
+								#' For convenience, some entries were added manually:
 								#'
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - 11 entries of *Streptococcus* (beta-haemolytic: groups A, B, C, D, F, G, H, K and unspecified; other: viridans, milleri)
-												(v0.8.0.9037) complete documentation rewrite

											
										
										
											2019-11-28 23:00:37 +01:00
+								#' - 2 entries of *Staphylococcus* (coagulase-negative (CoNS) and coagulase-positive (CoPS))
-												(v1.7.1.9051) updated taxonomy, updated git branch name

											
										
										
											2021-10-06 13:23:57 +02:00
+								#' - 3 entries of *Trichomonas* (*T. vaginalis*, and its family and genus)
 								#' - 1 entry of *Candida* (*C.  krusei*), that is not (yet) in the Catalogue of Life
 								#' - 1 entry of *Blastocystis* (*B.  hominis*), although it officially does not exist (Noel *et al.* 2005, PMID 15634993)
 								#' - 1 entry of *Moraxella* (*M. catarrhalis*), which was formally named *Branhamella catarrhalis* (Catlin, 1970) though this change was never accepted within the field of clinical microbiology
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - 5 other 'undefined' entries (unknown, unknown Gram negatives, unknown Gram positives, unknown yeast and unknown fungus)
-												(v1.0.1.9005) as.mo() improvements

											
										
										
											2020-04-13 21:09:56 +02:00
+								#' - 6 families under the Enterobacterales order, according to Adeolu *et al.* (2016, PMID 27620848), that are not (yet) in the Catalogue of Life
-												(v0.9.0.9019) website update

											
										
										
											2020-02-01 15:09:36 +01:00
+								#'
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' ## Direct download
-												(v1.4.0.9042) auto dark theme website

											
										
										
											2020-12-21 22:46:29 +01:00
+								#' This data set is available as 'flat file' for use even without \R - you can find the file here:
-												(v0.9.0.9019) website update

											
										
										
											2020-02-01 15:09:36 +01:00
+								#'
-												(v1.7.1.9051) updated taxonomy, updated git branch name

											
										
										
											2021-10-06 13:23:57 +02:00
+								#' * <https://github.com/msberends/AMR/raw/main/data-raw/microorganisms.txt>
-												(v1.0.1.9008) RIVM abbreviations for drugs

											
										
										
											2020-04-14 20:38:09 +02:00
+								#'
-												(v1.4.0.9042) auto dark theme website

											
										
										
											2020-12-21 22:46:29 +01:00
+								#' The file in \R format (with preserved data structure) can be found here:
-												(v1.0.1.9008) RIVM abbreviations for drugs

											
										
										
											2020-04-14 20:38:09 +02:00
+								#'
-												(v1.7.1.9051) updated taxonomy, updated git branch name

											
										
										
											2021-10-06 13:23:57 +02:00
+								#' * <https://github.com/msberends/AMR/raw/main/data/microorganisms.rda>
-												(v1.5.0.9028) Updated taxonomy until March 2021

											
										
										
											2021-03-04 23:28:32 +01:00
+								#' @section About the Records from LPSN (see *Source*):
 								#' The List of Prokaryotic names with Standing in Nomenclature (LPSN) provides comprehensive information on the nomenclature of prokaryotes. LPSN is a free to use service founded by Jean P. Euzeby in 1997 and later on maintained by Aidan C. Parte.
-												(v1.2.0.9035) as.mo() speed improvement

											
										
										
											2020-07-22 10:24:23 +02:00
+								#'
-												(v1.5.0.9028) Updated taxonomy until March 2021

											
										
										
											2021-03-04 23:28:32 +01:00
+								#' As of February 2020, the regularly augmented LPSN database at DSMZ is the basis of the new LPSN service. The new database was implemented for the Type-Strain Genome Server and augmented in 2018 to store all kinds of nomenclatural information. Data from the previous version of LPSN and from the Prokaryotic Nomenclature Up-to-date (PNU) service were imported into the new system. PNU had been established in 1993 as a service of the Leibniz Institute DSMZ, and was curated by Norbert Weiss, Manfred Kracht and Dorothea Gleim.
 								#' @source
-												(v1.5.0.9041) SNOMED update

											
										
										
											2021-03-11 21:42:30 +01:00
+								#' `r gsub("{year}", CATALOGUE_OF_LIFE$year, CATALOGUE_OF_LIFE$version, fixed = TRUE)` as currently implemented in this `AMR` package:
-												(v1.1.0.9020) updated taxonomy

											
										
										
											2020-05-27 16:37:49 +02:00
+								#'
-												(v1.5.0.9028) Updated taxonomy until March 2021

											
										
										
											2021-03-04 23:28:32 +01:00
+								#' * Annual Checklist (public online taxonomic database), <http://www.catalogueoflife.org>
 								#'
-												(v1.5.0.9041) SNOMED update

											
										
										
											2021-03-11 21:42:30 +01:00
+								#' List of Prokaryotic names with Standing in Nomenclature (`r CATALOGUE_OF_LIFE$yearmonth_LPSN`) as currently implemented in this `AMR` package:
-												(v1.5.0.9028) Updated taxonomy until March 2021

											
										
										
											2021-03-04 23:28:32 +01:00
+								#'
 								#' * Parte, A.C., Sarda Carbasse, J., Meier-Kolthoff, J.P., Reimer, L.C. and Goker, M. (2020). List of Prokaryotic names with Standing in Nomenclature (LPSN) moves to the DSMZ. International Journal of Systematic and Evolutionary Microbiology, 70, 5607-5612; \doi{10.1099/ijsem.0.004332}
 								#' * Parte, A.C. (2018). LPSN — List of Prokaryotic names with Standing in Nomenclature (bacterio.net), 20 years on. International Journal of Systematic and Evolutionary Microbiology, 68, 1825-1829; \doi{10.1099/ijsem.0.002786}
 								#' * Parte, A.C. (2014). LPSN — List of Prokaryotic names with Standing in Nomenclature. Nucleic Acids Research, 42, Issue D1, D613–D616; \doi{10.1093/nar/gkt1111}
 								#' * Euzeby, J.P. (1997). List of Bacterial Names with Standing in Nomenclature: a Folder Available on the Internet. International Journal of Systematic Bacteriology, 47, 590-592; \doi{10.1099/00207713-47-2-590}
-												(v1.5.0.9041) SNOMED update

											
										
										
											2021-03-11 21:42:30 +01:00
+								#'
 								#' `r SNOMED_VERSION$current_source` as currently implemented in this `AMR` package:
 								#'
 								#' * Retrieved from the `r SNOMED_VERSION$title`, OID `r SNOMED_VERSION$current_oid`, version `r SNOMED_VERSION$current_version`; url: <`r SNOMED_VERSION$url`>
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
 								#' @inheritSection AMR Read more on Our Website!
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#' @seealso [as.mo()], [mo_property()], [microorganisms.codes], [intrinsic_resistant]
-												- For functions `first_isolate`, `EUCAST_rules` the antibiotic column names are case-insensitive
- Functions `first_isolate`, `EUCAST_rules` and `rsi_predict` supports tidyverse-like evaluation of parameters (no need to quote columns them anymore)
- Functions `clipboard_import` and `clipboard_export` as helper functions to quickly copy and paste from/to software like Excel and SPSS
- Renamed dataset `bactlist` to `microorganisms`

											
										
										
											2018-03-23 14:46:02 +01:00
+								"microorganisms"
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set with Previously Accepted Taxonomic Names
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								#'
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' A data set containing old (previously valid or accepted) taxonomic names according to the Catalogue of Life. This data set is used internally by [as.mo()].
-												Catalogue of life

											
										
										
											2019-02-20 00:04:48 +01:00
+								#' @inheritSection catalogue_of_life Catalogue of Life
-												(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix

											
										
										
											2020-09-18 16:05:53 +02:00
+								#' @format A [data.frame] with `r format(nrow(microorganisms.old), big.mark = ",")` observations and `r ncol(microorganisms.old)` variables:
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `fullname`\cr Old full taxonomic name of the microorganism
-												(v1.1.0.9020) updated taxonomy

											
										
										
											2020-05-27 16:37:49 +02:00
+								#' - `fullname_new`\cr New full taxonomic name of the microorganism
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `ref`\cr Author(s) and year of concerning scientific publication
 								#' - `prevalence`\cr Prevalence of the microorganism, see [as.mo()]
 								#' @source Catalogue of Life: Annual Checklist (public online taxonomic database), <http://www.catalogueoflife.org> (check included annual version with [catalogue_of_life_version()]).
-												(v1.1.0.9020) updated taxonomy

											
										
										
											2020-05-27 16:37:49 +02:00
+								#'
-												v1.5.0

											
										
										
											2021-01-06 11:16:17 +01:00
+								#' Parte, A.C. (2018). LPSN — List of Prokaryotic names with Standing in Nomenclature (bacterio.net), 20 years on. International Journal of Systematic and Evolutionary Microbiology, 68, 1825-1829; \doi{10.1099/ijsem.0.002786}
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
 								#' @inheritSection AMR Read more on Our Website!
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' @seealso [as.mo()] [mo_property()] [microorganisms]
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								"microorganisms.old"
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set with `r format(nrow(microorganisms.codes), big.mark = ",")` Common Microorganism Codes
-												first commit

											
										
										
											2018-02-21 11:52:31 +01:00
+								#'
-												(v0.9.0.9020) as.mo() improvement

											
										
										
											2020-02-09 22:04:29 +01:00
+								#' A data set containing commonly used codes for microorganisms, from laboratory systems and WHONET. Define your own with [set_mo_source()]. They will all be searched when using [as.mo()] and consequently all the [`mo_*`][mo_property()] functions.
-												(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix

											
										
										
											2020-09-18 16:05:53 +02:00
+								#' @format A [data.frame] with `r format(nrow(microorganisms.codes), big.mark = ",")` observations and `r ncol(microorganisms.codes)` variables:
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `code`\cr Commonly used code of a microorganism
 								#' - `mo`\cr ID of the microorganism in the [microorganisms] data set
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
-												Catalogue of life

											
										
										
											2019-02-20 00:04:48 +01:00
+								#' @inheritSection catalogue_of_life Catalogue of Life
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Read more on Our Website!
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' @seealso [as.mo()] [microorganisms]
-												set_mo_source

											
										
										
											2019-01-21 15:53:01 +01:00
+								"microorganisms.codes"
-												added septic_patients

											
										
										
											2018-02-27 20:01:02 +01:00
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set with `r format(nrow(example_isolates), big.mark = ",")` Example Isolates
-												added septic_patients

											
										
										
											2018-02-27 20:01:02 +01:00
+								#'
-												(v1.5.0.9014) only_rsi_columns, is.rsi.eligible improvement

											
										
										
											2021-02-02 23:57:35 +01:00
+								#' A data set containing `r format(nrow(example_isolates), big.mark = ",")` microbial isolates with their full antibiograms. The data set reflects reality and can be used to practice AMR data analysis. For examples, please read [the tutorial on our website](https://msberends.github.io/AMR/articles/AMR.html).
-												(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix

											
										
										
											2020-09-18 16:05:53 +02:00
+								#' @format A [data.frame] with `r format(nrow(example_isolates), big.mark = ",")` observations and `r ncol(example_isolates)` variables:
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `date`\cr date of receipt at the laboratory
 								#' - `hospital_id`\cr ID of the hospital, from A to D
-												(v1.6.0.9021) join functions update

											
										
										
											2021-05-12 18:15:03 +02:00
+								#' - `ward_icu`\cr [logical] to determine if ward is an intensive care unit
 								#' - `ward_clinical`\cr [logical] to determine if ward is a regular clinical ward
 								#' - `ward_outpatient`\cr [logical] to determine if ward is an outpatient clinic
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `age`\cr age of the patient
 								#' - `gender`\cr gender of the patient
 								#' - `patient_id`\cr ID of the patient
 								#' - `mo`\cr ID of microorganism created with [as.mo()], see also [microorganisms]
-												(v1.4.0.9052) replaced all sapply's with type-safe vapply's

											
										
										
											2020-12-28 22:24:33 +01:00
+								#' - `PEN:RIF`\cr `r sum(vapply(FUN.VALUE = logical(1), example_isolates, is.rsi))` different antibiotics with class [`rsi`] (see [as.rsi()]); these column names occur in the [antibiotics] data set and can be translated with [ab_name()]
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
 								#' @inheritSection AMR Read more on Our Website!
-												(v0.7.1.9063) septic_patients -> example_isolates

											
										
										
											2019-08-27 16:45:42 +02:00
+								"example_isolates"
-												speed improvement as.mo, freq title

											
										
										
											2018-10-31 12:10:49 +01:00
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set with Unclean Data
-												(v1.0.0.9004) add example_isolaten_unclean

											
										
										
											2020-02-21 16:05:19 +01:00
+								#'
-												(v1.5.0.9014) only_rsi_columns, is.rsi.eligible improvement

											
										
										
											2021-02-02 23:57:35 +01:00
+								#' A data set containing `r format(nrow(example_isolates_unclean), big.mark = ",")` microbial isolates that are not cleaned up and consequently not ready for AMR data analysis. This data set can be used for practice.
-												(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix

											
										
										
											2020-09-18 16:05:53 +02:00
+								#' @format A [data.frame] with `r format(nrow(example_isolates_unclean), big.mark = ",")` observations and `r ncol(example_isolates_unclean)` variables:
-												(v1.0.0.9004) add example_isolaten_unclean

											
										
										
											2020-02-21 16:05:19 +01:00
+								#' - `patient_id`\cr ID of the patient
 								#' - `date`\cr date of receipt at the laboratory
 								#' - `hospital`\cr ID of the hospital, from A to C
 								#' - `bacteria`\cr info about microorganism that can be transformed with [as.mo()], see also [microorganisms]
 								#' - `AMX:GEN`\cr 4 different antibiotics that have to be transformed with [as.rsi()]
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
 								#' @inheritSection AMR Read more on Our Website!
-												(v1.0.0.9004) add example_isolaten_unclean

											
										
										
											2020-02-21 16:05:19 +01:00
+								"example_isolates_unclean"
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set with `r format(nrow(WHONET), big.mark = ",")` Isolates - WHONET Example
-												freq update

											
										
										
											2019-01-29 20:20:09 +01:00
+								#'
-												(v1.3.0.9016) mo_uncertainties() overhaul

											
										
										
											2020-09-12 08:49:01 +02:00
+								#' This example data set has the exact same structure as an export file from WHONET. Such files can be used with this package, as this example data set shows. The antibiotic results are from our [example_isolates] data set. All patient names are created using online surname generators and are only in place for practice purposes.
-												(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix

											
										
										
											2020-09-18 16:05:53 +02:00
+								#' @format A [data.frame] with `r format(nrow(WHONET), big.mark = ",")` observations and `r ncol(WHONET)` variables:
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `Identification number`\cr ID of the sample
 								#' - `Specimen number`\cr ID of the specimen
 								#' - `Organism`\cr Name of the microorganism. Before analysis, you should transform this to a valid microbial class, using [as.mo()].
 								#' - `Country`\cr Country of origin
 								#' - `Laboratory`\cr Name of laboratory
-												(v1.3.0.9014) as.mo() speed improvement

											
										
										
											2020-09-03 12:31:48 +02:00
+								#' - `Last name`\cr Fictitious last name of patient
 								#' - `First name`\cr Fictitious initial of patient
 								#' - `Sex`\cr Fictitious gender of patient
 								#' - `Age`\cr Fictitious age of patient
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `Age category`\cr Age group, can also be looked up using [age_groups()]
-												(v1.6.0.9021) join functions update

											
										
										
											2021-05-12 18:15:03 +02:00
+								#' - `Date of admission`\cr [Date] of hospital admission
 								#' - `Specimen date`\cr [Date] when specimen was received at laboratory
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `Specimen type`\cr Specimen type or group
 								#' - `Specimen type (Numeric)`\cr Translation of `"Specimen type"`
 								#' - `Reason`\cr Reason of request with Differential Diagnosis
 								#' - `Isolate number`\cr ID of isolate
 								#' - `Organism type`\cr Type of microorganism, can also be looked up using [mo_type()]
 								#' - `Serotype`\cr Serotype of microorganism
 								#' - `Beta-lactamase`\cr Microorganism produces beta-lactamase?
 								#' - `ESBL`\cr Microorganism produces extended spectrum beta-lactamase?
 								#' - `Carbapenemase`\cr Microorganism produces carbapenemase?
 								#' - `MRSA screening test`\cr Microorganism is possible MRSA?
 								#' - `Inducible clindamycin resistance`\cr Clindamycin can be induced?
 								#' - `Comment`\cr Other comments
-												(v1.6.0.9021) join functions update

											
										
										
											2021-05-12 18:15:03 +02:00
+								#' - `Date of data entry`\cr [Date] this data was entered in WHONET
-												(v1.4.0.9052) replaced all sapply's with type-safe vapply's

											
										
										
											2020-12-28 22:24:33 +01:00
+								#' - `AMP_ND10:CIP_EE`\cr `r sum(vapply(FUN.VALUE = logical(1), WHONET, is.rsi))` different antibiotics. You can lookup the abbreviations in the [antibiotics] data set, or use e.g. [`ab_name("AMP")`][ab_name()] to get the official name immediately. Before analysis, you should transform this to a valid antibiotic class, using [as.rsi()].
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
 								#' @inheritSection AMR Read more on Our Website!
-												freq update

											
										
										
											2019-01-29 20:20:09 +01:00
+								"WHONET"
-												new EUCAST rules algorithm

											
										
										
											2019-04-05 18:47:39 +02:00
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set for R/SI Interpretation
-												new antibiotics

											
										
										
											2019-05-10 16:44:59 +02:00
+								#'
-												(v1.7.0.9001) CLSI 2020 guideline

											
										
										
											2021-06-01 15:33:06 +02:00
+								#' Data set containing reference data to interpret MIC and disk diffusion to R/SI values, according to international guidelines. Currently implemented guidelines are EUCAST (`r min(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "EUCAST")$guideline)))`-`r max(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "EUCAST")$guideline)))`) and CLSI (`r min(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "CLSI")$guideline)))`-`r max(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "CLSI")$guideline)))`). Use [as.rsi()] to transform MICs or disks measurements to R/SI values.
-												(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix

											
										
										
											2020-09-18 16:05:53 +02:00
+								#' @format A [data.frame] with `r format(nrow(rsi_translation), big.mark = ",")` observations and `r ncol(rsi_translation)` variables:
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `guideline`\cr Name of the guideline
-												(v1.5.0.9028) Updated taxonomy until March 2021

											
										
										
											2021-03-04 23:28:32 +01:00
+								#' - `method`\cr Either `r vector_or(rsi_translation$method)`
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `site`\cr Body site, e.g. "Oral" or "Respiratory"
 								#' - `mo`\cr Microbial ID, see [as.mo()]
-												(v1.7.1.9070) Better WHONET support

											
										
										
											2021-12-13 10:18:28 +01:00
+								#' - `rank_index`\cr Taxonomic rank index of `mo` from 1 (subspecies/infraspecies) to 5 (unknown microorganism)
-												(v0.8.0.9036) complete documentation rewrite

											
										
										
											2019-11-28 22:32:17 +01:00
+								#' - `ab`\cr Antibiotic ID, see [as.ab()]
 								#' - `ref_tbl`\cr Info about where the guideline rule can be found
 								#' - `disk_dose`\cr Dose of the used disk diffusion method
-												(v0.9.0.9026) update documentation

											
										
										
											2020-02-17 14:38:01 +01:00
+								#' - `breakpoint_S`\cr Lowest MIC value or highest number of millimetres that leads to "S"
 								#' - `breakpoint_R`\cr Highest MIC value or lowest number of millimetres that leads to "R"
-												(v1.6.0.9021) join functions update

											
										
										
											2021-05-12 18:15:03 +02:00
+								#' - `uti`\cr A [logical] value (`TRUE`/`FALSE`) to indicate whether the rule applies to a urinary tract infection (UTI)
-												(v1.7.1.9070) Better WHONET support

											
										
										
											2021-12-13 10:18:28 +01:00
+								#' @details The repository of this `AMR` package contains a file comprising this exact data set: <https://github.com/msberends/AMR/blob/main/data-raw/rsi_translation.txt>. This file **allows for machine reading EUCAST and CLSI guidelines**, which is almost impossible with the Excel and PDF files distributed by EUCAST and CLSI. The file is updated automatically and the `mo` and `ab` columns have been transformed to contain the full official names instead of codes.
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
 								#' @inheritSection AMR Read more on Our Website!
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#' @seealso [intrinsic_resistant]
-												new antibiotics

											
										
										
											2019-05-10 16:44:59 +02:00
+								"rsi_translation"
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set with Bacterial Intrinsic Resistance
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#'
 								#' Data set containing defined intrinsic resistance by EUCAST of all bug-drug combinations.
-												(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix

											
										
										
											2020-09-18 16:05:53 +02:00
+								#' @format A [data.frame] with `r format(nrow(intrinsic_resistant), big.mark = ",")` observations and `r ncol(intrinsic_resistant)` variables:
-												(v1.7.1.9073) as.rsi() fix for UTIs

											
										
										
											2021-12-14 21:47:14 +01:00
+								#' - `mo`\cr Microorganism ID
 								#' - `ab`\cr Antibiotic ID
 								#' @details The repository of this `AMR` package contains a file comprising this data set with full taxonomic and antibiotic names: <https://github.com/msberends/AMR/blob/main/data-raw/intrinsic_resistant.txt>. This file **allows for machine reading EUCAST guidelines about intrinsic resistance**, which is almost impossible with the Excel and PDF files distributed by EUCAST. The file is updated automatically.
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#'
-												(v1.7.1.9064) eucast 3.3 for mdro(), major change to repeated calling

											
										
										
											2021-12-11 13:41:31 +01:00
+								#' This data set is based on `r format_eucast_version_nr(3.3)`.
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
 								#' @inheritSection AMR Read more on Our Website!
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#' @examples
-												(v1.7.1.9064) eucast 3.3 for mdro(), major change to repeated calling

											
										
										
											2021-12-11 13:41:31 +01:00
+								#' subset(intrinsic_resistant,
 								#'        antibiotic == "Vancomycin" & microorganism %like% "Enterococcus")$microorganism
 								#' #> [1] "Enterococcus casseliflavus" "Enterococcus gallinarum"
 								#'
-												(v1.6.0.9063) prepare new release

											
										
										
											2021-05-24 09:00:11 +02:00
+								#' \donttest{
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#' if (require("dplyr")) {
 								#'   intrinsic_resistant %>%
-												(v1.7.1.9064) eucast 3.3 for mdro(), major change to repeated calling

											
										
										
											2021-12-11 13:41:31 +01:00
+								#'     filter(antibiotic == "Vancomycin" & microorganism %like% "Enterococcus") %>%
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#'     pull(microorganism)
-												(v1.7.1.9064) eucast 3.3 for mdro(), major change to repeated calling

											
										
										
											2021-12-11 13:41:31 +01:00
+								#'   #> [1] "Enterococcus casseliflavus" "Enterococcus gallinarum"
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								#' }
-												(v1.6.0.9063) prepare new release

											
										
										
											2021-05-24 09:00:11 +02:00
+								#' }
-												(v1.3.0.9002) intrinsic_resistant data set

											
										
										
											2020-08-14 13:36:10 +02:00
+								"intrinsic_resistant"
-												(v.1.5.0.9000) implementation of EUCAST rules v11 (2021)

											
										
										
											2021-01-12 22:08:04 +01:00
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' Data Set with Treatment Dosages as Defined by EUCAST
-												(v.1.5.0.9000) implementation of EUCAST rules v11 (2021)

											
										
										
											2021-01-12 22:08:04 +01:00
+								#'
 								#' EUCAST breakpoints used in this package are based on the dosages in this data set. They can be retrieved with [eucast_dosage()].
 								#' @format A [data.frame] with `r format(nrow(dosage), big.mark = ",")` observations and `r ncol(dosage)` variables:
 								#' - `ab`\cr Antibiotic ID as used in this package (such as `AMC`), using the official EARS-Net (European Antimicrobial Resistance Surveillance Network) codes where available
 								#' - `name`\cr Official name of the antimicrobial agent as used by WHONET/EARS-Net or the WHO
 								#' - `type`\cr Type of the dosage, either `r vector_or(dosage$type)`
 								#' - `dose`\cr Dose, such as "2 g" or "25 mg/kg"
-												(v1.5.0.9001) more informative argument errors

											
										
										
											2021-01-14 14:41:44 +01:00
+								#' - `dose_times`\cr Number of times a dose must be administered
-												(v.1.5.0.9000) implementation of EUCAST rules v11 (2021)

											
										
										
											2021-01-12 22:08:04 +01:00
+								#' - `administration`\cr Route of administration, either `r vector_or(dosage$administration)`
 								#' - `notes`\cr Additional dosage notes
 								#' - `original_txt`\cr Original text in the PDF file of EUCAST
 								#' - `eucast_version`\cr Version number of the EUCAST Clinical Breakpoints guideline to which these dosages apply
 								#' @details `r format_eucast_version_nr(11.0)` are based on the dosages in this data set.
-												(v1.5.0.9006) major documentation update

											
										
										
											2021-01-18 16:57:56 +01:00
+								#' @inheritSection AMR Reference Data Publicly Available
 								#' @inheritSection AMR Read more on Our Website!
-												(v.1.5.0.9000) implementation of EUCAST rules v11 (2021)

											
										
										
											2021-01-12 22:08:04 +01:00
+								"dosage"