AMR/vignettes/datasets.Rmd

---
title: "Data sets for download / own use"
date: '`r format(Sys.Date(), "%d %B %Y")`'
output: 
  rmarkdown::html_vignette:
    toc: true
    toc_depth: 1
vignette: >
  %\VignetteIndexEntry{Data sets for download / own use}
  %\VignetteEncoding{UTF-8}
  %\VignetteEngine{knitr::rmarkdown}
editor_options: 
  chunk_output_type: console
---

```{r setup, include = FALSE, results = 'markup'}
knitr::opts_chunk$set(
  warning = FALSE,
  collapse = TRUE,
  comment = "#",
  fig.width = 7.5,
  fig.height = 5
)

library(AMR)
library(dplyr)

options(knitr.kable.NA = '')

structure_txt <- function(dataset) {
  paste0("A data set with ",
         format(nrow(dataset), big.mark = ","), " rows and ", 
         ncol(dataset), " columns, containing the following column names:  \n",
         AMR:::vector_or(colnames(dataset), quotes = "*", last_sep = " and ", sort = FALSE), ".")
}

download_txt <- function(filename) {
  msg <- paste0("It was last updated on ", 
                trimws(format(file.mtime(paste0("../data/", filename, ".rda")), "%e %B %Y %H:%M:%S %Z", tz = "UTC")), 
                ". Find more info about the structure of this data set [here](https://msberends.github.io/AMR/reference/", ifelse(filename == "antivirals", "antibiotics", filename), ".html).\n")
  github_base <- "https://github.com/msberends/AMR/raw/master/data-raw/"
  filename <- paste0("../data-raw/", filename)
  txt <- paste0(filename, ".txt")
  rds <- paste0(filename, ".rds")
  spss <- paste0(filename, ".sav")
  stata <- paste0(filename, ".dta")
  sas <- paste0(filename, ".sas")
  excel <- paste0(filename, ".xlsx")
  create_txt <- function(filename, type, software) {
    paste0("* Download as [", software, " file](", github_base, filename, ") (", AMR:::formatted_filesize(filename), ")  \n")
  }

  if (any(file.exists(rds),
          file.exists(excel),
          file.exists(txt),
          file.exists(sas),
          file.exists(spss),
          file.exists(stata))) {
    msg <- c(msg, "\n**Direct download links:**\n\n")
  }
  if (file.exists(rds)) msg <- c(msg, create_txt(rds, "rds", "R"))
  if (file.exists(excel)) msg <- c(msg, create_txt(excel, "xlsx", "Excel"))
  if (file.exists(txt)) msg <- c(msg, create_txt(txt, "txt", "plain text"))
  if (file.exists(sas)) msg <- c(msg, create_txt(sas, "sas", "SAS"))
  if (file.exists(spss)) msg <- c(msg, create_txt(spss, "sav", "SPSS"))
  if (file.exists(stata)) msg <- c(msg, create_txt(stata, "dta", "Stata"))
  paste0(msg, collapse = "")
}

print_df <- function(x, rows = 6) {
  x %>% 
    head(n = rows) %>% 
    mutate_all(function(x) {
      if (is.list(x)) {
        sapply(x, function(y) {
          if (length(y) > 3) {
            paste0(paste(y[1:3], collapse = ", "), ", ...")
          } else if (length(y) == 0 || all(is.na(y))) {
            ""
          } else {
            paste(y, collapse = ", ")
          }
        })
      } else {
        x
      }
    }) %>%
    knitr::kable(align = "c")
}

```

All reference data (about microorganisms, antibiotics, R/SI interpretation, EUCAST rules, etc.) in this `AMR` package are reliable, up-to-date and freely available. We continually export our data sets to formats for use in R, SPSS, SAS, Stata and Excel. We also supply  tab separated files that are machine-readable and suitable for input in any software program, such as laboratory information systems. 

On this page, we explain how to download them and how the structure of the data sets look like. 

<p class="dataset-within-r">If you are reading this page from within R, please <a href="https://msberends.github.io/AMR/articles/datasets.html">visit our website</a>, which is automatically updated with every code change.</p>

## Microorganisms (currently accepted names)

`r structure_txt(microorganisms)`

This data set is in R available as `microorganisms`, after you load the `AMR` package.

`r download_txt("microorganisms")`

**NOTE: The exported files for SAS, SPSS and Stata do not contain SNOMED codes, as their file size would exceed 100 MB; the file size limit of GitHub.** Advice? Use R instead.

### Source

Our full taxonomy of microorganisms is based on the authoritative and comprehensive:

* [Catalogue of Life](http://www.catalogueoflife.org) (included version: `r AMR:::CATALOGUE_OF_LIFE$year`)
* [List of Prokaryotic names with Standing in Nomenclature](https://lpsn.dsmz.de) (LPSN, last updated: `r AMR:::CATALOGUE_OF_LIFE$yearmonth_LPSN`)
* `r AMR:::SNOMED_VERSION$current_source`, retrieved from the [`r AMR:::SNOMED_VERSION$title`](`r AMR:::SNOMED_VERSION$url`), OID `r AMR:::SNOMED_VERSION$current_oid`, version `r AMR:::SNOMED_VERSION$current_version`

### Example content

Included (sub)species per taxonomic kingdom:

```{r, echo = FALSE}
microorganisms %>% 
  pull(kingdom) %>% 
  table() %>% 
  as.data.frame() %>% 
  mutate(Freq = format(Freq, big.mark = ",")) %>% 
  setNames(c("Kingdom", "Number of (sub)species")) %>% 
  print_df()
```

Example rows when filtering on genus *Escherichia*:

```{r, echo = FALSE}
microorganisms %>%
  filter(genus == "Escherichia") %>% 
  print_df()
```

## Microorganisms (previously accepted names)

`r structure_txt(microorganisms.old)`

**Note:** remember that the 'ref' columns contains the scientific reference to the old taxonomic entries, i.e. of column *'fullname'*. For the scientific reference of the new names, i.e. of column *'fullname_new'*, see the `microorganisms` data set.

This data set is in R available as `microorganisms.old`, after you load the `AMR` package.

`r download_txt("microorganisms.old")`

### Source

This data set contains old, previously accepted taxonomic names. The data sources are the same as the `microorganisms` data set:

* [Catalogue of Life](http://www.catalogueoflife.org) (included version: `r AMR:::CATALOGUE_OF_LIFE$year`)
* [List of Prokaryotic names with Standing in Nomenclature](https://lpsn.dsmz.de) (LPSN, last updated: `r AMR:::CATALOGUE_OF_LIFE$yearmonth_LPSN`)

### Example content

Example rows when filtering on *Escherichia*:

```{r, echo = FALSE}
microorganisms.old %>%
  filter(fullname %like% "^Escherichia") %>% 
  print_df()
```


## Antibiotic agents

`r structure_txt(antibiotics)`

This data set is in R available as `antibiotics`, after you load the `AMR` package.

`r download_txt("antibiotics")`

### Source

This data set contains all EARS-Net and ATC codes gathered from WHO and WHONET, and all compound IDs from PubChem. It also contains all brand names (synonyms) as found on PubChem and Defined Daily Doses (DDDs) for oral and parenteral administration.

* [ATC/DDD index from WHO Collaborating Centre for Drug Statistics Methodology](https://www.whocc.no/atc_ddd_index/) (note: this may not be used for commercial purposes, but is freely available from the WHO CC website for personal use)
* [PubChem by the US National Library of Medicine](https://pubchem.ncbi.nlm.nih.gov)
* [WHONET software 2019](https://whonet.org)

### Example content

```{r, echo = FALSE}
antibiotics %>%
  filter(ab %in% colnames(example_isolates)) %>% 
  print_df()
```


## Antiviral agents

`r structure_txt(antivirals)`

This data set is in R available as `antivirals`, after you load the `AMR` package.

`r download_txt("antivirals")`

### Source

This data set contains all ATC codes gathered from WHO and all compound IDs from PubChem. It also contains all brand names (synonyms) as found on PubChem and Defined Daily Doses (DDDs) for oral and parenteral administration.

* [ATC/DDD index from WHO Collaborating Centre for Drug Statistics Methodology](https://www.whocc.no/atc_ddd_index/) (note: this may not be used for commercial purposes, but is freely available from the WHO CC website for personal use)
* [PubChem by the US National Library of Medicine](https://pubchem.ncbi.nlm.nih.gov)

### Example content

```{r, echo = FALSE}
antivirals %>%
  print_df()
```


## Intrinsic bacterial resistance

`r structure_txt(intrinsic_resistant)`

This data set is in R available as `intrinsic_resistant`, after you load the `AMR` package.

`r download_txt("intrinsic_resistant")`

### Source

This data set contains all defined intrinsic resistance by EUCAST of all bug-drug combinations, and is based on `r AMR:::format_eucast_version_nr("3.2")`.

### Example content

Example rows when filtering on *Enterobacter cloacae*:

```{r, echo = FALSE}
intrinsic_resistant %>%
  filter(microorganism == "Enterobacter cloacae") %>% 
  print_df(rows = Inf)
```


## Interpretation from MIC values / disk diameters to R/SI

`r structure_txt(rsi_translation)`

This data set is in R available as `rsi_translation`, after you load the `AMR` package.

`r download_txt("rsi_translation")`

### Source

This data set contains interpretation rules for MIC values and disk diffusion diameters. Included guidelines are CLSI (`r min(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "CLSI")$guideline)))`-`r max(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "CLSI")$guideline)))`) and EUCAST (`r min(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "EUCAST")$guideline)))`-`r max(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "EUCAST")$guideline)))`).

### Example content

```{r, echo = FALSE}
rsi_translation %>% 
  mutate(ab = ab_name(ab), mo = mo_name(mo)) %>% 
  print_df()
```


## Dosage guidelines from EUCAST

`r structure_txt(dosage)`

This data set is in R available as `dosage`, after you load the `AMR` package.

`r download_txt("dosage")`

### Source

EUCAST breakpoints used in this package are based on the dosages in this data set.

Currently included dosages in the data set are meant for: `r AMR:::format_eucast_version_nr(unique(dosage$eucast_version))`.

### Example content

```{r, echo = FALSE}
dosage %>% 
  print_df()
```
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`---`
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`title: "Data sets for download / own use"`
(v1.5.0.9029) added missing families and orders 2021-03-05 01:31:46 +01:00			date: '`r format(Sys.Date(), "%d %B %Y")`'
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`output:`
			`rmarkdown::html_vignette:`
			`toc: true`
(v1.3.0.9014) as.mo() speed improvement 2020-09-03 12:31:48 +02:00			`toc_depth: 1`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`vignette: >`
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`%\VignetteIndexEntry{Data sets for download / own use}`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`%\VignetteEncoding{UTF-8}`
			`%\VignetteEngine{knitr::rmarkdown}`
			`editor_options:`
			`chunk_output_type: console`
			`---`

			```{r setup, include = FALSE, results = 'markup'}
			`knitr::opts_chunk$set(`
			`warning = FALSE,`
			`collapse = TRUE,`
			`comment = "#",`
			`fig.width = 7.5,`
			`fig.height = 5`
			`)`
(v1.3.0.9015) as.mo() speedup for valid taxonomic names 2020-09-03 20:59:21 +02:00
			`library(AMR)`
			`library(dplyr)`

(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`options(knitr.kable.NA = '')`

			`structure_txt <- function(dataset) {`
			`paste0("A data set with ",`
			`format(nrow(dataset), big.mark = ","), " rows and ",`
(v1.5.0.9015) unit test fix, grouped first isolates 2021-02-04 16:48:16 +01:00			`ncol(dataset), " columns, containing the following column names: \n",`
(v1.5.0.9041) SNOMED update 2021-03-11 21:42:30 +01:00			`AMR:::vector_or(colnames(dataset), quotes = "*", last_sep = " and ", sort = FALSE), ".")`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`}`

			`download_txt <- function(filename) {`
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`msg <- paste0("It was last updated on ",`
(v1.5.0.9041) SNOMED update 2021-03-11 21:42:30 +01:00			`trimws(format(file.mtime(paste0("../data/", filename, ".rda")), "%e %B %Y %H:%M:%S %Z", tz = "UTC")),`
(v1.4.0) matching score update 2020-10-08 11:16:03 +02:00			`". Find more info about the structure of this data set [here](https://msberends.github.io/AMR/reference/", ifelse(filename == "antivirals", "antibiotics", filename), ".html).\n")`
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`github_base <- "https://github.com/msberends/AMR/raw/master/data-raw/"`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`filename <- paste0("../data-raw/", filename)`
			`txt <- paste0(filename, ".txt")`
			`rds <- paste0(filename, ".rds")`
			`spss <- paste0(filename, ".sav")`
(v1.3.0.9006) website update 2020-08-21 11:40:13 +02:00			`stata <- paste0(filename, ".dta")`
			`sas <- paste0(filename, ".sas")`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`excel <- paste0(filename, ".xlsx")`
(v1.3.0.9026) eucast expert rules 3.2 2020-09-24 00:30:11 +02:00			`create_txt <- function(filename, type, software) {`
(v1.4.0.9041) updates based on review 2020-12-17 16:22:25 +01:00			`paste0("* Download as [", software, " file](", github_base, filename, ") (", AMR:::formatted_filesize(filename), ") \n")`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`}`

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`if (any(file.exists(rds),`
			`file.exists(excel),`
			`file.exists(txt),`
			`file.exists(sas),`
			`file.exists(spss),`
			`file.exists(stata))) {`
(v1.3.0.9026) eucast expert rules 3.2 2020-09-24 00:30:11 +02:00			`msg <- c(msg, "\nDirect download links:\n\n")`
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`}`
(v1.3.0.9026) eucast expert rules 3.2 2020-09-24 00:30:11 +02:00			`if (file.exists(rds)) msg <- c(msg, create_txt(rds, "rds", "R"))`
			`if (file.exists(excel)) msg <- c(msg, create_txt(excel, "xlsx", "Excel"))`
			`if (file.exists(txt)) msg <- c(msg, create_txt(txt, "txt", "plain text"))`
			`if (file.exists(sas)) msg <- c(msg, create_txt(sas, "sas", "SAS"))`
			`if (file.exists(spss)) msg <- c(msg, create_txt(spss, "sav", "SPSS"))`
			`if (file.exists(stata)) msg <- c(msg, create_txt(stata, "dta", "Stata"))`
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`paste0(msg, collapse = "")`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`}`

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`print_df <- function(x, rows = 6) {`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`x %>%`
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`head(n = rows) %>%`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`mutate_all(function(x) {`
			`if (is.list(x)) {`
			`sapply(x, function(y) {`
			`if (length(y) > 3) {`
			`paste0(paste(y[1:3], collapse = ", "), ", ...")`
(v1.4.0.9041) updates based on review 2020-12-17 16:22:25 +01:00			`} else if (length(y) == 0 \|\| all(is.na(y))) {`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`""`
			`} else {`
			`paste(y, collapse = ", ")`
			`}`
			`})`
			`} else {`
			`x`
			`}`
			`}) %>%`
			`knitr::kable(align = "c")`
			`}`

			```

(v1.3.0.9006) website update 2020-08-21 11:40:13 +02:00			All reference data (about microorganisms, antibiotics, R/SI interpretation, EUCAST rules, etc.) in this `AMR` package are reliable, up-to-date and freely available. We continually export our data sets to formats for use in R, SPSS, SAS, Stata and Excel. We also supply tab separated files that are machine-readable and suitable for input in any software program, such as laboratory information systems.
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`On this page, we explain how to download them and how the structure of the data sets look like.`

			`<p class="dataset-within-r">If you are reading this page from within R, please <a href="https://msberends.github.io/AMR/articles/datasets.html">visit our website</a>, which is automatically updated with every code change.</p>`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`## Microorganisms (currently accepted names)`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`r structure_txt(microorganisms)`

(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			This data set is in R available as `microorganisms`, after you load the `AMR` package.

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`r download_txt("microorganisms")`

(v1.5.0.9041) SNOMED update 2021-03-11 21:42:30 +01:00			`NOTE: The exported files for SAS, SPSS and Stata do not contain SNOMED codes, as their file size would exceed 100 MB; the file size limit of GitHub. Advice? Use R instead.`

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`### Source`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			`Our full taxonomy of microorganisms is based on the authoritative and comprehensive:`

(v1.5.0.9041) SNOMED update 2021-03-11 21:42:30 +01:00			* [Catalogue of Life](http://www.catalogueoflife.org) (included version: `r AMR:::CATALOGUE_OF_LIFE$year`)
			* [List of Prokaryotic names with Standing in Nomenclature](https://lpsn.dsmz.de) (LPSN, last updated: `r AMR:::CATALOGUE_OF_LIFE$yearmonth_LPSN`)
			* `r AMR:::SNOMED_VERSION$current_source`, retrieved from the [`r AMR:::SNOMED_VERSION$title`](`r AMR:::SNOMED_VERSION$url`), OID `r AMR:::SNOMED_VERSION$current_oid`, version `r AMR:::SNOMED_VERSION$current_version`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`### Example content`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`Included (sub)species per taxonomic kingdom:`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			```{r, echo = FALSE}
			`microorganisms %>%`
			`pull(kingdom) %>%`
			`table() %>%`
			`as.data.frame() %>%`
			`mutate(Freq = format(Freq, big.mark = ",")) %>%`
			`setNames(c("Kingdom", "Number of (sub)species")) %>%`
			`print_df()`
			```

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`Example rows when filtering on genus Escherichia:`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			```{r, echo = FALSE}
			`microorganisms %>%`
			`filter(genus == "Escherichia") %>%`
			`print_df()`
			```
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`## Microorganisms (previously accepted names)`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`r structure_txt(microorganisms.old)`

			Note: remember that the 'ref' columns contains the scientific reference to the old taxonomic entries, i.e. of column 'fullname'. For the scientific reference of the new names, i.e. of column 'fullname_new', see the `microorganisms` data set.

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			This data set is in R available as `microorganisms.old`, after you load the `AMR` package.
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`r download_txt("microorganisms.old")`

			`### Source`

			This data set contains old, previously accepted taxonomic names. The data sources are the same as the `microorganisms` data set:

(v1.5.0.9041) SNOMED update 2021-03-11 21:42:30 +01:00			* [Catalogue of Life](http://www.catalogueoflife.org) (included version: `r AMR:::CATALOGUE_OF_LIFE$year`)
			* [List of Prokaryotic names with Standing in Nomenclature](https://lpsn.dsmz.de) (LPSN, last updated: `r AMR:::CATALOGUE_OF_LIFE$yearmonth_LPSN`)
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`### Example content`
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00
			`Example rows when filtering on Escherichia:`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			```{r, echo = FALSE}
(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`microorganisms.old %>%`
			`filter(fullname %like% "^Escherichia") %>%`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`print_df()`
			```

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`## Antibiotic agents`

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`r structure_txt(antibiotics)`

(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			This data set is in R available as `antibiotics`, after you load the `AMR` package.

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`r download_txt("antibiotics")`

			`### Source`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			`This data set contains all EARS-Net and ATC codes gathered from WHO and WHONET, and all compound IDs from PubChem. It also contains all brand names (synonyms) as found on PubChem and Defined Daily Doses (DDDs) for oral and parenteral administration.`

(v1.5.0.9015) unit test fix, grouped first isolates 2021-02-04 16:48:16 +01:00			`* [ATC/DDD index from WHO Collaborating Centre for Drug Statistics Methodology](https://www.whocc.no/atc_ddd_index/) (note: this may not be used for commercial purposes, but is freely available from the WHO CC website for personal use)`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`* [PubChem by the US National Library of Medicine](https://pubchem.ncbi.nlm.nih.gov)`
			`* [WHONET software 2019](https://whonet.org)`

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`### Example content`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			```{r, echo = FALSE}
			`antibiotics %>%`
			`filter(ab %in% colnames(example_isolates)) %>%`
			`print_df()`
			```


			`## Antiviral agents`

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`r structure_txt(antivirals)`

(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			This data set is in R available as `antivirals`, after you load the `AMR` package.

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`r download_txt("antivirals")`

			`### Source`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			`This data set contains all ATC codes gathered from WHO and all compound IDs from PubChem. It also contains all brand names (synonyms) as found on PubChem and Defined Daily Doses (DDDs) for oral and parenteral administration.`

(v1.5.0.9015) unit test fix, grouped first isolates 2021-02-04 16:48:16 +01:00			`* [ATC/DDD index from WHO Collaborating Centre for Drug Statistics Methodology](https://www.whocc.no/atc_ddd_index/) (note: this may not be used for commercial purposes, but is freely available from the WHO CC website for personal use)`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			`* [PubChem by the US National Library of Medicine](https://pubchem.ncbi.nlm.nih.gov)`

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`### Example content`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			```{r, echo = FALSE}
			`antivirals %>%`
			`print_df()`
			```


			`## Intrinsic bacterial resistance`

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`r structure_txt(intrinsic_resistant)`

(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			This data set is in R available as `intrinsic_resistant`, after you load the `AMR` package.

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`r download_txt("intrinsic_resistant")`

			`### Source`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.5.0.9015) unit test fix, grouped first isolates 2021-02-04 16:48:16 +01:00			This data set contains all defined intrinsic resistance by EUCAST of all bug-drug combinations, and is based on `r AMR:::format_eucast_version_nr("3.2")`.
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`### Example content`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`Example rows when filtering on Enterobacter cloacae:`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			```{r, echo = FALSE}
			`intrinsic_resistant %>%`
(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`filter(microorganism == "Enterobacter cloacae") %>%`
			`print_df(rows = Inf)`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			```


			`## Interpretation from MIC values / disk diameters to R/SI`

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`r structure_txt(rsi_translation)`

(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00			This data set is in R available as `rsi_translation`, after you load the `AMR` package.

(v1.3.0.9005) website update 2020-08-17 21:49:58 +02:00			`r download_txt("rsi_translation")`

			`### Source`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			This data set contains interpretation rules for MIC values and disk diffusion diameters. Included guidelines are CLSI (`r min(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "CLSI")$guideline)))`-`r max(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "CLSI")$guideline)))`) and EUCAST (`r min(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "EUCAST")$guideline)))`-`r max(as.integer(gsub("[^0-9]", "", subset(rsi_translation, guideline %like% "EUCAST")$guideline)))`).

(v1.3.0.9012) data sets vignette update 2020-08-29 21:41:46 +02:00			`### Example content`
(v1.3.0.9004) data sets, as.disk() improvement 2020-08-16 21:38:42 +02:00
			```{r, echo = FALSE}
			`rsi_translation %>%`
			`mutate(ab = ab_name(ab), mo = mo_name(mo)) %>%`
			`print_df()`
			```
(v1.5.0.9006) major documentation update 2021-01-18 16:57:56 +01:00

			`## Dosage guidelines from EUCAST`

			`r structure_txt(dosage)`

			This data set is in R available as `dosage`, after you load the `AMR` package.

			`r download_txt("dosage")`

			`### Source`

			`EUCAST breakpoints used in this package are based on the dosages in this data set.`

			Currently included dosages in the data set are meant for: `r AMR:::format_eucast_version_nr(unique(dosage$eucast_version))`.

			`### Example content`

			```{r, echo = FALSE}
			`dosage %>%`
			`print_df()`
			```