AMR 1.8.2.9068
(this beta version will eventually become v2.0! We’re happy to reach a new major milestone soon!)
This is a new major release of the AMR package, with great new additions but also some breaking changes for current users. These are all listed below.
- Microbiological taxonomy (microorganismsdata set) updated to 2022 and now based on LPSN and GBIF
- Much increased algorithms to translate user input to valid taxonomy, e.g. by using recent scientific work about per-species human pathogenicity
- Clinical breakpoints added for EUCAST 2022 and CLSI 2022
- 20 new antibiotics added and updated all DDDs and ATC codes
- Extended support for antiviral agents (antiviralsdata set), with many new functions
- Now available in 16 languages
- Many new interesting functions, such as rsi_confidence_interval()andmean_amr_distance()
- Many small bug fixes
New
Interpretation of MIC and disk diffusion values
EUCAST 2022 and CLSI 2022 guidelines have been added for as.rsi(). EUCAST 2022 (v12.0) is now the new default guideline for all MIC and disks diffusion interpretations, and for eucast_rules() to apply EUCAST Expert Rules. The default guideline (EUCAST) can now be changed with the new AMR_guideline option, such as: options(AMR_guideline = "CLSI 2020").
Interpretation guidelines older than 10 years were removed, the oldest now included guidelines of EUCAST and CLSI are from 2013.
Supported languages
We added support for the following languages: Chinese, Greek, Japanese, Polish, Turkish and Ukrainian. All antibiotic names are now available in these languages, and the AMR package will automatically determine a supported language based on the user system language.
We are very grateful for the valuable input by our colleagues from other countries. The AMR package is now available in 16 languages and according to download stats used in almost all countries in the world!
Microbiological taxonomy
The microorganisms no longer relies on the Catalogue of Life, but on the List of Prokaryotic names with Standing in Nomenclature (LPSN) and is supplemented with the ‘backbone taxonomy’ from the Global Biodiversity Information Facility (GBIF). The structure of this data set has changed to include separate LPSN and GBIF identifiers. Almost all previous MO codes were retained. It contains over 1,400 taxonomic names from 2022.
We previously relied on our own experience to categorise species into pathogenic groups, but we were very happy to encounter the very recent work of Bartlett et al. (2022, DOI 10.1099/mic.0.001269) who extensively studied medical-scientific literature to categorise all bacterial species into groups. See mo_matching_score() on how their work was incorporated into the prevalence column of the microorganisms data set. Using their results, the as.mo() and all mo_*() functions are now much better capable of converting user input to valid taxonomic records.
We also made the following changes regarding the included taxonomy or microorganisms functions:
- Updated full microbiological taxonomy according to the latest daily LPSN data set (December 2022) and latest yearly GBIF taxonomy backbone (November 2022)
- Support for all 1,515 city-like serovars of Salmonella, such as Salmonella Goldcoast. Formally, these are serovars belonging to the S. enterica species, but they are reported with only the name of the genus and the city. For this reason, the serovars are in the subspeciescolumn of themicroorganismsdata set and “enterica” is in thespeciescolumn, but the full name does not contain the species name (enterica).
- All new algorithm for as.mo()(and thus allmo_*()functions) while still following our original set-up as described in our recently published JSS paper (DOI 10.18637/jss.v104.i03).- A new argument keep_synonymsallows to not correct for updated taxonomy, in favour of the now deleted argumentallow_uncertain
- It has increased tremendously in speed and returns generally more consequent results
- Sequential coercion is now extremely fast as results are stored to the package environment, although coercion of unknown values must be run once per session. Previous results can be reset/removed with the new mo_reset_session()function.
- Support for microorganism codes of the ASIan Antimicrobial Resistance Surveillance Network (ASIARS-Net)
- The MO matching score algorithm (mo_matching_score()) now counts deletions and substitutions as 2 instead of 1, which impacts the outcome ofas.mo()and anymo_*()function
 
- A new argument 
- 
Removed all species of the taxonomic kingdom Chromista from the package. This was done for multiple reasons:
- CRAN allows packages to be around 5 MB maximum, some packages are exempted but this package is not one of them
- Chromista are not relevant when it comes to antimicrobial resistance, thus lacking the primary scope of this package
- Chromista are almost never clinically relevant, thus lacking the secondary scope of this package
 
- The microorganisms.olddata set was removed, and all previously accepted names are now included in themicroorganismsdata set. A new columnstatuscontains"accepted"for currently accepted names and"synonym"for taxonomic synonyms; currently invalid names. All previously accepted names now have a microorganisms ID and - if available - an LPSN, GBIF and SNOMED CT identifier.
Antibiotic agents and selectors
The new function add_custom_antimicrobials() allows users to add custom antimicrobial codes and names to the AMR package.
The antibiotics data set was greatly updated:
- The following 20 antibiotics have been added (also includes the new J01RA ATC group): azithromycin/fluconazole/secnidazole (AFC), cefepime/amikacin (CFA), cefixime/ornidazole (CEO), ceftriaxone/beta-lactamase inhibitor (CEB), ciprofloxacin/metronidazole (CIM), ciprofloxacin/ornidazole (CIO), ciprofloxacin/tinidazole (CIT), furazidin (FUR), isoniazid/sulfamethoxazole/trimethoprim/pyridoxine (IST), lascufloxacin (LSC), levofloxacin/ornidazole (LEO), nemonoxacin (NEM), norfloxacin/metronidazole (NME), norfloxacin/tinidazole (NTI), ofloxacin/ornidazole (OOR), oteseconazole (OTE), rifampicin/ethambutol/isoniazid (REI), sarecycline (SRC), tetracycline/oleandomycin (TOL), and thioacetazone (TAT)
- Added some missing ATC codes
- Updated DDDs and PubChem Compound IDs
- Updated some antibiotic name spelling, now used by WHOCC (such as cephalexin -> cefalexin, and phenethicillin -> pheneticillin)
- Antibiotic code “CEI” for ceftolozane/tazobactam has been replaced with “CZT” to comply with EARS-Net and WHONET 2022. The old code will still work in all cases when using as.ab()or any of theab_*()functions.
- Support for antimicrobial interpretation of anaerobic bacteria, by adding a ‘placeholder’ code B_ANAERto themicroorganismsdata set and adding the breakpoints of anaerobics to thersi_interpretationdata set, which is used byas.rsi()for interpretion of MIC and disk diffusion values
Also, we added support for using antibiotic selectors in scoped dplyr verbs (with or without using vars()), such as in: ... %>% summarise_at(aminoglycosides(), resistance), please see resistance() for examples.
Antiviral agents
We now added extensive support for antiviral agents! For the first time, the AMR package has extensive support for antiviral drugs and to work with their names, codes and other data in any way.
- The antiviralsdata set has been extended with 18 new drugs (also from the new J05AJ ATC group) and now also contains antiviral identifiers and LOINC codes
- A new data type av(antivirals) has been added, which is functionally similar toabfor antibiotics
- Functions as.av(),av_name(),av_atc(),av_synonyms(),av_from_text()have all been added as siblings to theirab_*()equivalents
Other new functions
- Function rsi_confidence_interval()to add confidence intervals in AMR calculation. This is now also included inrsi_df()andproportion_df().
- Function mean_amr_distance()to calculate the mean AMR distance. The mean AMR distance is a normalised numeric value to compare AMR test results and can help to identify similar isolates, without comparing antibiograms by hand.
- Function rsi_interpretation_history()to view the history of previous runs ofas.rsi(). This returns a ‘logbook’ with the selected guideline, reference table and specific interpretation of each row in a data set on whichas.rsi()was run.
- Function mo_current()to get the currently valid taxonomic name of a microorganism
- Function add_custom_antimicrobials()to add custom antimicrobial codes and names to theAMRpackage
Changes
- Argument combine_IRhas been removed from this package (affecting functionscount_df(),proportion_df(), andrsi_df()and some plotting functions), since it was replaced withcombine_SIthree years ago
- Using unitsinab_ddd(..., units = "...")had been deprecated for some time and is now not supported anymore. Useab_ddd_units()instead.
- Support for data.frame-enhancing R packages, more specifically:data.table::data.table,janitor::tabyl,tibble::tibble, andtsibble::tsibble. AMR package functions that have a data set as output (such asrsi_df()andbug_drug_combinations()), will now return the same data type as the input.
- All data sets in this package are now a tibble, instead of base Rdata.frames. Older R versions are still supported, even if they do not supporttibbles.
- Our data sets are now also continually exported to Apache Feather and Apache Parquet formats. You can find more info in this article on our website.
- For as.rsi():- Fixed certain EUCAST breakpoints for MIC values
- Allow NAvalues (e.g.as.rsi(as.disk(NA), ...))
- Fix for bug-drug combinations with multiple breakpoints for different body sites
- Interpretation from MIC and disk zones is now more informative about availability of breakpoints and more robust
 
- Removed the as.integer()method for MIC values, since MIC are not integer values and runningtable()on MIC values consequently failed for not being able to retrieve the level position (as that’s how normallyas.integer()onfactors work)
- Fixed determination of Gram stains (mo_gramstain()), since the taxonomic phyla Actinobacteria, Chloroflexi, Firmicutes, and Tenericutes have been renamed to respectively Actinomycetota, Chloroflexota, Bacillota, and Mycoplasmatota in 2021
- 
droplevels()on MIC will now return a commonfactorat default and will lose themicclass. Usedroplevels(..., as.mic = TRUE)to keep themicclass.
- Small fix for using ab_from_text()
- Fixes for reading in text files using set_mo_source(), which now also allows the source file to contain valid taxonomic names instead of only valid microorganism ID of this package
- Fixed a bug for mdro()when using similar column names with the Magiorakos guideline
- Using any random_*()function (such asrandom_mic()) is now possible by directly calling the package without loading it first:AMR::random_mic(10)
- Extended support for the vctrspackage, used internally by the tidyverse. This allows to change values of classmic,disk,rsi,moandabin tibbles, and to use antibiotic selectors for selecting/filtering, e.g.df[carbapenems() == "R", ]
- Fix for using info = FALSEinmdro()
- For all interpretation guidelines using as.rsi()on amoxicillin, the rules for ampicillin will be used if amoxicillin rules are not available
- Fix for using ab_atc()on non-existing ATC codes
- Black and white message texts are now reversed in colour if using an RStudio dark theme
- 
mo_snomed()now returns classcharacter, notnumericanymore (to make long SNOMED codes readable)
- Fix for using as.ab()onNAvalues
- Updated support for all WHONET 2022 microorganism codes
- Antimicrobial interpretation ‘SDD’ (susceptible dose-dependent, coined by CLSI) will be interpreted as ‘I’ to comply with EUCAST’s ‘I’ in as.rsi()
- Fix for mo_shortname()in case of higher taxonomic ranks (order, class, phylum)
- Cleaning columns with as.rsi(),as.mic(), oras.disk()will now show the column name in the warning for invalid results
Other
- New website to make use of the new Bootstrap 5 and pkgdown 2.0. The website now contains results for all examples and will be automatically regenerated with every change to our repository, using GitHub Actions
- Added Peter Dutey-Magni, Dmytro Mykhailenko, Anton Mymrikov, and Jonas Salm as contributors, to thank them for their valuable input
- All R and Rmd files in this project are now styled using the stylerpackage
- Set scalar conditional expressions (&&and||) where possible to comply with the upcoming R 4.3
- An enormous lot of code cleaning, fixing some small bugs on the way