AMR/NEWS.md

# AMR 0.6.1

#### Changed
* Fixed a critical bug when using `eucast_rules()` with `verbose = TRUE`
* Coercion of microbial IDs are now written to the package namespace instead of the user's home folder, to comply with the CRAN policy

# AMR 0.6.0

**New website!**

We've got a new website: [https://msberends.gitlab.io/AMR](https://msberends.gitlab.io/AMR/) (built with the great [`pkgdown`](https://pkgdown.r-lib.org/))

* Contains the complete manual of this package and all of its functions with an explanation of their parameters
* Contains a comprehensive tutorial about how to conduct antimicrobial resistance analysis, import data from WHONET or SPSS and many more.

#### New
* **BREAKING**: removed deprecated functions, parameters and references to 'bactid'. Use `as.mo()` to identify an MO code.
* Catalogue of Life as a new taxonomic source for data about microorganisms, which also contains all ITIS data we used previously. The `microorganisms` data set now contains:
  * All ~55,000 (sub)species from the kingdoms of Archaea, Bacteria and Protozoa
  * All ~3,000 (sub)species from these orders of the kingdom of Fungi: Eurotiales, Onygenales, Pneumocystales, Saccharomycetales and Schizosaccharomycetales (covering at least like all species of *Aspergillus*, *Candida*, *Pneumocystis*, *Saccharomyces* and *Trichophyton*)
  * All ~2,000 (sub)species from ~100 other relevant genera, from the kingdoms of Animalia and Plantae (like *Strongyloides* and *Taenia*)
  * All ~15,000 previously accepted names of included (sub)species that have been taxonomically renamed
  * The responsible author(s) and year of scientific publication
  
    This data is updated annually - check the included version with the new function `catalogue_of_life_version()`.
  * Due to this change, some `mo` codes changed (e.g. *Streptococcus* changed from `B_STRPTC` to `B_STRPT`). A translation table is  used internally to support older microorganism IDs, so users will not notice this difference.
  * New function `mo_rank()` for the taxonomic rank (genus, species, infraspecies, etc.)
  * New function `mo_url()` to get the direct URL of a species from the Catalogue of Life
* Support for data from [WHONET](https://whonet.org/) and [EARS-Net](https://ecdc.europa.eu/en/about-us/partnerships-and-networks/disease-and-laboratory-networks/ears-net) (European Antimicrobial Resistance Surveillance Network):
  * Exported files from WHONET can be read and used in this package. For functions like `first_isolate()` and `eucast_rules()`, all parameters will be filled in automatically.
  * This package now knows all antibiotic abbrevations by EARS-Net (which are also being used by WHONET) - the `antibiotics` data set now contains a column `ears_net`.
  * The function `as.mo()` now knows all WHONET species abbreviations too, because almost 2,000 microbial abbreviations were added to the `microorganisms.codes` data set.
* New filters for antimicrobial classes. Use these functions to filter isolates on results in one of more antibiotics from a specific class:
  ```r
  filter_aminoglycosides()
  filter_carbapenems()
  filter_cephalosporins()
  filter_1st_cephalosporins()
  filter_2nd_cephalosporins()
  filter_3rd_cephalosporins()
  filter_4th_cephalosporins()
  filter_fluoroquinolones()
  filter_glycopeptides()
  filter_macrolides()
  filter_tetracyclines()
  ```
  The `antibiotics` data set will be searched, after which the input data will be checked for column names with a value in any abbreviations, codes or official names found in the `antibiotics` data set.
  For example:
  ```r
  septic_patients %>% filter_glycopeptides(result = "R")
  # Filtering on glycopeptide antibacterials: any of `vanc` or `teic` is R
  septic_patients %>% filter_glycopeptides(result = "R", scope = "all")
  # Filtering on glycopeptide antibacterials: all of `vanc` and `teic` is R
  ```
* All `ab_*` functions are deprecated and replaced by `atc_*` functions:
  ```r
  ab_property -> atc_property()
  ab_name -> atc_name()
  ab_official -> atc_official()
  ab_trivial_nl -> atc_trivial_nl()
  ab_certe -> atc_certe()
  ab_umcg -> atc_umcg()
  ab_tradenames -> atc_tradenames()
  ```
  These functions use `as.atc()` internally. The old `atc_property` has been renamed `atc_online_property()`. This is done for two reasons: firstly, not all ATC codes are of antibiotics (ab) but can also be of antivirals or antifungals. Secondly, the input must have class `atc` or must be coerable to this class. Properties of these classes should start with the same class name, analogous to `as.mo()` and e.g. `mo_genus`.
* New functions `set_mo_source()` and `get_mo_source()` to use your own predefined MO codes as input for `as.mo()` and consequently all `mo_*` functions
* Support for the upcoming [`dplyr`](https://dplyr.tidyverse.org) version 0.8.0
* New function `guess_ab_col()` to find an antibiotic column in a table
* New function `mo_failures()` to review values that could not be coerced to a valid MO code, using `as.mo()`. This latter function will now only show a maximum of 10 uncoerced values and will refer to `mo_failures()`.
* New function `mo_uncertainties()` to review values that could be coerced to a valid MO code using `as.mo()`, but with uncertainty.
* New function `mo_renamed()` to get a list of all returned values from `as.mo()` that have had taxonomic renaming
* New function `age()` to calculate the (patients) age in years
* New function `age_groups()` to split ages into custom or predefined groups (like children or elderly). This allows for easier demographic antimicrobial resistance analysis per age group.
* New function `ggplot_rsi_predict()` as well as the base R `plot()` function can now be used for resistance prediction calculated with `resistance_predict()`:
  ```r
  x <- resistance_predict(septic_patients, col_ab = "amox")
  plot(x)
  ggplot_rsi_predict(x)
  ```
* Functions `filter_first_isolate()` and `filter_first_weighted_isolate()` to shorten and fasten filtering on data sets with antimicrobial results, e.g.:
  ```r
  septic_patients %>% filter_first_isolate(...)
  # or
  filter_first_isolate(septic_patients, ...)
  ```
  is equal to:
  ```r
  septic_patients %>%
    mutate(only_firsts = first_isolate(septic_patients, ...)) %>%
    filter(only_firsts == TRUE) %>%
    select(-only_firsts)
  ```
* New function `availability()` to check the number of available (non-empty) results in a `data.frame`
* New vignettes about how to conduct AMR analysis, predict antimicrobial resistance, use the *G*-test and more. These are also available (and even easier readable) on our website: https://msberends.gitlab.io/AMR.

#### Changed
* Function `eucast_rules()`:
  * Updated EUCAST Clinical breakpoints to [version 9.0 of 1 January 2019](http://www.eucast.org/clinical_breakpoints/), the data set `septic_patients` now reflects these changes
  * Fixed a critical bug where some rules that depend on previous applied rules would not be applied adequately
  * Emphasised in manual that penicillin is meant as benzylpenicillin (ATC [J01CE01](https://www.whocc.no/atc_ddd_index/?code=J01CE01))
  * New info is returned when running this function, stating exactly what has been changed or added. Use `eucast_rules(..., verbose = TRUE)` to get a data set with all changed per bug and drug combination.
* Removed data sets `microorganisms.oldDT`, `microorganisms.prevDT`, `microorganisms.unprevDT` and `microorganismsDT` since they were no longer needed and only contained info already available in the `microorganisms` data set
* Added 65 antibiotics to the `antibiotics` data set, from the [Pharmaceuticals Community Register](http://ec.europa.eu/health/documents/community-register/html/atc.htm) of the European Commission
* Removed columns `atc_group1_nl` and `atc_group2_nl` from the `antibiotics` data set
* Functions `atc_ddd()` and `atc_groups()` have been renamed `atc_online_ddd()` and `atc_online_groups()`. The old functions are deprecated and will be removed in a future version.
* Function `guess_mo()` is now deprecated in favour of `as.mo()` and will be removed in future versions
* Function `guess_atc()` is now deprecated in favour of `as.atc()` and will be removed in future versions
* Improvements for `as.mo()`:
  * Now handles incorrect spelling, like `i` instead of `y` and `f` instead of `ph`:
    ```r
    # mo_fullname() uses as.mo() internally
    
    mo_fullname("Sthafilokockus aaureuz")
    #> [1] "Staphylococcus aureus"
    
    mo_fullname("S. klossi")
    #> [1] "Staphylococcus kloosii"
    ```
  * Uncertainty of the algorithm is now divided into four levels, 0 to 3, where the default `allow_uncertain = TRUE` is equal to uncertainty level 2. Run `?as.mo` for more info about these levels.
    ```r
    # equal:
    as.mo(..., allow_uncertain = TRUE)
    as.mo(..., allow_uncertain = 2)
    
    # also equal:
    as.mo(..., allow_uncertain = FALSE)
    as.mo(..., allow_uncertain = 0)
    ```
    Using `as.mo(..., allow_uncertain = 3)` could lead to very unreliable results.
  * Implemented the latest publication of Becker *et al.* (2019), for categorising coagulase-negative *Staphylococci*
  * All microbial IDs that found are now saved to a local file `~/.Rhistory_mo`. Use the new function `clean_mo_history()` to delete this file, which resets the algorithms.
  * Incoercible results will now be considered 'unknown', MO code `UNKNOWN`. On foreign systems, properties of these will be translated to all languages already previously supported: German, Dutch, French, Italian, Spanish and Portuguese:
    ```r
    mo_genus("qwerty", language = "es")
    # Warning: 
    # one unique value (^= 100.0%) could not be coerced and is considered 'unknown': "qwerty". Use mo_failures() to review it.
    #> [1] "(género desconocido)"
    ```
  * Fix for vector containing only empty values
  * Finds better results when input is in other languages
  * Better handling for subspecies
  * Better handling for *Salmonellae*, especially the 'city like' serovars like *Salmonella London*
  * Understanding of highly virulent *E. coli* strains like EIEC, EPEC and STEC
  * There will be looked for uncertain results at default - these results will be returned with an informative warning
  * Manual (help page) now contains more info about the algorithms
  * Progress bar will be shown when it takes more than 3 seconds to get results
  * Support for formatted console text
  * Console will return the percentage of uncoercable input
* Function `first_isolate()`:
  * Fixed a bug where distances between dates would not be calculated right - in the `septic_patients` data set this yielded a difference of 0.15% more isolates
  * Will now use a column named like "patid" for the patient ID (parameter `col_patientid`), when this parameter was left blank
  * Will now use a column named like "key(...)ab" or "key(...)antibiotics" for the key antibiotics (parameter `col_keyantibiotics()`), when this parameter was left blank
  * Removed parameter `output_logical`, the function will now always return a logical value
  * Renamed parameter `filter_specimen` to `specimen_group`, although using `filter_specimen` will still work
* A note to the manual pages of the `portion` functions, that low counts can influence the outcome and that the `portion` functions may camouflage this, since they only return the portion (albeit being dependent on the `minimum` parameter)
* Merged data sets `microorganisms.certe` and `microorganisms.umcg` into `microorganisms.codes`
* Function `mo_taxonomy()` now contains the kingdom too
* Reduce false positives for `is.rsi.eligible()` using the new `threshold` parameter
* New colours for `scale_rsi_colours()`
* Summaries of class `mo` will now return the top 3 and the unique count, e.g. using `summary(mo)`
* Small text updates to summaries of class `rsi` and `mic`
* Function `as.rsi()`:
  * Now gives a warning when inputting MIC values
  * Now accepts high and low resistance: `"HIGH S"` will return `S`
* Frequency tables (`freq()` function):
  * Support for tidyverse quasiquotation! Now you can create frequency tables of function outcomes:
    ```r
    # Determine genus of microorganisms (mo) in `septic_patients` data set:
    # OLD WAY
    septic_patients %>%
      mutate(genus = mo_genus(mo)) %>%
      freq(genus)
    # NEW WAY
    septic_patients %>% 
      freq(mo_genus(mo))
    
    # Even supports grouping variables:
    septic_patients %>%
      group_by(gender) %>% 
      freq(mo_genus(mo))
    ```
  * Header info is now available as a list, with the `header` function
  * The parameter `header` is now set to `TRUE` at default, even for markdown
  * Added header info for class `mo` to show unique count of families, genera and species
  * Now honours the `decimal.mark` setting, which just like `format` defaults to `getOption("OutDec")`
  * The new `big.mark` parameter will at default be `","` when `decimal.mark = "."` and `"."` otherwise
  * Fix for header text where all observations are `NA`
  * New parameter `droplevels` to exclude empty factor levels when input is a factor
  * Factor levels will be in header when present in input data (maximum of 5)
  * Fix for using `select()` on frequency tables
* Function `scale_y_percent()` now contains the `limits` parameter
* Automatic parameter filling for `mdro()`, `key_antibiotics()` and `eucast_rules()`
* Updated examples for resistance prediction (`resistance_predict()` function)
* Fix for `as.mic()` to support more values ending in (several) zeroes
* if using different lengths of pattern and x in `%like%`, it will now return the call

#### Other
* Updated licence text to emphasise GPL 2.0 and that this is an R package.

# AMR 0.5.0

#### New
* Repository moved to GitLab: https://gitlab.com/msberends/AMR
* Function `count_all` to get all available isolates (that like all `portion_*` and `count_*` functions also supports `summarise` and `group_by`), the old `n_rsi` is now an alias of `count_all`
* Function `get_locale` to determine language for language-dependent output for some `mo_*` functions. This is now the default value for their `language` parameter, by which the system language will be used at default.
* Data sets `microorganismsDT`, `microorganisms.prevDT`, `microorganisms.unprevDT` and `microorganisms.oldDT` to improve the speed of `as.mo`. They are for reference only, since they are primarily for internal use of `as.mo`.
* Function `read.4D` to read from the 4D database of the MMB department of the UMCG
* Functions `mo_authors` and `mo_year` to get specific values about the scientific reference of a taxonomic entry

#### Changed
* Functions `MDRO`, `BRMO`, `MRGN` and `EUCAST_exceptional_phenotypes` were renamed to `mdro`, `brmo`, `mrgn` and `eucast_exceptional_phenotypes`
* `EUCAST_rules` was renamed to `eucast_rules`, the old function still exists as a deprecated function
* Big changes to the `eucast_rules` function:
  * Now also applies rules from the EUCAST 'Breakpoint tables for bacteria', version 8.1, 2018, http://www.eucast.org/clinical_breakpoints/ (see Source of the function)
  * New parameter `rules` to specify which rules should be applied (expert rules, breakpoints, others or all)
  * New parameter `verbose` which can be set to `TRUE` to get very specific messages about which columns and rows were affected
  * Better error handling when rules cannot be applied (i.e. new values could not be inserted)
  * The number of affected values will now only be measured once per row/column combination
  * Data set `septic_patients` now reflects these changes
  * Added parameter `pipe` for piperacillin (J01CA12), also to the `mdro` function
  * Small fixes to EUCAST clinical breakpoint rules
* Added column `kingdom` to the microorganisms data set, and function `mo_kingdom` to look up values
* Tremendous speed improvement for `as.mo` (and subsequently all `mo_*` functions), as empty values wil be ignored *a priori*
* Fewer than 3 characters as input for `as.mo` will return NA
* Function `as.mo` (and all `mo_*` wrappers) now supports genus abbreviations with "species" attached
  ```r
  as.mo("E. species")        # B_ESCHR
  mo_fullname("E. spp.")     # "Escherichia species"
  as.mo("S. spp")            # B_STPHY
  mo_fullname("S. species")  # "Staphylococcus species"
  ```
* Added parameter `combine_IR` (TRUE/FALSE) to functions `portion_df` and `count_df`, to indicate that all values of I and R must be merged into one, so the output only consists of S vs. IR (susceptible vs. non-susceptible)
* Fix for `portion_*(..., as_percent = TRUE)` when minimal number of isolates would not be met
* Added parameter `also_single_tested` for `portion_*` and `count_*` functions to also include cases where not all antibiotics were tested but at least one of the tested antibiotics includes the target antimicribial interpretation, see `?portion`
* Using `portion_*` functions now throws a warning when total available isolate is below parameter `minimum`
* Functions `as.mo`, `as.rsi`, `as.mic`, `as.atc` and `freq` will not set package name as attribute anymore
* Frequency tables - `freq()`:
  * Support for grouping variables, test with:
    ```r
    septic_patients %>% 
      group_by(hospital_id) %>% 
      freq(gender)
    ```
  * Support for (un)selecting columns:
    ```r
    septic_patients %>% 
      freq(hospital_id) %>% 
      select(-count, -cum_count) # only get item, percent, cum_percent
    ```
  * Check for `hms::is.hms`
  * Now prints in markdown at default in non-interactive sessions
  * No longer adds the factor level column and sorts factors on count again
  * Support for class `difftime`
  * New parameter `na`, to choose which character to print for empty values
  * New parameter `header` to turn the header info off (default when `markdown = TRUE`)
  * New parameter `title` to manually setbthe title of the frequency table
* `first_isolate` now tries to find columns to use as input when parameters are left blank
* Improvements for MDRO algorithm (function `mdro`)
* Data set `septic_patients` is now a `data.frame`, not a tibble anymore
* Removed diacritics from all authors (columns `microorganisms$ref` and `microorganisms.old$ref`) to comply with CRAN policy to only allow ASCII characters
* Fix for `mo_property` not working properly
* Fix for `eucast_rules` where some Streptococci would become ceftazidime R in EUCAST rule 4.5
* Support for named vectors of class `mo`, useful for `top_freq()`
* `ggplot_rsi` and `scale_y_percent` have `breaks` parameter
* AI improvements for `as.mo`:
  * `"CRS"` -> *Stenotrophomonas maltophilia*
  * `"CRSM"` -> *Stenotrophomonas maltophilia*
  * `"MSSA"` -> *Staphylococcus aureus*
  * `"MSSE"` -> *Staphylococcus epidermidis*
* Fix for `join` functions
* Speed improvement for `is.rsi.eligible`, now 15-20 times faster
* In `g.test`, when `sum(x)` is below 1000 or any of the expected values is below 5, Fisher's Exact Test will be suggested
* `ab_name` will try to fall back on `as.atc` when no results are found
* Removed the addin to view data sets
* Percentages will now will rounded more logically (e.g. in `freq` function)

#### Other
* New dependency on package `crayon`, to support formatted text in the console
* Dependency `tidyr` is now mandatory (went to `Import` field) since `portion_df` and `count_df` rely on it
* Updated vignettes to comply with README


# AMR 0.4.0

#### New
* The data set `microorganisms` now contains **all microbial taxonomic data from ITIS** (kingdoms Bacteria, Fungi and Protozoa), the Integrated Taxonomy Information System, available via https://itis.gov. The data set now contains more than 18,000 microorganisms with all known bacteria, fungi and protozoa according ITIS with genus, species, subspecies, family, order, class, phylum and subkingdom. The new data set `microorganisms.old` contains all previously known taxonomic names from those kingdoms.
* New functions based on the existing function `mo_property`:
  * Taxonomic names: `mo_phylum`, `mo_class`, `mo_order`, `mo_family`, `mo_genus`, `mo_species`, `mo_subspecies`
  * Semantic names: `mo_fullname`, `mo_shortname`
  * Microbial properties: `mo_type`, `mo_gramstain`
  * Author and year: `mo_ref`
  
  They also come with support for German, Dutch, French, Italian, Spanish and Portuguese:
  ```r
  mo_gramstain("E. coli")
  # [1] "Gram negative"
  mo_gramstain("E. coli", language = "de") # German
  # [1] "Gramnegativ"
  mo_gramstain("E. coli", language = "es") # Spanish
  # [1] "Gram negativo"
  mo_fullname("S. group A", language = "pt") # Portuguese
  # [1] "Streptococcus grupo A"
  ```
  
  Furthermore, former taxonomic names will give a note about the current taxonomic name:
  ```r
  mo_gramstain("Esc blattae")
  # Note: 'Escherichia blattae' (Burgess et al., 1973) was renamed 'Shimwellia blattae' (Priest and Barker, 2010)
  # [1] "Gram negative"
  ```
* Functions `count_R`, `count_IR`, `count_I`, `count_SI` and `count_S` to selectively count resistant or susceptible isolates
  * Extra function `count_df` (which works like `portion_df`) to get all counts of S, I and R of a data set with antibiotic columns, with support for grouped variables
* Function `is.rsi.eligible` to check for columns that have valid antimicrobial results, but do not have the `rsi` class yet. Transform the columns of your raw data with: `data %>% mutate_if(is.rsi.eligible, as.rsi)`
* Functions `as.mo` and `is.mo` as replacements for `as.bactid` and `is.bactid` (since the `microoganisms` data set not only contains bacteria). These last two functions are deprecated and will be removed in a future release. The `as.mo` function determines microbial IDs using intelligent rules:
  ```r
  as.mo("E. coli")
  # [1] B_ESCHR_COL
  as.mo("MRSA")
  # [1] B_STPHY_AUR
  as.mo("S group A")
  # [1] B_STRPTC_GRA
  ```
  And with great speed too - on a quite regular Linux server from 2007 it takes us less than 0.02 seconds to transform 25,000 items:
  ```r
  thousands_of_E_colis <- rep("E. coli", 25000)
  microbenchmark::microbenchmark(as.mo(thousands_of_E_colis), unit = "s")
  # Unit: seconds
  #         min       median         max  neval
  #  0.01817717  0.01843957  0.03878077    100
  ```
* Added parameter `reference_df` for `as.mo`, so users can supply their own microbial IDs, name or codes as a reference table
* Renamed all previous references to `bactid` to `mo`, like:
  * Column names inputs of `EUCAST_rules`, `first_isolate` and `key_antibiotics`
  * Column names of datasets `microorganisms` and `septic_patients`
  * All old syntaxes will still work with this version, but will throw warnings
* Function `labels_rsi_count` to print datalabels on a RSI `ggplot2` model
* Functions `as.atc` and `is.atc` to transform/look up antibiotic ATC codes as defined by the WHO. The existing function `guess_atc` is now an alias of `as.atc`.

* Function `ab_property` and its aliases: `ab_name`, `ab_tradenames`, `ab_certe`, `ab_umcg` and `ab_trivial_nl`
* Introduction to AMR as a vignette
* Removed clipboard functions as it violated the CRAN policy
* Renamed `septic_patients$sex` to `septic_patients$gender`

#### Changed
* Added three antimicrobial agents to the `antibiotics` data set: Terbinafine (D01BA02), Rifaximin (A07AA11) and Isoconazole (D01AC05)
* Added 163 trade names to the `antibiotics` data set, it now contains 298 different trade names in total, e.g.:
  ```r
  ab_official("Bactroban")
  # [1] "Mupirocin"
  ab_name(c("Bactroban", "Amoxil", "Zithromax", "Floxapen"))
  # [1] "Mupirocin" "Amoxicillin" "Azithromycin" "Flucloxacillin"
  ab_atc(c("Bactroban", "Amoxil", "Zithromax", "Floxapen"))
  # [1] "R01AX06" "J01CA04" "J01FA10" "J01CF05"
  ```
* For `first_isolate`, rows will be ignored when there's no species available
* Function `ratio` is now deprecated and will be removed in a future release, as it is not really the scope of this package
* Fix for `as.mic` for values ending in zeroes after a real number
* Small fix where *B. fragilis* would not be found in the `microorganisms.umcg` data set
* Added `prevalence` column to the `microorganisms` data set
* Added parameters `minimum` and `as_percent` to `portion_df`
* Support for quasiquotation in the functions series `count_*` and `portions_*`, and `n_rsi`. This allows to check for more than 2 vectors or columns.
  ```r
  septic_patients %>% select(amox, cipr) %>% count_IR()
  # which is the same as:
  septic_patients %>% count_IR(amox, cipr)
  
  septic_patients %>% portion_S(amcl)
  septic_patients %>% portion_S(amcl, gent)
  septic_patients %>% portion_S(amcl, gent, pita)
  ```
* Edited `ggplot_rsi` and `geom_rsi` so they can cope with `count_df`. The new `fun` parameter has value `portion_df` at default, but can be set to `count_df`.
* Fix for `ggplot_rsi` when the `ggplot2` package was not loaded
* Added datalabels function `labels_rsi_count` to `ggplot_rsi`
* Added possibility to set any parameter to `geom_rsi` (and `ggplot_rsi`) so you can set your own preferences
* Fix for joins, where predefined suffices would not be honoured
* Added parameter `quote` to the `freq` function
* Added generic function `diff` for frequency tables
* Added longest en shortest character length in the frequency table (`freq`) header of class `character`
* Support for types (classes) list and matrix for `freq`
  ```r
  my_matrix = with(septic_patients, matrix(c(age, gender), ncol = 2))
  freq(my_matrix)
  ```
  For lists, subsetting is possible:
  ```r
  my_list = list(age = septic_patients$age, gender = septic_patients$gender)
  my_list %>% freq(age)
  my_list %>% freq(gender)
  ```

#### Other
* More unit tests to ensure better integrity of functions

# AMR 0.3.0

#### New
* **BREAKING**: `rsi_df` was removed in favour of new functions `portion_R`, `portion_IR`, `portion_I`, `portion_SI` and `portion_S` to selectively calculate resistance or susceptibility. These functions are 20 to 30 times faster than the old `rsi` function. The old function still works, but is deprecated.
  * New function `portion_df` to get all portions of S, I and R of a data set with antibiotic columns, with support for grouped variables
* **BREAKING**: the methodology for determining first weighted isolates was changed. The antibiotics that are compared between isolates (call *key antibiotics*) to include more first isolates (afterwards called first *weighted* isolates) are now as follows:
  * Universal: amoxicillin, amoxicillin/clavlanic acid, cefuroxime, piperacillin/tazobactam, ciprofloxacin,  trimethoprim/sulfamethoxazole
  * Gram-positive: vancomycin, teicoplanin, tetracycline, erythromycin, oxacillin, rifampicin
  * Gram-negative: gentamicin, tobramycin, colistin, cefotaxime, ceftazidime, meropenem
* Support for `ggplot2`
  * New functions `geom_rsi`, `facet_rsi`, `scale_y_percent`, `scale_rsi_colours` and `theme_rsi`
  * New wrapper function `ggplot_rsi` to apply all above functions on a data set:
    * `septic_patients %>% select(tobr, gent) %>% ggplot_rsi` will show portions of S, I and R immediately in a pretty plot
    * Support for grouped variables, see `?ggplot_rsi`
* Determining bacterial ID:
  * New functions `as.bactid` and `is.bactid` to transform/ look up microbial ID's.
  * The existing function `guess_bactid` is now an alias of `as.bactid`
  * New Becker classification for *Staphylococcus* to categorise them into Coagulase Negative *Staphylococci* (CoNS) and Coagulase Positve *Staphylococci* (CoPS)
  * New Lancefield classification for *Streptococcus* to categorise them into Lancefield groups
* For convience, new descriptive statistical functions `kurtosis` and `skewness` that are lacking in base R - they are generic functions and have support for vectors, data.frames and matrices
* Function `g.test` to perform the Χ<sup>2</sup> distributed [*G*-test](https://en.wikipedia.org/wiki/G-test), which use is the same as `chisq.test`
* ~~Function `ratio` to transform a vector of values to a preset ratio~~
  * ~~For example: `ratio(c(10, 500, 10), ratio = "1:2:1")` would return `130, 260, 130`~~
* Support for Addins menu in RStudio to quickly insert `%in%` or `%like%` (and give them keyboard shortcuts), or to view the datasets that come with this package
* Function `p.symbol` to transform p values to their related symbols: `0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1`
* Functions `clipboard_import` and `clipboard_export` as helper functions to quickly copy and paste from/to software like Excel and SPSS. These functions use the `clipr` package, but are a little altered to also support headless Linux servers (so you can use it in RStudio Server)
* New for frequency tables (function `freq`):
  * A vignette to explain its usage
  * Support for `rsi` (antimicrobial resistance) to use as input
  * Support for `table` to use as input: `freq(table(x, y))`
  * Support for existing functions `hist` and `plot` to use a frequency table as input: `hist(freq(df$age))`
  * Support for `as.vector`, `as.data.frame`, `as_tibble` and `format`
  * Support for quasiquotation: `freq(mydata, mycolumn)` is the same as `mydata %>% freq(mycolumn)`
  * Function `top_freq` function to return the top/below *n* items as vector
  * Header of frequency tables now also show Mean Absolute Deviaton (MAD) and Interquartile Range (IQR)
  * Possibility to globally set the default for the amount of items to print, with `options(max.print.freq = n)` where *n* is your preset value

#### Changed
* Improvements for forecasting with `resistance_predict` and added more examples
* More antibiotics added as parameters for EUCAST rules
* Updated version of the `septic_patients` data set to better reflect the reality
* Pretty printing for tibbles removed as it is not really the scope of this package
* Printing of `mic` and `rsi` classes now returns all values - use `freq` to check distributions
* Improved speed of key antibiotics comparison for determining first isolates
* Column names for the `key_antibiotics` function are now generic: 6 for broadspectrum ABs, 6 for Gram-positive specific and 6 for Gram-negative specific ABs
* Speed improvement for the `abname` function
* `%like%` now supports multiple patterns
* Frequency tables are now actual `data.frame`s with altered console printing to make it look like a frequency table. Because of this, the parameter `toConsole` is not longer needed.
* Fix for `freq` where the class of an item would be lost
* Small translational improvements to the `septic_patients` dataset and the column `bactid` now has the new class `"bactid"`
* Small improvements to the `microorganisms` dataset (especially for *Salmonella*) and the column `bactid` now has the new class `"bactid"`
* Combined MIC/RSI values will now be coerced by the `rsi` and `mic` functions:
  * `as.rsi("<=0.002; S")` will return `S`
  * `as.mic("<=0.002; S")` will return `<=0.002`
* Now possible to coerce MIC values with a space between operator and value, i.e. `as.mic("<= 0.002")` now works
* Classes `rsi` and `mic` do not add the attribute `package.version` anymore
* Added `"groups"` option for `atc_property(..., property)`. It will return a vector of the ATC hierarchy as defined by the [WHO](https://www.whocc.no/atc/structure_and_principles/). The new function `atc_groups` is a convenient wrapper around this.
* Build-in host check for `atc_property` as it requires the host set by `url` to be responsive
* Improved `first_isolate` algorithm to exclude isolates where bacteria ID or genus is unavailable
* Fix for warning *hybrid evaluation forced for row_number* ([`924b62`](https://github.com/tidyverse/dplyr/commit/924b62)) from the `dplyr` package v0.7.5 and above
* Support for empty values and for 1 or 2 columns as input for `guess_bactid` (now called `as.bactid`)
  * So `yourdata %>% select(genus, species) %>% as.bactid()` now also works
* Other small fixes

#### Other
* Added integration tests (check if everything works as expected) for all releases of R 3.1 and higher
  * Linux and macOS: https://travis-ci.org/msberends/AMR
  * Windows: https://ci.appveyor.com/project/msberends/amr
* Added thesis advisors to DESCRIPTION file

# AMR 0.2.0

#### New
* Full support for Windows, Linux and macOS
* Full support for old R versions, only R-3.0.0 (April 2013) or later is needed (needed packages may have other dependencies)
* Function `n_rsi` to count cases where antibiotic test results were available, to be used in conjunction with `dplyr::summarise`, see ?rsi
* Function `guess_bactid` to **determine the ID** of a microorganism based on genus/species or known abbreviations like MRSA
* Function `guess_atc` to **determine the ATC** of an antibiotic based on name, trade name, or known abbreviations
* Function `freq` to create **frequency tables**, with additional info in a header
* Function `MDRO` to **determine Multi Drug Resistant Organisms (MDRO)** with support for country-specific guidelines.
  * [Exceptional resistances defined by EUCAST](http://www.eucast.org/expert_rules_and_intrinsic_resistance) are also supported instead of countries alone
  * Functions `BRMO` and `MRGN` are wrappers for Dutch and German guidelines, respectively
* New algorithm to determine weighted isolates, can now be `"points"` or `"keyantibiotics"`, see `?first_isolate`
* New print format for `tibble`s and `data.table`s

#### Changed
* Fixed `rsi` class for vectors that contain only invalid antimicrobial interpretations
* Renamed dataset `ablist` to `antibiotics`
* Renamed dataset `bactlist` to `microorganisms`
* Added common abbreviations and trade names to the `antibiotics` dataset
* Added more microorganisms to the `microorganisms` dataset
* Added analysis examples on help page of dataset `septic_patients`
* Added support for character vector in `join` functions
* Added warnings when a join results in more rows after than before the join
* Altered `%like%` to make it case insensitive
* For parameters of functions `first_isolate` and `EUCAST_rules` column names are now case-insensitive
* Functions `as.rsi` and `as.mic` now add the package name and version as attributes

#### Other
* Expanded `README.md` with more examples
* Added [ORCID](https://orcid.org) of authors to DESCRIPTION file
* Added unit testing with the `testthat` package
* Added build tests for Linux and macOS using Travis CI (https://travis-ci.org/msberends/AMR)
* Added line coverage checking using CodeCov (https://codecov.io/gh/msberends/AMR/tree/master/R)

# AMR 0.1.1

* `EUCAST_rules` applies for amoxicillin even if ampicillin is missing
* Edited column names to comply with GLIMS, the laboratory information system
* Added more valid MIC values
* Renamed 'Daily Defined Dose' to 'Defined Daily Dose'
* Added barplots for `rsi` and `mic` classes

# AMR 0.1.0

* First submission to CRAN.
-												v0.6.1

											
										
										
											2019-03-28 21:33:28 +01:00
+								# AMR 0.6.1
 								#### Changed
 								* Fixed a critical bug when using `eucast_rules()` with `verbose = TRUE`
 								* Coercion of microbial IDs are now written to the package namespace instead of the user's home folder, to comply with the CRAN policy
-												v0.6.0

											
										
										
											2019-03-27 11:22:36 +01:00
+								# AMR 0.6.0
-												better as.mo handling

											
										
										
											2018-12-06 14:36:39 +01:00
-												Catalogue of Life, replaces ITIS

											
										
										
											2019-02-18 02:33:37 +01:00
+								**New website!**
 								We've got a new website: [https://msberends.gitlab.io/AMR](https://msberends.gitlab.io/AMR/) (built with the great [`pkgdown`](https://pkgdown.r-lib.org/))
 								* Contains the complete manual of this package and all of its functions with an explanation of their parameters
 								* Contains a comprehensive tutorial about how to conduct antimicrobial resistance analysis, import data from WHONET or SPSS and many more.
-												better as.mo handling

											
										
										
											2018-12-06 14:36:39 +01:00
+								#### New
-												eucast rules fix, 1st isolate fix, website update

											
										
										
											2018-12-31 01:48:53 +01:00
+								* **BREAKING**: removed deprecated functions, parameters and references to 'bactid'. Use `as.mo()` to identify an MO code.
-												Catalogue of life

											
										
										
											2019-02-20 00:04:48 +01:00
+								* Catalogue of Life as a new taxonomic source for data about microorganisms, which also contains all ITIS data we used previously. The `microorganisms` data set now contains:
-												v0.6.0

											
										
										
											2019-03-27 11:22:36 +01:00
+								  * All ~55,000 (sub)species from the kingdoms of Archaea, Bacteria and Protozoa
-												as.mo improvement

											
										
										
											2019-02-26 12:33:26 +01:00
+								  * All ~3,000 (sub)species from these orders of the kingdom of Fungi: Eurotiales, Onygenales, Pneumocystales, Saccharomycetales and Schizosaccharomycetales (covering at least like all species of *Aspergillus*, *Candida*, *Pneumocystis*, *Saccharomyces* and *Trichophyton*)
-												rlang dependency, new fungi

											
										
										
											2019-02-28 13:56:28 +01:00
+								  * All ~2,000 (sub)species from ~100 other relevant genera, from the kingdoms of Animalia and Plantae (like *Strongyloides* and *Taenia*)
-												unit tests

											
										
										
											2019-02-21 23:32:30 +01:00
+								  * All ~15,000 previously accepted names of included (sub)species that have been taxonomically renamed
-												Catalogue of life

											
										
										
											2019-02-20 00:04:48 +01:00
+								  * The responsible author(s) and year of scientific publication
-												unit tests

											
										
										
											2019-02-21 23:32:30 +01:00
-												as.mo improvement

											
										
										
											2019-02-26 12:33:26 +01:00
+								    This data is updated annually - check the included version with the new function `catalogue_of_life_version()`.
-												Catalogue of life

											
										
										
											2019-02-20 00:04:48 +01:00
+								  * Due to this change, some `mo` codes changed (e.g. *Streptococcus* changed from `B_STRPTC` to `B_STRPT`). A translation table is  used internally to support older microorganism IDs, so users will not notice this difference.
-												filter_ab_class fix

											
										
										
											2019-03-06 09:32:48 +01:00
+								  * New function `mo_rank()` for the taxonomic rank (genus, species, infraspecies, etc.)
 								  * New function `mo_url()` to get the direct URL of a species from the Catalogue of Life
-												WHONET/EARS-Net support

											
										
										
											2019-01-29 00:06:50 +01:00
+								* Support for data from [WHONET](https://whonet.org/) and [EARS-Net](https://ecdc.europa.eu/en/about-us/partnerships-and-networks/disease-and-laboratory-networks/ears-net) (European Antimicrobial Resistance Surveillance Network):
 								  * Exported files from WHONET can be read and used in this package. For functions like `first_isolate()` and `eucast_rules()`, all parameters will be filled in automatically.
 								  * This package now knows all antibiotic abbrevations by EARS-Net (which are also being used by WHONET) - the `antibiotics` data set now contains a column `ears_net`.
-												uncertainty levels, new WHONET codes

											
										
										
											2019-03-12 12:19:27 +01:00
+								  * The function `as.mo()` now knows all WHONET species abbreviations too, because almost 2,000 microbial abbreviations were added to the `microorganisms.codes` data set.
-												filter_ab_class fix

											
										
										
											2019-03-06 09:32:48 +01:00
+								* New filters for antimicrobial classes. Use these functions to filter isolates on results in one of more antibiotics from a specific class:
 								  ```r
 								  filter_aminoglycosides()
 								  filter_carbapenems()
 								  filter_cephalosporins()
 								  filter_1st_cephalosporins()
 								  filter_2nd_cephalosporins()
 								  filter_3rd_cephalosporins()
 								  filter_4th_cephalosporins()
 								  filter_fluoroquinolones()
 								  filter_glycopeptides()
 								  filter_macrolides()
 								  filter_tetracyclines()
 								  ```
 								  The `antibiotics` data set will be searched, after which the input data will be checked for column names with a value in any abbreviations, codes or official names found in the `antibiotics` data set.
 								  For example:
 								  ```r
 								  septic_patients %>% filter_glycopeptides(result = "R")
 								  # Filtering on glycopeptide antibacterials: any of `vanc` or `teic` is R
 								  septic_patients %>% filter_glycopeptides(result = "R", scope = "all")
 								  # Filtering on glycopeptide antibacterials: all of `vanc` and `teic` is R
 								  ```
-												atc_ functions

											
										
										
											2019-01-26 23:22:56 +01:00
+								* All `ab_*` functions are deprecated and replaced by `atc_*` functions:
 								  ```r
 								  ab_property -> atc_property()
 								  ab_name -> atc_name()
 								  ab_official -> atc_official()
 								  ab_trivial_nl -> atc_trivial_nl()
 								  ab_certe -> atc_certe()
 								  ab_umcg -> atc_umcg()
 								  ab_tradenames -> atc_tradenames()
 								  ```
 								  These functions use `as.atc()` internally. The old `atc_property` has been renamed `atc_online_property()`. This is done for two reasons: firstly, not all ATC codes are of antibiotics (ab) but can also be of antivirals or antifungals. Secondly, the input must have class `atc` or must be coerable to this class. Properties of these classes should start with the same class name, analogous to `as.mo()` and e.g. `mo_genus`.
-												set_mo_source

											
										
										
											2019-01-21 15:53:01 +01:00
+								* New functions `set_mo_source()` and `get_mo_source()` to use your own predefined MO codes as input for `as.mo()` and consequently all `mo_*` functions
-												direct warning if failing as.mo

											
										
										
											2019-01-12 16:45:20 +01:00
+								* Support for the upcoming [`dplyr`](https://dplyr.tidyverse.org) version 0.8.0
-												select() fix for freq

											
										
										
											2019-01-17 12:08:04 +01:00
+								* New function `guess_ab_col()` to find an antibiotic column in a table
-												as.mo

											
										
										
											2019-01-21 21:24:40 +01:00
+								* New function `mo_failures()` to review values that could not be coerced to a valid MO code, using `as.mo()`. This latter function will now only show a maximum of 10 uncoerced values and will refer to `mo_failures()`.
-												mo codes for WHONET

											
										
										
											2019-02-08 16:06:54 +01:00
+								* New function `mo_uncertainties()` to review values that could be coerced to a valid MO code using `as.mo()`, but with uncertainty.
-												select() fix for freq

											
										
										
											2019-01-17 12:08:04 +01:00
+								* New function `mo_renamed()` to get a list of all returned values from `as.mo()` that have had taxonomic renaming
 								* New function `age()` to calculate the (patients) age in years
 								* New function `age_groups()` to split ages into custom or predefined groups (like children or elderly). This allows for easier demographic antimicrobial resistance analysis per age group.
 								* New function `ggplot_rsi_predict()` as well as the base R `plot()` function can now be used for resistance prediction calculated with `resistance_predict()`:
-												resistance predict

											
										
										
											2019-01-15 12:45:24 +01:00
+								  ```r
 								  x <- resistance_predict(septic_patients, col_ab = "amox")
 								  plot(x)
 								  ggplot_rsi_predict(x)
 								  ```
-												eucast rules fix, 1st isolate fix, website update

											
										
										
											2018-12-31 01:48:53 +01:00
+								* Functions `filter_first_isolate()` and `filter_first_weighted_isolate()` to shorten and fasten filtering on data sets with antimicrobial results, e.g.:
-												dplyr 0.8.0 support, fixes #7

											
										
										
											2018-12-22 22:39:34 +01:00
+								  ```r
-												direct warning if failing as.mo

											
										
										
											2019-01-12 16:45:20 +01:00
+								  septic_patients %>% filter_first_isolate(...)
-												dplyr 0.8.0 support, fixes #7

											
										
										
											2018-12-22 22:39:34 +01:00
+								  # or
-												direct warning if failing as.mo

											
										
										
											2019-01-12 16:45:20 +01:00
+								  filter_first_isolate(septic_patients, ...)
-												dplyr 0.8.0 support, fixes #7

											
										
										
											2018-12-22 22:39:34 +01:00
+								  ```
-												new website, freq updates

											
										
										
											2018-12-29 22:24:19 +01:00
+								  is equal to:
-												dplyr 0.8.0 support, fixes #7

											
										
										
											2018-12-22 22:39:34 +01:00
+								  ```r
 								  septic_patients %>%
 								    mutate(only_firsts = first_isolate(septic_patients, ...)) %>%
 								    filter(only_firsts == TRUE) %>%
 								    select(-only_firsts)
 								  ```
-												mo codes for WHONET

											
										
										
											2019-02-08 16:06:54 +01:00
+								* New function `availability()` to check the number of available (non-empty) results in a `data.frame`
-												big website update, licence txt update

											
										
										
											2019-01-02 23:24:07 +01:00
+								* New vignettes about how to conduct AMR analysis, predict antimicrobial resistance, use the *G*-test and more. These are also available (and even easier readable) on our website: https://msberends.gitlab.io/AMR.
-												better as.mo handling

											
										
										
											2018-12-06 14:36:39 +01:00
 								#### Changed
-												mo codes for WHONET

											
										
										
											2019-02-08 16:06:54 +01:00
+								* Function `eucast_rules()`:
 								  * Updated EUCAST Clinical breakpoints to [version 9.0 of 1 January 2019](http://www.eucast.org/clinical_breakpoints/), the data set `septic_patients` now reflects these changes
 								  * Fixed a critical bug where some rules that depend on previous applied rules would not be applied adequately
 								  * Emphasised in manual that penicillin is meant as benzylpenicillin (ATC [J01CE01](https://www.whocc.no/atc_ddd_index/?code=J01CE01))
 								  * New info is returned when running this function, stating exactly what has been changed or added. Use `eucast_rules(..., verbose = TRUE)` to get a data set with all changed per bug and drug combination.
-												Catalogue of Life, replaces ITIS

											
										
										
											2019-02-18 02:33:37 +01:00
+								* Removed data sets `microorganisms.oldDT`, `microorganisms.prevDT`, `microorganisms.unprevDT` and `microorganismsDT` since they were no longer needed and only contained info already available in the `microorganisms` data set
-												WHO update, antibiotics update

											
										
										
											2019-01-25 13:18:41 +01:00
+								* Added 65 antibiotics to the `antibiotics` data set, from the [Pharmaceuticals Community Register](http://ec.europa.eu/health/documents/community-register/html/atc.htm) of the European Commission
 								* Removed columns `atc_group1_nl` and `atc_group2_nl` from the `antibiotics` data set
-												quasiquotation for freq()

											
										
										
											2019-01-28 11:20:32 +01:00
+								* Functions `atc_ddd()` and `atc_groups()` have been renamed `atc_online_ddd()` and `atc_online_groups()`. The old functions are deprecated and will be removed in a future version.
-												re-add atc_ddd and atc_groups

											
										
										
											2019-01-27 13:33:43 +01:00
+								* Function `guess_mo()` is now deprecated in favour of `as.mo()` and will be removed in future versions
 								* Function `guess_atc()` is now deprecated in favour of `as.atc()` and will be removed in future versions
-												uncertainty levels, new WHONET codes

											
										
										
											2019-03-12 12:19:27 +01:00
+								* Improvements for `as.mo()`:
-												memory for as.mo()

											
										
										
											2019-03-15 13:57:25 +01:00
+								  * Now handles incorrect spelling, like `i` instead of `y` and `f` instead of `ph`:
 								    ```r
 								    # mo_fullname() uses as.mo() internally
 								    mo_fullname("Sthafilokockus aaureuz")
 								    #> [1] "Staphylococcus aureus"
 								    mo_fullname("S. klossi")
 								    #> [1] "Staphylococcus kloosii"
 								    ```
-												uncertainty levels, new WHONET codes

											
										
										
											2019-03-12 12:19:27 +01:00
+								  * Uncertainty of the algorithm is now divided into four levels, 0 to 3, where the default `allow_uncertain = TRUE` is equal to uncertainty level 2. Run `?as.mo` for more info about these levels.
-												memory for as.mo()

											
										
										
											2019-03-15 13:57:25 +01:00
+								    ```r
 								    # equal:
 								    as.mo(..., allow_uncertain = TRUE)
 								    as.mo(..., allow_uncertain = 2)
 								    # also equal:
 								    as.mo(..., allow_uncertain = FALSE)
 								    as.mo(..., allow_uncertain = 0)
 								    ```
 								    Using `as.mo(..., allow_uncertain = 3)` could lead to very unreliable results.
-												v0.6.0

											
										
										
											2019-03-27 11:22:36 +01:00
+								  * Implemented the latest publication of Becker *et al.* (2019), for categorising coagulase-negative *Staphylococci*
 								  * All microbial IDs that found are now saved to a local file `~/.Rhistory_mo`. Use the new function `clean_mo_history()` to delete this file, which resets the algorithms.
-												uncertainty levels, new WHONET codes

											
										
										
											2019-03-12 12:19:27 +01:00
+								  * Incoercible results will now be considered 'unknown', MO code `UNKNOWN`. On foreign systems, properties of these will be translated to all languages already previously supported: German, Dutch, French, Italian, Spanish and Portuguese:
-												memory for as.mo()

											
										
										
											2019-03-15 13:57:25 +01:00
+								    ```r
 								    mo_genus("qwerty", language = "es")
 								    # Warning:
 								    # one unique value (^= 100.0%) could not be coerced and is considered 'unknown': "qwerty". Use mo_failures() to review it.
 								    #> [1] "(género desconocido)"
 								    ```
-												EUCAST update, as.mo bugfix for empty vlaues

											
										
										
											2019-01-08 16:23:45 +01:00
+								  * Fix for vector containing only empty values
-												better as.mo handling

											
										
										
											2018-12-06 14:36:39 +01:00
+								  * Finds better results when input is in other languages
 								  * Better handling for subspecies
-												unknown codes, rsi fix

											
										
										
											2019-03-02 22:47:04 +01:00
+								  * Better handling for *Salmonellae*, especially the 'city like' serovars like *Salmonella London*
-												mo codes for WHONET

											
										
										
											2019-02-08 16:06:54 +01:00
+								  * Understanding of highly virulent *E. coli* strains like EIEC, EPEC and STEC
-												freq - decimals

											
										
										
											2018-12-10 10:13:40 +01:00
+								  * There will be looked for uncertain results at default - these results will be returned with an informative warning
-												as.mo improvement

											
										
										
											2019-02-26 12:33:26 +01:00
+								  * Manual (help page) now contains more info about the algorithms
-												freq - decimals

											
										
										
											2018-12-10 10:13:40 +01:00
+								  * Progress bar will be shown when it takes more than 3 seconds to get results
-												reorganised notes and warnings

											
										
										
											2018-12-14 10:52:20 +01:00
+								  * Support for formatted console text
-												WHO update, antibiotics update

											
										
										
											2019-01-25 13:18:41 +01:00
+								  * Console will return the percentage of uncoercable input
-												eucast rules fix, 1st isolate fix, website update

											
										
										
											2018-12-31 01:48:53 +01:00
+								* Function `first_isolate()`:
 								  * Fixed a bug where distances between dates would not be calculated right - in the `septic_patients` data set this yielded a difference of 0.15% more isolates
-												keyab automatic

											
										
										
											2018-12-10 15:14:29 +01:00
+								  * Will now use a column named like "patid" for the patient ID (parameter `col_patientid`), when this parameter was left blank
-												eucast rules fix, 1st isolate fix, website update

											
										
										
											2018-12-31 01:48:53 +01:00
+								  * Will now use a column named like "key(...)ab" or "key(...)antibiotics" for the key antibiotics (parameter `col_keyantibiotics()`), when this parameter was left blank
-												dplyr 0.8.0 support, fixes #7

											
										
										
											2018-12-22 22:39:34 +01:00
+								  * Removed parameter `output_logical`, the function will now always return a logical value
 								  * Renamed parameter `filter_specimen` to `specimen_group`, although using `filter_specimen` will still work
-												age and age_groups

											
										
										
											2018-12-15 22:40:07 +01:00
+								* A note to the manual pages of the `portion` functions, that low counts can influence the outcome and that the `portion` functions may camouflage this, since they only return the portion (albeit being dependent on the `minimum` parameter)
-												set_mo_source

											
										
										
											2019-01-21 15:53:01 +01:00
+								* Merged data sets `microorganisms.certe` and `microorganisms.umcg` into `microorganisms.codes`
-												eucast rules fix, 1st isolate fix, website update

											
										
										
											2018-12-31 01:48:53 +01:00
+								* Function `mo_taxonomy()` now contains the kingdom too
-												is.rsi.eligible update

											
										
										
											2019-02-04 12:24:07 +01:00
+								* Reduce false positives for `is.rsi.eligible()` using the new `threshold` parameter
-												WHONET/EARS-Net support

											
										
										
											2019-01-29 00:06:50 +01:00
+								* New colours for `scale_rsi_colours()`
-												AI improvements

											
										
										
											2018-12-07 12:04:55 +01:00
+								* Summaries of class `mo` will now return the top 3 and the unique count, e.g. using `summary(mo)`
 								* Small text updates to summaries of class `rsi` and `mic`
-												unknown codes, rsi fix

											
										
										
											2019-03-02 22:47:04 +01:00
+								* Function `as.rsi()`:
 								  * Now gives a warning when inputting MIC values
 								  * Now accepts high and low resistance: `"HIGH S"` will return `S`
-												eucast rules fix, 1st isolate fix, website update

											
										
										
											2018-12-31 01:48:53 +01:00
+								* Frequency tables (`freq()` function):
-												WHONET/EARS-Net support

											
										
										
											2019-01-29 00:06:50 +01:00
+								  * Support for tidyverse quasiquotation! Now you can create frequency tables of function outcomes:
-												quasiquotation for freq()

											
										
										
											2019-01-28 11:20:32 +01:00
+								    ```r
 								    # Determine genus of microorganisms (mo) in `septic_patients` data set:
 								    # OLD WAY
 								    septic_patients %>%
 								      mutate(genus = mo_genus(mo)) %>%
 								      freq(genus)
 								    # NEW WAY
 								    septic_patients %>%
 								      freq(mo_genus(mo))
 								    # Even supports grouping variables:
 								    septic_patients %>%
 								      group_by(gender) %>%
 								      freq(mo_genus(mo))
 								    ```
-												new website, freq updates

											
										
										
											2018-12-29 22:24:19 +01:00
+								  * Header info is now available as a list, with the `header` function
-												freq fix

											
										
										
											2019-01-30 16:00:55 +01:00
+								  * The parameter `header` is now set to `TRUE` at default, even for markdown
-												freq - decimals

											
										
										
											2018-12-10 10:13:40 +01:00
+								  * Added header info for class `mo` to show unique count of families, genera and species
 								  * Now honours the `decimal.mark` setting, which just like `format` defaults to `getOption("OutDec")`
 								  * The new `big.mark` parameter will at default be `","` when `decimal.mark = "."` and `"."` otherwise
-												freq header fix for NAs

											
										
										
											2018-12-14 10:08:51 +01:00
+								  * Fix for header text where all observations are `NA`
-												dplyr 0.8.0 support, fixes #7

											
										
										
											2018-12-22 22:39:34 +01:00
+								  * New parameter `droplevels` to exclude empty factor levels when input is a factor
-												select() fix for freq

											
										
										
											2019-01-17 12:08:04 +01:00
+								  * Factor levels will be in header when present in input data (maximum of 5)
 								  * Fix for using `select()` on frequency tables
-												eucast rules fix, 1st isolate fix, website update

											
										
										
											2018-12-31 01:48:53 +01:00
+								* Function `scale_y_percent()` now contains the `limits` parameter
 								* Automatic parameter filling for `mdro()`, `key_antibiotics()` and `eucast_rules()`
 								* Updated examples for resistance prediction (`resistance_predict()` function)
 								* Fix for `as.mic()` to support more values ending in (several) zeroes
-												uncertainty levels, new WHONET codes

											
										
										
											2019-03-12 12:19:27 +01:00
+								* if using different lengths of pattern and x in `%like%`, it will now return the call
-												better as.mo handling

											
										
										
											2018-12-06 14:36:39 +01:00
-												limits for scale_y_percent - Licence update

											
										
										
											2018-12-16 22:45:12 +01:00
+								#### Other
 								* Updated licence text to emphasise GPL 2.0 and that this is an R package.
-												better as.mo handling

											
										
										
											2018-12-06 14:36:39 +01:00
-												new website, freq updates

											
										
										
											2018-12-29 22:24:19 +01:00
+								# AMR 0.5.0
-												CRAN fixes for release 0.4.0
https://cran.r-project.org/web/checks/check_results_AMR.html

											
										
										
											2018-10-09 13:53:33 +02:00
 								#### New
-												switch to gitlab

											
										
										
											2018-10-23 16:49:40 +02:00
+								* Repository moved to GitLab: https://gitlab.com/msberends/AMR
-												count_all and some fixes

											
										
										
											2018-10-12 16:35:18 +02:00
+								* Function `count_all` to get all available isolates (that like all `portion_*` and `count_*` functions also supports `summarise` and `group_by`), the old `n_rsi` is now an alias of `count_all`
-												new function get_locale

											
										
										
											2018-11-05 13:20:32 +01:00
+								* Function `get_locale` to determine language for language-dependent output for some `mo_*` functions. This is now the default value for their `language` parameter, by which the system language will be used at default.
-												speed improvement as.mo, freq title

											
										
										
											2018-10-31 12:10:49 +01:00
+								* Data sets `microorganismsDT`, `microorganisms.prevDT`, `microorganisms.unprevDT` and `microorganisms.oldDT` to improve the speed of `as.mo`. They are for reference only, since they are primarily for internal use of `as.mo`.
-												read.4D improvements

											
										
										
											2018-11-15 12:42:35 +01:00
+								* Function `read.4D` to read from the 4D database of the MMB department of the UMCG
-												new kingdom

											
										
										
											2018-11-09 13:11:54 +01:00
+								* Functions `mo_authors` and `mo_year` to get specific values about the scientific reference of a taxonomic entry
-												CRAN fixes for release 0.4.0
https://cran.r-project.org/web/checks/check_results_AMR.html

											
										
										
											2018-10-09 13:53:33 +02:00
 								#### Changed
-												MDRO update

											
										
										
											2018-11-16 20:50:50 +01:00
+								* Functions `MDRO`, `BRMO`, `MRGN` and `EUCAST_exceptional_phenotypes` were renamed to `mdro`, `brmo`, `mrgn` and `eucast_exceptional_phenotypes`
 								* `EUCAST_rules` was renamed to `eucast_rules`, the old function still exists as a deprecated function
 								* Big changes to the `eucast_rules` function:
-												param rules for EUCAST

											
										
										
											2018-10-18 12:10:10 +02:00
+								  * Now also applies rules from the EUCAST 'Breakpoint tables for bacteria', version 8.1, 2018, http://www.eucast.org/clinical_breakpoints/ (see Source of the function)
 								  * New parameter `rules` to specify which rules should be applied (expert rules, breakpoints, others or all)
 								  * New parameter `verbose` which can be set to `TRUE` to get very specific messages about which columns and rows were affected
 								  * Better error handling when rules cannot be applied (i.e. new values could not be inserted)
-												speed improvement as.mo, freq title

											
										
										
											2018-10-31 12:10:49 +01:00
+								  * The number of affected values will now only be measured once per row/column combination
-												new EUCAST rules: clinical breakpoints

											
										
										
											2018-10-17 17:32:34 +02:00
+								  * Data set `septic_patients` now reflects these changes
-												MDRO update

											
										
										
											2018-11-16 20:50:50 +01:00
+								  * Added parameter `pipe` for piperacillin (J01CA12), also to the `mdro` function
-												eucast updates

											
										
										
											2018-11-01 17:06:08 +01:00
+								  * Small fixes to EUCAST clinical breakpoint rules
-												new kingdom

											
										
										
											2018-11-09 13:11:54 +01:00
+								* Added column `kingdom` to the microorganisms data set, and function `mo_kingdom` to look up values
-												speed improvement as.mo, freq title

											
										
										
											2018-10-31 12:10:49 +01:00
+								* Tremendous speed improvement for `as.mo` (and subsequently all `mo_*` functions), as empty values wil be ignored *a priori*
-												new verbose

											
										
										
											2018-10-19 00:17:03 +02:00
+								* Fewer than 3 characters as input for `as.mo` will return NA
-												support A. species for as.mo, cleanup

											
										
										
											2018-11-24 20:25:09 +01:00
+								* Function `as.mo` (and all `mo_*` wrappers) now supports genus abbreviations with "species" attached
 								  ```r
 								  as.mo("E. species")        # B_ESCHR
 								  mo_fullname("E. spp.")     # "Escherichia species"
 								  as.mo("S. spp")            # B_STPHY
 								  mo_fullname("S. species")  # "Staphylococcus species"
 								  ```
-												parameter combine_IR

											
										
										
											2018-10-16 09:59:31 +02:00
+								* Added parameter `combine_IR` (TRUE/FALSE) to functions `portion_df` and `count_df`, to indicate that all values of I and R must be merged into one, so the output only consists of S vs. IR (susceptible vs. non-susceptible)
-												speed improvement as.mo, freq title

											
										
										
											2018-10-31 12:10:49 +01:00
+								* Fix for `portion_*(..., as_percent = TRUE)` when minimal number of isolates would not be met
-												fix for as.mo, added also_single_tested

											
										
										
											2018-10-19 13:53:31 +02:00
+								* Added parameter `also_single_tested` for `portion_*` and `count_*` functions to also include cases where not all antibiotics were tested but at least one of the tested antibiotics includes the target antimicribial interpretation, see `?portion`
-												count_all and some fixes

											
										
										
											2018-10-12 16:35:18 +02:00
+								* Using `portion_*` functions now throws a warning when total available isolate is below parameter `minimum`
-												mdro and 1st isolate improvements

											
										
										
											2018-10-23 11:15:05 +02:00
+								* Functions `as.mo`, `as.rsi`, `as.mic`, `as.atc` and `freq` will not set package name as attribute anymore
 								* Frequency tables - `freq()`:
-												grouping var for freq

											
										
										
											2018-11-06 16:41:59 +01:00
+								  * Support for grouping variables, test with:
 								    ```r
 								    septic_patients %>%
 								      group_by(hospital_id) %>%
 								      freq(gender)
 								    ```
-												unit test read.4d, unselecting freq cols

											
										
										
											2018-11-19 13:00:22 +01:00
+								  * Support for (un)selecting columns:
 								    ```r
 								    septic_patients %>%
 								      freq(hospital_id) %>%
 								      select(-count, -cum_count) # only get item, percent, cum_percent
 								    ```
-												eucast update

											
										
										
											2018-11-01 20:23:33 +01:00
+								  * Check for `hms::is.hms`
-												mdro and 1st isolate improvements

											
										
										
											2018-10-23 11:15:05 +02:00
+								  * Now prints in markdown at default in non-interactive sessions
 								  * No longer adds the factor level column and sorts factors on count again
 								  * Support for class `difftime`
-												support A. species for as.mo, cleanup

											
										
										
											2018-11-24 20:25:09 +01:00
+								  * New parameter `na`, to choose which character to print for empty values
 								  * New parameter `header` to turn the header info off (default when `markdown = TRUE`)
 								  * New parameter `title` to manually setbthe title of the frequency table
-												mdro and 1st isolate improvements

											
										
										
											2018-10-23 11:15:05 +02:00
+								* `first_isolate` now tries to find columns to use as input when parameters are left blank
-												unit test read.4d, unselecting freq cols

											
										
										
											2018-11-19 13:00:22 +01:00
+								* Improvements for MDRO algorithm (function `mdro`)
-												new EUCAST rules: clinical breakpoints

											
										
										
											2018-10-17 17:32:34 +02:00
+								* Data set `septic_patients` is now a `data.frame`, not a tibble anymore
-												CRAN fixes for release 0.4.0
https://cran.r-project.org/web/checks/check_results_AMR.html

											
										
										
											2018-10-09 13:53:33 +02:00
+								* Removed diacritics from all authors (columns `microorganisms$ref` and `microorganisms.old$ref`) to comply with CRAN policy to only allow ASCII characters
-												update NEWS

											
										
										
											2018-10-10 10:50:19 +02:00
+								* Fix for `mo_property` not working properly
-												MDRO update

											
										
										
											2018-11-16 20:50:50 +01:00
+								* Fix for `eucast_rules` where some Streptococci would become ceftazidime R in EUCAST rule 4.5
-												count_all and some fixes

											
										
										
											2018-10-12 16:35:18 +02:00
+								* Support for named vectors of class `mo`, useful for `top_freq()`
-												breaks param, tidyr dep change, freq markdown

											
										
										
											2018-10-22 12:32:59 +02:00
+								* `ggplot_rsi` and `scale_y_percent` have `breaks` parameter
-												count_all and some fixes

											
										
										
											2018-10-12 16:35:18 +02:00
+								* AI improvements for `as.mo`:
 								  * `"CRS"` -> *Stenotrophomonas maltophilia*
 								  * `"CRSM"` -> *Stenotrophomonas maltophilia*
 								  * `"MSSA"` -> *Staphylococcus aureus*
 								  * `"MSSE"` -> *Staphylococcus epidermidis*
 								* Fix for `join` functions
-												speed improvement is.rsi.eligible

											
										
										
											2018-11-02 14:55:29 +01:00
+								* Speed improvement for `is.rsi.eligible`, now 15-20 times faster
-												breaks param, tidyr dep change, freq markdown

											
										
										
											2018-10-22 12:32:59 +02:00
+								* In `g.test`, when `sum(x)` is below 1000 or any of the expected values is below 5, Fisher's Exact Test will be suggested
 								* `ab_name` will try to fall back on `as.atc` when no results are found
-												read.4D improvements

											
										
										
											2018-11-15 12:42:35 +01:00
+								* Removed the addin to view data sets
-												support A. species for as.mo, cleanup

											
										
										
											2018-11-24 20:25:09 +01:00
+								* Percentages will now will rounded more logically (e.g. in `freq` function)
-												update NEWS

											
										
										
											2018-10-10 10:50:19 +02:00
 								#### Other
-												new EUCAST rules: clinical breakpoints

											
										
										
											2018-10-17 17:32:34 +02:00
+								* New dependency on package `crayon`, to support formatted text in the console
-												breaks param, tidyr dep change, freq markdown

											
										
										
											2018-10-22 12:32:59 +02:00
+								* Dependency `tidyr` is now mandatory (went to `Import` field) since `portion_df` and `count_df` rely on it
-												update NEWS

											
										
										
											2018-10-10 10:50:19 +02:00
+								* Updated vignettes to comply with README
-												CRAN fixes for release 0.4.0
https://cran.r-project.org/web/checks/check_results_AMR.html

											
										
										
											2018-10-09 13:53:33 +02:00
-												new website, freq updates

											
										
										
											2018-12-29 22:24:19 +01:00
+								# AMR 0.4.0
-												count_* functions

											
										
										
											2018-08-22 00:02:26 +02:00
 								#### New
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								* The data set `microorganisms` now contains **all microbial taxonomic data from ITIS** (kingdoms Bacteria, Fungi and Protozoa), the Integrated Taxonomy Information System, available via https://itis.gov. The data set now contains more than 18,000 microorganisms with all known bacteria, fungi and protozoa according ITIS with genus, species, subspecies, family, order, class, phylum and subkingdom. The new data set `microorganisms.old` contains all previously known taxonomic names from those kingdoms.
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								* New functions based on the existing function `mo_property`:
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								  * Taxonomic names: `mo_phylum`, `mo_class`, `mo_order`, `mo_family`, `mo_genus`, `mo_species`, `mo_subspecies`
 								  * Semantic names: `mo_fullname`, `mo_shortname`
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								  * Microbial properties: `mo_type`, `mo_gramstain`
-												renamed year columns to ref

											
										
										
											2018-10-01 14:44:40 +02:00
+								  * Author and year: `mo_ref`
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								  They also come with support for German, Dutch, French, Italian, Spanish and Portuguese:
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								  ```r
 								  mo_gramstain("E. coli")
 								  # [1] "Gram negative"
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								  mo_gramstain("E. coli", language = "de") # German
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								  # [1] "Gramnegativ"
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								  mo_gramstain("E. coli", language = "es") # Spanish
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								  # [1] "Gram negativo"
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								  mo_fullname("S. group A", language = "pt") # Portuguese
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								  # [1] "Streptococcus grupo A"
 								  ```
-												renamed year columns to ref

											
										
										
											2018-10-01 14:44:40 +02:00
+								  Furthermore, former taxonomic names will give a note about the current taxonomic name:
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								  ```r
-												renamed year columns to ref

											
										
										
											2018-10-01 14:44:40 +02:00
+								  mo_gramstain("Esc blattae")
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								  # Note: 'Escherichia blattae' (Burgess et al., 1973) was renamed 'Shimwellia blattae' (Priest and Barker, 2010)
-												renamed year columns to ref

											
										
										
											2018-10-01 14:44:40 +02:00
+								  # [1] "Gram negative"
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
+								  ```
-												quasiquotation, alpha for geom_rsi

											
										
										
											2018-08-23 00:40:36 +02:00
+								* Functions `count_R`, `count_IR`, `count_I`, `count_SI` and `count_S` to selectively count resistant or susceptible isolates
-new trade names, added ab_tradenames

											
										
										
											2018-08-29 12:27:37 +02:00
+								  * Extra function `count_df` (which works like `portion_df`) to get all counts of S, I and R of a data set with antibiotic columns, with support for grouped variables
-												small change in mutate example
											
										
										
											2018-08-22 12:55:05 +02:00
+								* Function `is.rsi.eligible` to check for columns that have valid antimicrobial results, but do not have the `rsi` class yet. Transform the columns of your raw data with: `data %>% mutate_if(is.rsi.eligible, as.rsi)`
-												small as.mo fix

											
										
										
											2019-03-06 14:39:02 +01:00
+								* Functions `as.mo` and `is.mo` as replacements for `as.bactid` and `is.bactid` (since the `microoganisms` data set not only contains bacteria). These last two functions are deprecated and will be removed in a future release. The `as.mo` function determines microbial IDs using intelligent rules:
-												algorithm improvement

											
										
										
											2018-09-14 10:31:21 +02:00
+								  ```r
 								  as.mo("E. coli")
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								  # [1] B_ESCHR_COL
-												algorithm improvement

											
										
										
											2018-09-14 10:31:21 +02:00
+								  as.mo("MRSA")
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								  # [1] B_STPHY_AUR
-												algorithm improvement

											
										
										
											2018-09-14 10:31:21 +02:00
+								  as.mo("S group A")
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								  # [1] B_STRPTC_GRA
-												algorithm improvement

											
										
										
											2018-09-14 10:31:21 +02:00
+								  ```
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								  And with great speed too - on a quite regular Linux server from 2007 it takes us less than 0.02 seconds to transform 25,000 items:
-												algorithm improvement

											
										
										
											2018-09-14 10:31:21 +02:00
+								  ```r
 								  thousands_of_E_colis <- rep("E. coli", 25000)
 								  microbenchmark::microbenchmark(as.mo(thousands_of_E_colis), unit = "s")
 								  # Unit: seconds
-												small improvement for is.rsi.eligible, more unit tests

											
										
										
											2018-09-14 11:54:01 +02:00
+								  #         min       median         max  neval
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								  #  0.01817717  0.01843957  0.03878077    100
-												algorithm improvement

											
										
										
											2018-09-14 10:31:21 +02:00
+								  ```
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								* Added parameter `reference_df` for `as.mo`, so users can supply their own microbial IDs, name or codes as a reference table
-												replaced bactid by mo

											
										
										
											2018-08-31 13:36:19 +02:00
+								* Renamed all previous references to `bactid` to `mo`, like:
 								  * Column names inputs of `EUCAST_rules`, `first_isolate` and `key_antibiotics`
 								  * Column names of datasets `microorganisms` and `septic_patients`
 								  * All old syntaxes will still work with this version, but will throw warnings
-												add labels_rsi_count

											
										
										
											2018-09-16 22:11:17 +02:00
+								* Function `labels_rsi_count` to print datalabels on a RSI `ggplot2` model
-												atc and bactid functions, readme update

											
										
										
											2018-08-25 22:01:14 +02:00
+								* Functions `as.atc` and `is.atc` to transform/look up antibiotic ATC codes as defined by the WHO. The existing function `guess_atc` is now an alias of `as.atc`.
-												first inclusion of ITIS data

											
										
										
											2018-09-24 23:33:29 +02:00
-												added prevalence column and alterted as.mo algorith to use it, added ab_name as alias

											
										
										
											2018-09-16 16:43:29 +02:00
+								* Function `ab_property` and its aliases: `ab_name`, `ab_tradenames`, `ab_certe`, `ab_umcg` and `ab_trivial_nl`
-new trade names, added ab_tradenames

											
										
										
											2018-08-29 12:27:37 +02:00
+								* Introduction to AMR as a vignette
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								* Removed clipboard functions as it violated the CRAN policy
 								* Renamed `septic_patients$sex` to `septic_patients$gender`
-												count_* functions

											
										
										
											2018-08-22 00:02:26 +02:00
 								#### Changed
-new trade names, added ab_tradenames

											
										
										
											2018-08-29 12:27:37 +02:00
+								* Added three antimicrobial agents to the `antibiotics` data set: Terbinafine (D01BA02), Rifaximin (A07AA11) and Isoconazole (D01AC05)
 								* Added 163 trade names to the `antibiotics` data set, it now contains 298 different trade names in total, e.g.:
 								  ```r
 								  ab_official("Bactroban")
 								  # [1] "Mupirocin"
-												added prevalence column and alterted as.mo algorith to use it, added ab_name as alias

											
										
										
											2018-09-16 16:43:29 +02:00
+								  ab_name(c("Bactroban", "Amoxil", "Zithromax", "Floxapen"))
-new trade names, added ab_tradenames

											
										
										
											2018-08-29 12:27:37 +02:00
+								  # [1] "Mupirocin" "Amoxicillin" "Azithromycin" "Flucloxacillin"
 								  ab_atc(c("Bactroban", "Amoxil", "Zithromax", "Floxapen"))
 								  # [1] "R01AX06" "J01CA04" "J01FA10" "J01CF05"
 								  ```
-												support for portuguese, language determination based on system

											
										
										
											2018-09-08 16:06:47 +02:00
+								* For `first_isolate`, rows will be ignored when there's no species available
-												replaced bactid by mo

											
										
										
											2018-08-31 13:36:19 +02:00
+								* Function `ratio` is now deprecated and will be removed in a future release, as it is not really the scope of this package
-												ggplot_rsi example update, more unit tests

											
										
										
											2018-08-29 16:25:57 +02:00
+								* Fix for `as.mic` for values ending in zeroes after a real number
-												small improvement for is.rsi.eligible, more unit tests

											
										
										
											2018-09-14 11:54:01 +02:00
+								* Small fix where *B. fragilis* would not be found in the `microorganisms.umcg` data set
-												added prevalence column and alterted as.mo algorith to use it, added ab_name as alias

											
										
										
											2018-09-16 16:43:29 +02:00
+								* Added `prevalence` column to the `microorganisms` data set
-												count_* functions

											
										
										
											2018-08-22 00:02:26 +02:00
+								* Added parameters `minimum` and `as_percent` to `portion_df`
-												unit tests

											
										
										
											2018-08-23 01:01:50 +02:00
+								* Support for quasiquotation in the functions series `count_*` and `portions_*`, and `n_rsi`. This allows to check for more than 2 vectors or columns.
-												ab_* functions, mo_* functions, 180 new microorganisms, speed improvement for bactid

											
										
										
											2018-08-28 13:51:13 +02:00
+								  ```r
-												news update

											
										
										
											2018-09-03 11:22:28 +02:00
+								  septic_patients %>% select(amox, cipr) %>% count_IR()
 								  # which is the same as:
 								  septic_patients %>% count_IR(amox, cipr)
-												ab_* functions, mo_* functions, 180 new microorganisms, speed improvement for bactid

											
										
										
											2018-08-28 13:51:13 +02:00
+								  septic_patients %>% portion_S(amcl)
 								  septic_patients %>% portion_S(amcl, gent)
 								  septic_patients %>% portion_S(amcl, gent, pita)
 								  ```
-												count_* functions

											
										
										
											2018-08-22 00:02:26 +02:00
+								* Edited `ggplot_rsi` and `geom_rsi` so they can cope with `count_df`. The new `fun` parameter has value `portion_df` at default, but can be set to `count_df`.
-												quasiquotation, alpha for geom_rsi

											
										
										
											2018-08-23 00:40:36 +02:00
+								* Fix for `ggplot_rsi` when the `ggplot2` package was not loaded
-												add labels_rsi_count

											
										
										
											2018-09-16 22:11:17 +02:00
+								* Added datalabels function `labels_rsi_count` to `ggplot_rsi`
-												geom_rsi - any parameter

											
										
										
											2018-08-23 21:27:15 +02:00
+								* Added possibility to set any parameter to `geom_rsi` (and `ggplot_rsi`) so you can set your own preferences
-												fix for joins

											
										
										
											2018-09-03 10:04:49 +02:00
+								* Fix for joins, where predefined suffices would not be honoured
-												support for French and Italian, added quote to freq

											
										
										
											2018-09-10 15:45:25 +02:00
+								* Added parameter `quote` to the `freq` function
-												authors from ITIS, diff for freq

											
										
										
											2018-10-01 11:39:43 +02:00
+								* Added generic function `diff` for frequency tables
-												add shortest and longest to freq for characters

											
										
										
											2018-09-17 09:42:09 +02:00
+								* Added longest en shortest character length in the frequency table (`freq`) header of class `character`
-												support for French and Italian, added quote to freq

											
										
										
											2018-09-10 15:45:25 +02:00
+								* Support for types (classes) list and matrix for `freq`
-												removed ratio, better rsi_calc, update for freq

											
										
										
											2018-08-24 11:08:20 +02:00
+								  ```r
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								  my_matrix = with(septic_patients, matrix(c(age, gender), ncol = 2))
-												removed ratio, better rsi_calc, update for freq

											
										
										
											2018-08-24 11:08:20 +02:00
+								  freq(my_matrix)
 								  ```
-												small improvement for is.rsi.eligible, more unit tests

											
										
										
											2018-09-14 11:54:01 +02:00
+								  For lists, subsetting is possible:
 								  ```r
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								  my_list = list(age = septic_patients$age, gender = septic_patients$gender)
-												small improvement for is.rsi.eligible, more unit tests

											
										
										
											2018-09-14 11:54:01 +02:00
+								  my_list %>% freq(age)
-												diff for freq, fix for mo_shortname

											
										
										
											2018-09-29 21:54:32 +02:00
+								  my_list %>% freq(gender)
-												small improvement for is.rsi.eligible, more unit tests

											
										
										
											2018-09-14 11:54:01 +02:00
+								  ```
-												ggplot_rsi example update, more unit tests

											
										
										
											2018-08-29 16:25:57 +02:00
-												removed ratio, better rsi_calc, update for freq

											
										
										
											2018-08-24 11:08:20 +02:00
+								#### Other
 								* More unit tests to ensure better integrity of functions
-												count_* functions

											
										
										
											2018-08-22 00:02:26 +02:00
-												new website, freq updates

											
										
										
											2018-12-29 22:24:19 +01:00
+								# AMR 0.3.0
-												more unit tests

											
										
										
											2018-07-30 00:57:49 +02:00
-												top_freq

											
										
										
											2018-06-20 14:47:37 +02:00
+								#### New
-												new ggplot enhancement

											
										
										
											2018-08-11 21:30:00 +02:00
+								* **BREAKING**: `rsi_df` was removed in favour of new functions `portion_R`, `portion_IR`, `portion_I`, `portion_SI` and `portion_S` to selectively calculate resistance or susceptibility. These functions are 20 to 30 times faster than the old `rsi` function. The old function still works, but is deprecated.
-												ggplot_rsi improvements

											
										
										
											2018-08-13 16:42:37 +02:00
+								  * New function `portion_df` to get all portions of S, I and R of a data set with antibiotic columns, with support for grouped variables
-												keyab fixes

											
										
										
											2018-07-17 19:51:09 +02:00
+								* **BREAKING**: the methodology for determining first weighted isolates was changed. The antibiotics that are compared between isolates (call *key antibiotics*) to include more first isolates (afterwards called first *weighted* isolates) are now as follows:
-												update to septic_patients, speed improvements

											
										
										
											2018-07-25 14:17:04 +02:00
+								  * Universal: amoxicillin, amoxicillin/clavlanic acid, cefuroxime, piperacillin/tazobactam, ciprofloxacin,  trimethoprim/sulfamethoxazole
 								  * Gram-positive: vancomycin, teicoplanin, tetracycline, erythromycin, oxacillin, rifampicin
 								  * Gram-negative: gentamicin, tobramycin, colistin, cefotaxime, ceftazidime, meropenem
-												new ggplot enhancement

											
										
										
											2018-08-11 21:30:00 +02:00
+								* Support for `ggplot2`
 								  * New functions `geom_rsi`, `facet_rsi`, `scale_y_percent`, `scale_rsi_colours` and `theme_rsi`
 								  * New wrapper function `ggplot_rsi` to apply all above functions on a data set:
 								    * `septic_patients %>% select(tobr, gent) %>% ggplot_rsi` will show portions of S, I and R immediately in a pretty plot
-												ggplot_rsi improvements

											
										
										
											2018-08-13 16:42:37 +02:00
+								    * Support for grouped variables, see `?ggplot_rsi`
-												Becker classification
Lancefield classification
Added Lancefield groups to `microorganisms` data set

											
										
										
											2018-08-02 13:15:45 +02:00
+								* Determining bacterial ID:
 								  * New functions `as.bactid` and `is.bactid` to transform/ look up microbial ID's.
 								  * The existing function `guess_bactid` is now an alias of `as.bactid`
 								  * New Becker classification for *Staphylococcus* to categorise them into Coagulase Negative *Staphylococci* (CoNS) and Coagulase Positve *Staphylococci* (CoPS)
 								  * New Lancefield classification for *Streptococcus* to categorise them into Lancefield groups
-												Welcome C++!

											
										
										
											2018-07-13 17:23:46 +02:00
+								* For convience, new descriptive statistical functions `kurtosis` and `skewness` that are lacking in base R - they are generic functions and have support for vectors, data.frames and matrices
-												keyab fixes

											
										
										
											2018-07-17 19:51:09 +02:00
+								* Function `g.test` to perform the Χ<sup>2</sup> distributed [*G*-test](https://en.wikipedia.org/wiki/G-test), which use is the same as `chisq.test`
-												few extra tests

											
										
										
											2018-08-24 14:18:38 +02:00
+								* ~~Function `ratio` to transform a vector of values to a preset ratio~~
 								  * ~~For example: `ratio(c(10, 500, 10), ratio = "1:2:1")` would return `130, 260, 130`~~
-												keyab fixes

											
										
										
											2018-07-17 19:51:09 +02:00
+								* Support for Addins menu in RStudio to quickly insert `%in%` or `%like%` (and give them keyboard shortcuts), or to view the datasets that come with this package
 								* Function `p.symbol` to transform p values to their related symbols: `0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1`
 								* Functions `clipboard_import` and `clipboard_export` as helper functions to quickly copy and paste from/to software like Excel and SPSS. These functions use the `clipr` package, but are a little altered to also support headless Linux servers (so you can use it in RStudio Server)
-												kurtosis, skewness, start with ML

											
										
										
											2018-07-08 22:14:55 +02:00
+								* New for frequency tables (function `freq`):
 								  * A vignette to explain its usage
-												rsi for freq

											
										
										
											2018-08-01 22:37:28 +02:00
+								  * Support for `rsi` (antimicrobial resistance) to use as input
-												freq: support for table

											
										
										
											2018-07-09 14:02:58 +02:00
+								  * Support for `table` to use as input: `freq(table(x, y))`
-												kurtosis, skewness, start with ML

											
										
										
											2018-07-08 22:14:55 +02:00
+								  * Support for existing functions `hist` and `plot` to use a frequency table as input: `hist(freq(df$age))`
-												support format.freq

											
										
										
											2018-07-16 16:41:48 +02:00
+								  * Support for `as.vector`, `as.data.frame`, `as_tibble` and `format`
-												kurtosis, skewness, start with ML

											
										
										
											2018-07-08 22:14:55 +02:00
+								  * Support for quasiquotation: `freq(mydata, mycolumn)` is the same as `mydata %>% freq(mycolumn)`
 								  * Function `top_freq` function to return the top/below *n* items as vector
 								  * Header of frequency tables now also show Mean Absolute Deviaton (MAD) and Interquartile Range (IQR)
 								  * Possibility to globally set the default for the amount of items to print, with `options(max.print.freq = n)` where *n* is your preset value
-												fix for printing tibbles, improve guess_bactid

											
										
										
											2018-06-08 12:06:54 +02:00
-												top_freq

											
										
										
											2018-06-20 14:47:37 +02:00
+								#### Changed
-												abname improvement, small fixes

											
										
										
											2018-08-13 11:00:53 +02:00
+								* Improvements for forecasting with `resistance_predict` and added more examples
 								* More antibiotics added as parameters for EUCAST rules
-												remove reshape2 dependency

											
										
										
											2018-07-30 01:18:40 +02:00
+								* Updated version of the `septic_patients` data set to better reflect the reality
-												remove print function, out of scope

											
										
										
											2018-07-11 20:12:19 +02:00
+								* Pretty printing for tibbles removed as it is not really the scope of this package
-												rsi for freq

											
										
										
											2018-08-01 22:37:28 +02:00
+								* Printing of `mic` and `rsi` classes now returns all values - use `freq` to check distributions
-												speed improvements

											
										
										
											2018-07-15 22:56:41 +02:00
+								* Improved speed of key antibiotics comparison for determining first isolates
-												new col names for `key_antibiotics`

											
										
										
											2018-07-19 15:11:23 +02:00
+								* Column names for the `key_antibiotics` function are now generic: 6 for broadspectrum ABs, 6 for Gram-positive specific and 6 for Gram-negative specific ABs
-												abname improvement, small fixes

											
										
										
											2018-08-13 11:00:53 +02:00
+								* Speed improvement for the `abname` function
-												addins and small improvements to microorganisms dataset

											
										
										
											2018-07-04 17:20:03 +02:00
+								* `%like%` now supports multiple patterns
-												new g.test() and edited freq()

											
										
										
											2018-07-01 21:40:37 +02:00
+								* Frequency tables are now actual `data.frame`s with altered console printing to make it look like a frequency table. Because of this, the parameter `toConsole` is not longer needed.
-												new class bactid

											
										
										
											2018-07-23 14:14:03 +02:00
+								* Fix for `freq` where the class of an item would be lost
 								* Small translational improvements to the `septic_patients` dataset and the column `bactid` now has the new class `"bactid"`
 								* Small improvements to the `microorganisms` dataset (especially for *Salmonella*) and the column `bactid` now has the new class `"bactid"`
-												extra unit tests, add row.names to freq

											
										
										
											2018-06-19 15:20:14 +02:00
+								* Combined MIC/RSI values will now be coerced by the `rsi` and `mic` functions:
 								  * `as.rsi("<=0.002; S")` will return `S`
 								  * `as.mic("<=0.002; S")` will return `<=0.002`
 								* Now possible to coerce MIC values with a space between operator and value, i.e. `as.mic("<= 0.002")` now works
-												update to septic_patients, speed improvements

											
										
										
											2018-07-25 14:17:04 +02:00
+								* Classes `rsi` and `mic` do not add the attribute `package.version` anymore
-												extra unit tests, add row.names to freq

											
										
										
											2018-06-19 15:20:14 +02:00
+								* Added `"groups"` option for `atc_property(..., property)`. It will return a vector of the ATC hierarchy as defined by the [WHO](https://www.whocc.no/atc/structure_and_principles/). The new function `atc_groups` is a convenient wrapper around this.
 								* Build-in host check for `atc_property` as it requires the host set by `url` to be responsive
-												atc_groups

											
										
										
											2018-06-19 10:05:38 +02:00
+								* Improved `first_isolate` algorithm to exclude isolates where bacteria ID or genus is unavailable
 								* Fix for warning *hybrid evaluation forced for row_number* ([`924b62`](https://github.com/tidyverse/dplyr/commit/924b62)) from the `dplyr` package v0.7.5 and above
-												new class bactid

											
										
										
											2018-07-23 14:14:03 +02:00
+								* Support for empty values and for 1 or 2 columns as input for `guess_bactid` (now called `as.bactid`)
 								  * So `yourdata %>% select(genus, species) %>% as.bactid()` now also works
-												more mic classes

											
										
										
											2018-07-28 10:48:27 +02:00
+								* Other small fixes
-												added vignette of freq

											
										
										
											2018-05-09 11:44:46 +02:00
-												new g.test, extra unit tests

											
										
										
											2018-07-10 12:27:07 +02:00
+								#### Other
-												new AppVeyor environment

											
										
										
											2018-08-14 08:57:17 +02:00
+								* Added integration tests (check if everything works as expected) for all releases of R 3.1 and higher
 								  * Linux and macOS: https://travis-ci.org/msberends/AMR
 								  * Windows: https://ci.appveyor.com/project/msberends/amr
-												thesis advisors

											
										
										
											2018-08-03 09:59:39 +02:00
+								* Added thesis advisors to DESCRIPTION file
-												new g.test, extra unit tests

											
										
										
											2018-07-10 12:27:07 +02:00
-												new website, freq updates

											
										
										
											2018-12-29 22:24:19 +01:00
+								# AMR 0.2.0
-												more unit tests

											
										
										
											2018-07-30 00:57:49 +02:00
-												MDRO, freq tables, new print format for tibbles

											
										
										
											2018-04-18 12:24:54 +02:00
+								#### New
-												Try to support older R versions

											
										
										
											2018-04-18 15:19:00 +02:00
+								* Full support for Windows, Linux and macOS
-												EUCAST rules for MDRO

											
										
										
											2018-04-25 15:33:58 +02:00
+								* Full support for old R versions, only R-3.0.0 (April 2013) or later is needed (needed packages may have other dependencies)
-												Added function `n_rsi`

											
										
										
											2018-05-02 14:56:25 +02:00
+								* Function `n_rsi` to count cases where antibiotic test results were available, to be used in conjunction with `dplyr::summarise`, see ?rsi
-												EUCAST rules for MDRO

											
										
										
											2018-04-25 15:33:58 +02:00
+								* Function `guess_bactid` to **determine the ID** of a microorganism based on genus/species or known abbreviations like MRSA
 								* Function `guess_atc` to **determine the ATC** of an antibiotic based on name, trade name, or known abbreviations
 								* Function `freq` to create **frequency tables**, with additional info in a header
 								* Function `MDRO` to **determine Multi Drug Resistant Organisms (MDRO)** with support for country-specific guidelines.
 								  * [Exceptional resistances defined by EUCAST](http://www.eucast.org/expert_rules_and_intrinsic_resistance) are also supported instead of countries alone
 								  * Functions `BRMO` and `MRGN` are wrappers for Dutch and German guidelines, respectively
-												Try to support older R versions

											
										
										
											2018-04-18 15:19:00 +02:00
+								* New algorithm to determine weighted isolates, can now be `"points"` or `"keyantibiotics"`, see `?first_isolate`
-												EUCAST rules for MDRO

											
										
										
											2018-04-25 15:33:58 +02:00
+								* New print format for `tibble`s and `data.table`s
-												MDRO, freq tables, new print format for tibbles

											
										
										
											2018-04-18 12:24:54 +02:00
 								#### Changed
-												Added function `n_rsi`

											
										
										
											2018-05-02 14:56:25 +02:00
+								* Fixed `rsi` class for vectors that contain only invalid antimicrobial interpretations
-												Try to support older R versions

											
										
										
											2018-04-18 15:19:00 +02:00
+								* Renamed dataset `ablist` to `antibiotics`
 								* Renamed dataset `bactlist` to `microorganisms`
-												EUCAST rules for MDRO

											
										
										
											2018-04-25 15:33:58 +02:00
+								* Added common abbreviations and trade names to the `antibiotics` dataset
 								* Added more microorganisms to the `microorganisms` dataset
-												Try to support older R versions

											
										
										
											2018-04-18 15:19:00 +02:00
+								* Added analysis examples on help page of dataset `septic_patients`
 								* Added support for character vector in `join` functions
 								* Added warnings when a join results in more rows after than before the join
 								* Altered `%like%` to make it case insensitive
-												EUCAST rules for MDRO

											
										
										
											2018-04-25 15:33:58 +02:00
+								* For parameters of functions `first_isolate` and `EUCAST_rules` column names are now case-insensitive
 								* Functions `as.rsi` and `as.mic` now add the package name and version as attributes
-												MDRO, freq tables, new print format for tibbles

											
										
										
											2018-04-18 12:24:54 +02:00
 								#### Other
-												EUCAST rules for MDRO

											
										
										
											2018-04-25 15:33:58 +02:00
+								* Expanded `README.md` with more examples
 								* Added [ORCID](https://orcid.org) of authors to DESCRIPTION file
-												Try to support older R versions

											
										
										
											2018-04-18 15:19:00 +02:00
+								* Added unit testing with the `testthat` package
 								* Added build tests for Linux and macOS using Travis CI (https://travis-ci.org/msberends/AMR)
-												EUCAST rules for MDRO

											
										
										
											2018-04-25 15:33:58 +02:00
+								* Added line coverage checking using CodeCov (https://codecov.io/gh/msberends/AMR/tree/master/R)
-												MDRO, freq tables, new print format for tibbles

											
										
										
											2018-04-18 12:24:54 +02:00
-												new website, freq updates

											
										
										
											2018-12-29 22:24:19 +01:00
+								# AMR 0.1.1
-												more unit tests

											
										
										
											2018-07-30 00:57:49 +02:00
-												Try to support older R versions

											
										
										
											2018-04-18 15:19:00 +02:00
+								* `EUCAST_rules` applies for amoxicillin even if ampicillin is missing
 								* Edited column names to comply with GLIMS, the laboratory information system
 								* Added more valid MIC values
 								* Renamed 'Daily Defined Dose' to 'Defined Daily Dose'
 								* Added barplots for `rsi` and `mic` classes
-												MDRO, freq tables, new print format for tibbles

											
										
										
											2018-04-18 12:24:54 +02:00
-												new website, freq updates

											
										
										
											2018-12-29 22:24:19 +01:00
+								# AMR 0.1.0
-												more unit tests

											
										
										
											2018-07-30 00:57:49 +02:00
-												Try to support older R versions

											
										
										
											2018-04-18 15:19:00 +02:00
+								* First submission to CRAN.