25 KiB
Executable File
0.5.0.90xx (latest development version)
New
- BREAKING: removed deprecated functions, parameters and references to 'bactid'. Use
as.moto identify an MO code. - Support for
dplyrversion 0.8.0 - Function
mo_failuresto review values that could not be coerced to a valid MO code, usingas.mo. This latter function will now only show a maximum of 25 uncoerced values. - Function
mo_renamedto get a list of all returned values fromas.mothat have had taxonomic renaming - Function
ageto calculate the (patients) age in years - Function
age_groupsto split ages into custom or predefined groups (like children or elderly). This allows for easier demographic antimicrobial resistance analysis per age group. - Functions
filter_first_isolateandfilter_first_weighted_isolate()to shorten and fasten filtering on data sets with antimicrobial results, e.g.:is the same as:septic_patients %>% filter_first_isolate() # or filter_first_isolate(septic_patients)septic_patients %>% mutate(only_firsts = first_isolate(septic_patients, ...)) %>% filter(only_firsts == TRUE) %>% select(-only_firsts)
Changed
- Improvements for
as.mo:- Finds better results when input is in other languages
- Better handling for subspecies
- Better handling for Salmonellae
- There will be looked for uncertain results at default - these results will be returned with an informative warning
- Manual now contains more info about the algorithms
- Progress bar will be shown when it takes more than 3 seconds to get results
- Support for formatted console text
- Function
first_isolate:- Will now use a column named like "patid" for the patient ID (parameter
col_patientid), when this parameter was left blank - Will now use a column named like "key(...)ab" or "key(...)antibiotics" for the key antibiotics (parameter
col_keyantibiotics), when this parameter was left blank - Removed parameter
output_logical, the function will now always return a logical value - Renamed parameter
filter_specimentospecimen_group, although usingfilter_specimenwill still work
- Will now use a column named like "patid" for the patient ID (parameter
- A note to the manual pages of the
portionfunctions, that low counts can influence the outcome and that theportionfunctions may camouflage this, since they only return the portion (albeit being dependent on theminimumparameter) - Function
mo_taxonomynow contains the kingdom too - Function
first_isolatewill now use a column named like "patid" for the patient ID, when this parameter was left blank - Reduce false positives for
is.rsi.eligible - Summaries of class
mowill now return the top 3 and the unique count, e.g. usingsummary(mo) - Small text updates to summaries of class
rsiandmic - Frequency tables (
freqfunction):- Added header info for class
moto show unique count of families, genera and species - Now honours the
decimal.marksetting, which just likeformatdefaults togetOption("OutDec") - The new
big.markparameter will at default be","whendecimal.mark = "."and"."otherwise - Fix for header text where all observations are
NA - New parameter
droplevelsto exclude empty factor levels when input is a factor - Factor levels will be in header when present
- Added header info for class
- Function
scale_y_percentnow has thelimitsparameter - Automatic parameter filling for
mdro,key_antibioticsandeucast_rules - Updated examples for resistance prediction (
resistance_predictfunction)
Other
- Updated licence text to emphasise GPL 2.0 and that this is an R package.
0.5.0 (latest stable release)
Published on CRAN: 2018-11-30
New
- Repository moved to GitLab: https://gitlab.com/msberends/AMR
- Function
count_allto get all available isolates (that like allportion_*andcount_*functions also supportssummariseandgroup_by), the oldn_rsiis now an alias ofcount_all - Function
get_localeto determine language for language-dependent output for somemo_*functions. This is now the default value for theirlanguageparameter, by which the system language will be used at default. - Data sets
microorganismsDT,microorganisms.prevDT,microorganisms.unprevDTandmicroorganisms.oldDTto improve the speed ofas.mo. They are for reference only, since they are primarily for internal use ofas.mo. - Function
read.4Dto read from the 4D database of the MMB department of the UMCG - Functions
mo_authorsandmo_yearto get specific values about the scientific reference of a taxonomic entry
Changed
- Functions
MDRO,BRMO,MRGNandEUCAST_exceptional_phenotypeswere renamed tomdro,brmo,mrgnandeucast_exceptional_phenotypes EUCAST_ruleswas renamed toeucast_rules, the old function still exists as a deprecated function- Big changes to the
eucast_rulesfunction:- Now also applies rules from the EUCAST 'Breakpoint tables for bacteria', version 8.1, 2018, http://www.eucast.org/clinical_breakpoints/ (see Source of the function)
- New parameter
rulesto specify which rules should be applied (expert rules, breakpoints, others or all) - New parameter
verbosewhich can be set toTRUEto get very specific messages about which columns and rows were affected - Better error handling when rules cannot be applied (i.e. new values could not be inserted)
- The number of affected values will now only be measured once per row/column combination
- Data set
septic_patientsnow reflects these changes - Added parameter
pipefor piperacillin (J01CA12), also to themdrofunction - Small fixes to EUCAST clinical breakpoint rules
- Added column
kingdomto the microorganisms data set, and functionmo_kingdomto look up values - Tremendous speed improvement for
as.mo(and subsequently allmo_*functions), as empty values wil be ignored a priori - Fewer than 3 characters as input for
as.mowill return NA - Function
as.mo(and allmo_*wrappers) now supports genus abbreviations with "species" attachedas.mo("E. species") # B_ESCHR mo_fullname("E. spp.") # "Escherichia species" as.mo("S. spp") # B_STPHY mo_fullname("S. species") # "Staphylococcus species" - Added parameter
combine_IR(TRUE/FALSE) to functionsportion_dfandcount_df, to indicate that all values of I and R must be merged into one, so the output only consists of S vs. IR (susceptible vs. non-susceptible) - Fix for
portion_*(..., as_percent = TRUE)when minimal number of isolates would not be met - Added parameter
also_single_testedforportion_*andcount_*functions to also include cases where not all antibiotics were tested but at least one of the tested antibiotics includes the target antimicribial interpretation, see?portion - Using
portion_*functions now throws a warning when total available isolate is below parameterminimum - Functions
as.mo,as.rsi,as.mic,as.atcandfreqwill not set package name as attribute anymore - Frequency tables -
freq():- Support for grouping variables, test with:
septic_patients %>% group_by(hospital_id) %>% freq(gender) - Support for (un)selecting columns:
septic_patients %>% freq(hospital_id) %>% select(-count, -cum_count) # only get item, percent, cum_percent - Check for
hms::is.hms - Now prints in markdown at default in non-interactive sessions
- No longer adds the factor level column and sorts factors on count again
- Support for class
difftime - New parameter
na, to choose which character to print for empty values - New parameter
headerto turn the header info off (default whenmarkdown = TRUE) - New parameter
titleto manually setbthe title of the frequency table
- Support for grouping variables, test with:
first_isolatenow tries to find columns to use as input when parameters are left blank- Improvements for MDRO algorithm (function
mdro) - Data set
septic_patientsis now adata.frame, not a tibble anymore - Removed diacritics from all authors (columns
microorganisms$refandmicroorganisms.old$ref) to comply with CRAN policy to only allow ASCII characters - Fix for
mo_propertynot working properly - Fix for
eucast_ruleswhere some Streptococci would become ceftazidime R in EUCAST rule 4.5 - Support for named vectors of class
mo, useful fortop_freq() ggplot_rsiandscale_y_percenthavebreaksparameter- AI improvements for
as.mo:"CRS"-> Stenotrophomonas maltophilia"CRSM"-> Stenotrophomonas maltophilia"MSSA"-> Staphylococcus aureus"MSSE"-> Staphylococcus epidermidis
- Fix for
joinfunctions - Speed improvement for
is.rsi.eligible, now 15-20 times faster - In
g.test, whensum(x)is below 1000 or any of the expected values is below 5, Fisher's Exact Test will be suggested ab_namewill try to fall back onas.atcwhen no results are found- Removed the addin to view data sets
- Percentages will now will rounded more logically (e.g. in
freqfunction)
Other
- New dependency on package
crayon, to support formatted text in the console - Dependency
tidyris now mandatory (went toImportfield) sinceportion_dfandcount_dfrely on it - Updated vignettes to comply with README
0.4.0
Published on CRAN: 2018-10-01
New
-
The data set
microorganismsnow contains all microbial taxonomic data from ITIS (kingdoms Bacteria, Fungi and Protozoa), the Integrated Taxonomy Information System, available via https://itis.gov. The data set now contains more than 18,000 microorganisms with all known bacteria, fungi and protozoa according ITIS with genus, species, subspecies, family, order, class, phylum and subkingdom. The new data setmicroorganisms.oldcontains all previously known taxonomic names from those kingdoms. -
New functions based on the existing function
mo_property:- Taxonomic names:
mo_phylum,mo_class,mo_order,mo_family,mo_genus,mo_species,mo_subspecies - Semantic names:
mo_fullname,mo_shortname - Microbial properties:
mo_type,mo_gramstain - Author and year:
mo_ref
They also come with support for German, Dutch, French, Italian, Spanish and Portuguese:
mo_gramstain("E. coli") # [1] "Gram negative" mo_gramstain("E. coli", language = "de") # German # [1] "Gramnegativ" mo_gramstain("E. coli", language = "es") # Spanish # [1] "Gram negativo" mo_fullname("S. group A", language = "pt") # Portuguese # [1] "Streptococcus grupo A"Furthermore, former taxonomic names will give a note about the current taxonomic name:
mo_gramstain("Esc blattae") # Note: 'Escherichia blattae' (Burgess et al., 1973) was renamed 'Shimwellia blattae' (Priest and Barker, 2010) # [1] "Gram negative" - Taxonomic names:
-
Functions
count_R,count_IR,count_I,count_SIandcount_Sto selectively count resistant or susceptible isolates- Extra function
count_df(which works likeportion_df) to get all counts of S, I and R of a data set with antibiotic columns, with support for grouped variables
- Extra function
-
Function
is.rsi.eligibleto check for columns that have valid antimicrobial results, but do not have thersiclass yet. Transform the columns of your raw data with:data %>% mutate_if(is.rsi.eligible, as.rsi) -
Functions
as.moandis.moas replacements foras.bactidandis.bactid(since themicrooganismsdata set not only contains bacteria). These last two functions are deprecated and will be removed in a future release. Theas.mofunction determines microbial IDs using Artificial Intelligence (AI):as.mo("E. coli") # [1] B_ESCHR_COL as.mo("MRSA") # [1] B_STPHY_AUR as.mo("S group A") # [1] B_STRPTC_GRAAnd with great speed too - on a quite regular Linux server from 2007 it takes us less than 0.02 seconds to transform 25,000 items:
thousands_of_E_colis <- rep("E. coli", 25000) microbenchmark::microbenchmark(as.mo(thousands_of_E_colis), unit = "s") # Unit: seconds # min median max neval # 0.01817717 0.01843957 0.03878077 100 -
Added parameter
reference_dfforas.mo, so users can supply their own microbial IDs, name or codes as a reference table -
Renamed all previous references to
bactidtomo, like:- Column names inputs of
EUCAST_rules,first_isolateandkey_antibiotics - Column names of datasets
microorganismsandseptic_patients - All old syntaxes will still work with this version, but will throw warnings
- Column names inputs of
-
Function
labels_rsi_countto print datalabels on a RSIggplot2model -
Functions
as.atcandis.atcto transform/look up antibiotic ATC codes as defined by the WHO. The existing functionguess_atcis now an alias ofas.atc. -
Function
ab_propertyand its aliases:ab_name,ab_tradenames,ab_certe,ab_umcgandab_trivial_nl -
Introduction to AMR as a vignette
-
Removed clipboard functions as it violated the CRAN policy
-
Renamed
septic_patients$sextoseptic_patients$gender
Changed
- Added three antimicrobial agents to the
antibioticsdata set: Terbinafine (D01BA02), Rifaximin (A07AA11) and Isoconazole (D01AC05) - Added 163 trade names to the
antibioticsdata set, it now contains 298 different trade names in total, e.g.:ab_official("Bactroban") # [1] "Mupirocin" ab_name(c("Bactroban", "Amoxil", "Zithromax", "Floxapen")) # [1] "Mupirocin" "Amoxicillin" "Azithromycin" "Flucloxacillin" ab_atc(c("Bactroban", "Amoxil", "Zithromax", "Floxapen")) # [1] "R01AX06" "J01CA04" "J01FA10" "J01CF05" - For
first_isolate, rows will be ignored when there's no species available - Function
ratiois now deprecated and will be removed in a future release, as it is not really the scope of this package - Fix for
as.micfor values ending in zeroes after a real number - Small fix where B. fragilis would not be found in the
microorganisms.umcgdata set - Added
prevalencecolumn to themicroorganismsdata set - Added parameters
minimumandas_percenttoportion_df - Support for quasiquotation in the functions series
count_*andportions_*, andn_rsi. This allows to check for more than 2 vectors or columns.septic_patients %>% select(amox, cipr) %>% count_IR() # which is the same as: septic_patients %>% count_IR(amox, cipr) septic_patients %>% portion_S(amcl) septic_patients %>% portion_S(amcl, gent) septic_patients %>% portion_S(amcl, gent, pita) - Edited
ggplot_rsiandgeom_rsiso they can cope withcount_df. The newfunparameter has valueportion_dfat default, but can be set tocount_df. - Fix for
ggplot_rsiwhen theggplot2package was not loaded - Added datalabels function
labels_rsi_counttoggplot_rsi - Added possibility to set any parameter to
geom_rsi(andggplot_rsi) so you can set your own preferences - Fix for joins, where predefined suffices would not be honoured
- Added parameter
quoteto thefreqfunction - Added generic function
difffor frequency tables - Added longest en shortest character length in the frequency table (
freq) header of classcharacter - Support for types (classes) list and matrix for
freqFor lists, subsetting is possible:my_matrix = with(septic_patients, matrix(c(age, gender), ncol = 2)) freq(my_matrix)my_list = list(age = septic_patients$age, gender = septic_patients$gender) my_list %>% freq(age) my_list %>% freq(gender)
Other
- More unit tests to ensure better integrity of functions
0.3.0
Published on CRAN: 2018-08-14
New
- BREAKING:
rsi_dfwas removed in favour of new functionsportion_R,portion_IR,portion_I,portion_SIandportion_Sto selectively calculate resistance or susceptibility. These functions are 20 to 30 times faster than the oldrsifunction. The old function still works, but is deprecated.- New function
portion_dfto get all portions of S, I and R of a data set with antibiotic columns, with support for grouped variables
- New function
- BREAKING: the methodology for determining first weighted isolates was changed. The antibiotics that are compared between isolates (call key antibiotics) to include more first isolates (afterwards called first weighted isolates) are now as follows:
- Universal: amoxicillin, amoxicillin/clavlanic acid, cefuroxime, piperacillin/tazobactam, ciprofloxacin, trimethoprim/sulfamethoxazole
- Gram-positive: vancomycin, teicoplanin, tetracycline, erythromycin, oxacillin, rifampicin
- Gram-negative: gentamicin, tobramycin, colistin, cefotaxime, ceftazidime, meropenem
- Support for
ggplot2- New functions
geom_rsi,facet_rsi,scale_y_percent,scale_rsi_coloursandtheme_rsi - New wrapper function
ggplot_rsito apply all above functions on a data set:septic_patients %>% select(tobr, gent) %>% ggplot_rsiwill show portions of S, I and R immediately in a pretty plot- Support for grouped variables, see
?ggplot_rsi
- New functions
- Determining bacterial ID:
- New functions
as.bactidandis.bactidto transform/ look up microbial ID's. - The existing function
guess_bactidis now an alias ofas.bactid - New Becker classification for Staphylococcus to categorise them into Coagulase Negative Staphylococci (CoNS) and Coagulase Positve Staphylococci (CoPS)
- New Lancefield classification for Streptococcus to categorise them into Lancefield groups
- New functions
- For convience, new descriptive statistical functions
kurtosisandskewnessthat are lacking in base R - they are generic functions and have support for vectors, data.frames and matrices - Function
g.testto perform the Χ2 distributed G-test, which use is the same aschisq.test Functionratioto transform a vector of values to a preset ratioFor example:ratio(c(10, 500, 10), ratio = "1:2:1")would return130, 260, 130
- Support for Addins menu in RStudio to quickly insert
%in%or%like%(and give them keyboard shortcuts), or to view the datasets that come with this package - Function
p.symbolto transform p values to their related symbols:0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 - Functions
clipboard_importandclipboard_exportas helper functions to quickly copy and paste from/to software like Excel and SPSS. These functions use thecliprpackage, but are a little altered to also support headless Linux servers (so you can use it in RStudio Server) - New for frequency tables (function
freq):- A vignette to explain its usage
- Support for
rsi(antimicrobial resistance) to use as input - Support for
tableto use as input:freq(table(x, y)) - Support for existing functions
histandplotto use a frequency table as input:hist(freq(df$age)) - Support for
as.vector,as.data.frame,as_tibbleandformat - Support for quasiquotation:
freq(mydata, mycolumn)is the same asmydata %>% freq(mycolumn) - Function
top_freqfunction to return the top/below n items as vector - Header of frequency tables now also show Mean Absolute Deviaton (MAD) and Interquartile Range (IQR)
- Possibility to globally set the default for the amount of items to print, with
options(max.print.freq = n)where n is your preset value
Changed
- Improvements for forecasting with
resistance_predictand added more examples - More antibiotics added as parameters for EUCAST rules
- Updated version of the
septic_patientsdata set to better reflect the reality - Pretty printing for tibbles removed as it is not really the scope of this package
- Printing of
micandrsiclasses now returns all values - usefreqto check distributions - Improved speed of key antibiotics comparison for determining first isolates
- Column names for the
key_antibioticsfunction are now generic: 6 for broadspectrum ABs, 6 for Gram-positive specific and 6 for Gram-negative specific ABs - Speed improvement for the
abnamefunction %like%now supports multiple patterns- Frequency tables are now actual
data.frames with altered console printing to make it look like a frequency table. Because of this, the parametertoConsoleis not longer needed. - Fix for
freqwhere the class of an item would be lost - Small translational improvements to the
septic_patientsdataset and the columnbactidnow has the new class"bactid" - Small improvements to the
microorganismsdataset (especially for Salmonella) and the columnbactidnow has the new class"bactid" - Combined MIC/RSI values will now be coerced by the
rsiandmicfunctions:as.rsi("<=0.002; S")will returnSas.mic("<=0.002; S")will return<=0.002
- Now possible to coerce MIC values with a space between operator and value, i.e.
as.mic("<= 0.002")now works - Classes
rsiandmicdo not add the attributepackage.versionanymore - Added
"groups"option foratc_property(..., property). It will return a vector of the ATC hierarchy as defined by the WHO. The new functionatc_groupsis a convenient wrapper around this. - Build-in host check for
atc_propertyas it requires the host set byurlto be responsive - Improved
first_isolatealgorithm to exclude isolates where bacteria ID or genus is unavailable - Fix for warning hybrid evaluation forced for row_number (
924b62) from thedplyrpackage v0.7.5 and above - Support for empty values and for 1 or 2 columns as input for
guess_bactid(now calledas.bactid)- So
yourdata %>% select(genus, species) %>% as.bactid()now also works
- So
- Other small fixes
Other
- Added integration tests (check if everything works as expected) for all releases of R 3.1 and higher
- Linux and macOS: https://travis-ci.org/msberends/AMR
- Windows: https://ci.appveyor.com/project/msberends/amr
- Added thesis advisors to DESCRIPTION file
0.2.0
Published on CRAN: 2018-05-03
New
- Full support for Windows, Linux and macOS
- Full support for old R versions, only R-3.0.0 (April 2013) or later is needed (needed packages may have other dependencies)
- Function
n_rsito count cases where antibiotic test results were available, to be used in conjunction withdplyr::summarise, see ?rsi - Function
guess_bactidto determine the ID of a microorganism based on genus/species or known abbreviations like MRSA - Function
guess_atcto determine the ATC of an antibiotic based on name, trade name, or known abbreviations - Function
freqto create frequency tables, with additional info in a header - Function
MDROto determine Multi Drug Resistant Organisms (MDRO) with support for country-specific guidelines.- Suggest your own via https://github.com/msberends/AMR/issues/new
- Exceptional resistances defined by EUCAST are also supported instead of countries alone
- Functions
BRMOandMRGNare wrappers for Dutch and German guidelines, respectively
- New algorithm to determine weighted isolates, can now be
"points"or"keyantibiotics", see?first_isolate - New print format for
tibbles anddata.tables
Changed
- Fixed
rsiclass for vectors that contain only invalid antimicrobial interpretations - Renamed dataset
ablisttoantibiotics - Renamed dataset
bactlisttomicroorganisms - Added common abbreviations and trade names to the
antibioticsdataset - Added more microorganisms to the
microorganismsdataset - Added analysis examples on help page of dataset
septic_patients - Added support for character vector in
joinfunctions - Added warnings when a join results in more rows after than before the join
- Altered
%like%to make it case insensitive - For parameters of functions
first_isolateandEUCAST_rulescolumn names are now case-insensitive - Functions
as.rsiandas.micnow add the package name and version as attributes
Other
- Expanded
README.mdwith more examples - Added ORCID of authors to DESCRIPTION file
- Added unit testing with the
testthatpackage - Added build tests for Linux and macOS using Travis CI (https://travis-ci.org/msberends/AMR)
- Added line coverage checking using CodeCov (https://codecov.io/gh/msberends/AMR/tree/master/R)
0.1.1
Published on CRAN: 2018-03-14
EUCAST_rulesapplies for amoxicillin even if ampicillin is missing- Edited column names to comply with GLIMS, the laboratory information system
- Added more valid MIC values
- Renamed 'Daily Defined Dose' to 'Defined Daily Dose'
- Added barplots for
rsiandmicclasses
0.1.0
Published on CRAN: 2018-02-22
- First submission to CRAN.