AMR/R/mo_source.R

# ==================================================================== #
# TITLE                                                                #
# Antimicrobial Resistance (AMR) Analysis for R                        #
#                                                                      #
# SOURCE                                                               #
# https://github.com/msberends/AMR                                     #
#                                                                      #
# LICENCE                                                              #
# (c) 2018-2020 Berends MS, Luz CF et al.                              #
# Developed at the University of Groningen, the Netherlands, in        #
# collaboration with non-profit organisations Certe Medical            #
# Diagnostics & Advice, and University Medical Center Groningen.       # 
#                                                                      #
# This R package is free software; you can freely use and distribute   #
# it for both personal and commercial purposes under the terms of the  #
# GNU General Public License version 2.0 (GNU GPL-2), as published by  #
# the Free Software Foundation.                                        #
# We created this package for both routine data analysis and academic  #
# research and it was publicly released in the hope that it will be    #
# useful, but it comes WITHOUT ANY WARRANTY OR LIABILITY.              #
#                                                                      #
# Visit our website for the full manual and a complete tutorial about  #
# how to conduct AMR analysis: https://msberends.github.io/AMR/        #
# ==================================================================== #

#' User-defined reference data set for microorganisms
#'
#' @description These functions can be used to predefine your own reference to be used in [as.mo()] and consequently all `mo_*` functions like [mo_genus()] and [mo_gramstain()].
#'
#' This is **the fastest way** to have your organisation (or analysis) specific codes picked up and translated by this package.
#' @inheritSection lifecycle Stable lifecycle
#' @param path location of your reference file, see Details. Can be `""`, `NULL` or `FALSE` to delete the reference file.
#' @rdname mo_source
#' @name mo_source
#' @aliases set_mo_source get_mo_source
#' @details The reference file can be a text file separated with commas (CSV) or tabs or pipes, an Excel file (either 'xls' or 'xlsx' format) or an R object file (extension '.rds'). To use an Excel file, you will need to have the `readxl` package installed.
#'
#' [set_mo_source()] will check the file for validity: it must be a [data.frame], must have a column named `"mo"` which contains values from [`microorganisms$mo`][microorganisms] and must have a reference column with your own defined values. If all tests pass, [set_mo_source()] will read the file into R and will ask to export it to `"~/.mo_source.rds"`. The CRAN policy disallows packages to write to the file system, although '*exceptions may be allowed in interactive sessions if the package obtains confirmation from the user*'. For this reason, this function only works in interactive sessions so that the user can **specifically confirm and allow** that this file will be created. 
#' 
#' The created compressed data file `"~/.mo_source.rds"` will be used at default for MO determination (function [as.mo()] and consequently all `mo_*` functions like [mo_genus()] and [mo_gramstain()]). The location of the original file will be saved as an R option with `options(mo_source = path)`. Its timestamp will be saved with `options(mo_source_datetime = ...)`. 
#' 
#' The function [get_mo_source()] will return the data set by reading `"~/.mo_source.rds"` with [readRDS()]. If the original file has changed (by checking the aforementioned options `mo_source` and `mo_source_datetime`), it will call [set_mo_source()] to update the data file automatically if used in an interactive session.
#'
#' Reading an Excel file (`.xlsx`) with only one row has a size of 8-9 kB. The compressed file created with [set_mo_source()] will then have a size of 0.1 kB and can be read by [get_mo_source()] in only a couple of microseconds (millionths of a second).
#' 
#' @section How to setup:
#' 
#' Imagine this data on a sheet of an Excel file (mo codes were looked up in the [microorganisms] data set). The first column contains the organisation specific codes, the second column contains an MO code from this package:
#' 
#' ```
#'   |         A          |       B      |
#' --|--------------------|--------------|
#' 1 | Organisation XYZ   | mo           |
#' 2 | lab_mo_ecoli       | B_ESCHR_COLI |
#' 3 | lab_mo_kpneumoniae | B_KLBSL_PNMN |
#' 4 |                    |              |
#' ```
#'
#' We save it as `"home/me/ourcodes.xlsx"`. Now we have to set it as a source:
#' 
#' ```
#' set_mo_source("home/me/ourcodes.xlsx")
#' #> NOTE: Created mo_source file '~/.mo_source.rds' from 'home/me/ourcodes.xlsx'
#' #>       (columns "Organisation XYZ" and "mo")
#' ```
#'
#' It has now created a file `"~/.mo_source.rds"` with the contents of our Excel file. Only the first column with foreign values and the 'mo' column will be kept when creating the RDS file.
#'
#' And now we can use it in our functions:
#' 
#' ```
#' as.mo("lab_mo_ecoli")
#' #> [1] B_ESCHR_COLI
#'
#' mo_genus("lab_mo_kpneumoniae")
#' #> [1] "Klebsiella"
#'
#' # other input values still work too
#' as.mo(c("Escherichia coli", "E. coli", "lab_mo_ecoli"))
#' #> [1] B_ESCHR_COLI B_ESCHR_COLI B_ESCHR_COLI
#' ```
#'
#' If we edit the Excel file by, let's say, adding row 4 like this:
#' 
#' ```
#'   |         A          |       B      |
#' --|--------------------|--------------|
#' 1 | Organisation XYZ   | mo           |
#' 2 | lab_mo_ecoli       | B_ESCHR_COLI |
#' 3 | lab_mo_kpneumoniae | B_KLBSL_PNMN |
#' 4 | lab_Staph_aureus   | B_STPHY_AURS |
#' 5 |                    |              |
#' ```
#'
#' ...any new usage of an MO function in this package will update your data file:
#' 
#' ```
#' as.mo("lab_mo_ecoli")
#' #> NOTE: Updated mo_source file '~/.mo_source.rds' from 'home/me/ourcodes.xlsx'
#' #>       (columns "Organisation XYZ" and "mo")
#' #> [1] B_ESCHR_COLI
#'
#' mo_genus("lab_Staph_aureus")
#' #> [1] "Staphylococcus"
#' ```
#'
#' To delete the reference data file, just use `""`, `NULL` or `FALSE` as input for [set_mo_source()]:
#' 
#' ```
#' set_mo_source(NULL)
#' # Removed mo_source file '~/.mo_source.rds'.
#' ```
#' 
#' If the original Excel file is moved or deleted, the mo_source file will be removed upon the next use of [as.mo()]. If the mo_source file is manually deleted (i.e. without using [set_mo_source()]), the references to the mo_source file will be removed upon the next use of [as.mo()].
#' @export
#' @inheritSection AMR Read more on our website!
set_mo_source <- function(path) {
  meet_criteria(path, allow_class = "character", has_length = 1)
  
  file_location <- path.expand("~/mo_source.rds")
  
  stop_ifnot(interactive(), "This function can only be used in interactive mode, since it must ask for the user's permission to write a file to their home folder.")

  if (is.null(path) || path %in% c(FALSE, "")) {
    options(mo_source = NULL)
    options(mo_source_timestamp = NULL)
    if (file.exists(file_location)) {
      unlink(file_location)
      message_("Removed mo_source file '", font_bold(file_location), "'",
               add_fn = font_red,
               as_note = FALSE)
    }
    return(invisible())
  }
  
  stop_ifnot(file.exists(path), "file not found: ", path)
  
  if (path %like% "[.]rds$") {
    df <- readRDS(path)
    
  } else if (path %like% "[.]xlsx?$") {
    # is Excel file (old or new)
    read_excel <- import_fn("read_excel", "readxl")
    df <- read_excel(path)
    
  } else if (path %like% "[.]tsv$") {
    df <- utils::read.table(header = TRUE, sep = "\t", stringsAsFactors = FALSE)
    
  } else {
    # try comma first
    try(
      df <- utils::read.table(header = TRUE, sep = ",", stringsAsFactors = FALSE),
      silent = TRUE)
    if (!mo_source_isvalid(df, stop_on_error = FALSE)) {
      # try tab
      try(
        df <- utils::read.table(header = TRUE, sep = "\t", stringsAsFactors = FALSE),
        silent = TRUE)
    }
    if (!mo_source_isvalid(df, stop_on_error = FALSE)) {
      # try pipe
      try(
        df <- utils::read.table(header = TRUE, sep = "|", stringsAsFactors = FALSE),
        silent = TRUE)
    }
  }
  
  # check integrity
  mo_source_isvalid(df)
  
  df <- subset(df, !is.na(mo))
  
  # keep only first two columns, second must be mo
  if (colnames(df)[1] == "mo") {
    df <- df[, c(colnames(df)[2], "mo")]
  } else {
    df <- df[, c(colnames(df)[1], "mo")]
  }
  
  df <- as.data.frame(df, stringAsFactors = FALSE)
  
  # success
  if (file.exists(file_location)) {
    action <- "Updated"
  } else {
    action <- "Created"
    # only ask when file is created, not when it is updated
    txt <- paste0("This will write create the new file '", 
                  file_location, 
                  "', for which your permission is needed.\n\nDo you agree that this file will be created? ")
    if ("rsasdtudioapi" %in% rownames(utils::installed.packages())) {
      showQuestion <- import_fn("showQuestion", "rstudioapi")
      q_continue <- showQuestion("Create new file in home directory", txt)
    } else {
      q_continue <- utils::menu(choices = c("OK", "Cancel"), graphics = FALSE, title = txt)
    }
    if (q_continue %in% c(FALSE, 2)) {
      return(invisible())
    }
  }
  saveRDS(df, file_location)
  options(mo_source = path)
  options(mo_source_timestamp = as.character(file.info(path)$mtime))
  message_(action, " mo_source file '", font_bold(file_location), "'",
           " from '", font_bold(path), "'",
           '(columns "', colnames(df)[1], '" and "', colnames(df)[2], '")')
}

#' @rdname mo_source
#' @export
get_mo_source <- function() {
  if (is.null(getOption("mo_source", NULL))) {
    return(NULL)
  }
  
  if (!file.exists(path.expand("~/mo_source.rds"))) {
    options(mo_source = NULL)
    options(mo_source_timestamp = NULL)
    message_("Removed references to deleted mo_source file (see ?mo_source)")
    return(NULL)
  }
  
  old_time <- as.POSIXct(getOption("mo_source_timestamp"))
  new_time <- as.POSIXct(as.character(file.info(getOption("mo_source", ""))$mtime))
  
  if (is.na(new_time)) {
    # source file was deleted, remove reference too
    set_mo_source("")
    return(NULL)
  }
  if (interactive() && new_time != old_time) {
    # set updated source
    set_mo_source(getOption("mo_source"))
  }
  file_location <- path.expand("~/mo_source.rds")
  readRDS(file_location)
}

mo_source_isvalid <- function(x, refer_to_name = "`reference_df`", stop_on_error = TRUE) {
  check_dataset_integrity()
  
  if (paste(deparse(substitute(x)), collapse = "") == "get_mo_source()") {
    return(TRUE)
  }
  if (identical(x, get_mo_source())) {
    return(TRUE)
  }
  if (is.null(x)) {
    if (stop_on_error == TRUE) {
      stop_(refer_to_name, " cannot be NULL", call = FALSE)
    } else {
      return(FALSE)
    }
  }
  if (!is.data.frame(x)) {
    if (stop_on_error == TRUE) {
      stop_(refer_to_name, " must be a data.frame", call = FALSE)
    } else {
      return(FALSE)
    }
  }
  if (!"mo" %in% colnames(x)) {
    if (stop_on_error == TRUE) {
      stop_(refer_to_name, " must contain a column 'mo'", call = FALSE)
    } else {
      return(FALSE)
    }
  }
  if (!all(x$mo %in% c("", microorganisms$mo, microorganisms.translation$mo_old), na.rm = TRUE)) {
    if (stop_on_error == TRUE) {
      invalid <- x[which(!x$mo %in% c("", microorganisms$mo, microorganisms.translation$mo_old)), , drop = FALSE]
      if (nrow(invalid) > 1) {
        plural <- "s"
      } else {
        plural <- ""
      }
      stop_("Value", plural, " ", paste0("'", invalid[, 1, drop = TRUE], "'", collapse = ", "), 
           " found in ", tolower(refer_to_name), 
           ", but with invalid microorganism code", plural, " ", paste0("'", invalid$mo, "'", collapse = ", "),
           call = FALSE)
    } else {
      return(FALSE)
    }
  }
  if (colnames(x)[1] != "mo" & nrow(x) > length(unique(x[, 1, drop = TRUE]))) {
    if (stop_on_error == TRUE) {
      stop_(refer_to_name, " contains duplicate values in column '", colnames(x)[1], "'", call = FALSE)
    } else {
      return(FALSE)
    }
  }
  if (colnames(x)[2] != "mo" & nrow(x) > length(unique(x[, 2, drop = TRUE]))) {
    if (stop_on_error == TRUE) {
      stop_(refer_to_name, " contains duplicate values in column '", colnames(x)[2], "'", call = FALSE)
    } else {
      return(FALSE)
    }
  }
  return(TRUE)
}
set_mo_source 2019-01-21 15:53:01 +01:00			`# ==================================================================== #`
			`# TITLE #`
(v1.4.0) matching score update 2020-10-08 11:16:03 +02:00			`# Antimicrobial Resistance (AMR) Analysis for R #`
set_mo_source 2019-01-21 15:53:01 +01:00			`# #`
			`# SOURCE #`
(v1.2.0.9026) move to github 2020-07-08 14:48:06 +02:00			`# https://github.com/msberends/AMR #`
set_mo_source 2019-01-21 15:53:01 +01:00			`# #`
			`# LICENCE #`
(v0.9.0.9008) Happy new year! Add lifecycles 2020-01-05 17:22:09 +01:00			`# (c) 2018-2020 Berends MS, Luz CF et al. #`
(v1.4.0) matching score update 2020-10-08 11:16:03 +02:00			`# Developed at the University of Groningen, the Netherlands, in #`
			`# collaboration with non-profit organisations Certe Medical #`
			`# Diagnostics & Advice, and University Medical Center Groningen. #`
set_mo_source 2019-01-21 15:53:01 +01:00			`# #`
			`# This R package is free software; you can freely use and distribute #`
			`# it for both personal and commercial purposes under the terms of the #`
			`# GNU General Public License version 2.0 (GNU GPL-2), as published by #`
			`# the Free Software Foundation. #`
(v0.9.0.9008) Happy new year! Add lifecycles 2020-01-05 17:22:09 +01:00			`# We created this package for both routine data analysis and academic #`
			`# research and it was publicly released in the hope that it will be #`
			`# useful, but it comes WITHOUT ANY WARRANTY OR LIABILITY. #`
(v1.4.0) matching score update 2020-10-08 11:16:03 +02:00			`# #`
			`# Visit our website for the full manual and a complete tutorial about #`
			`# how to conduct AMR analysis: https://msberends.github.io/AMR/ #`
set_mo_source 2019-01-21 15:53:01 +01:00			`# ==================================================================== #`

(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' User-defined reference data set for microorganisms`
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' @description These functions can be used to predefine your own reference to be used in [as.mo()] and consequently all `mo_*` functions like [mo_genus()] and [mo_gramstain()].
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			`#' This is the fastest way to have your organisation (or analysis) specific codes picked up and translated by this package.`
(v0.9.0.9008) Happy new year! Add lifecycles 2020-01-05 17:22:09 +01:00			`#' @inheritSection lifecycle Stable lifecycle`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			#' @param path location of your reference file, see Details. Can be `""`, `NULL` or `FALSE` to delete the reference file.
set_mo_source 2019-01-21 15:53:01 +01:00			`#' @rdname mo_source`
			`#' @name mo_source`
			`#' @aliases set_mo_source get_mo_source`
(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix 2020-09-18 16:05:53 +02:00			#' @details The reference file can be a text file separated with commas (CSV) or tabs or pipes, an Excel file (either 'xls' or 'xlsx' format) or an R object file (extension '.rds'). To use an Excel file, you will need to have the `readxl` package installed.
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v1.3.0.9038) prefinal 1.4.0 2020-10-04 19:26:43 +02:00			#' [set_mo_source()] will check the file for validity: it must be a [data.frame], must have a column named `"mo"` which contains values from [`microorganisms$mo`][microorganisms] and must have a reference column with your own defined values. If all tests pass, [set_mo_source()] will read the file into R and will ask to export it to `"~/.mo_source.rds"`. The CRAN policy disallows packages to write to the file system, although 'exceptions may be allowed in interactive sessions if the package obtains confirmation from the user'. For this reason, this function only works in interactive sessions so that the user can specifically confirm and allow that this file will be created.
(v1.3.0.9022) mo_matching_score(), poorman update, as.rsi() fix 2020-09-18 16:05:53 +02:00			`#'`
			#' The created compressed data file `"~/.mo_source.rds"` will be used at default for MO determination (function [as.mo()] and consequently all `mo_*` functions like [mo_genus()] and [mo_gramstain()]). The location of the original file will be saved as an R option with `options(mo_source = path)`. Its timestamp will be saved with `options(mo_source_datetime = ...)`.
			`#'`
(v1.3.0.9026) eucast expert rules 3.2 2020-09-24 00:30:11 +02:00			#' The function [get_mo_source()] will return the data set by reading `"~/.mo_source.rds"` with [readRDS()]. If the original file has changed (by checking the aforementioned options `mo_source` and `mo_source_datetime`), it will call [set_mo_source()] to update the data file automatically if used in an interactive session.
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			#' Reading an Excel file (`.xlsx`) with only one row has a size of 8-9 kB. The compressed file created with [set_mo_source()] will then have a size of 0.1 kB and can be read by [get_mo_source()] in only a couple of microseconds (millionths of a second).
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' @section How to setup:`
			`#'`
			`#' Imagine this data on a sheet of an Excel file (mo codes were looked up in the [microorganisms] data set). The first column contains the organisation specific codes, the second column contains an MO code from this package:`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			`#'`
			#' ```
(v0.9.0) website fixes 2019-11-30 12:01:50 +01:00			`#' \| A \| B \|`
			`#' --\|--------------------\|--------------\|`
			`#' 1 \| Organisation XYZ \| mo \|`
			`#' 2 \| lab_mo_ecoli \| B_ESCHR_COLI \|`
			`#' 3 \| lab_mo_kpneumoniae \| B_KLBSL_PNMN \|`
			`#' 4 \| \| \|`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' We save it as `"home/me/ourcodes.xlsx"`. Now we have to set it as a source:
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
age_groups fix 2019-02-27 11:36:12 +01:00			`#' set_mo_source("home/me/ourcodes.xlsx")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> NOTE: Created mo_source file '~/.mo_source.rds' from 'home/me/ourcodes.xlsx'`
			`#' #> (columns "Organisation XYZ" and "mo")`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			#' It has now created a file `"~/.mo_source.rds"` with the contents of our Excel file. Only the first column with foreign values and the 'mo' column will be kept when creating the RDS file.
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#'`
			`#' And now we can use it in our functions:`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#' as.mo("lab_mo_ecoli")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> [1] B_ESCHR_COLI`
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
			`#' mo_genus("lab_mo_kpneumoniae")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> [1] "Klebsiella"`
mo_source improvement 2019-03-01 09:34:04 +01:00			`#'`
			`#' # other input values still work too`
			`#' as.mo(c("Escherichia coli", "E. coli", "lab_mo_ecoli"))`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> [1] B_ESCHR_COLI B_ESCHR_COLI B_ESCHR_COLI`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' If we edit the Excel file by, let's say, adding row 4 like this:`
			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
			`#' \| A \| B \|`
			`#' --\|--------------------\|--------------\|`
			`#' 1 \| Organisation XYZ \| mo \|`
			`#' 2 \| lab_mo_ecoli \| B_ESCHR_COLI \|`
			`#' 3 \| lab_mo_kpneumoniae \| B_KLBSL_PNMN \|`
			`#' 4 \| lab_Staph_aureus \| B_STPHY_AURS \|`
			`#' 5 \| \| \|`
			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v0.9.0) website fixes 2019-11-30 12:01:50 +01:00			`#' ...any new usage of an MO function in this package will update your data file:`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#' as.mo("lab_mo_ecoli")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> NOTE: Updated mo_source file '~/.mo_source.rds' from 'home/me/ourcodes.xlsx'`
			`#' #> (columns "Organisation XYZ" and "mo")`
			`#' #> [1] B_ESCHR_COLI`
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#'`
			`#' mo_genus("lab_Staph_aureus")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> [1] "Staphylococcus"`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			#' To delete the reference data file, just use `""`, `NULL` or `FALSE` as input for [set_mo_source()]:
			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#' set_mo_source(NULL)`
			`#' # Removed mo_source file '~/.mo_source.rds'.`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#'`
			`#' If the original Excel file is moved or deleted, the mo_source file will be removed upon the next use of [as.mo()]. If the mo_source file is manually deleted (i.e. without using [set_mo_source()]), the references to the mo_source file will be removed upon the next use of [as.mo()].`
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#' @export`
			`#' @inheritSection AMR Read more on our website!`
set_mo_source 2019-01-21 15:53:01 +01:00			`set_mo_source <- function(path) {`
(v1.4.0.9001) is_gram_positive(), is_gram_negative(), parameter hardening 2020-10-19 17:09:19 +02:00			`meet_criteria(path, allow_class = "character", has_length = 1)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v0.7.1.9102) lintr 2019-10-11 17:21:02 +02:00			`file_location <- path.expand("~/mo_source.rds")`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.3.0.9014) as.mo() speed improvement 2020-09-03 12:31:48 +02:00			`stop_ifnot(interactive(), "This function can only be used in interactive mode, since it must ask for the user's permission to write a file to their home folder.")`
(v1.4.0.9001) is_gram_positive(), is_gram_negative(), parameter hardening 2020-10-19 17:09:19 +02:00
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`if (is.null(path) \|\| path %in% c(FALSE, "")) {`
set_mo_source 2019-01-21 15:53:01 +01:00			`options(mo_source = NULL)`
			`options(mo_source_timestamp = NULL)`
memory for as.mo() 2019-03-15 13:57:25 +01:00			`if (file.exists(file_location)) {`
			`unlink(file_location)`
(v1.4.0.9011) message formatting 2020-10-27 15:56:51 +01:00			`message_("Removed mo_source file '", font_bold(file_location), "'",`
			`add_fn = font_red,`
			`as_note = FALSE)`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
			`return(invisible())`
			`}`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.4.0.9001) is_gram_positive(), is_gram_negative(), parameter hardening 2020-10-19 17:09:19 +02:00			`stop_ifnot(file.exists(path), "file not found: ", path)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v0.7.1.9102) lintr 2019-10-11 17:21:02 +02:00			`if (path %like% "[.]rds$") {`
set_mo_source 2019-01-21 15:53:01 +01:00			`df <- readRDS(path)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v0.7.1.9102) lintr 2019-10-11 17:21:02 +02:00			`} else if (path %like% "[.]xlsx?$") {`
set_mo_source 2019-01-21 15:53:01 +01:00			`# is Excel file (old or new)`
(v1.2.0.9008) ab_class improvement 2020-06-17 15:14:37 +02:00			`read_excel <- import_fn("read_excel", "readxl")`
(v1.1.0.9007) lose dependencies 2020-05-16 21:40:50 +02:00			`df <- read_excel(path)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v0.7.1.9102) lintr 2019-10-11 17:21:02 +02:00			`} else if (path %like% "[.]tsv$") {`
age_groups fix 2019-02-27 11:36:12 +01:00			`df <- utils::read.table(header = TRUE, sep = "\t", stringsAsFactors = FALSE)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
set_mo_source 2019-01-21 15:53:01 +01:00			`} else {`
			`# try comma first`
			`try(`
			`df <- utils::read.table(header = TRUE, sep = ",", stringsAsFactors = FALSE),`
			`silent = TRUE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (!mo_source_isvalid(df, stop_on_error = FALSE)) {`
age_groups fix 2019-02-27 11:36:12 +01:00			`# try tab`
			`try(`
			`df <- utils::read.table(header = TRUE, sep = "\t", stringsAsFactors = FALSE),`
			`silent = TRUE)`
			`}`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (!mo_source_isvalid(df, stop_on_error = FALSE)) {`
set_mo_source 2019-01-21 15:53:01 +01:00			`# try pipe`
			`try(`
			`df <- utils::read.table(header = TRUE, sep = "\|", stringsAsFactors = FALSE),`
			`silent = TRUE)`
			`}`
			`}`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`# check integrity`
			`mo_source_isvalid(df)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`df <- subset(df, !is.na(mo))`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
mo_source improvement 2019-03-01 09:34:04 +01:00			`# keep only first two columns, second must be mo`
set_mo_source 2019-01-21 15:53:01 +01:00			`if (colnames(df)[1] == "mo") {`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`df <- df[, c(colnames(df)[2], "mo")]`
mo_source improvement 2019-03-01 09:34:04 +01:00			`} else {`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`df <- df[, c(colnames(df)[1], "mo")]`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
set_mo_source 2019-01-21 15:53:01 +01:00			`df <- as.data.frame(df, stringAsFactors = FALSE)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
set_mo_source 2019-01-21 15:53:01 +01:00			`# success`
memory for as.mo() 2019-03-15 13:57:25 +01:00			`if (file.exists(file_location)) {`
set_mo_source 2019-01-21 15:53:01 +01:00			`action <- "Updated"`
			`} else {`
			`action <- "Created"`
(v1.3.0.9014) as.mo() speed improvement 2020-09-03 12:31:48 +02:00			`# only ask when file is created, not when it is updated`
			`txt <- paste0("This will write create the new file '",`
			`file_location,`
			`"', for which your permission is needed.\n\nDo you agree that this file will be created? ")`
			`if ("rsasdtudioapi" %in% rownames(utils::installed.packages())) {`
			`showQuestion <- import_fn("showQuestion", "rstudioapi")`
			`q_continue <- showQuestion("Create new file in home directory", txt)`
			`} else {`
			`q_continue <- utils::menu(choices = c("OK", "Cancel"), graphics = FALSE, title = txt)`
			`}`
			`if (q_continue %in% c(FALSE, 2)) {`
			`return(invisible())`
			`}`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
memory for as.mo() 2019-03-15 13:57:25 +01:00			`saveRDS(df, file_location)`
set_mo_source 2019-01-21 15:53:01 +01:00			`options(mo_source = path)`
			`options(mo_source_timestamp = as.character(file.info(path)$mtime))`
(v1.4.0.9011) message formatting 2020-10-27 15:56:51 +01:00			`message_(action, " mo_source file '", font_bold(file_location), "'",`
			`" from '", font_bold(path), "'",`
			`'(columns "', colnames(df)[1], '" and "', colnames(df)[2], '")')`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`

			`#' @rdname mo_source`
			`#' @export`
			`get_mo_source <- function() {`
			`if (is.null(getOption("mo_source", NULL))) {`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`return(NULL)`
			`}`

			`if (!file.exists(path.expand("~/mo_source.rds"))) {`
			`options(mo_source = NULL)`
			`options(mo_source_timestamp = NULL)`
(v1.4.0.9011) message formatting 2020-10-27 15:56:51 +01:00			`message_("Removed references to deleted mo_source file (see ?mo_source)")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`return(NULL)`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00
			`old_time <- as.POSIXct(getOption("mo_source_timestamp"))`
			`new_time <- as.POSIXct(as.character(file.info(getOption("mo_source", ""))$mtime))`

			`if (is.na(new_time)) {`
			`# source file was deleted, remove reference too`
			`set_mo_source("")`
			`return(NULL)`
			`}`
(v1.3.0.9026) eucast expert rules 3.2 2020-09-24 00:30:11 +02:00			`if (interactive() && new_time != old_time) {`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`# set updated source`
			`set_mo_source(getOption("mo_source"))`
			`}`
			`file_location <- path.expand("~/mo_source.rds")`
			`readRDS(file_location)`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			mo_source_isvalid <- function(x, refer_to_name = "`reference_df`", stop_on_error = TRUE) {
(v0.9.0.9023) EUCAST 2020 guidelines 2020-02-14 19:54:13 +01:00			`check_dataset_integrity()`

(v1.4.0.9015) bugfix 2020-11-10 16:35:56 +01:00			`if (paste(deparse(substitute(x)), collapse = "") == "get_mo_source()") {`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`return(TRUE)`
			`}`
			`if (identical(x, get_mo_source())) {`
			`return(TRUE)`
			`}`
			`if (is.null(x)) {`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (stop_on_error == TRUE) {`
(v1.4.0.9015) bugfix 2020-11-10 16:35:56 +01:00			`stop_(refer_to_name, " cannot be NULL", call = FALSE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`} else {`
			`return(FALSE)`
			`}`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`}`
			`if (!is.data.frame(x)) {`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (stop_on_error == TRUE) {`
(v1.4.0.9015) bugfix 2020-11-10 16:35:56 +01:00			`stop_(refer_to_name, " must be a data.frame", call = FALSE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`} else {`
			`return(FALSE)`
			`}`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`}`
			`if (!"mo" %in% colnames(x)) {`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (stop_on_error == TRUE) {`
(v1.4.0.9015) bugfix 2020-11-10 16:35:56 +01:00			`stop_(refer_to_name, " must contain a column 'mo'", call = FALSE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`} else {`
			`return(FALSE)`
			`}`
			`}`
			`if (!all(x$mo %in% c("", microorganisms$mo, microorganisms.translation$mo_old), na.rm = TRUE)) {`
			`if (stop_on_error == TRUE) {`
			`invalid <- x[which(!x$mo %in% c("", microorganisms$mo, microorganisms.translation$mo_old)), , drop = FALSE]`
			`if (nrow(invalid) > 1) {`
			`plural <- "s"`
			`} else {`
			`plural <- ""`
			`}`
(v1.4.0.9015) bugfix 2020-11-10 16:35:56 +01:00			`stop_("Value", plural, " ", paste0("'", invalid[, 1, drop = TRUE], "'", collapse = ", "),`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`" found in ", tolower(refer_to_name),`
(v1.2.0.9011) mo_domain(), improved error handling 2020-06-22 11:18:40 +02:00			`", but with invalid microorganism code", plural, " ", paste0("'", invalid$mo, "'", collapse = ", "),`
(v1.4.0.9015) bugfix 2020-11-10 16:35:56 +01:00			`call = FALSE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`} else {`
			`return(FALSE)`
			`}`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`}`
(v1.4.0.9015) bugfix 2020-11-10 16:35:56 +01:00			`if (colnames(x)[1] != "mo" & nrow(x) > length(unique(x[, 1, drop = TRUE]))) {`
			`if (stop_on_error == TRUE) {`
			`stop_(refer_to_name, " contains duplicate values in column '", colnames(x)[1], "'", call = FALSE)`
			`} else {`
			`return(FALSE)`
			`}`
			`}`
			`if (colnames(x)[2] != "mo" & nrow(x) > length(unique(x[, 2, drop = TRUE]))) {`
			`if (stop_on_error == TRUE) {`
			`stop_(refer_to_name, " contains duplicate values in column '", colnames(x)[2], "'", call = FALSE)`
			`} else {`
			`return(FALSE)`
			`}`
			`}`
			`return(TRUE)`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`}`