AMR/R/mo_source.R

# ==================================================================== #
# TITLE                                                                #
# Antimicrobial Resistance (AMR) Analysis                              #
#                                                                      #
# SOURCE                                                               #
# https://github.com/msberends/AMR                                     #
#                                                                      #
# LICENCE                                                              #
# (c) 2018-2020 Berends MS, Luz CF et al.                              #
#                                                                      #
# This R package is free software; you can freely use and distribute   #
# it for both personal and commercial purposes under the terms of the  #
# GNU General Public License version 2.0 (GNU GPL-2), as published by  #
# the Free Software Foundation.                                        #
#                                                                      #
# We created this package for both routine data analysis and academic  #
# research and it was publicly released in the hope that it will be    #
# useful, but it comes WITHOUT ANY WARRANTY OR LIABILITY.              #
# Visit our website for more info: https://msberends.github.io/AMR.    #
# ==================================================================== #

#' User-defined reference data set for microorganisms
#'
#' @description These functions can be used to predefine your own reference to be used in [as.mo()] and consequently all `mo_*` functions like [mo_genus()] and [mo_gramstain()].
#'
#' This is **the fastest way** to have your organisation (or analysis) specific codes picked up and translated by this package.
#' @inheritSection lifecycle Stable lifecycle
#' @param path location of your reference file, see Details. Can be `""`, `NULL` or `FALSE` to delete the reference file.
#' @rdname mo_source
#' @name mo_source
#' @aliases set_mo_source get_mo_source
#' @details The reference file can be a text file seperated with commas (CSV) or tabs or pipes, an Excel file (either 'xls' or 'xlsx' format) or an R object file (extension '.rds'). To use an Excel file, you need to have the `readxl` package installed.
#'
#' [set_mo_source()] will check the file for validity: it must be a [`data.frame`], must have a column named `"mo"` which contains values from [`microorganisms$mo`][microorganisms] and must have a reference column with your own defined values. If all tests pass, [set_mo_source()] will read the file into R and export it to `"~/.mo_source.rds"`. This compressed data file will then be used at default for MO determination (function [as.mo()] and consequently all `mo_*` functions like [mo_genus()] and [mo_gramstain()]). The location of the original file will be saved as option with `options(mo_source = path)`. Its timestamp will be saved with `options(mo_source_datetime = ...)`.
#'
#' [get_mo_source()] will return the data set by reading `"~/.mo_source.rds"` with [readRDS()]. If the original file has changed (the file defined with `path`), it will call [set_mo_source()] to update the data file automatically.
#'
#' Reading an Excel file (`.xlsx`) with only one row has a size of 8-9 kB. The compressed file created with [set_mo_source()] will then have a size of 0.1 kB and can be read by [get_mo_source()] in only a couple of microseconds (millionths of a second).
#' 
#' @section How to setup:
#' 
#' Imagine this data on a sheet of an Excel file (mo codes were looked up in the [microorganisms] data set). The first column contains the organisation specific codes, the second column contains an MO code from this package:
#' 
#' ```
#'   |         A          |       B      |
#' --|--------------------|--------------|
#' 1 | Organisation XYZ   | mo           |
#' 2 | lab_mo_ecoli       | B_ESCHR_COLI |
#' 3 | lab_mo_kpneumoniae | B_KLBSL_PNMN |
#' 4 |                    |              |
#' ```
#'
#' We save it as `"home/me/ourcodes.xlsx"`. Now we have to set it as a source:
#' 
#' ```
#' set_mo_source("home/me/ourcodes.xlsx")
#' #> NOTE: Created mo_source file '~/.mo_source.rds' from 'home/me/ourcodes.xlsx'
#' #>       (columns "Organisation XYZ" and "mo")
#' ```
#'
#' It has now created a file `"~/.mo_source.rds"` with the contents of our Excel file. Only the first column with foreign values and the 'mo' column will be kept when creating the RDS file.
#'
#' And now we can use it in our functions:
#' 
#' ```
#' as.mo("lab_mo_ecoli")
#' #> [1] B_ESCHR_COLI
#'
#' mo_genus("lab_mo_kpneumoniae")
#' #> [1] "Klebsiella"
#'
#' # other input values still work too
#' as.mo(c("Escherichia coli", "E. coli", "lab_mo_ecoli"))
#' #> [1] B_ESCHR_COLI B_ESCHR_COLI B_ESCHR_COLI
#' ```
#'
#' If we edit the Excel file by, let's say, adding row 4 like this:
#' 
#' ```
#'   |         A          |       B      |
#' --|--------------------|--------------|
#' 1 | Organisation XYZ   | mo           |
#' 2 | lab_mo_ecoli       | B_ESCHR_COLI |
#' 3 | lab_mo_kpneumoniae | B_KLBSL_PNMN |
#' 4 | lab_Staph_aureus   | B_STPHY_AURS |
#' 5 |                    |              |
#' ```
#'
#' ...any new usage of an MO function in this package will update your data file:
#' 
#' ```
#' as.mo("lab_mo_ecoli")
#' #> NOTE: Updated mo_source file '~/.mo_source.rds' from 'home/me/ourcodes.xlsx'
#' #>       (columns "Organisation XYZ" and "mo")
#' #> [1] B_ESCHR_COLI
#'
#' mo_genus("lab_Staph_aureus")
#' #> [1] "Staphylococcus"
#' ```
#'
#' To delete the reference data file, just use `""`, `NULL` or `FALSE` as input for [set_mo_source()]:
#' 
#' ```
#' set_mo_source(NULL)
#' # Removed mo_source file '~/.mo_source.rds'.
#' ```
#' 
#' If the original Excel file is moved or deleted, the mo_source file will be removed upon the next use of [as.mo()]. If the mo_source file is manually deleted (i.e. without using [set_mo_source()]), the references to the mo_source file will be removed upon the next use of [as.mo()].
#' @export
#' @inheritSection AMR Read more on our website!
set_mo_source <- function(path) {
  
  file_location <- path.expand("~/mo_source.rds")
  
  stop_ifnot(length(path) == 1, "`path` must be of length 1")
  
  if (is.null(path) || path %in% c(FALSE, "")) {
    options(mo_source = NULL)
    options(mo_source_timestamp = NULL)
    if (file.exists(file_location)) {
      unlink(file_location)
      message(font_red(paste0("Removed mo_source file '", font_bold(file_location), "'")))
    }
    return(invisible())
  }
  
  stop_ifnot(file.exists(path),
             "file not found: ", path)
  
  if (path %like% "[.]rds$") {
    df <- readRDS(path)
    
  } else if (path %like% "[.]xlsx?$") {
    # is Excel file (old or new)
    read_excel <- import_fn("read_excel", "readxl")
    df <- read_excel(path)
    
  } else if (path %like% "[.]tsv$") {
    df <- utils::read.table(header = TRUE, sep = "\t", stringsAsFactors = FALSE)
    
  } else {
    # try comma first
    try(
      df <- utils::read.table(header = TRUE, sep = ",", stringsAsFactors = FALSE),
      silent = TRUE)
    if (!mo_source_isvalid(df, stop_on_error = FALSE)) {
      # try tab
      try(
        df <- utils::read.table(header = TRUE, sep = "\t", stringsAsFactors = FALSE),
        silent = TRUE)
    }
    if (!mo_source_isvalid(df, stop_on_error = FALSE)) {
      # try pipe
      try(
        df <- utils::read.table(header = TRUE, sep = "|", stringsAsFactors = FALSE),
        silent = TRUE)
    }
  }
  
  # check integrity
  mo_source_isvalid(df)
  
  df <- subset(df, !is.na(mo))
  
  # keep only first two columns, second must be mo
  if (colnames(df)[1] == "mo") {
    df <- df[, c(colnames(df)[2], "mo")]
  } else {
    df <- df[, c(colnames(df)[1], "mo")]
  }
  
  df <- as.data.frame(df, stringAsFactors = FALSE)
  
  # success
  if (file.exists(file_location)) {
    action <- "Updated"
  } else {
    action <- "Created"
  }
  saveRDS(df, file_location)
  options(mo_source = path)
  options(mo_source_timestamp = as.character(file.info(path)$mtime))
  message(font_blue(paste0("NOTE: ",
                           action, " mo_source file '", font_bold(file_location), "'",
                           " from '", font_bold(path), "'",
                           '\n      (columns "', colnames(df)[1], '" and "', colnames(df)[2], '")')))
}

#' @rdname mo_source
#' @export
get_mo_source <- function() {
  if (is.null(getOption("mo_source", NULL))) {
    return(NULL)
  }
  
  if (!file.exists(path.expand("~/mo_source.rds"))) {
    options(mo_source = NULL)
    options(mo_source_timestamp = NULL)
    message(font_blue("NOTE: Removed references to deleted mo_source file (see ?mo_source)"))
    return(NULL)
  }
  
  old_time <- as.POSIXct(getOption("mo_source_timestamp"))
  new_time <- as.POSIXct(as.character(file.info(getOption("mo_source", ""))$mtime))
  
  if (is.na(new_time)) {
    # source file was deleted, remove reference too
    set_mo_source("")
    return(NULL)
  }
  if (new_time != old_time) {
    # set updated source
    set_mo_source(getOption("mo_source"))
  }
  file_location <- path.expand("~/mo_source.rds")
  readRDS(file_location)
}

mo_source_isvalid <- function(x, refer_to_name = "`reference_df`", stop_on_error = TRUE) {
  
  check_dataset_integrity()
  
  if (deparse(substitute(x)) == "get_mo_source()") {
    return(TRUE)
  }
  if (identical(x, get_mo_source())) {
    return(TRUE)
  }
  if (is.null(x)) {
    if (stop_on_error == TRUE) {
      stop(refer_to_name, " cannot be NULL", call. = FALSE)
    } else {
      return(FALSE)
    }
  }
  if (!is.data.frame(x)) {
    if (stop_on_error == TRUE) {
      stop(refer_to_name, " must be a data.frame", call. = FALSE)
    } else {
      return(FALSE)
    }
  }
  if (!"mo" %in% colnames(x)) {
    if (stop_on_error == TRUE) {
      stop(refer_to_name, " must contain a column 'mo'", call. = FALSE)
    } else {
      return(FALSE)
    }
  }
  if (!all(x$mo %in% c("", microorganisms$mo, microorganisms.translation$mo_old), na.rm = TRUE)) {
    if (stop_on_error == TRUE) {
      invalid <- x[which(!x$mo %in% c("", microorganisms$mo, microorganisms.translation$mo_old)), , drop = FALSE]
      if (nrow(invalid) > 1) {
        plural <- "s"
      } else {
        plural <- ""
      }
      stop("Value", plural, " ", paste0("'", invalid[, 1, drop = TRUE], "'", collapse = ", "), 
           " found in ", tolower(refer_to_name), 
           ", but with invalid microorganism code", plural, " ", paste0("'", invalid$mo, "'", collapse = ", "),
           call. = FALSE)
    } else {
      return(FALSE)
    }
  }
  TRUE
}
set_mo_source 2019-01-21 15:53:01 +01:00			`# ==================================================================== #`
			`# TITLE #`
			`# Antimicrobial Resistance (AMR) Analysis #`
			`# #`
			`# SOURCE #`
(v1.2.0.9026) move to github 2020-07-08 14:48:06 +02:00			`# https://github.com/msberends/AMR #`
set_mo_source 2019-01-21 15:53:01 +01:00			`# #`
			`# LICENCE #`
(v0.9.0.9008) Happy new year! Add lifecycles 2020-01-05 17:22:09 +01:00			`# (c) 2018-2020 Berends MS, Luz CF et al. #`
set_mo_source 2019-01-21 15:53:01 +01:00			`# #`
			`# This R package is free software; you can freely use and distribute #`
			`# it for both personal and commercial purposes under the terms of the #`
			`# GNU General Public License version 2.0 (GNU GPL-2), as published by #`
			`# the Free Software Foundation. #`
			`# #`
(v0.9.0.9008) Happy new year! Add lifecycles 2020-01-05 17:22:09 +01:00			`# We created this package for both routine data analysis and academic #`
			`# research and it was publicly released in the hope that it will be #`
			`# useful, but it comes WITHOUT ANY WARRANTY OR LIABILITY. #`
(v1.2.0.9026) move to github 2020-07-08 14:48:06 +02:00			`# Visit our website for more info: https://msberends.github.io/AMR. #`
set_mo_source 2019-01-21 15:53:01 +01:00			`# ==================================================================== #`

(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' User-defined reference data set for microorganisms`
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' @description These functions can be used to predefine your own reference to be used in [as.mo()] and consequently all `mo_*` functions like [mo_genus()] and [mo_gramstain()].
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			`#' This is the fastest way to have your organisation (or analysis) specific codes picked up and translated by this package.`
(v0.9.0.9008) Happy new year! Add lifecycles 2020-01-05 17:22:09 +01:00			`#' @inheritSection lifecycle Stable lifecycle`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			#' @param path location of your reference file, see Details. Can be `""`, `NULL` or `FALSE` to delete the reference file.
set_mo_source 2019-01-21 15:53:01 +01:00			`#' @rdname mo_source`
			`#' @name mo_source`
			`#' @aliases set_mo_source get_mo_source`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' @details The reference file can be a text file seperated with commas (CSV) or tabs or pipes, an Excel file (either 'xls' or 'xlsx' format) or an R object file (extension '.rds'). To use an Excel file, you need to have the `readxl` package installed.
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' [set_mo_source()] will check the file for validity: it must be a [`data.frame`], must have a column named `"mo"` which contains values from [`microorganisms$mo`][microorganisms] and must have a reference column with your own defined values. If all tests pass, [set_mo_source()] will read the file into R and export it to `"~/.mo_source.rds"`. This compressed data file will then be used at default for MO determination (function [as.mo()] and consequently all `mo_*` functions like [mo_genus()] and [mo_gramstain()]). The location of the original file will be saved as option with `options(mo_source = path)`. Its timestamp will be saved with `options(mo_source_datetime = ...)`.
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' [get_mo_source()] will return the data set by reading `"~/.mo_source.rds"` with [readRDS()]. If the original file has changed (the file defined with `path`), it will call [set_mo_source()] to update the data file automatically.
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			#' Reading an Excel file (`.xlsx`) with only one row has a size of 8-9 kB. The compressed file created with [set_mo_source()] will then have a size of 0.1 kB and can be read by [get_mo_source()] in only a couple of microseconds (millionths of a second).
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' @section How to setup:`
			`#'`
			`#' Imagine this data on a sheet of an Excel file (mo codes were looked up in the [microorganisms] data set). The first column contains the organisation specific codes, the second column contains an MO code from this package:`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			`#'`
			#' ```
(v0.9.0) website fixes 2019-11-30 12:01:50 +01:00			`#' \| A \| B \|`
			`#' --\|--------------------\|--------------\|`
			`#' 1 \| Organisation XYZ \| mo \|`
			`#' 2 \| lab_mo_ecoli \| B_ESCHR_COLI \|`
			`#' 3 \| lab_mo_kpneumoniae \| B_KLBSL_PNMN \|`
			`#' 4 \| \| \|`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' We save it as `"home/me/ourcodes.xlsx"`. Now we have to set it as a source:
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
age_groups fix 2019-02-27 11:36:12 +01:00			`#' set_mo_source("home/me/ourcodes.xlsx")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> NOTE: Created mo_source file '~/.mo_source.rds' from 'home/me/ourcodes.xlsx'`
			`#' #> (columns "Organisation XYZ" and "mo")`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			#' It has now created a file `"~/.mo_source.rds"` with the contents of our Excel file. Only the first column with foreign values and the 'mo' column will be kept when creating the RDS file.
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#'`
			`#' And now we can use it in our functions:`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#' as.mo("lab_mo_ecoli")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> [1] B_ESCHR_COLI`
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
			`#' mo_genus("lab_mo_kpneumoniae")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> [1] "Klebsiella"`
mo_source improvement 2019-03-01 09:34:04 +01:00			`#'`
			`#' # other input values still work too`
			`#' as.mo(c("Escherichia coli", "E. coli", "lab_mo_ecoli"))`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> [1] B_ESCHR_COLI B_ESCHR_COLI B_ESCHR_COLI`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' If we edit the Excel file by, let's say, adding row 4 like this:`
			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
			`#' \| A \| B \|`
			`#' --\|--------------------\|--------------\|`
			`#' 1 \| Organisation XYZ \| mo \|`
			`#' 2 \| lab_mo_ecoli \| B_ESCHR_COLI \|`
			`#' 3 \| lab_mo_kpneumoniae \| B_KLBSL_PNMN \|`
			`#' 4 \| lab_Staph_aureus \| B_STPHY_AURS \|`
			`#' 5 \| \| \|`
			#' ```
set_mo_source 2019-01-21 15:53:01 +01:00			`#'`
(v0.9.0) website fixes 2019-11-30 12:01:50 +01:00			`#' ...any new usage of an MO function in this package will update your data file:`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#' as.mo("lab_mo_ecoli")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> NOTE: Updated mo_source file '~/.mo_source.rds' from 'home/me/ourcodes.xlsx'`
			`#' #> (columns "Organisation XYZ" and "mo")`
			`#' #> [1] B_ESCHR_COLI`
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#'`
			`#' mo_genus("lab_Staph_aureus")`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#' #> [1] "Staphylococcus"`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#'`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			#' To delete the reference data file, just use `""`, `NULL` or `FALSE` as input for [set_mo_source()]:
			`#'`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#' set_mo_source(NULL)`
			`#' # Removed mo_source file '~/.mo_source.rds'.`
(v0.8.0.9036) complete documentation rewrite 2019-11-28 22:32:17 +01:00			#' ```
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`#'`
			`#' If the original Excel file is moved or deleted, the mo_source file will be removed upon the next use of [as.mo()]. If the mo_source file is manually deleted (i.e. without using [set_mo_source()]), the references to the mo_source file will be removed upon the next use of [as.mo()].`
rlang dependency, new fungi 2019-02-28 13:56:28 +01:00			`#' @export`
			`#' @inheritSection AMR Read more on our website!`
set_mo_source 2019-01-21 15:53:01 +01:00			`set_mo_source <- function(path) {`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v0.7.1.9102) lintr 2019-10-11 17:21:02 +02:00			`file_location <- path.expand("~/mo_source.rds")`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.2.0.9011) mo_domain(), improved error handling 2020-06-22 11:18:40 +02:00			stop_ifnot(length(path) == 1, "`path` must be of length 1")
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`if (is.null(path) \|\| path %in% c(FALSE, "")) {`
set_mo_source 2019-01-21 15:53:01 +01:00			`options(mo_source = NULL)`
			`options(mo_source_timestamp = NULL)`
memory for as.mo() 2019-03-15 13:57:25 +01:00			`if (file.exists(file_location)) {`
			`unlink(file_location)`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`message(font_red(paste0("Removed mo_source file '", font_bold(file_location), "'")))`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
			`return(invisible())`
			`}`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.2.0.9011) mo_domain(), improved error handling 2020-06-22 11:18:40 +02:00			`stop_ifnot(file.exists(path),`
			`"file not found: ", path)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v0.7.1.9102) lintr 2019-10-11 17:21:02 +02:00			`if (path %like% "[.]rds$") {`
set_mo_source 2019-01-21 15:53:01 +01:00			`df <- readRDS(path)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v0.7.1.9102) lintr 2019-10-11 17:21:02 +02:00			`} else if (path %like% "[.]xlsx?$") {`
set_mo_source 2019-01-21 15:53:01 +01:00			`# is Excel file (old or new)`
(v1.2.0.9008) ab_class improvement 2020-06-17 15:14:37 +02:00			`read_excel <- import_fn("read_excel", "readxl")`
(v1.1.0.9007) lose dependencies 2020-05-16 21:40:50 +02:00			`df <- read_excel(path)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v0.7.1.9102) lintr 2019-10-11 17:21:02 +02:00			`} else if (path %like% "[.]tsv$") {`
age_groups fix 2019-02-27 11:36:12 +01:00			`df <- utils::read.table(header = TRUE, sep = "\t", stringsAsFactors = FALSE)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
set_mo_source 2019-01-21 15:53:01 +01:00			`} else {`
			`# try comma first`
			`try(`
			`df <- utils::read.table(header = TRUE, sep = ",", stringsAsFactors = FALSE),`
			`silent = TRUE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (!mo_source_isvalid(df, stop_on_error = FALSE)) {`
age_groups fix 2019-02-27 11:36:12 +01:00			`# try tab`
			`try(`
			`df <- utils::read.table(header = TRUE, sep = "\t", stringsAsFactors = FALSE),`
			`silent = TRUE)`
			`}`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (!mo_source_isvalid(df, stop_on_error = FALSE)) {`
set_mo_source 2019-01-21 15:53:01 +01:00			`# try pipe`
			`try(`
			`df <- utils::read.table(header = TRUE, sep = "\|", stringsAsFactors = FALSE),`
			`silent = TRUE)`
			`}`
			`}`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`# check integrity`
			`mo_source_isvalid(df)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`df <- subset(df, !is.na(mo))`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
mo_source improvement 2019-03-01 09:34:04 +01:00			`# keep only first two columns, second must be mo`
set_mo_source 2019-01-21 15:53:01 +01:00			`if (colnames(df)[1] == "mo") {`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`df <- df[, c(colnames(df)[2], "mo")]`
mo_source improvement 2019-03-01 09:34:04 +01:00			`} else {`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`df <- df[, c(colnames(df)[1], "mo")]`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
set_mo_source 2019-01-21 15:53:01 +01:00			`df <- as.data.frame(df, stringAsFactors = FALSE)`
(v1.2.0.9034) code cleaning 2020-07-13 09:17:24 +02:00
set_mo_source 2019-01-21 15:53:01 +01:00			`# success`
memory for as.mo() 2019-03-15 13:57:25 +01:00			`if (file.exists(file_location)) {`
set_mo_source 2019-01-21 15:53:01 +01:00			`action <- "Updated"`
			`} else {`
			`action <- "Created"`
			`}`
memory for as.mo() 2019-03-15 13:57:25 +01:00			`saveRDS(df, file_location)`
set_mo_source 2019-01-21 15:53:01 +01:00			`options(mo_source = path)`
			`options(mo_source_timestamp = as.character(file.info(path)$mtime))`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`message(font_blue(paste0("NOTE: ",`
			`action, " mo_source file '", font_bold(file_location), "'",`
			`" from '", font_bold(path), "'",`
			`'\n (columns "', colnames(df)[1], '" and "', colnames(df)[2], '")')))`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`

			`#' @rdname mo_source`
			`#' @export`
			`get_mo_source <- function() {`
			`if (is.null(getOption("mo_source", NULL))) {`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00			`return(NULL)`
			`}`

			`if (!file.exists(path.expand("~/mo_source.rds"))) {`
			`options(mo_source = NULL)`
			`options(mo_source_timestamp = NULL)`
			`message(font_blue("NOTE: Removed references to deleted mo_source file (see ?mo_source)"))`
			`return(NULL)`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
(v1.1.0.9019) mo_source fix 2020-05-25 01:01:14 +02:00
			`old_time <- as.POSIXct(getOption("mo_source_timestamp"))`
			`new_time <- as.POSIXct(as.character(file.info(getOption("mo_source", ""))$mtime))`

			`if (is.na(new_time)) {`
			`# source file was deleted, remove reference too`
			`set_mo_source("")`
			`return(NULL)`
			`}`
			`if (new_time != old_time) {`
			`# set updated source`
			`set_mo_source(getOption("mo_source"))`
			`}`
			`file_location <- path.expand("~/mo_source.rds")`
			`readRDS(file_location)`
set_mo_source 2019-01-21 15:53:01 +01:00			`}`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			mo_source_isvalid <- function(x, refer_to_name = "`reference_df`", stop_on_error = TRUE) {
(v0.9.0.9023) EUCAST 2020 guidelines 2020-02-14 19:54:13 +01:00
			`check_dataset_integrity()`

con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`if (deparse(substitute(x)) == "get_mo_source()") {`
			`return(TRUE)`
			`}`
			`if (identical(x, get_mo_source())) {`
			`return(TRUE)`
			`}`
			`if (is.null(x)) {`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (stop_on_error == TRUE) {`
(v1.2.0.9011) mo_domain(), improved error handling 2020-06-22 11:18:40 +02:00			`stop(refer_to_name, " cannot be NULL", call. = FALSE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`} else {`
			`return(FALSE)`
			`}`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`}`
			`if (!is.data.frame(x)) {`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (stop_on_error == TRUE) {`
(v1.2.0.9011) mo_domain(), improved error handling 2020-06-22 11:18:40 +02:00			`stop(refer_to_name, " must be a data.frame", call. = FALSE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`} else {`
			`return(FALSE)`
			`}`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`}`
			`if (!"mo" %in% colnames(x)) {`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`if (stop_on_error == TRUE) {`
(v1.2.0.9011) mo_domain(), improved error handling 2020-06-22 11:18:40 +02:00			`stop(refer_to_name, " must contain a column 'mo'", call. = FALSE)`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`} else {`
			`return(FALSE)`
			`}`
			`}`
			`if (!all(x$mo %in% c("", microorganisms$mo, microorganisms.translation$mo_old), na.rm = TRUE)) {`
			`if (stop_on_error == TRUE) {`
			`invalid <- x[which(!x$mo %in% c("", microorganisms$mo, microorganisms.translation$mo_old)), , drop = FALSE]`
			`if (nrow(invalid) > 1) {`
			`plural <- "s"`
			`} else {`
			`plural <- ""`
			`}`
			`stop("Value", plural, " ", paste0("'", invalid[, 1, drop = TRUE], "'", collapse = ", "),`
			`" found in ", tolower(refer_to_name),`
(v1.2.0.9011) mo_domain(), improved error handling 2020-06-22 11:18:40 +02:00			`", but with invalid microorganism code", plural, " ", paste0("'", invalid$mo, "'", collapse = ", "),`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`call. = FALSE)`
			`} else {`
			`return(FALSE)`
			`}`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`}`
(v1.1.0.9004) lose dependencies 2020-05-16 13:05:47 +02:00			`TRUE`
con WHONET, filter ab class 2019-03-05 22:47:42 +01:00			`}`