(v2.1.1.9095) Python support

2025-08-24 19:42:15 +02:00 · 2024-10-15 17:12:55 +02:00
parent 94501371cd
commit 5c4d8fcd2a
18 changed files with 513 additions and 435 deletions
--- a/vignettes/AMR_for_Python.Rmd
+++ b/vignettes/AMR_for_Python.Rmd
@@ -24,161 +24,180 @@ knitr::opts_chunk$set(

 # Introduction

-The `AMR` package for R is an incredible tool for antimicrobial resistance (AMR) data analysis, providing extensive functionality for working with microbial and antimicrobial properties. But what if you're working in Python and still want to benefit from the robust features of `AMR`?
+The `AMR` package for R is a powerful tool for antimicrobial resistance (AMR) analysis. It provides extensive features for handling microbial and antimicrobial data. However, for those who work primarily in Python, we now have a more intuitive option available: the `AMR` Python package, which uses `rpy2` internally. This package allows Python users to access all the functions from the R `AMR` package without the need to set up `rpy2` themselves. Since this Python package is not a true 'port' (which would require all R functions to be rewritten into Python), R and the AMR R package are still required to be installed. Yet, Python users can now easily work with AMR data directly through Python code.

-The best way is to access R directly from Python with the help of `rpy2`, a simple yet powerful Python package. You can easily call functions from the `AMR` package to process your own data in your own Python environment. This post will guide you through setting up `rpy2` and show you how to use R functions from `AMR` in Python to supercharge your antimicrobial resistance analysis.
+In this document, we explain how this works and provide simple examples of using the `AMR` Python package.

-# What is `rpy2`?
+## How It Works

-`rpy2` is a Python library that allows Python users to run R code within their Python scripts. Essentially, it acts as a bridge between the two languages, allowing you to tap into the rich ecosystem of R libraries (like `AMR`) while maintaining the flexibility of Python.
+The `AMR` Python package acts as a wrapper around the functions in the `AMR` R package. The package simplifies the process of calling R functions in Python, eliminating the need to manually manage the `rpy2` setup, which Python uses internally to be able to work with the R package. By just using `import AMR`, Python users can directly use the functions from the `AMR` R package as if they were native Python functions.

-## Key Features of `rpy2`:
- Seamlessly call R functions from Python.
- Convert R data structures into Python data structures like pandas DataFrames.
- Leverage the full power of R libraries without leaving your Python environment.
+Internally, `rpy2` is still being used, but all complexity is hidden from the user. This approach keeps the Python code clean and Pythonic, while still leveraging the full power of the R `AMR` package.

-# Setting Up `rpy2`
+## Example of Usage

-Before diving into the examples, you’ll need to install both R and `rpy2`. Here's a step-by-step guide on setting things up.
-
-## Step 1: Install R
-
-Ensure that R is installed on your system. R has minimal dependencies and is very simple to install:
-
-* **Linux**
-  * Ubuntu / Debian:  
-    `sudo apt install r-base`
-  * Fedora:  
-    `sudo dnf install R`
-  * CentOS/RHEL:  
-    `sudo yum install R`
-  * Arch Linux:  
-    `sudo pacman -S r`
-* **macOS** (with Homebrew):  
-  `brew install r`
-* **Other Systems:**  
-  Visit the [CRAN download page](https://cran.r-project.org).
-
-## Step 2: Install the `AMR` package in R
-
-On Linux and macOS, open Terminal and run:
-
-```bash
-Rscript -e 'install.packages("AMR")'
-```
-
-For other systems, open your R console and install the `AMR` package by running:
-
-```r
-install.packages("AMR")
-```
-
-On any system, you can also install the latest development version of the `AMR` package by setting `repos` to our beta channel:
-
-```r
-install.packages("AMR", repos = "https://msberends.r-universe.dev")
-```
-
-## Step 3: Install `rpy2` in Python
-
-To install `rpy2`, simply run the following command in your terminal:
-
-```bash
-pip install rpy2
-```
-
-## Step 4: Test `rpy2` Installation
-
-To ensure everything is set up correctly, you can test your installation by running the following Python script, which essentially runs R in the background:
-
-```python
-import rpy2.robjects as ro
-
-# Test a simple R function from Python
-ro.r('1 + 1')
-```
-
-If this returns `2`, you're good to go!
-
-# Working with `AMR` in Python
-
-Now that we have `rpy2` set up, let’s walk through some practical examples of using the `AMR` package within Python.
-
-## Example 1: Converting Taxonomic Data
-
-Let’s start by converting taxonomic user input to valid taxonomy using the `AMR` package, from within Python:
+Here’s an example that demonstrates how to clean microorganism and drug names using the `AMR` Python package:

 ```python
 import pandas as pd
-import rpy2.robjects as ro
-from rpy2.robjects.packages import importr
-from rpy2.robjects import pandas2ri
+import AMR

-# Load the AMR package from R
-amr = importr('AMR')
+# Sample data
+data = {
+    "MOs": ['E. coli', 'ESCCOL', 'esco', 'Esche coli'],
+    "Drug": ['Cipro', 'CIP', 'J01MA02', 'Ciproxin']
+}
+df = pd.DataFrame(data)

-# Example user dataset in Python
-data = pd.DataFrame({
-    'microorganism': ['E. coli', 'S. aureus', 'P. aeruginosa', 'K. pneumoniae']
-})
+# Use AMR functions to clean microorganism and drug names
+df['MO_clean'] = AMR.mo_name(df['MOs'])
+df['Drug_clean'] = AMR.ab_name(df['Drug'])

-# Apply mo_name() from the AMR package to the 'microorganism' column
-ro.globalenv['r_data'] = data
-ro.r('r_data$mo_name <- mo_name(r_data$microorganism)')
+# Display the results
+print(df)
+```

-# Retrieve and print the modified R DataFrame in Python
-result = ro.r('r_data')
-result = pandas2ri.rpy2py(result)
+| MOs         | Drug      | MO_clean           | Drug_clean    |
+|-------------|-----------|--------------------|---------------|
+| E. coli     | Cipro     | Escherichia coli   | Ciprofloxacin |
+| ESCCOL      | CIP       | Escherichia coli   | Ciprofloxacin |
+| esco        | J01MA02   | Escherichia coli   | Ciprofloxacin |
+| Esche coli  | Ciproxin  | Escherichia coli   | Ciprofloxacin |
+
+### Explanation
+
+* **mo_name:** This function standardises microorganism names. Here, different variations of *Escherichia coli* (such as "E. coli", "ESCCOL", "esco", and "Esche coli") are all converted into the correct, standardised form, "Escherichia coli".
+
+* **ab_name**: Similarly, this function standardises antimicrobial names. The different representations of ciprofloxacin (e.g., "Cipro", "CIP", "J01MA02", and "Ciproxin") are all converted to the standard name, "Ciprofloxacin".
+
+### Taxonomic Data Sets Now in Python!
+
+As a Python user, you might like that the most important data sets of the `AMR` R package, `microorganisms`, `antibiotics`, `clinical_breakpoints`, and `example_isolates`, are now available as regular Python data frames:
+
+```python
+AMR.microorganisms
+```
+
+| mo           | fullname                           | status   | kingdom  | gbif      | gbif_parent | gbif_renamed_to | prevalence |
+|--------------|------------------------------------|----------|----------|-----------|-------------|-----------------|------------|
+| B_GRAMN      | (unknown Gram-negatives)           | unknown  | Bacteria | None      | None        | None            | 2.0        |
+| B_GRAMP      | (unknown Gram-positives)           | unknown  | Bacteria | None      | None        | None            | 2.0        |
+| B_ANAER-NEG  | (unknown anaerobic Gram-negatives) | unknown  | Bacteria | None      | None        | None            | 2.0        |
+| B_ANAER-POS  | (unknown anaerobic Gram-positives) | unknown  | Bacteria | None      | None        | None            | 2.0        |
+| B_ANAER      | (unknown anaerobic bacteria)       | unknown  | Bacteria | None      | None        | None            | 2.0        |
+| ...          | ...                                | ...      | ...      | ...       | ...         | ...             | ...        |
+| B_ZYMMN_POMC | Zymomonas pomaceae                 | accepted | Bacteria | 10744418  | 3221412     | None            | 2.0        |
+| B_ZYMPH      | Zymophilus                         | synonym  | Bacteria | None      | 9475166     | None            | 2.0        |
+| B_ZYMPH_PCVR | Zymophilus paucivorans             | synonym  | Bacteria | None      | None        | None            | 2.0        |
+| B_ZYMPH_RFFN | Zymophilus raffinosivorans         | synonym  | Bacteria | None      | None        | None            | 2.0        |
+| F_ZYZYG      | Zyzygomyces                        | unknown  | Fungi    | None      | 7581        | None            | 2.0        |
+
+```python
+AMR.antibiotics
+```
+
+| ab  | cid         | name                 | group                      | oral_ddd | oral_units | iv_ddd | iv_units |
+|-----|-------------|----------------------|----------------------------|----------|------------|--------|----------|
+| AMA | 4649.0      | 4-aminosalicylic acid| Antimycobacterials         | 12.00    | g          | NaN    | None     |
+| ACM | 6450012.0   | Acetylmidecamycin    | Macrolides/lincosamides    | NaN      | None       | NaN    | None     |
+| ASP | 49787020.0  | Acetylspiramycin     | Macrolides/lincosamides    | NaN      | None       | NaN    | None     |
+| ALS | 8954.0      | Aldesulfone sodium   | Other antibacterials       | 0.33     | g          | NaN    | None     |
+| AMK | 37768.0     | Amikacin             | Aminoglycosides            | NaN      | None       | 1.0    | g        |
+| ... | ...         | ...                  | ...                        | ...      | ...        | ...    | ...      |
+| VIR | 11979535.0  | Virginiamycine       | Other antibacterials       | NaN      | None       | NaN    | None     |
+| VOR | 71616.0     | Voriconazole         | Antifungals/antimycotics   | 0.40     | g          | 0.4    | g        |
+| XBR | 72144.0     | Xibornol             | Other antibacterials       | NaN      | None       | NaN    | None     |
+| ZID | 77846445.0  | Zidebactam           | Other antibacterials       | NaN      | None       | NaN    | None     |
+| ZFD | NaN         | Zoliflodacin         | None                       | NaN      | None       | NaN    | None     |
+
+
+# Installation
+
+To be able to use the `AMR` Python package, it is required to install both R and the `AMR` R package.
+
+### Preparation: Install R and `AMR` R package
+
+For Linux and macOS, this is just:
+
+```bash
+# Ubuntu / Debian
+sudo apt install r-base && Rscript -e 'install.packages("AMR")'
+# Fedora:
+sudo dnf install R && Rscript -e 'install.packages("AMR")'
+# CentOS/RHEL
+sudo yum install R && Rscript -e 'install.packages("AMR")'
+# Arch Linux
+sudo pacman -S r && Rscript -e 'install.packages("AMR")'
+# macOS
+brew install r && Rscript -e 'install.packages("AMR")'
+```
+
+For Windows, visit the [CRAN download page](https://cran.r-project.org) in install R, then afterwards install the 'AMR' package manually.
+
+### Install `AMR` Python Package
+
+Since the Python package is available on the official [Python Package Index](https://pypi.org/project/AMR/), you can just run:
+
+```bash
+pip install AMR
+```
+
+# Working with `AMR` in Python
+
+Now that we have everything set up, let’s walk through some practical examples of using the `AMR` package within Python.
+
+## Example 1: Calculating AMR
+
+```python
+import AMR
+import pandas as pd
+
+df = AMR.example_isolates
+result = AMR.resistance(df["AMX"])
 print(result)
 ```

-In this example, a Python dataset with microorganism names like *E. coli* and *S. aureus* is passed to the R function `mo_name()`. The result is an updated `DataFrame` that includes the standardised microorganism names based on the `mo_name()` function from the `AMR` package.
+```
+[0.59555556]
+```

-## Example 2: Generating an Antibiogram
+## Example 2: Generating Antibiograms

 One of the core functions of the `AMR` package is generating an antibiogram, a table that summarises the antimicrobial susceptibility of bacterial isolates. Here’s how you can generate an antibiogram from Python:

 ```python
-# Run an antibiogram in R from Python
-ro.r('result <- antibiogram(example_isolates, antibiotics = c(aminoglycosides(), carbapenems()))')
-
-# Retrieve the result in Python
-result = ro.r('as.data.frame(result)')
-print(result)
+result2a = AMR.antibiogram(df[["mo", "AMX", "CIP", "TZP"]])
+print(result2a)
 ```

-In this example, we generate an antibiogram by selecting aminoglycosides and carbapenems, two classes of antibiotics, and then convert the resulting R data frame into a Python-readable format.
+| Pathogen        | Amoxicillin     | Ciprofloxacin   | Piperacillin/tazobactam  |
+|-----------------|-----------------|-----------------|--------------------------|
+| CoNS            | 7% (10/142)     | 73% (183/252)   | 30% (10/33)              |
+| E. coli         | 50% (196/392)   | 88% (399/456)   | 94% (393/416)            |
+| K. pneumoniae   | 0% (0/58)       | 96% (53/55)     | 89% (47/53)              |
+| P. aeruginosa   | 0% (0/30)       | 100% (30/30)    | None                     |
+| P. mirabilis    | None            | 94% (34/36)     | None                     |
+| S. aureus       | 6% (8/131)      | 90% (171/191)   | None                     |
+| S. epidermidis  | 1% (1/91)       | 64% (87/136)    | None                     |
+| S. hominis      | None            | 80% (56/70)     | None                     |
+| S. pneumoniae   | 100% (112/112)  | None            | 100% (112/112)           |

-## Example 3: Filtering Data Based on Gram-Negative Bacteria
-
-Let’s say you want to filter the dataset for Gram-negative bacteria and display their resistance to certain antibiotics:

 ```python
-# Filter for Gram-negative bacteria with intrinsic resistance to cefotaxime
-ro.r('result <- example_isolates[which(mo_is_gram_negative() & mo_is_intrinsic_resistant(ab = "cefotax")), c("bacteria", aminoglycosides(), carbapenems())]')
-
-# Retrieve the filtered result in Python
-result = ro.r('as.data.frame(result)')
-print(result)
+result2b = AMR.antibiogram(df[["mo", "AMX", "CIP", "TZP"]], mo_transform = "gramstain")
+print(result2b)
 ```

-This example uses the AMR functions `mo_is_gram_negative()` and `mo_is_intrinsic_resistant()` to filter the dataset and returns a subset of bacteria with resistance data.
+| Pathogen       | Amoxicillin     | Ciprofloxacin    | Piperacillin/tazobactam  |
+|----------------|-----------------|------------------|--------------------------|
+| Gram-negative  | 36% (226/631)   | 91% (621/684)    | 88% (565/641)            |
+| Gram-positive  | 43% (305/703)   | 77% (560/724)    | 86% (296/345)            |

-## Example 4: Customising the Antibiogram

-You can easily customise the antibiogram by passing different antibiotics or microorganism transformations, as shown below:
-
-```python
-# Customise the antibiogram with different settings
-ro.r('result <- antibiogram(example_isolates, antibiotics = c("TZP", "TZP+TOB", "TZP+GEN"), mo_transform = "gramstain")')
-
-# Retrieve and print the result
-result = ro.r('as.data.frame(result)')
-print(result)
-```
-
-Here, we use piperacillin/tazobactam (TZP) in combination with tobramycin (TOB) and gentamicin (GEN) to see how they perform against various Gram-negative bacteria.
+In this example, we generate an antibiogram by selecting various antibiotics.

 # Conclusion

-Using `rpy2`, you can easily integrate the power of R's `AMR` package into your Python workflows. Whether you are generating antibiograms, analyzing resistance data, or performing complex filtering, `rpy2` gives you the flexibility to run R code without leaving the Python environment. This makes it a perfect solution for teams working across both R and Python.
+With the `AMR` Python package, Python users can now effortlessly call R functions from the `AMR` R package. This eliminates the need for complex `rpy2` configurations and provides a clean, easy-to-use interface for antimicrobial resistance analysis. The examples provided above demonstrate how this can be applied to typical workflows, such as standardising microorganism and antimicrobial names or calculating resistance.
+
+By using `import AMR`, you can seamlessly integrate the robust features of the R `AMR` package into your Python workflows. Whether you're cleaning data or analysing resistance patterns, the `AMR` Python package makes it easy to work with AMR data in Python.
--- a/vignettes/other_pkg.Rmd
+++ b/vignettes/other_pkg.Rmd
@@ -1,25 +0,0 @@
---
-title: "Using AMR with other packages: AMR & dplyr/tidyverse"
-output: 
-  rmarkdown::html_vignette:
-    toc: true
-    toc_depth: 3
-vignette: >
-  %\VignetteIndexEntry{How to conduct AMR data analysis}
-  %\VignetteEncoding{UTF-8}
-  %\VignetteEngine{knitr::rmarkdown}
-editor_options: 
-  chunk_output_type: console
---
-
-```{r setup, include = FALSE, results = 'markup'}
-knitr::opts_chunk$set(
-  warning = FALSE,
-  collapse = TRUE,
-  comment = "#>",
-  fig.width = 7.5,
-  fig.height = 5
-)
-```
-
-This page will be updated shortly, to give explicit examples of how to work ideally with the `AMR` package, for those who are used to working in `dplyr` or other tidyverse packages.