TB-ML-a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis.

Julian Libiseller-Egger ORCID logo ; Linfeng Wang ; Wouter Deelder ORCID logo ; Susana Campino ORCID logo ; Taane G Clark ORCID logo ; Jody E Phelan ORCID logo ; (2023) TB-ML-a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis. Bioinformatics advances, 3 (1). vbad040-. ISSN 2635-0041 DOI: 10.1093/bioadv/vbad040
Copy

MOTIVATION: Machine learning (ML) has shown impressive performance in predicting antimicrobial resistance (AMR) from sequence data, including for Mycobacterium tuberculosis, the causative agent of tuberculosis. However, current ML development and publication practices make it difficult for researchers and clinicians to use, test or reproduce published models. RESULTS: We packaged a number of published and unpublished ML models for predicting AMR of M.tuberculosis into Docker containers. Similarly, the pipelines required for pre-processing genomic data into the formats required by the models were also packaged into separate containers. By following a minimal container I/O standard, we ensured as much interoperability as possible. We also created a command-line application, TB-ML, which can be used to easily combine pre-processing and prediction containers into complete pipelines ready for predicting resistance from novel, raw data with a single command. As long as there is adherence to this minimal standard for the container interface, containers produced by researchers holding new models can likewise be included in these pipelines, making benchmark comparisons of different models simple and facilitating faster uptake in the clinic. AVAILABILITY AND IMPLEMENTATION: TB-ML contains a simple Docker API written in Python and is available at https://github.com/jodyphelan/tb-ml. Example Docker containers for resistance prediction and corresponding data pre-processing as well as a tutorial on how to create new containers for TB-ML are available at https://tb-ml.github.io/tb-ml-containers/. CONTACT: jody.phelan@lshtm.ac.uk.


picture_as_pdf
Egger-etal-2023-TB-ml-a-framework-for.pdf
subject
Published Version
Available under Creative Commons: Attribution 4.0

View Download

Atom BibTeX OpenURL ContextObject in Span Multiline CSV OpenURL ContextObject Dublin Core Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation JSON MARC (ASCII) MARC (ISO 2709) METS MODS RDF+N3 RDF+N-Triples RDF+XML RIOXX2 XML Reference Manager Refer Simple Metadata ASCII Citation EP3 XML
Export

Downloads