Coll I Cerezo, F; (2015) Bioinformatic analysis of Mycobacterium tuberculosis whole genome data. PhD thesis, London School of Hygiene & Tropical Medicine. DOI: https://doi.org/10.17037/PUBS.02124343
Permanent Identifier
Use this Digital Object Identifier when citing or linking to this resource.
Abstract
Tuberculosis (TB) caused by bacteria of the Mycobacterium- tuberculosis complex (MTBC) is the second major cause of death from an infectious disease worldwide. Recent advances in DNA sequencing are leading to the ability to generate whole genome information of clinical isolates of MTBC. The objectives of this work include developing bioinformatic tools for processing and making accessible MTBC genomic data, as well as the identification of informative genetic markers, both strainOspecific and associated with drug resistance (DR), to barcode MTBC isolates in research and clinical settings. SpolPred software was developed to accurately predict the spoligotype from raw sequence reads, and used to bridge the gap between classical genotyping and highO throughput sequencing. A genome variation discovery pipeline was implemented to derive genomic polymorphisms from MTBC raw sequence data. This pipeline was applied to >1,500 publicly available isolates and the characterised genomic variation hosted in PolyTB, a webObased tool where genetic variants can be investigated using a genome browser, a world map showing their global allele distribution, and an additional phylogenetic view. An extensive repertoire of strainOspecific mutations was identified, of which a subset was proposed to accurately discriminate known MTBC circulating strains. A curated list of DR associated mutations was compiled from the literature and their diagnostic accuracy for predicting phenotypic resistance assessed. In addition, potentially novel genes involved in DR were discovered by applying genomeOwide association approaches to a global population of more than 2,500 MTBC strains. Whole genome sequencing (WGS) promises to be transformative for the practice of clinical microbiology, and the rapidly falling cost and turnaround time mean that this will become a viable technology in clinical settings. In this new paradigm, the presented work will facilitate the transition to and applications of WGS in clinical settings as an important tool for TB control.
Item Type | Thesis |
---|---|
Thesis Type | Doctoral |
Thesis Name | PhD |
Contributors | Clark, Taane |
Faculty and Department | Faculty of Infectious and Tropical Diseases > Department of Infection Biology > Dept of Pathogen Molecular Biology (-2019) |
Funder Name | Bloomsbury Colleges PhD Studentships |
Copyright Holders | Francesc Coll I Cerezo |
Download
Filename: 2015_ITD_PhD_CollICerezo_F.pdf
Licence: Creative Commons: Attribution-Noncommercial-No Derivative Works 3.0
Download