Interpreting Viral Deep Sequencing Data with GLUE.

Joshua B Singer ORCID logo ; Emma C Thomson ORCID logo ; Joseph Hughes ORCID logo ; Elihu Aranday-Cortes ORCID logo ; John McLauchlan ORCID logo ; Ana da Silva Filipe ; Lily Tong ; Carmen F Manso ORCID logo ; Robert J Gifford ORCID logo ; David L Robertson ORCID logo ; +6 more... Eleanor Barnes ORCID logo ; M Azim Ansari ORCID logo ; Jean L Mbisa ORCID logo ; David F Bibby ORCID logo ; Daniel Bradshaw ; David Smith ORCID logo ; (2019) Interpreting Viral Deep Sequencing Data with GLUE. Viruses, 11 (4). p. 323. ISSN 1999-4915 DOI: 10.3390/v11040323
Copy

Using deep sequencing technologies such as Illumina's platform, it is possible to obtain reads from the viral RNA population revealing the viral genome diversity within a single host. A range of software tools and pipelines can transform raw deep sequencing reads into Sequence Alignment Mapping (SAM) files. We propose that interpretation tools should process these SAM files, directly translating individual reads to amino acids in order to extract statistics of interest such as the proportion of different amino acid residues at specific sites. This preserves per-read linkage between nucleotide variants at different positions within a codon location. The samReporter is a subsystem of the GLUE software toolkit which follows this direct read translation approach in its processing of SAM files. We test samReporter on a deep sequencing dataset obtained from a cohort of 241 UK HCV patients for whom prior treatment with direct-acting antivirals has failed; deep sequencing and resistance testing have been suggested to be of clinical use in this context. We compared the polymorphism interpretation results of the samReporter against an approach that does not preserve per-read linkage. We found that the samReporter was able to properly interpret the sequence data at resistance-associated locations in nine patients where the alternative approach was equivocal. In three cases, the samReporter confirmed that resistance or an atypical substitution was present at NS5A position 30. In three further cases, it confirmed that the sofosbuvir-resistant NS5B substitution S282T was absent. This suggests the direct read translation approach implemented is of value for interpreting viral deep sequencing data.

picture_as_pdf

picture_as_pdf
Singer-etal-2019-Interpreting-Viral-Deep-Sequencing-Data-with-GLUE.pdf
subject
Published Version
Available under Creative Commons: Attribution 4.0

View Download

Atom BibTeX OpenURL ContextObject in Span Multiline CSV OpenURL ContextObject Dublin Core Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation JSON MARC (ASCII) MARC (ISO 2709) METS MODS RDF+N3 RDF+N-Triples RDF+XML RIOXX2 XML Reference Manager Refer Simple Metadata ASCII Citation EP3 XML
Export

Downloads