Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples

Dissertation (MSc)--University of Pretoria, 2009.

Saved in:
Bibliographic Details
Other Authors: Reva, Oleg N.
Format: Thesis
Published: University of Pretoria 2013
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613682314248192
access_status_str Open Access
author2 Reva, Oleg N.
author_browse Reva, Oleg N.
author_facet Reva, Oleg N.
collection Thesis
dc_rights_str_mv © 2008, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
description Dissertation (MSc)--University of Pretoria, 2009.
format Thesis
id oai:repository.up.ac.za:2263/27147
institution University of Pretoria (South Africa)
last_indexed 2026-06-10T12:40:01.664Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from UPSpace — University of Pretoria Institutional Repository
publishDate 2013
publishDateRange 2013
publishDateSort 2013
publisher University of Pretoria
publisherStr University of Pretoria
record_format dspace
source_str UPSpace — University of Pretoria Institutional Repository
spelling oai:repository.up.ac.za:2263/27147 Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples Reva, Oleg N. warren.emmett@gmail.com Emmett, Warren Anthony Pathogenic bacteria Metagenomic samples UCTD Dissertation (MSc)--University of Pretoria, 2009. Pathogenic bacteria are responsible for millions of deaths every year with an estimated mortality of 70 million people by 2010 for Mycobacterium tuberculosis alone. Novel methods for identification of bacterial species in hosts, urban environments, water sources and food stuffs are required to advance diagnosis and preventative medicine. Detection of bacterial species in environmental samples is a complex task since large numbers of bacteria are present and are resistant to culturing. Therefore, the genetic content of the entire sample has to be analysed simultaneously and this constitutes a metagenomic sample. Commonly-used methods of bacterial identification focus on detection of specific genomic regions to determine species. Currently only one percent of a metagenomic sample can be used for identification employing phylogenetic markers. This method is highly inefficient. The search for more widespread markers within each genome is essential to improve detection methods. Also, modern sequencing technologies used in these environments have short read lengths which prove difficult to assemble e.g. repeats can lead to incorrect assembly. The use of overrepresented oligonucleotides provides a solution to both of these difficulties. Overrepresented oligonucleotides (8-14bp in length) are utilised to differentiate between species based on observed frequency of occurrence rather than presence or absence. They occur throughout the genome thereby increasing genomic coverage. Furthermore, overrepresented oligonucleotides can be easily identified in a raw metagenomic sample, bypassing the need for sequence assembly. Raw oligonucleotide data was filtered, analysed and imported into a structured database. A program, Oligosignatures, allowed for creation of species and phylogenetic lineage specific oligonucleotide markers dependent on the selection of species specified by the user. For the purposes of this study, the context of bacterial identification in an unknown environment was selected. A similarity trial was then executed to determine if strains of the same species can be separated from each other using overrepresented oligonucleotides. Outcomes of this test provided a guideline for the creation of species and lineage specific oligonucleotide markers. Each species and lineage was therefore described by a marker profile which consisted of representative oligonucleotide markers. These marker profiles were then tested against artificial and experimental data to determine their effectivity. Two approaches were used for testing, namely Oligonucleotide frequency analysis and Sequence read analysis. Oligonucleotide frequency analysis focused on the identification of species dependent on the global frequencies of marker oligonucleotides within each marker profile. Sequence read analysis attempted to assign metagenomic reads to a specific species dependent on the number of marker oligonucleotides present within the read. The final database contained 439 bacterial genomes from 22 different phylogenetic lineages. Interpretation of the results obtained after strain similarity testing showed that strains of the same species had highly similar markers and were not separable using this approach. All strains of a species that conformed to this premise were reduced to a single representative member. Similarly, species marker profiles demonstrated that closely related species remained difficult to separate. Twenty-one of the 22 lineages showed sufficient lineage specific markers for use in testing. This provides support for the abundance of overrepresented oligonucleotides and their potential for use as a detection method. In general, metagenomic testing of marker profiles showed that species specific determination was prone to interference, specifically, in closely related species. However, more distantly related species could be separated using both methods. Lineage discrimination generated more reliable results proving that lineage determination was possible in both artificial and experimental datasets. Oligonucleotide frequency analysis, the most sensitive approach, showed the best results for lineage determination but poorer results for species identification. Sequence read analysis provided a more effective method of determining confidence using different thresholds for read classification. In conclusion, the use of overrepresented oligonucleotides holds promise as a novel method for bacterial identification in a metagenomic context. Although several obstacles still prevent optimal utilization of these oligonucleotides, with further research the classification and identification of species and phylogenetic lineages from metagenomic samples can become a reality. Copyright Biochemistry unrestricted 2013-09-07T10:46:44Z 2009-09-16 2013-09-07T10:46:44Z 2008-04-15 2009-09-16 2009-08-11 Dissertation Emmett , WA 2008, Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples, MSc dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/27147 > E1362/gm http://hdl.handle.net/2263/27147 http://upetd.up.ac.za/thesis/available/etd-08112009-151918/ © 2008, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. application/pdf University of Pretoria
spellingShingle Pathogenic bacteria
Metagenomic samples
UCTD
Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples
title Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples
title_full Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples
title_fullStr Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples
title_full_unstemmed Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples
title_short Using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples
title_sort using oligonucleotide signatures to build a system for effective detection of pathogenic bacteria in metagenomic samples
topic Pathogenic bacteria
Metagenomic samples
UCTD
url http://hdl.handle.net/2263/27147
http://upetd.up.ac.za/thesis/available/etd-08112009-151918/