Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes

Large single-stranded nucleotide sequences such as certain DNA and RNA virus genomes form complex secondary structures through Watson-Crick base-pairing interactions between bases on the same DNA or RNA strand. Both computational and experimental methods are available for determining the secondary s...

Full description

Saved in:

Bibliographic Details
Main Author:	Golden, Michael Thomas
Other Authors:	Martin, Darren
Format:	Thesis
Language:	English
Published:	Department of Molecular and Cell Biology 2014
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867613224399011840
access_status_str	Open Access
author	Golden, Michael Thomas
author2	Martin, Darren
author_browse	Golden, Michael Thomas Martin, Darren
author_facet	Martin, Darren Golden, Michael Thomas
author_sort	Golden, Michael Thomas
collection	Thesis
description	Large single-stranded nucleotide sequences such as certain DNA and RNA virus genomes form complex secondary structures through Watson-Crick base-pairing interactions between bases on the same DNA or RNA strand. Both computational and experimental methods are available for determining the secondary structure conformations of nucleotide sequences and have been used to identify many thousands of structural elements within these sequences. However, to date only a small subset of these elements have been shown to have any biological functionality. In this study, a software tool enabling the graphical visualisation of structural elements overlaid with additional biological information was developed in order to assist researchers in identifying elements with likely biological functionality within larger secondary structures. Additionally, a tool was developed for identifying conserved structural elements amongst a set of related sequences. Using these tools a number of conserved elements were identified within computationally predicted secondary structures of HIV and five flavivirus species. Further evidence of selection having acted to preserve the identified elements, in support of their likely biological importance, was provided by evolutionary analysis of related sequences. In addition to the identification of individual structural elements, a large-scale analysis of HIV-1 and seven flavivirus species revealed statistically significant associations between base-paired nucleotide positions and various patterns of mutation. While the results of the various analyses carried out were consistent with the presence of between one and five highly-conserved functionally important secondary structures within the various genomes analysed, they were also consistent with the occurrence of many less-conserved ( or only transiently conserved) structures that potentially have no specific biological functions. Although many of these less-conserved structures may, as has previously been suggested, defend these genomes against intra-cellular anti-viral siRNAs, a strong association was found between sites that are base-paired and codon sites that are apparently evolving under purifying selection: a finding which suggests that an additional or alternative role of these structures may simply be to reduce basal mutation frequencies, and hence amino acid substitution rates at codon sites that encode functionally important amino acids. An investigation of HIV-1 natural inter- and intra-subtype recombinant sequences revealed that recombination breakpoints had a significant tendency to occur more frequently at base-paired nucleotide positions within the computationally predicted secondary structures of these sequences. Furthermore, it was found that these natural HIV-1 recombinants were less disruptive of protein tertiary structure and RNA secondary structure conformations than expected by chance: suggesting that natural selection favouring the maintenance of both protein and nucleic acid folding has likely had a major influence on the patterns of recombination that are detectable within HIV genomes. Finally, a general method for predicting the secondary structures of a set of related sequences sharing similar secondary structure conformations was developed. By taking alignment uncertainty into account through the analysis of large numbers of similarly plausible multiple sequence alignments that are sampled using a statistical alignment procedure, the method was found to have better predictive accuracy than existing related RNA secondary structure prediction methods.
format	Thesis
id	oai:open.uct.ac.za:11427/5972
institution	University of Cape Town (South Africa)
language	eng
last_indexed	2026-06-10T12:32:44.899Z
license_str	Not specified — see source repository
provenance_str_mv	Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate	2014
publishDateRange	2014
publishDateSort	2014
publisher	Department of Molecular and Cell Biology
publisherStr	Department of Molecular and Cell Biology
record_format	dspace
source_str	UCTD — University of Cape Town Open Access Repository
spelling	oai:open.uct.ac.za:11427/5972 Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes Golden, Michael Thomas Martin, Darren Large single-stranded nucleotide sequences such as certain DNA and RNA virus genomes form complex secondary structures through Watson-Crick base-pairing interactions between bases on the same DNA or RNA strand. Both computational and experimental methods are available for determining the secondary structure conformations of nucleotide sequences and have been used to identify many thousands of structural elements within these sequences. However, to date only a small subset of these elements have been shown to have any biological functionality. In this study, a software tool enabling the graphical visualisation of structural elements overlaid with additional biological information was developed in order to assist researchers in identifying elements with likely biological functionality within larger secondary structures. Additionally, a tool was developed for identifying conserved structural elements amongst a set of related sequences. Using these tools a number of conserved elements were identified within computationally predicted secondary structures of HIV and five flavivirus species. Further evidence of selection having acted to preserve the identified elements, in support of their likely biological importance, was provided by evolutionary analysis of related sequences. In addition to the identification of individual structural elements, a large-scale analysis of HIV-1 and seven flavivirus species revealed statistically significant associations between base-paired nucleotide positions and various patterns of mutation. While the results of the various analyses carried out were consistent with the presence of between one and five highly-conserved functionally important secondary structures within the various genomes analysed, they were also consistent with the occurrence of many less-conserved ( or only transiently conserved) structures that potentially have no specific biological functions. Although many of these less-conserved structures may, as has previously been suggested, defend these genomes against intra-cellular anti-viral siRNAs, a strong association was found between sites that are base-paired and codon sites that are apparently evolving under purifying selection: a finding which suggests that an additional or alternative role of these structures may simply be to reduce basal mutation frequencies, and hence amino acid substitution rates at codon sites that encode functionally important amino acids. An investigation of HIV-1 natural inter- and intra-subtype recombinant sequences revealed that recombination breakpoints had a significant tendency to occur more frequently at base-paired nucleotide positions within the computationally predicted secondary structures of these sequences. Furthermore, it was found that these natural HIV-1 recombinants were less disruptive of protein tertiary structure and RNA secondary structure conformations than expected by chance: suggesting that natural selection favouring the maintenance of both protein and nucleic acid folding has likely had a major influence on the patterns of recombination that are detectable within HIV genomes. Finally, a general method for predicting the secondary structures of a set of related sequences sharing similar secondary structure conformations was developed. By taking alignment uncertainty into account through the analysis of large numbers of similarly plausible multiple sequence alignments that are sampled using a statistical alignment procedure, the method was found to have better predictive accuracy than existing related RNA secondary structure prediction methods. 2014-08-02T14:56:46Z 2014-08-02T14:56:46Z 2013 Master Thesis Masters MSc (Med) http://hdl.handle.net/11427/5972 eng application/pdf Department of Molecular and Cell Biology Faculty of Science University of Cape Town
spellingShingle	Golden, Michael Thomas Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes
thesis_degree_str	Master's
title	Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes
title_full	Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes
title_fullStr	Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes
title_full_unstemmed	Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes
title_short	Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes
title_sort	identification of evolutionarily conserved nucleic acid secondary structures within viral genomes
url	http://hdl.handle.net/11427/5972
work_keys_str_mv	AT goldenmichaelthomas identificationofevolutionarilyconservednucleicacidsecondarystructureswithinviralgenomes

Full Text Available

Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes

Similar Items