Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
Immunological research often compares subgroups defined by exposure variables known (or hypothesised) to influence continuous immune responses. Many immune outcomes are measured over time, often in a small number of patients. Effective outcome selection ensures that research focuses on immune outcom...
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | English |
| Published: |
Department of Statistical Sciences
2025
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867613144176656384 |
|---|---|
| access_status_str | Open Access |
| author | Holcroft, Shannon |
| author2 | Little, Francesca |
| author_browse | Holcroft, Shannon Little, Francesca |
| author_facet | Little, Francesca Holcroft, Shannon |
| author_sort | Holcroft, Shannon |
| collection | Thesis |
| description | Immunological research often compares subgroups defined by exposure variables known (or hypothesised) to influence continuous immune responses. Many immune outcomes are measured over time, often in a small number of patients. Effective outcome selection ensures that research focuses on immune outcomes with the strongest signals for subgroup differences. This dissertation explores an outcome selection technique for longitudinal immunological data, addressing current methodological limitations and proposing improvements. The approach integrates statistical modelling with dimension reduction to identify immune outcomes with the most evidence for subgroup differences. By focusing on these subsets, fewer statistical hypotheses are tested simultaneously, preserving power when stricter significance thresholds are applied to reduce type-I error inflation. The dissertation examines the suitability of different longitudinal modelling frameworks. Generalised linear mixed-effects models are better suited to the characteristics of immunological data and research than linear mixed-effects models. Two dimension reduction techniques are compared: principal component analysis (PCA) and hierarchical cluster analysis (HCA) followed by PCA. PCA identifies the largest sources of variance across all outcomes, while HCA followed by PCA identifies variance within groups of similar outcomes. These techniques influence the definition of families of tests for false discovery rate (FDR) corrections. When outcomes are selected via PCA-only dimension reduction, more tests are performed simultaneously and require correction. It was hypothesised that HCA followed by PCA would yield more significant discoveries after FDR control. However, fewer simultaneous comparisons did not reliably correspond with more statistically significant discoveries. The methodology was applied to a dataset from the South African Tuberculosis Vaccine Initiative (SATVI), focusing on 33 immune outcomes and three exposures: MVA85A priming, maternal Mycobacterium tuberculosis sensitisation (measured by a positive QuantiFERONTB Gold test), and combinations of feeding practices and cotrimoxazole treatment. The analysis shows that different dimension reduction techniques lead to different outcome selections and families of tests, emphasising the need to align analysis objectives with outcome selection techniques. This dissertation contributes to outcome selection methodology in high-dimensional, longitudinal settings, with broader applications in biomedical research. |
| format | Thesis |
| id | oai:open.uct.ac.za:11427/42279 |
| institution | University of Cape Town (South Africa) |
| language | eng |
| last_indexed | 2026-06-10T12:31:28.055Z |
| license_str | Not specified — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository |
| publishDate | 2025 |
| publishDateRange | 2025 |
| publishDateSort | 2025 |
| publisher | Department of Statistical Sciences |
| publisherStr | Department of Statistical Sciences |
| record_format | dspace |
| source_str | UCTD — University of Cape Town Open Access Repository |
| spelling | oai:open.uct.ac.za:11427/42279 Outcome selection in longitudinal analysis of immunological data Holcroft, Shannon Little, Francesca Nemes, Elisa principal component analysis PCA Immunological research often compares subgroups defined by exposure variables known (or hypothesised) to influence continuous immune responses. Many immune outcomes are measured over time, often in a small number of patients. Effective outcome selection ensures that research focuses on immune outcomes with the strongest signals for subgroup differences. This dissertation explores an outcome selection technique for longitudinal immunological data, addressing current methodological limitations and proposing improvements. The approach integrates statistical modelling with dimension reduction to identify immune outcomes with the most evidence for subgroup differences. By focusing on these subsets, fewer statistical hypotheses are tested simultaneously, preserving power when stricter significance thresholds are applied to reduce type-I error inflation. The dissertation examines the suitability of different longitudinal modelling frameworks. Generalised linear mixed-effects models are better suited to the characteristics of immunological data and research than linear mixed-effects models. Two dimension reduction techniques are compared: principal component analysis (PCA) and hierarchical cluster analysis (HCA) followed by PCA. PCA identifies the largest sources of variance across all outcomes, while HCA followed by PCA identifies variance within groups of similar outcomes. These techniques influence the definition of families of tests for false discovery rate (FDR) corrections. When outcomes are selected via PCA-only dimension reduction, more tests are performed simultaneously and require correction. It was hypothesised that HCA followed by PCA would yield more significant discoveries after FDR control. However, fewer simultaneous comparisons did not reliably correspond with more statistically significant discoveries. The methodology was applied to a dataset from the South African Tuberculosis Vaccine Initiative (SATVI), focusing on 33 immune outcomes and three exposures: MVA85A priming, maternal Mycobacterium tuberculosis sensitisation (measured by a positive QuantiFERONTB Gold test), and combinations of feeding practices and cotrimoxazole treatment. The analysis shows that different dimension reduction techniques lead to different outcome selections and families of tests, emphasising the need to align analysis objectives with outcome selection techniques. This dissertation contributes to outcome selection methodology in high-dimensional, longitudinal settings, with broader applications in biomedical research. 2025-11-20T11:12:19Z 2025-11-20T11:12:19Z 2025 2025-11-20T11:06:22Z Thesis / Dissertation Masters MSc http://hdl.handle.net/11427/42279 eng application/pdf Department of Statistical Sciences Faculty of Science University of Cape Town |
| spellingShingle | principal component analysis PCA Holcroft, Shannon Outcome selection in longitudinal analysis of immunological data |
| thesis_degree_str | Master's |
| title | Outcome selection in longitudinal analysis of immunological data |
| title_full | Outcome selection in longitudinal analysis of immunological data |
| title_fullStr | Outcome selection in longitudinal analysis of immunological data |
| title_full_unstemmed | Outcome selection in longitudinal analysis of immunological data |
| title_short | Outcome selection in longitudinal analysis of immunological data |
| title_sort | outcome selection in longitudinal analysis of immunological data |
| topic | principal component analysis PCA |
| url | http://hdl.handle.net/11427/42279 |
| work_keys_str_mv | AT holcroftshannon outcomeselectioninlongitudinalanalysisofimmunologicaldata |