Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Outcome selection in longitudinal analysis of immunological data

Immunological research often compares subgroups defined by exposure variables known (or hypothesised) to influence continuous immune responses. Many immune outcomes are measured over time, often in a small number of patients. Effective outcome selection ensures that research focuses on immune outcom...

Full description

Saved in:

Bibliographic Details
Main Author:	Holcroft, Shannon
Other Authors:	Little, Francesca
Format:	Thesis
Language:	English
Published:	Department of Statistical Sciences 2025
Subjects:	principal component analysis PCA
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867613144176656384
access_status_str	Open Access
author	Holcroft, Shannon
author2	Little, Francesca
author_browse	Holcroft, Shannon Little, Francesca
author_facet	Little, Francesca Holcroft, Shannon
author_sort	Holcroft, Shannon
collection	Thesis
description	Immunological research often compares subgroups defined by exposure variables known (or hypothesised) to influence continuous immune responses. Many immune outcomes are measured over time, often in a small number of patients. Effective outcome selection ensures that research focuses on immune outcomes with the strongest signals for subgroup differences. This dissertation explores an outcome selection technique for longitudinal immunological data, addressing current methodological limitations and proposing improvements. The approach integrates statistical modelling with dimension reduction to identify immune outcomes with the most evidence for subgroup differences. By focusing on these subsets, fewer statistical hypotheses are tested simultaneously, preserving power when stricter significance thresholds are applied to reduce type-I error inflation. The dissertation examines the suitability of different longitudinal modelling frameworks. Generalised linear mixed-effects models are better suited to the characteristics of immunological data and research than linear mixed-effects models. Two dimension reduction techniques are compared: principal component analysis (PCA) and hierarchical cluster analysis (HCA) followed by PCA. PCA identifies the largest sources of variance across all outcomes, while HCA followed by PCA identifies variance within groups of similar outcomes. These techniques influence the definition of families of tests for false discovery rate (FDR) corrections. When outcomes are selected via PCA-only dimension reduction, more tests are performed simultaneously and require correction. It was hypothesised that HCA followed by PCA would yield more significant discoveries after FDR control. However, fewer simultaneous comparisons did not reliably correspond with more statistically significant discoveries. The methodology was applied to a dataset from the South African Tuberculosis Vaccine Initiative (SATVI), focusing on 33 immune outcomes and three exposures: MVA85A priming, maternal Mycobacterium tuberculosis sensitisation (measured by a positive QuantiFERONTB Gold test), and combinations of feeding practices and cotrimoxazole treatment. The analysis shows that different dimension reduction techniques lead to different outcome selections and families of tests, emphasising the need to align analysis objectives with outcome selection techniques. This dissertation contributes to outcome selection methodology in high-dimensional, longitudinal settings, with broader applications in biomedical research.
format	Thesis
id	oai:open.uct.ac.za:11427/42279
institution	University of Cape Town (South Africa)
language	eng
last_indexed	2026-06-10T12:31:28.055Z
license_str	Not specified — see source repository
provenance_str_mv	Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate	2025
publishDateRange	2025
publishDateSort	2025
publisher	Department of Statistical Sciences
publisherStr	Department of Statistical Sciences
record_format	dspace
source_str	UCTD — University of Cape Town Open Access Repository
spelling	oai:open.uct.ac.za:11427/42279 Outcome selection in longitudinal analysis of immunological data Holcroft, Shannon Little, Francesca Nemes, Elisa principal component analysis PCA Immunological research often compares subgroups defined by exposure variables known (or hypothesised) to influence continuous immune responses. Many immune outcomes are measured over time, often in a small number of patients. Effective outcome selection ensures that research focuses on immune outcomes with the strongest signals for subgroup differences. This dissertation explores an outcome selection technique for longitudinal immunological data, addressing current methodological limitations and proposing improvements. The approach integrates statistical modelling with dimension reduction to identify immune outcomes with the most evidence for subgroup differences. By focusing on these subsets, fewer statistical hypotheses are tested simultaneously, preserving power when stricter significance thresholds are applied to reduce type-I error inflation. The dissertation examines the suitability of different longitudinal modelling frameworks. Generalised linear mixed-effects models are better suited to the characteristics of immunological data and research than linear mixed-effects models. Two dimension reduction techniques are compared: principal component analysis (PCA) and hierarchical cluster analysis (HCA) followed by PCA. PCA identifies the largest sources of variance across all outcomes, while HCA followed by PCA identifies variance within groups of similar outcomes. These techniques influence the definition of families of tests for false discovery rate (FDR) corrections. When outcomes are selected via PCA-only dimension reduction, more tests are performed simultaneously and require correction. It was hypothesised that HCA followed by PCA would yield more significant discoveries after FDR control. However, fewer simultaneous comparisons did not reliably correspond with more statistically significant discoveries. The methodology was applied to a dataset from the South African Tuberculosis Vaccine Initiative (SATVI), focusing on 33 immune outcomes and three exposures: MVA85A priming, maternal Mycobacterium tuberculosis sensitisation (measured by a positive QuantiFERONTB Gold test), and combinations of feeding practices and cotrimoxazole treatment. The analysis shows that different dimension reduction techniques lead to different outcome selections and families of tests, emphasising the need to align analysis objectives with outcome selection techniques. This dissertation contributes to outcome selection methodology in high-dimensional, longitudinal settings, with broader applications in biomedical research. 2025-11-20T11:12:19Z 2025-11-20T11:12:19Z 2025 2025-11-20T11:06:22Z Thesis / Dissertation Masters MSc http://hdl.handle.net/11427/42279 eng application/pdf Department of Statistical Sciences Faculty of Science University of Cape Town
spellingShingle	principal component analysis PCA Holcroft, Shannon Outcome selection in longitudinal analysis of immunological data
thesis_degree_str	Master's
title	Outcome selection in longitudinal analysis of immunological data
title_full	Outcome selection in longitudinal analysis of immunological data
title_fullStr	Outcome selection in longitudinal analysis of immunological data
title_full_unstemmed	Outcome selection in longitudinal analysis of immunological data
title_short	Outcome selection in longitudinal analysis of immunological data
title_sort	outcome selection in longitudinal analysis of immunological data
topic	principal component analysis PCA
url	http://hdl.handle.net/11427/42279
work_keys_str_mv	AT holcroftshannon outcomeselectioninlongitudinalanalysisofimmunologicaldata

Full Text Available

Outcome selection in longitudinal analysis of immunological data

Similar Items