Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Assessing and designing heterogeneous data management solutions for health-related research applications

Thesis (MSc)--Stellenbosch University, 2026.

Saved in:
Bibliographic Details
Main Author: De Castro Silva, Danilo
Other Authors: Dunaiski, Marcel
Format: Thesis
Language:English
Published: Stellenbosch : Stellenbosch University 2026
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867614099378012160
access_status_str Open Access
author De Castro Silva, Danilo
author2 Dunaiski, Marcel
author_browse De Castro Silva, Danilo
Dunaiski, Marcel
author_facet Dunaiski, Marcel
De Castro Silva, Danilo
author_sort De Castro Silva, Danilo
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MSc)--Stellenbosch University, 2026.
format Thesis
id oai:scholar.sun.ac.za:10019.1/135725
institution Stellenbosch University (South Africa)
language English
last_indexed 2026-06-10T12:46:39.009Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2026
publishDateRange 2026
publishDateSort 2026
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/135725 Assessing and designing heterogeneous data management solutions for health-related research applications De Castro Silva, Danilo Dunaiski, Marcel Moir, Monika Xavier, Joicymara Stellenbosch University. Faculty of Science. Dept. of Computer Science. Thesis (MSc)--Stellenbosch University, 2026. De Castro Silva, D. 2026. Assessing and designing heterogeneous data management solutions for health-related research applications. Unpublished masters thesis. Stellenbosch: Stellenbosch University [online]. Available: https://scholar.sun.ac.za/items/dfe32078-8a8d-4af8-a314-2158e1fba5c6 Data management may be a complex challenge in fields such as bioinformatics and health sciences, which continuously generate extensive heterogeneous datasets. In the context of collaborative global health initiatives, secure storage and sharing of data are crucial to support impactful research. However, the absence of a unified data management platform complicates efficient data exchange and governance within these initiatives. In this paper, we introduce the design process of a data management prototype platform based on a data lakehouse architecture, data federation, and the FAIR principles. The platform is designed using open-source tools, guided by system requirements identified in previously published studies and complemented by insights from the existing literature. The current prototype platform comprises a user-friendly website, an open API, Python and R packages, allowing users to interact with the platform in multiple ways. Through a user study that included participants with varying technical backgrounds, we showed that our proposed data management prototype is both usable and useful. Our prototype design showcases the adaptability, scalability, and reproducibility of a lakehouse system that can be used by any organisation. It is designed as a flexible and complementary approach that allows organisations to customise data management systems to their specific requirements and resources, including cloud-based or self-hosted storage choices. Masters 2026-04-09T05:28:14Z 2026-04-09T05:28:14Z 2026-03 Thesis https://scholar.sun.ac.za/handle/10019.1/135725 en Stellenbosch University 114 pages : ill. application/pdf Stellenbosch : Stellenbosch University
spellingShingle De Castro Silva, Danilo
Assessing and designing heterogeneous data management solutions for health-related research applications
title Assessing and designing heterogeneous data management solutions for health-related research applications
title_full Assessing and designing heterogeneous data management solutions for health-related research applications
title_fullStr Assessing and designing heterogeneous data management solutions for health-related research applications
title_full_unstemmed Assessing and designing heterogeneous data management solutions for health-related research applications
title_short Assessing and designing heterogeneous data management solutions for health-related research applications
title_sort assessing and designing heterogeneous data management solutions for health related research applications
url https://scholar.sun.ac.za/handle/10019.1/135725
work_keys_str_mv AT decastrosilvadanilo assessinganddesigningheterogeneousdatamanagementsolutionsforhealthrelatedresearchapplications