Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
Thesis (MSc)--Stellenbosch University, 2026.
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | English |
| Published: |
Stellenbosch : Stellenbosch University
2026
|
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867614099378012160 |
|---|---|
| access_status_str | Open Access |
| author | De Castro Silva, Danilo |
| author2 | Dunaiski, Marcel |
| author_browse | De Castro Silva, Danilo Dunaiski, Marcel |
| author_facet | Dunaiski, Marcel De Castro Silva, Danilo |
| author_sort | De Castro Silva, Danilo |
| collection | Thesis |
| dc_rights_str_mv | Stellenbosch University |
| description | Thesis (MSc)--Stellenbosch University, 2026. |
| format | Thesis |
| id | oai:scholar.sun.ac.za:10019.1/135725 |
| institution | Stellenbosch University (South Africa) |
| language | English |
| last_indexed | 2026-06-10T12:46:39.009Z |
| license_str | Other — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository |
| publishDate | 2026 |
| publishDateRange | 2026 |
| publishDateSort | 2026 |
| publisher | Stellenbosch : Stellenbosch University |
| publisherStr | Stellenbosch : Stellenbosch University |
| record_format | dspace |
| source_str | SUNScholar — Stellenbosch University Repository |
| spelling | oai:scholar.sun.ac.za:10019.1/135725 Assessing and designing heterogeneous data management solutions for health-related research applications De Castro Silva, Danilo Dunaiski, Marcel Moir, Monika Xavier, Joicymara Stellenbosch University. Faculty of Science. Dept. of Computer Science. Thesis (MSc)--Stellenbosch University, 2026. De Castro Silva, D. 2026. Assessing and designing heterogeneous data management solutions for health-related research applications. Unpublished masters thesis. Stellenbosch: Stellenbosch University [online]. Available: https://scholar.sun.ac.za/items/dfe32078-8a8d-4af8-a314-2158e1fba5c6 Data management may be a complex challenge in fields such as bioinformatics and health sciences, which continuously generate extensive heterogeneous datasets. In the context of collaborative global health initiatives, secure storage and sharing of data are crucial to support impactful research. However, the absence of a unified data management platform complicates efficient data exchange and governance within these initiatives. In this paper, we introduce the design process of a data management prototype platform based on a data lakehouse architecture, data federation, and the FAIR principles. The platform is designed using open-source tools, guided by system requirements identified in previously published studies and complemented by insights from the existing literature. The current prototype platform comprises a user-friendly website, an open API, Python and R packages, allowing users to interact with the platform in multiple ways. Through a user study that included participants with varying technical backgrounds, we showed that our proposed data management prototype is both usable and useful. Our prototype design showcases the adaptability, scalability, and reproducibility of a lakehouse system that can be used by any organisation. It is designed as a flexible and complementary approach that allows organisations to customise data management systems to their specific requirements and resources, including cloud-based or self-hosted storage choices. Masters 2026-04-09T05:28:14Z 2026-04-09T05:28:14Z 2026-03 Thesis https://scholar.sun.ac.za/handle/10019.1/135725 en Stellenbosch University 114 pages : ill. application/pdf Stellenbosch : Stellenbosch University |
| spellingShingle | De Castro Silva, Danilo Assessing and designing heterogeneous data management solutions for health-related research applications |
| title | Assessing and designing heterogeneous data management solutions for health-related research applications |
| title_full | Assessing and designing heterogeneous data management solutions for health-related research applications |
| title_fullStr | Assessing and designing heterogeneous data management solutions for health-related research applications |
| title_full_unstemmed | Assessing and designing heterogeneous data management solutions for health-related research applications |
| title_short | Assessing and designing heterogeneous data management solutions for health-related research applications |
| title_sort | assessing and designing heterogeneous data management solutions for health related research applications |
| url | https://scholar.sun.ac.za/handle/10019.1/135725 |
| work_keys_str_mv | AT decastrosilvadanilo assessinganddesigningheterogeneousdatamanagementsolutionsforhealthrelatedresearchapplications |