Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

A scalable database model of RFI data for the MeerKAT/SKA radio telescope

In radio astronomy, radio frequency interference (RFI) refers to any signal captured by a radio telescope that did not originate from the observed target in the sky. RFI from terrestrial and other sources is a recognized problem that contaminates the desired signal and must be tracked and ultimately...

Full description

Saved in:
Bibliographic Details
Main Author: Balekaki, Gerald Nathan
Other Authors: Kuttel, Michelle
Format: Thesis
Language:English
Published: Department of Computer Science 2024
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613172668563456
access_status_str Open Access
author Balekaki, Gerald Nathan
author2 Kuttel, Michelle
author_browse Balekaki, Gerald Nathan
Kuttel, Michelle
author_facet Kuttel, Michelle
Balekaki, Gerald Nathan
author_sort Balekaki, Gerald Nathan
collection Thesis
description In radio astronomy, radio frequency interference (RFI) refers to any signal captured by a radio telescope that did not originate from the observed target in the sky. RFI from terrestrial and other sources is a recognized problem that contaminates the desired signal and must be tracked and ultimately removed. RFI corrupts observed data and may even damage radio telescope equipment. Astronomers, therefore, seek to store data on RFI to mitigate or prevent future interference events. At the MeerKAT radio telescope (a precursor to the Square Kilometre Array, and one of the largest and most sensitive radio telescopes in the world to date), RFI is captured in different formats using a variety of devices including telescopes, sensors and scanners; however, the combination of data from these multiple sources does not only yield storage problems but also data integration challenges. In this work, we present two designs for the scalable database model. In the first design, RFI data is stored in multiple databases (PSQL, SciDB, and Accumulo). Our findings indicate that PSQL outperforms both SciDB and Accumulo. Consequently, in the second design (alternative), all RFI data is stored exclusively in PSQL. However, we observed that the performance of the alternative model is impacted by the transformation of SciDB (array data) and Accumulo (key-value data) into PSQL (relational data). Our results recommend storing RFI data in its appropriate database rather than transforming it into another format, as this approach boosts the model's performance. Our model demonstrates a response time of less than 12 seconds for 1 MB RFI data (bulk request), with latency below 0.14 seconds—well within the acceptable maximum latency of 1 second in scalable databases. We found that the native database API is slightly faster (5%) than a third-party API, with no significant impact on the model's performance. In addition, this work indicates the direction for improvement in join queries involving disparate databases, which remains a limitation in heterogeneous environments. Our model facilitates fast queries across various databases, underscoring the importance of storing each data type in the appropriate database system. Lastly, our RFI database model offers good performance and scales effectively with increasing data volumes, multiple users, and varying workloads, making it suitable for the MeerKAT and SKA radio telescopes.
format Thesis
id oai:open.uct.ac.za:11427/40788
institution University of Cape Town (South Africa)
language eng
last_indexed 2026-06-10T12:31:54.917Z
license_str Not specified — see source repository
provenance_str_mv Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate 2024
publishDateRange 2024
publishDateSort 2024
publisher Department of Computer Science
publisherStr Department of Computer Science
record_format dspace
source_str UCTD — University of Cape Town Open Access Repository
spelling oai:open.uct.ac.za:11427/40788 A scalable database model of RFI data for the MeerKAT/SKA radio telescope Balekaki, Gerald Nathan Kuttel, Michelle Blyth, Sarah Schroeder, Anja Computer Science In radio astronomy, radio frequency interference (RFI) refers to any signal captured by a radio telescope that did not originate from the observed target in the sky. RFI from terrestrial and other sources is a recognized problem that contaminates the desired signal and must be tracked and ultimately removed. RFI corrupts observed data and may even damage radio telescope equipment. Astronomers, therefore, seek to store data on RFI to mitigate or prevent future interference events. At the MeerKAT radio telescope (a precursor to the Square Kilometre Array, and one of the largest and most sensitive radio telescopes in the world to date), RFI is captured in different formats using a variety of devices including telescopes, sensors and scanners; however, the combination of data from these multiple sources does not only yield storage problems but also data integration challenges. In this work, we present two designs for the scalable database model. In the first design, RFI data is stored in multiple databases (PSQL, SciDB, and Accumulo). Our findings indicate that PSQL outperforms both SciDB and Accumulo. Consequently, in the second design (alternative), all RFI data is stored exclusively in PSQL. However, we observed that the performance of the alternative model is impacted by the transformation of SciDB (array data) and Accumulo (key-value data) into PSQL (relational data). Our results recommend storing RFI data in its appropriate database rather than transforming it into another format, as this approach boosts the model's performance. Our model demonstrates a response time of less than 12 seconds for 1 MB RFI data (bulk request), with latency below 0.14 seconds—well within the acceptable maximum latency of 1 second in scalable databases. We found that the native database API is slightly faster (5%) than a third-party API, with no significant impact on the model's performance. In addition, this work indicates the direction for improvement in join queries involving disparate databases, which remains a limitation in heterogeneous environments. Our model facilitates fast queries across various databases, underscoring the importance of storing each data type in the appropriate database system. Lastly, our RFI database model offers good performance and scales effectively with increasing data volumes, multiple users, and varying workloads, making it suitable for the MeerKAT and SKA radio telescopes. 2024-12-20T06:25:55Z 2024-12-20T06:25:55Z 2024 2024-12-20T06:24:56Z Thesis / Dissertation Doctoral PhD http://hdl.handle.net/11427/40788 eng application/pdf Department of Computer Science Faculty of Science University of Cape Town
spellingShingle Computer Science
Balekaki, Gerald Nathan
A scalable database model of RFI data for the MeerKAT/SKA radio telescope
thesis_degree_str Doctoral
title A scalable database model of RFI data for the MeerKAT/SKA radio telescope
title_full A scalable database model of RFI data for the MeerKAT/SKA radio telescope
title_fullStr A scalable database model of RFI data for the MeerKAT/SKA radio telescope
title_full_unstemmed A scalable database model of RFI data for the MeerKAT/SKA radio telescope
title_short A scalable database model of RFI data for the MeerKAT/SKA radio telescope
title_sort scalable database model of rfi data for the meerkat ska radio telescope
topic Computer Science
url http://hdl.handle.net/11427/40788
work_keys_str_mv AT balekakigeraldnathan ascalabledatabasemodelofrfidataforthemeerkatskaradiotelescope
AT balekakigeraldnathan scalabledatabasemodelofrfidataforthemeerkatskaradiotelescope