Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
In this era of information overload, machine learning and artificial intelligence have been increasingly popular in various fields, including the field of astronomy. These approaches attempt to extract meaningful information from the data through automated means. In this work, we develop generic mac...
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | English |
| Published: |
Department of Statistical Sciences
2023
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867613164593479680 |
|---|---|
| access_status_str | Open Access |
| author | Bangiso, Aphiwe |
| author2 | Groot, Paul |
| author_browse | Bangiso, Aphiwe Groot, Paul |
| author_facet | Groot, Paul Bangiso, Aphiwe |
| author_sort | Bangiso, Aphiwe |
| collection | Thesis |
| description | In this era of information overload, machine learning and artificial intelligence have been increasingly popular in various fields, including the field of astronomy. These approaches attempt to extract meaningful information from the data through automated means. In this work, we develop generic machine learning models that classify a given transient object from the observed light curve. We train random forest (sect 4.1.1) and multilayer perceptron neural network (sect 4.1.3) models on simulated LSST PLAsTiCC data and real data from the MeerLICHT survey. We found that the random forest model outperforms the neural network model in both data sets, achieving test accuracy of 66.0% and 98.0% in the PLAsTiCC and MeerLICHT data respectively. On the other hand, the neural network model achieved test accuracy of 65.7% and 86.6 % in the PLAsTiCC and MeerLICHT data respectively. For PLAsTiCC simulated data, we also show that grouping all types of supernovae into one aggregate class and discarding distance information improves the performance of both models to 96.5% and 96.0% for random forest and neural networks respectively. As additional work, we attempt to find sub-classes within the M-type class in MeerLiCHT data using k-means and hierarchical clustering algorithms. We find two distinct sub-classes in this data. Namely variable and non-variable M-type stars. |
| format | Thesis |
| id | oai:open.uct.ac.za:11427/37099 |
| institution | University of Cape Town (South Africa) |
| language | eng |
| last_indexed | 2026-06-10T12:31:47.142Z |
| license_str | Not specified — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository |
| publishDate | 2023 |
| publishDateRange | 2023 |
| publishDateSort | 2023 |
| publisher | Department of Statistical Sciences |
| publisherStr | Department of Statistical Sciences |
| record_format | dspace |
| source_str | UCTD — University of Cape Town Open Access Repository |
| spelling | oai:open.uct.ac.za:11427/37099 Machine Learning techniques to discover and understand the population of flare stars in MeerLICHT data Bangiso, Aphiwe Groot, Paul Buckley, David Johnston, Cole Astronomy In this era of information overload, machine learning and artificial intelligence have been increasingly popular in various fields, including the field of astronomy. These approaches attempt to extract meaningful information from the data through automated means. In this work, we develop generic machine learning models that classify a given transient object from the observed light curve. We train random forest (sect 4.1.1) and multilayer perceptron neural network (sect 4.1.3) models on simulated LSST PLAsTiCC data and real data from the MeerLICHT survey. We found that the random forest model outperforms the neural network model in both data sets, achieving test accuracy of 66.0% and 98.0% in the PLAsTiCC and MeerLICHT data respectively. On the other hand, the neural network model achieved test accuracy of 65.7% and 86.6 % in the PLAsTiCC and MeerLICHT data respectively. For PLAsTiCC simulated data, we also show that grouping all types of supernovae into one aggregate class and discarding distance information improves the performance of both models to 96.5% and 96.0% for random forest and neural networks respectively. As additional work, we attempt to find sub-classes within the M-type class in MeerLiCHT data using k-means and hierarchical clustering algorithms. We find two distinct sub-classes in this data. Namely variable and non-variable M-type stars. 2023-03-02T07:46:01Z 2023-03-02T07:46:01Z 2022 2023-02-20T12:15:13Z Master Thesis Masters MSc http://hdl.handle.net/11427/37099 eng application/pdf Department of Statistical Sciences Faculty of Science |
| spellingShingle | Astronomy Bangiso, Aphiwe Machine Learning techniques to discover and understand the population of flare stars in MeerLICHT data |
| thesis_degree_str | Master's |
| title | Machine Learning techniques to discover and understand the population of flare stars in MeerLICHT data |
| title_full | Machine Learning techniques to discover and understand the population of flare stars in MeerLICHT data |
| title_fullStr | Machine Learning techniques to discover and understand the population of flare stars in MeerLICHT data |
| title_full_unstemmed | Machine Learning techniques to discover and understand the population of flare stars in MeerLICHT data |
| title_short | Machine Learning techniques to discover and understand the population of flare stars in MeerLICHT data |
| title_sort | machine learning techniques to discover and understand the population of flare stars in meerlicht data |
| topic | Astronomy |
| url | http://hdl.handle.net/11427/37099 |
| work_keys_str_mv | AT bangisoaphiwe machinelearningtechniquestodiscoverandunderstandthepopulationofflarestarsinmeerlichtdata |