Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

An investigation of the effectiveness of using Twitter data for predicting South African protests with Graph Neural Networks

Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2024.

Saved in:
Bibliographic Details
Other Authors: Marivate, Vukosi
Format: Thesis
Language:English
Published: University of Pretoria 2024
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613567668191232
access_status_str Open Access
author2 Marivate, Vukosi
author_browse Marivate, Vukosi
author_facet Marivate, Vukosi
collection Thesis
dc_rights_str_mv © 2021 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
description Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2024.
format Thesis
id oai:repository.up.ac.za:2263/98149
institution University of Pretoria (South Africa)
language English
last_indexed 2026-06-10T12:38:12.302Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from UPSpace — University of Pretoria Institutional Repository
publishDate 2024
publishDateRange 2024
publishDateSort 2024
publisher University of Pretoria
publisherStr University of Pretoria
record_format dspace
source_str UPSpace — University of Pretoria Institutional Repository
spelling oai:repository.up.ac.za:2263/98149 An investigation of the effectiveness of using Twitter data for predicting South African protests with Graph Neural Networks Marivate, Vukosi Ahmed, Maxamed Ngomane, Derwin UCTD Twitter data Graph Neural Networks South African Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2024. Social media creates an echo chamber effect that is closely related to social movement theory, which aims to mobilise people to change society. In South Africa, there has been an increase in protests that appear to have started on social media. For example, consider the riots that occurred in July 2021 following the arrest of former President Jacob Zuma. Protests in South Africa, on the other hand, have culminated in violent incidents, such as the July 2021 protest. In that situation, the South African Human Rights Commission found that social media sites such as WhatsApp, Facebook, and Twitter aided the violence by sharing protest information. This study investigates whether social media can be utilised to signal upcoming South African protests. This research investigates the effectiveness of nose reduction techniques on Twitter data for predicting protest-related events in South Africa using Graph Neural Networks. It addresses research gaps by addressing the need for graph-based methodologies in the South African context, addressing the lack of noise reduction research for Twitter data, and using an automated method to extract relevant keywords in the word networks. The work aims to provide a new avenue for noise reduction in real-world scenarios where future events have not occurred. This study examines a three-year data window between 2019 and 2021 using the Global Dataset of Events, Location, and Tone (GDELT) and Twitter data. GDELT focuses on CAMEO codes related to protests and conflict, while Twitter extracts social media text related to protest-related posts. A sliding window approach is used to combine the data, with noise-reduction filtration techniques guiding the filtration. This work explores the potential of processing Twitter data to reveal signals for improved predictive capability. Derivative metrics, from hashtags, links, and mentions, are used to reveal such signals. The study compares different machine learning methods, including Logistic Regression, Graph Convolutional Networks, and Graph Isomorphism Networks, to model the data. It is discovered that the geometric deep learning methods struggle with overfitting in hold-out testing data but are stable and have better cross-validation scores. The GIN model exhibits higher accuracy and isomorphism detection, making it suitable for the task. However, graph neural networks struggle with limited data and hence overfit the training data, as well as isomorphism and isolated nodes due to message-passing paradigm. The intricacy of Twitter interactions and conversations is highlighted in this work, empha- sising the need for future research in data processing and model building. The study excluded other data features to add more information about the data space’s complexity, such as user interactions. Keyword selection was done independently, but node eigenvector centrality could be used for informed decision-making. The graph neural network paradigm of message passing has limited capability in the existence of isolated nodes, and isomorphism is crucial for network performance. Further research should investigate dynamic capabilities and edge weights in GIN networks. Computer Science MIT (Big Data Science) Unrestricted Faculty of Engineering, Built Environment and Information Technology SDG-09: Industry, innovation and infrastructure 2024-09-12T09:08:11Z 2024-09-12T09:08:11Z 2024-04 2024-04 Mini Dissertation * A2024 http://hdl.handle.net/2263/98149 en © 2021 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. application/pdf University of Pretoria
spellingShingle UCTD
Twitter data
Graph Neural Networks
South African
An investigation of the effectiveness of using Twitter data for predicting South African protests with Graph Neural Networks
title An investigation of the effectiveness of using Twitter data for predicting South African protests with Graph Neural Networks
title_full An investigation of the effectiveness of using Twitter data for predicting South African protests with Graph Neural Networks
title_fullStr An investigation of the effectiveness of using Twitter data for predicting South African protests with Graph Neural Networks
title_full_unstemmed An investigation of the effectiveness of using Twitter data for predicting South African protests with Graph Neural Networks
title_short An investigation of the effectiveness of using Twitter data for predicting South African protests with Graph Neural Networks
title_sort investigation of the effectiveness of using twitter data for predicting south african protests with graph neural networks
topic UCTD
Twitter data
Graph Neural Networks
South African
url http://hdl.handle.net/2263/98149