Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

SMS spam detection and classification to combat abuse in telephone networks using natural language processing

In the modern era, mobile phones have become ubiquitous, and Short Message Service (SMS) has grown to become a multi-million-dollar service due to the widespread adoption of mobile devices and the millions of people who use SMS daily. However, SMS spam has also become a pervasive problem that endang...

Full description

Saved in:

Bibliographic Details
Format:	Article
Published:	2023
Subjects:	SMS spam Logistic regression Natural language processing Naive bayes Machine learning SVM classifier BERT model Gradient boosting Random forest classifier
Tags:	Add Tag No Tags, Be the first to tag this record!

MARC


LEADER	00000njm a2000000a 4500
001	oai:repository.ui.edu.ng:123456789/11383
042			\|a dc
720			\|a Oyeyemi, D. A. \|e author
720			\|a Ojo, A. K. \|e author
260			\|c 2023
520			\|a In the modern era, mobile phones have become ubiquitous, and Short Message Service (SMS) has grown to become a multi-million-dollar service due to the widespread adoption of mobile devices and the millions of people who use SMS daily. However, SMS spam has also become a pervasive problem that endangers users' privacy and security through phishing and fraud. Despite numerous spam filtering techniques, there is still a need for a more effective solution to address this problem [1]. This research addresses the pervasive issue of SMS spam, which poses threats to users' privacy and security. Despite existing spam filtering techniques, the high false-positive rate persists as a challenge. The study introduces a novel approach utilizing Natural Language Processing (NLP) and machine learning models, particularly BERT (Bidirectional Encoder Representations from Transformers), for SMS spam detection and classification. Data preprocessing techniques, such as stop word removal and tokenization, are applied, along with feature extraction using BERT. Machine learning models, including SVM, Logistic Regression, Naive Bayes, Gradient Boosting, and Random Forest, are integrated with BERT for differentiating spam from ham messages. Evaluation results revealed that the Naïve Bayes classifier + BERT model achieves the highest accuracy at 97.31% with the fastest execution time of 0.3 seconds on the test dataset. This approach demonstrates a notable enhancement in spam detection efficiency and a low false-positive rate. The developed model presents a valuable solution to combat SMS spam, ensuring faster and more accurate detection. This model not only safeguards users' privacy but also assists network providers in effectively identifying and blocking SMS spam messages.
024	8		\|a 2456-9968
024	8		\|a ui_art_ojo_sms_2023
024	8		\|a Journal of Advances in Mathematics and Computer Science 38(10), pp. 144-156
024	8		\|a https://repository.ui.edu.ng/handle/123456789/11383
653			\|a SMS spam
653			\|a Logistic regression
653			\|a Natural language processing
653			\|a Naive bayes
653			\|a Machine learning
653			\|a SVM classifier
653			\|a BERT model
653			\|a Gradient boosting
653			\|a Random forest classifier
245	0	0	\|a SMS spam detection and classification to combat abuse in telephone networks using natural language processing

Full Text Available

SMS spam detection and classification to combat abuse in telephone networks using natural language processing

MARC

Similar Items