Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Arabic sentence-level sentiment analysis

Sentiment analysis has recently become one of the growing areas of research related to text mining and natural language processing. The increasing availability of online resources and popularity of rich and fast resources for opinion sharing like news, online review sites and personal blogs, caused...

Full description

Saved in:
Bibliographic Details
Main Author: Shoukry, Amira Magdy
Format: Thesis
Published: AUC Knowledge Fountain 2013
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613416500232192
access_status_str Open Access
author Shoukry, Amira Magdy
author_browse Shoukry, Amira Magdy
author_facet Shoukry, Amira Magdy
author_sort Shoukry, Amira Magdy
collection Thesis
dc_rights_str_mv The author retains all rights with regard to copyright. The author certifies that written permission from the owner(s) of third-party copyrighted matter included in the thesis, dissertation, paper, or record of study has been obtained. The author further certifies that IRB approval has been obtained for this thesis, or that IRB approval is not necessary for this thesis. Insofar as this thesis, dissertation, paper, or record of study is an educational record as defined in the Family Educational Rights and Privacy Act (FERPA) (20 USC 1232g), the author has granted consent to disclosure of it to anyone who requests a copy.
description Sentiment analysis has recently become one of the growing areas of research related to text mining and natural language processing. The increasing availability of online resources and popularity of rich and fast resources for opinion sharing like news, online review sites and personal blogs, caused several parties such as customers, companies, and governments to start analyzing and exploring these opinions. The main task of sentiment classification is to classify a sentence (i.e. review, blog, comment, news, etc.) as holding an overall positive, negative or neutral sentiment. Most of the current studies related to this topic focus mainly on English texts with very limited resources available for other languages like Arabic, especially for the Egyptian dialect. In this research work, we would like to improve the performance measures of Egyptian dialect sentence-level sentiment analysis by proposing a hybrid approach which combines both the machine learning approach using support vector machines and the semantic orientation approach. Two methodologies were proposed, one for each approach, which were then joined, creating the hybrid proposed approach. The corpus used contains more than 20,000 Egyptian dialect tweets collected from Twitter, from which 4800 manually annotated tweets will be used (1600 positive tweets, 1600 negative tweets and 1600 neutral tweets). We performed several experiments to: 1) compare the results of each approach individually with regards to our case which is dealing with the Egyptian dialect before and after preprocessing; 2) compare the performance of merging both approaches together generating the hybrid approach against the performance of each approach separately; and 3) evaluate the effectiveness of considering negation on the performance of the hybrid approach. The results obtained show significant improvements in terms of the accuracy, precision, recall and F-measure, indicating that our proposed hybrid approach is effective in sentence-level sentiment classification. Also, the results are very promising which encourages continuing in this line of research.
format Thesis
id oai:fount.aucegypt.edu:etds-2214
institution American University in Cairo (Egypt)
last_indexed 2026-06-10T12:35:47.730Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from AUC Knowledge Fountain — bepress
publishDate 2013
publishDateRange 2013
publishDateSort 2013
publisher AUC Knowledge Fountain
publisherStr AUC Knowledge Fountain
record_format dspace
source_str AUC Knowledge Fountain — bepress
spelling oai:fount.aucegypt.edu:etds-2214 Arabic sentence-level sentiment analysis Shoukry, Amira Magdy Sentiment analysis has recently become one of the growing areas of research related to text mining and natural language processing. The increasing availability of online resources and popularity of rich and fast resources for opinion sharing like news, online review sites and personal blogs, caused several parties such as customers, companies, and governments to start analyzing and exploring these opinions. The main task of sentiment classification is to classify a sentence (i.e. review, blog, comment, news, etc.) as holding an overall positive, negative or neutral sentiment. Most of the current studies related to this topic focus mainly on English texts with very limited resources available for other languages like Arabic, especially for the Egyptian dialect. In this research work, we would like to improve the performance measures of Egyptian dialect sentence-level sentiment analysis by proposing a hybrid approach which combines both the machine learning approach using support vector machines and the semantic orientation approach. Two methodologies were proposed, one for each approach, which were then joined, creating the hybrid proposed approach. The corpus used contains more than 20,000 Egyptian dialect tweets collected from Twitter, from which 4800 manually annotated tweets will be used (1600 positive tweets, 1600 negative tweets and 1600 neutral tweets). We performed several experiments to: 1) compare the results of each approach individually with regards to our case which is dealing with the Egyptian dialect before and after preprocessing; 2) compare the performance of merging both approaches together generating the hybrid approach against the performance of each approach separately; and 3) evaluate the effectiveness of considering negation on the performance of the hybrid approach. The results obtained show significant improvements in terms of the accuracy, precision, recall and F-measure, indicating that our proposed hybrid approach is effective in sentence-level sentiment classification. Also, the results are very promising which encourages continuing in this line of research. 2013-06-01T07:00:00Z thesis application/pdf https://fount.aucegypt.edu/etds/1215 https://fount.aucegypt.edu/context/etds/article/2214/viewcontent/Arabic_Sentence_Level_Sentiment_Analysis.pdf The author retains all rights with regard to copyright. The author certifies that written permission from the owner(s) of third-party copyrighted matter included in the thesis, dissertation, paper, or record of study has been obtained. The author further certifies that IRB approval has been obtained for this thesis, or that IRB approval is not necessary for this thesis. Insofar as this thesis, dissertation, paper, or record of study is an educational record as defined in the Family Educational Rights and Privacy Act (FERPA) (20 USC 1232g), the author has granted consent to disclosure of it to anyone who requests a copy. Theses and Dissertations AUC Knowledge Fountain tural language processing (Computer science) Arabic language
spellingShingle tural language processing (Computer science)
Arabic language
Shoukry, Amira Magdy
Arabic sentence-level sentiment analysis
title Arabic sentence-level sentiment analysis
title_full Arabic sentence-level sentiment analysis
title_fullStr Arabic sentence-level sentiment analysis
title_full_unstemmed Arabic sentence-level sentiment analysis
title_short Arabic sentence-level sentiment analysis
title_sort arabic sentence level sentiment analysis
topic tural language processing (Computer science)
Arabic language
url https://fount.aucegypt.edu/etds/1215
https://fount.aucegypt.edu/context/etds/article/2214/viewcontent/Arabic_Sentence_Level_Sentiment_Analysis.pdf
work_keys_str_mv AT shoukryamiramagdy arabicsentencelevelsentimentanalysis