Techniques for Improving the Labelling Process of Sentiment Analysis in the Saudi Stock Market

Abstract

Sentiment analysis is utilised to assess users’ feedback and comments. Recently, researchers have shown an increased interest in this topic due to the spread and expansion of social networks. Users’ feedback and comments are written in unstructured formats, usually with informal language, which presents challenges for sentiment analysis. For the Arabic language, further challenges exist due to the complexity of the language and no sentiment lexicon is available. Therefore, labelling carried out by hand can lead to mislabelling and misclassification. Consequently, inaccurate classification creates the need to construct a relabelling process for Arabic documents to remove noise in labelling. The aim of this study is to improve the labelling process of the sentiment analysis. Two approaches were utilised. First, a neutral class was added to create a framework of reliable Twitter tweets with positive, negative, or neutral sentiments. The second approach was improving the labelling process by relabelling. In this study, the relabelling process applied to only seven random features (positive or negative): “earnings” (ارباح), “losses” (خسائر), “green colour” (باللون_الاخضر), “growing” (زياده), “distribution” (توزيع), “decrease” (انخفاض), “financial penalty” (غرامة), and “delay” (تاجيل). Of the 48 tweets documented and examined, 20 tweets were relabelled and the classification error was reduced by 1.34%.

Authors and Affiliations

Hamed AL-Rubaiee, Renxi Qiu, Khalid Alomar, Dayou Li

Keywords

Related Articles

A Sleep Monitoring System with Sleep-Promoting Functions in Noise Detection and Sound Generation

Recently, there has been a growing demand and interest in developing sleep-promoting systems for improving sleep condition. Because sleep environments are various, and sensitivity to noise differs individually, it is dif...

Improvement of Persian Spam Filtering by Game Theory

There are different methods for dealing with spams; however, since spammers continuously use tricks to defeat the proposed methods, hence, filters should be constantly updated. In this study, Stackelberg game was used to...

  Enhancement of Passive MAC Spoofing Detection Techniques

 Failure of addressing all IEEE 802.11i Robust Security Networks (RSNs) vulnerabilities enforces many researchers to revise robust and reliable Wireless Intrusion Detection Techniques (WIDTs). In this paper we propo...

CluSandra: A Framework and Algorithm for Data Stream Cluster Analysis

The clustering or partitioning of a dataset’s records into groups of similar records is an important aspect of knowledge discovery from datasets. A considerable amount of research has been applied to the identification o...

Most Valuable Player Algorithm for Solving Minimum Vertex Cover Problem

Minimum Vertex Cover Problem (MVCP) is a combinatorial optimization problem that is utilized to formulate multiple real-life applications. Owing to this fact, abundant research has been undertaken to discover valuable MV...

Download PDF file
  • EP ID EP277952
  • DOI 10.14569/IJACSA.2018.090307
  • Views 109
  • Downloads 0

How To Cite

Hamed AL-Rubaiee, Renxi Qiu, Khalid Alomar, Dayou Li (2018). Techniques for Improving the Labelling Process of Sentiment Analysis in the Saudi Stock Market. International Journal of Advanced Computer Science & Applications, 9(3), 34-43. https://www.europub.co.uk/articles/-A-277952