Emotion Recognition from Speech using Prosodic and Linguistic Features

Abstract

Speech signal can be used to extract emotions. However, it is pertinent to note that variability in speech signal can make emotion extraction a challenging task. There are a number of factors that indicate presence of emotions. Prosodic and temporal features have been used previously for the purpose of identifying emotions. Separately, prosodic/temporal and linguistic features of speech do not provide results with adequate accuracy. We can also find out emotions from linguistic features if we can identify contents. Therefore, We consider prosodic as well as temporal or linguistic features which help increasing accuracy of emotion recognition, which is our first contribution reported in this paper. We propose a two-step model for emotion recognition; we extract emotions based on prosodic features in the first step. We extract emotions from word segmentation combined with linguistic features in the second step. While performing our experiments, we prove that the classification mechanisms, if trained without considering age factor, do not help improving accuracy. We argue that the classifier should be based on the age group on which the actual emotion extraction be required, and this becomes our second contribution submitted in this paper.

Authors and Affiliations

Mahwish Pervaiz, Tamim Khan

Keywords

Related Articles

A Feature Fusion Approach for Hand Tools Classification

The most important functions in objects classification and recognition system are to segment the objects from the input image, extract common features from the objects, and classify these objects as a member of one of th...

An Efficient Image Haze Removal Algorithm based on New Accurate Depth and Light Estimation Algorithm

Single image Dehazing has become a challenging task for a variety of image processing and computer applications. Many attempts have been devised to recover faded colors and improve image contrast. Such methods, however,...

Computer Students Attitudes on the Integration of m-Learning Applications

Technology has an important role in the lives particularly in the field of education nowadays because of its accessibility and affordability. Mobile learning (m-Learning) which is form of e-learning is a novel approach i...

IMouse: Eyes Gesture Control System

A high number of people, affected with neuro-locomotor disabilities or those paralyzed by injury cannot use computers for basic tasks such as sending or receiving messages, browsing the internet, watch their favorite TV...

Performance Investigation of VoIP Over Mobile WiMAX Networks through OPNET Simulation

Worldwide Interoperability for Microwave Access (WiMAX) is regarded as a promising technology that can provide wireless communication because of its advantages which include, high-speed data rates, high coverage and low...

Download PDF file
  • EP ID EP96329
  • DOI 10.14569/IJACSA.2016.070813
  • Views 122
  • Downloads 0

How To Cite

Mahwish Pervaiz, Tamim Khan (2016). Emotion Recognition from Speech using Prosodic and Linguistic Features. International Journal of Advanced Computer Science & Applications, 7(8), 84-90. https://www.europub.co.uk/articles/-A-96329