Emotional Speech Recognition Using Rhythm Metrics and a New Arabic Corpus

Ali H. Meftah; Mustafa Qamhan; Yousef Alotaibi; Sid-Ahmed Selouani; IEEE

Back

Conference proceeding

Emotional Speech Recognition Using Rhythm Metrics and a New Arabic Corpus

Ali H. Meftah, Mustafa Qamhan, Yousef Alotaibi, Sid-Ahmed Selouani and IEEE

2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), pp.57-62

01/01/2020

Abstract

Computer Science

Computer Science, Interdisciplinary Applications

Engineering

Engineering, Electrical & Electronic

Science & Technology

Technology

This study aims to investigate the possible use of speech rhythm metrics as a new feature for speech emotion recognition, gender identification, and regional accent identification. Further, it aims to evaluate a new Arabic speech emotion corpus. The King Saud University Emotions (KSUEmotions) speech corpus contains five emotions: neutral, sadness, happiness, surprise, and anger. For this study, speech acoustic features are extracted and used to classify the speakers' emotions. All classification results were obtained using the multilayer perceptron (MLP) neural networks and support vector machine (SVM) classifiers. Results demonstrate that the rhythm metrics are not sufficient for speech emotion classification. Nevertheless, they can improve the classifier accuracy when combined with other speech acoustic features. These results also demonstrate that the average performance accuracy of the KSUEmotions Phase 1 is 54.07% and 84.14% for Phase 2 and that the emotion of sadness achieves the best emotions' classification accuracy.

Metrics

1 Record Views

Details

Title: Emotional Speech Recognition Using Rhythm Metrics and a New Arabic Corpus
Creators - without role: Ali H. Meftah - King Saud University
Mustafa Qamhan - King Saud University
Yousef Alotaibi - King Saud University
Sid-Ahmed Selouani - Univ Moncton, 218 Bvd JD Gauthier, Shippegan, NB E8S 1P6, Canada
IEEE
Publication Details: 2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), pp.57-62
Publisher: IEEE
Number of pages: 6
Identifiers: 9952576308331
Academic Unit: King Saud University
Language: English
Resource Type: Conference proceeding