Speaker Emotion Recognition: From Classical Classifiers To Deep Neural Networks

Eya Mezghani; Maha Charfeddine; Henri Nicolas; Chokri Ben Amar

doi:10.1117/12.2309476

Back

Conference proceeding

Speaker Emotion Recognition: From Classical Classifiers To Deep Neural Networks

Eya Mezghani, Maha Charfeddine, Henri Nicolas and Chokri Ben Amar

TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), Vol.10696, pp.106962M-106962M-7

Proceedings of SPIE

01/01/2018

DOI: https://doi.org/10.1117/12.2309476

Abstract

Computer Science

Computer Science, Artificial Intelligence

Optics

Physical Sciences

Science & Technology

Technology

Speaker emotion recognition is considered among the most challenging tasks in recent years. In fact, automatic systems for security, medicine or education can be improved when considering the speech affective state. In this paper, a twofold approach for speech emotion classification is proposed. At the first side, a relevant set of features is adopted, and then at the second one, numerous supervised training techniques, involving classic methods as well as deep learning, are experimented. Experimental results indicate that deep architecture can improve classification performance on two affective databases, the Berlin Dataset of Emotional Speech and the SAVEE Dataset Surrey Audio-Visual Expressed Emotion.

Metrics

1 Record Views

Details

Title: Speaker Emotion Recognition: From Classical Classifiers To Deep Neural Networks
Creators - without role: Eya Mezghani - University of Sfax
Maha Charfeddine - University of Sfax
Henri Nicolas - University of Bordeaux
Chokri Ben Amar - University of Sfax
Contributors - without role: A Verikas
P Radeva
D Nikolaev
J Zhou
Publication Details: TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), Vol.10696, pp.106962M-106962M-7
Series: Proceedings of SPIE
Publisher: Spie-Int Soc Optical Engineering
Number of pages: 7
Identifiers: 9910516008331
Academic Unit: Taif University
Language: English
Resource Type: Conference proceeding