Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario

Ghulam Muhammad; Mehedi Masud; Abdulhameed Alelaiwi; Md. Abdur Rahman; Ali Karime; Atif Alamri; M. Shamim Hossain

doi:10.1007/s11042-014-1973-7

Back

Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario

Journal article

Peer reviewed

Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario

Ghulam Muhammad, Mehedi Masud, Abdulhameed Alelaiwi, Md. Abdur Rahman, Ali Karime, Atif Alamri and M. Shamim Hossain

Multimedia tools and applications, Vol.74(14), pp.5313-5327

01/07/2015

DOI: https://doi.org/10.1007/s11042-014-1973-7

Abstract

Computer Science

Computer Science, Information Systems

Computer Science, Software Engineering

Computer Science, Theory & Methods

Engineering

Engineering, Electrical & Electronic

Science & Technology

Technology

Speech is one of the important modalities in a serious game platform. Serious game can be very useful for the rehabilitation of individuals with voice disorders. Therefore, we need an efficient and high-performance automatic speech recognition (ASR) system. In this paper, we propose a spectro-temporal directional derivative (STDD) feature that requires less number of computations in the modeling and yet gives high recognition accuracy in the ASR system. The proposed STDD feature is achieved by applying different directional derivative filters in the spectro-temporal domain. The feature dimension is then compressed by discrete cosine transform. The experiments are performed with voice samples of Arabic numerals spoken by persons with and without voice pathology. The experimental results show that the STDD feature outperforms the conventional mel-frequency cepstral coefficients both in clean and noisy environments.

Metrics

1 Record Views

See more details

Details

Title: Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario
Creators - without role: Ghulam Muhammad - King Saud University
Mehedi Masud - Taif University
Abdulhameed Alelaiwi - King Saud University
Md. Abdur Rahman - Umm al-Qura University
Ali Karime - University of Ottawa
Atif Alamri - King Saud University
M. Shamim Hossain - King Saud University
Publication Details: Multimedia tools and applications, Vol.74(14), pp.5313-5327
Publisher: Springer Nature
Number of pages: 15
Grant note: RGP-VPP-228 / Deanship of Scientific Research at King Saud University, Riyadh, Saudi Arabia
Identifiers: 9911114008331
Academic Unit: Taif University; King Saud University
Language: English
Resource Type: Journal article