Multidirectional Regression (MDR)-Based Features for Automatic Voice Disorder Detection

Ghulam Muhammad; Tamer A. Mesallam; Khalid H. Malki; Mohamed Farahat; Awais Mahmood; Mansour Alsulaiman

doi:10.1016/j.jvoice.2012.05.002

Back

Multidirectional Regression (MDR)-Based Features for Automatic Voice Disorder Detection

Journal article

Peer reviewed

Multidirectional Regression (MDR)-Based Features for Automatic Voice Disorder Detection

Ghulam Muhammad, Tamer A. Mesallam, Khalid H. Malki, Mohamed Farahat, Awais Mahmood and Mansour Alsulaiman

Journal of voice, Vol.26(6), pp.817.e19-817.e27

01/11/2012

DOI: https://doi.org/10.1016/j.jvoice.2012.05.002

PMID: 23177748

Abstract

Arabic digits

Automatic speech recognition

Multidirectional regression

Voice disorders detection

Objective assessment of voice pathology has a growing interest nowadays. Automatic speech/speaker recognition (ASR) systems are commonly deployed in voice pathology detection. The aim of this work was to develop a novel feature extraction method for ASR that incorporates distributions of voiced and unvoiced parts, and voice onset and offset characteristics in a time-frequency domain to detect voice pathology. The speech samples of 70 dysphonic patients with six different types of voice disorders and 50 normal subjects were analyzed. The Arabic spoken digits (1–10) were taken as an input. The proposed feature extraction method was embedded into the ASR system with Gaussian mixture model (GMM) classifier to detect voice disorder. Accuracy of 97.48% was obtained in text independent (all digits' training) case, and over 99% accuracy was obtained in text dependent (separate digit's training) case. The proposed method outperformed the conventional Mel frequency cepstral coefficient (MFCC) features. The results of this study revealed that incorporating voice onset and offset information leads to efficient automatic voice disordered detection.

Metrics

1 Record Views

Details

Title: Multidirectional Regression (MDR)-Based Features for Automatic Voice Disorder Detection
Creators - without role: Ghulam Muhammad - King Saud University
Tamer A. Mesallam - King Saud University
Khalid H. Malki - King Saud University
Mohamed Farahat - King Saud University
Awais Mahmood - King Saud University
Mansour Alsulaiman - King Saud University
Publication Details: Journal of voice, Vol.26(6), pp.817.e19-817.e27
Publisher: Mosby, Inc
Identifiers: 9946177308331
Academic Unit: King Saud University
Language: English
Resource Type: Journal article