On the use of Deep Learning and Scattering Transform for Pathological voices recognition

S. Souli; R. Amami; A. Soltani; S. Ben Yahia; IEEE

doi:10.1109/CoDIT55151.2022.9803962

Back

Conference proceeding

On the use of Deep Learning and Scattering Transform for Pathological voices recognition

S. Souli, R. Amami, A. Soltani, S. Ben Yahia and IEEE

2022 8th International Conference on Control, Decision and Information Technologies (CoDIT), Vol.1, pp.1055-1058

17/05/2022

DOI: https://doi.org/10.1109/CoDIT55151.2022.9803962

Abstract

Deep learning

Feature extraction

Neural networks

Pathology

Scattering

Speech recognition

Wavelet transforms

In the last few decades, Deep Neural Networks (DNNs) has shown outstanding performance in speech recognition applications. We demonstrate that the improved accuracy obtained by Deep Convolutional Neural Network (DCNN) arose from their capacity to extract discriminative representations which are robust to various sources of variability in speech signals. By this study, we propose a new algorithm, named Scattering Transform-Deep Convolutional Neural Network CNN: ST-DCNN to identify normal and pathological voices. The effectiveness of advances in speech features have been proven to be the root for an efficient pathological voices classification. The proposed algorithm involved two stages: First, scatter wavelet features are extracted. Then, DCNN is used to classify the voices samples. We evaluated the robustness of the proposed system in silent environments. The experimental results indicates that it achieves better performance with scattering wavelet and DCNN with the clean data within 99.62 % of recognition rate.

Metrics

1 Record Views

Details

Title: On the use of Deep Learning and Scattering Transform for Pathological voices recognition
Creators - without role: S. Souli - Tunis El Manar University
R. Amami - Imam Abdulrahman Bin Faisal University
A. Soltani - National Engineering School of Tunis
S. Ben Yahia - Faculty
IEEE
Publication Details: 2022 8th International Conference on Control, Decision and Information Technologies (CoDIT), Vol.1, pp.1055-1058
Publisher: IEEE
Identifiers: 9915565708331
Academic Unit: Imam Abdulrahman Bin Faisal University
Language: English
Resource Type: Conference proceeding