Quality of Arabic Utterances Transformed Using Different Residual Prediction Techniques

Rania Elmanfaloty; N. Korany; El-Sayed A. Youssef

doi:10.1117/12.913264

Back

Conference proceeding

Quality of Arabic Utterances Transformed Using Different Residual Prediction Techniques

Rania Elmanfaloty, N. Korany and El-Sayed A. Youssef

INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), Vol.8285(1), pp.82854C-82854C-7

Proceedings of SPIE

01/01/2011

DOI: https://doi.org/10.1117/12.913264

Abstract

Engineering

Engineering, Electrical & Electronic

Optics

Physical Sciences

Science & Technology

Technology

Voice conversion (VC) is a process which modifies the speech signal produced by one source speaker so that it sounds like another target speaker. In this paper the transformation is determined by using equal Arabic utterances from source and target speakers; these utterances are time-aligned using dynamic time warping algorithm. A conversion function based on Gaussian mixture model (GMM) is used for transforming the spectral envelope described by line spectral frequencies (LSF) and the residuals are converted using three residual prediction techniques. We also compare between these techniques in the conversion of some Arabic utterances. The quality of the transformed utterances is measured using subjective and objective evaluations.

Metrics

1 Record Views

Details

Title: Quality of Arabic Utterances Transformed Using Different Residual Prediction Techniques
Creators - without role: Rania Elmanfaloty - Alexandria University
N. Korany - Alexandria University
El-Sayed A. Youssef - Alexandria University
Contributors - without role: Y Xie
Y Zheng
Publication Details: INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), Vol.8285(1), pp.82854C-82854C-7
Series: Proceedings of SPIE
Publisher: Spie-Int Soc Optical Engineering
Number of pages: 7
Identifiers: 9939518408331
Academic Unit: King Abdulaziz University
Language: English
Resource Type: Conference proceeding