Pitch-synchronous ZCPA (PS-ZCPA)-based feature extraction with auditory masking

M. Ghulam; T. Fukuda; J. Horikawa; T. Nitta

doi:10.1109/ICASSP.2005.1415164

Back

Conference proceeding

Pitch-synchronous ZCPA (PS-ZCPA)-based feature extraction with auditory masking

M. Ghulam, T. Fukuda, J. Horikawa and T. Nitta

Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005, Vol.1, pp.I/517-I/520 Vol. 1

2005

DOI: https://doi.org/10.1109/ICASSP.2005.1415164

Abstract

Automatic speech recognition

Band pass filters

Detectors

Feature extraction

Frequency

Histograms

Nervous system

Noise robustness

Periodic structures

Sensor arrays

A pitch-synchronous (PS) auditory feature extraction method, based on ZCPA (zero-crossings peak-amplitudes), has been proposed (Ghulam, M. et al., Proc. ICSLP04, 2004) and was shown to be more robust than the conventional ZCPA (Kim, D.S. et al., IEEE Trans. Speech Audio Process., vol.7, no.1, p.55-69, 1999). We examine the effect of auditory masking, both simultaneous and temporal, in the PS-ZCPA method. We also observe the effect of varying the number of histogram bins on the way to find out the optimum parameters of the proposed method. Experimental results demonstrate the improved performance of the PS-ZCPA method achieved by embedding auditory masking into it; for example, with both the masking methods embedded, the performance increases to 73.71% from the 69.92% obtained without masking for PS-ZCPA, while it showed little improvement with an increased number of histogram bins.

Metrics

1 Record Views

Details

Title: Pitch-synchronous ZCPA (PS-ZCPA)-based feature extraction with auditory masking
Creators - without role: M. Ghulam - Toyohashi University of Technology
T. Fukuda - Toyohashi University of Technology
J. Horikawa - Toyohashi University of Technology
T. Nitta - Toyohashi University of Technology
Publication Details: Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005, Vol.1, pp.I/517-I/520 Vol. 1
Publisher: IEEE
Identifiers: 9951314708331
Academic Unit: King Saud University
Language: English
Resource Type: Conference proceeding