Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources

O M Mubarak; E Ambikairajah; J Epps; IEEE

doi:10.1109/ISSPA.2005.1581014

Back

Conference proceeding

Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources

O M Mubarak, E Ambikairajah, J Epps and IEEE

ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, Vol.2, pp.619-622

01/01/2005

DOI: https://doi.org/10.1109/ISSPA.2005.1581014

Abstract

Computer Science

Computer Science, Artificial Intelligence

Engineering

Engineering, Electrical & Electronic

Imaging Science & Photographic Technology

Science & Technology

Technology

Discrimination between speech and music signals is an important problem in efficient digital radio broadcasting, particularly for variable bit rate applications such as Internet radio. This paper presents a speech/music discrimination system based on a Mel frequency cepstral coefficient (MFCC) front end and a GMM classifier. This system can be used to select the optimum coding scheme for the current frame of an input signal without knowing a priori whether it contains speech-like or music-like characteristics. An analysis of speech and music error rates for different numbers of MFCCs (from 8 to 28) is presented. For the 46 minute evaluation database used in this experiment, an accuracy of up to 97.14% for music and 93.87% for speech can be attained.

Metrics

1 Record Views

Details

Title: Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources
Creators - without role: O M Mubarak
E Ambikairajah
J Epps
IEEE
Publication Details: ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, Vol.2, pp.619-622
Publisher: IEEE
Number of pages: 4
Identifiers: 9913051008331
Academic Unit: Al Jouf University
Language: English
Resource Type: Conference proceeding