Self-Organizing Feature Maps for HMM Based Lip-Reading

Naoyuki Tsuruta; Hirotaka Iuchi; Alaa El Sagheer; Tarek El Tobely; Alaa Sagheer

doi:10.1007/978-3-540-45226-3_23

Back

Self-Organizing Feature Maps for HMM Based Lip-Reading

Book chapter

Peer reviewed

Self-Organizing Feature Maps for HMM Based Lip-Reading

Naoyuki Tsuruta, Hirotaka Iuchi, Alaa El Sagheer, Tarek El Tobely and Alaa Sagheer

Knowledge-Based Intelligent Information and Engineering Systems, pp.162-168

Lecture Notes in Computer Science, Springer Berlin Heidelberg

2003

DOI: https://doi.org/10.1007/978-3-540-45226-3_23

Abstract

Feature Extraction Time

Hide Markov Model

Invariant Recognition

Randomization Technique

Target Sentence

Audio-visual dialogue is an appealing tool for natural interface with computers. Lip-reading is one of important part for audio-visual dialogue. In this paper, it is proposed to use a self-organizing feature map (SOM) and a hierarchical SOM: Hypercolumn model (HCM), as a module of phoneme feature space construction for HMM base lip-reading system. Those SOMs allow alleviating many difficulties associated with feature space construction. It is, however, required for on-line systems to reduce the feature extraction time to the range of normal video camera rates. To achieve this, a randomization technique is introduced. The experimental results show performances of the SOMs for Japanese lip-reading.

Metrics

1 Record Views

Details

Title: Self-Organizing Feature Maps for HMM Based Lip-Reading
Creators - without role: Naoyuki Tsuruta - Fukuoka University
Hirotaka Iuchi - Fukuoka University
Alaa El Sagheer
Tarek El Tobely
Alaa Sagheer - King Faisal University
Publication Details: Knowledge-Based Intelligent Information and Engineering Systems, pp.162-168
Series: Lecture Notes in Computer Science
Publisher: Springer Berlin Heidelberg; Berlin, Heidelberg
Identifiers: 9920044208331
Academic Unit: King Faisal University
Language: English
Resource Type: Book chapter