Complex-Valued Representation for RGB-D Object Recognition

Rim Trabelsi; Issam Jabri; Farid Melgani; Fethi Smach; Nicola Conci; Ammar Bouallegue

doi:10.1007/978-3-319-75786-5_2

Back

Complex-Valued Representation for RGB-D Object Recognition

Conference proceeding

Peer reviewed

Complex-Valued Representation for RGB-D Object Recognition

Rim Trabelsi, Issam Jabri, Farid Melgani, Fethi Smach, Nicola Conci and Ammar Bouallegue

IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), Vol.10749, pp.17-27

Lecture Notes in Computer Science

01/01/2018

DOI: https://doi.org/10.1007/978-3-319-75786-5_2

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Theory & Methods

Engineering

Engineering, Electrical & Electronic

Imaging Science & Photographic Technology

Science & Technology

Technology

Object recognition methods usually tend to focus on single cues coming from traditional vision based systems but ignore to incorporate multi-modal data. With the advent of depth RGB-D sensors which provide synchronized multi-modal data with good quality, new opportunities have been emerged. In this paper, we make use of RGB and depth images to propose a new object recognition approach. Using a pixel-wise scheme, we propose a novel method to describe RGB-D images with a complex-valued representation. By means of neural network, we introduce a new CVNN (Complex-Valued Neural Network) with RBF neurons. Different from many RGB-D features, the proposed approach is able to jointly use RGB and depth data within a unified end-to-end learning framework. Category and instance object recognition tasks are evaluated through experiments carried out on a large scale RGB-D object dataset. Results show that our method can efficiently recognize objects in RGB-D images and outperforms state-of-the-art approaches.

Metrics

1 Record Views

Details

Title: Complex-Valued Representation for RGB-D Object Recognition
Creators - without role: Rim Trabelsi - University of Gabès
Issam Jabri - Al Yamamah University
Farid Melgani - University of Trento
Fethi Smach - Profil Technology
Nicola Conci - University of Trento
Ammar Bouallegue - Tunis El Manar University
Contributors - without role: M Paul
C Hitoshi
Q Huang
Publication Details: IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), Vol.10749, pp.17-27
Series: Lecture Notes in Computer Science
Publisher: Springer Nature
Number of pages: 11
Grant note: Singapore Agency for Science, Technology and Research (A*STAR) through the ARAP program European Union through ALYSSA program (ERASMUS-MUNDUS action 2 lot 6)
Identifiers: 9914034808331
Academic Unit: Al-Yamamah University
Language: English
Resource Type: Conference proceeding