Multiple Object Scene Description for the Visually Impaired Using Pre-trained Convolutional Neural Networks

Haikel Alhichri; Bilel Bin Jdira; Yacoub Bazi; Naif Alajlan

doi:10.1007/978-3-319-41501-7_33

Back

Multiple Object Scene Description for the Visually Impaired Using Pre-trained Convolutional Neural Networks

Conference proceeding

Peer reviewed

Multiple Object Scene Description for the Visually Impaired Using Pre-trained Convolutional Neural Networks

Haikel Alhichri, Bilel Bin Jdira, Yacoub Bazi and Naif Alajlan

IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016), Vol.9730, pp.290-295

Lecture Notes in Computer Science

01/01/2016

DOI: https://doi.org/10.1007/978-3-319-41501-7_33

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Theory & Methods

Imaging Science & Photographic Technology

Science & Technology

Technology

This paper introduces a new method for multiple object scene description as part of a system to guide the visually impaired in an indoor environment. Here we are interested in a coarse scene description, where only the presence of certain objects is indicated regardless of its position in the scene. The proposed method is based on the extraction of powerful features using pre-trained convolutional neural networks (CNN), then training a Neural Network regression to predict the content of any unknown scene based on its CNN feature. We have found the CNN feature to be highly descriptive, even though it is trained on auxiliary data from a completely different domain. The proposed methodology was assessed on four datasets representing different indoor environments. It achieves better results in terms of both accuracy and processing time when compared to state-of-the art.

Metrics

1 Record Views

Details

Title: Multiple Object Scene Description for the Visually Impaired Using Pre-trained Convolutional Neural Networks
Creators - without role: Haikel Alhichri - King Saud University
Bilel Bin Jdira - King Saud University
Yacoub Bazi - King Saud University
Naif Alajlan - King Saud University
Contributors - without role: A Campilho
F Karray
Publication Details: IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016), Vol.9730, pp.290-295
Series: Lecture Notes in Computer Science
Publisher: Springer Nature
Number of pages: 6
Identifiers: 9927037108331
Academic Unit: Prince Sultan University; King Saud University
Language: English
Resource Type: Conference proceeding