Story Generation from Images Using Deep Learning

Abrar Alnami; Miada Almasre; Norah Al-Malki

doi:10.1007/978-3-030-88378-2_16

Back

Story Generation from Images Using Deep Learning

Conference proceeding

Peer reviewed

Story Generation from Images Using Deep Learning

Abrar Alnami, Miada Almasre and Norah Al-Malki

INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY (ICICCT 2021), Vol.1417, pp.198-208

Communications in Computer and Information Science

01/01/2021

DOI: https://doi.org/10.1007/978-3-030-88378-2_16

Abstract

Computer Science

Computer Science, Artificial Intelligence

Science & Technology

Technology

Telecommunications

Recently, the problem of creating descriptive captions for images became a significant one. However, human languages' expressivity had been among the challenges that hindered researchers from widely experimenting with creating linguistically rich captions for images. That motivated us to utilize advanced deep learning algorithms to generate captions for images. The researchers proposed an AI model utilizing deep learning and natural language processing algorithms, which has two main components, an image-feature extractor, and a story generator. The researchers trained the first component (image-feature extractor) of the model to predict object names in images. The second component (story-generator) was trained on a custom short descriptive sentence which considered short stories. So, the output from the first component (list of words) will be entered into the second component to generate stories on input images. Thus, when testing the model's performance, a list of names will be entered from the first component so that the second generator arranges them and generates a short story from them. The proposed model developed could generate a short story expressive of an input image as shown by the results of a logical value used on the BLEU scale of 0.59, which further research is planned to improve.

Metrics

1 Record Views

Details

Title: Story Generation from Images Using Deep Learning
Creators - without role: Abrar Alnami - King Abdulaziz University
Miada Almasre - King Abdulaziz University
Norah Al-Malki - King Abdulaziz University
Contributors - without role: M Bhattacharya
L Kharb
D Chahal
Publication Details: INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY (ICICCT 2021), Vol.1417, pp.198-208
Series: Communications in Computer and Information Science
Publisher: Springer Nature
Number of pages: 11
Identifiers: 9939528708331
Academic Unit: King Abdulaziz University
Language: English
Resource Type: Conference proceeding