OPEN-ENDED VISUAL QUESTION ANSWERING MODEL FOR REMOTE SENSING IMAGES

Sara O. Alsaleh; Yakoub Bazi; Mohamad M. Al Rahhal; Mansour Al Zuair; IEEE

doi:10.1109/IGARSS46834.2022.9884295

Back

Conference proceeding

OPEN-ENDED VISUAL QUESTION ANSWERING MODEL FOR REMOTE SENSING IMAGES

Sara O. Alsaleh, Yakoub Bazi, Mohamad M. Al Rahhal, Mansour Al Zuair and IEEE

2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), pp.2848-2851

IEEE International Symposium on Geoscience and Remote Sensing IGARSS

01/01/2022

DOI: https://doi.org/10.1109/IGARSS46834.2022.9884295

Abstract

Geology

Geosciences, Multidisciplinary

Physical Sciences

Remote Sensing

Science & Technology

Technology

In this paper, we present an open-ended visual question answering (VQA) model for remote sensing images, where the answers can be given in the form of short sentences, unlike closed-ended VQA. This model uses a vision and natural language transformers for embedding the image and its related question. The feature representations obtained at the output are concatenated and fed to a light transformer decoder for generating the answer in an autoregressive way. The complete architecture is trained in an end-to-end manner via the backpropagation algorithm. In the experiments, we evaluate the model on a manually labeled open-ended VQA dataset termed TextRS composed of 6245 image-question pairs.

Metrics

1 Record Views

Details

Title: OPEN-ENDED VISUAL QUESTION ANSWERING MODEL FOR REMOTE SENSING IMAGES
Creators - without role: Sara O. Alsaleh - King Saud University
Yakoub Bazi - King Saud University
Mohamad M. Al Rahhal - King Saud University
Mansour Al Zuair - King Saud University
IEEE
Publication Details: 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), pp.2848-2851
Series: IEEE International Symposium on Geoscience and Remote Sensing IGARSS
Publisher: IEEE
Number of pages: 4
Identifiers: 9948820308331
Academic Unit: King Saud University
Language: English
Resource Type: Conference proceeding