Sign in
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Conference proceeding

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

Mattia Soldan, Alejandro Pardo, Juan Leon Alcazar, Fabian Caba Heilbron, Chen Zhao, Silvio Giancola, Bernard Ghanem and IEEE COMP SOC
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), Vol.2022-, pp.5016-5025
IEEE Conference on Computer Vision and Pattern Recognition
01/01/2022

Abstract

Computer Science Computer Science, Artificial Intelligence Imaging Science & Photographic Technology Science & Technology Technology

Metrics

1 Record Views

Details