Closed captioning
Related papers: 20
Top Researchers
Top Cited Papers
A Critical Review of Recurrent Neural Networks for Sequence Learning
Zachary C. Lipton, John Berkowitz, Charles Elkan
Citations: 2093 • 2015
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng, Krzysztof Choromański, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael S. Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
Citations: 171 • 2022
The Long-Short Story of Movie Description
Anna Rohrbach, Marcus Rohrbach, Bernt Schiele
Citations: 124 • 2015
Deep Learning for Image-to-Text Generation: A Technical Overview
Xiaodong He, Li Deng
Citations: 118 • 2017
Actor-Critic Sequence Training for Image Captioning
Li Zhang, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, Timothy M. Hospedales
Citations: 99 • 2017
Automatic Image and Video Caption Generation With Deep Learning: A Concise Review and Algorithmic Overlap
Soheyla Amirian, Khaled Rasheed, Thiab R. Taha, Hamid R. Arabnia
Citations: 94 • 2020
Automatic medical image interpretation: State of the art and future directions
Hareem Ayesha, Sajid Iqbal, Mehreen Tariq, Muhammad Abrar, Muhammad Sanaullah, Ishaq Abbas, Amjad Rehman, Muhammad Farooq Khan Niazi, Shafiq Hussain
Citations: 79 • 2021
Video scene analysis: an overview and challenges on deep learning algorithms
Qaisar Abbas, Mostafa E. A. Ibrahim, M. Arfan Jaffar
Citations: 73 • 2017
Computer Vision and Natural Language Processing
Peratham Wiriyathammabhum, Douglas Summers-Stay, Cornelia Fermüller, Yiannis Aloimonos
Citations: 63 • 2016
A comprehensive survey on image captioning: from handcrafted to deep learning-based techniques, a taxonomy and open research issues
Himanshu Sharma, Devanand Padha
Citations: 48 • 2023
Teaching Machines to Describe Images with Natural Language Feedback
Huan Ling, Sanja Fidler
Citations: 38 • 2017
Visual Image Caption Generation for Service Robotics and Industrial Applications
Ren C. Luo, Yu‐Ting Hsu, Yu-Cheng Wen, Huan-Jun Ye
Citations: 38 • 2019
Evolution of visual data captioning Methods, Datasets, and evaluation Metrics: A comprehensive survey
Dhruv Sharma, Chhavi Dhiman, Dinesh Kumar
Citations: 32 • 2023
Learning Actions from Human Demonstration Video for Robotic Manipulation
Shuo Yang, Wei Zhang, Weizhi Lu, Hesheng Wang, Yibin Li
Citations: 29 • 2019
Video and Audio Deepfake Datasets and Open Issues in Deepfake Technology: Being Ahead of the Curve
Zahid Akhtar, Thanvi Lahari Pendyala, Virinchi Sai Athmakuri
Citations: 29 • 2024
3D-Aware Scene Change Captioning From Multiview Images
Yue Qiu, Yutaka Satoh, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka
Citations: 26 • 2020
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
Soheyla Amirian, Khaled Rasheed, Thiab R. Taha, Hamid R. Arabnia
Citations: 24 • 2021
Indoor Scene Change Captioning Based on Multimodality Data
Yue Qiu, Yutaka Satoh, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka
Citations: 23 • 2020
Video Captioning Based on Both Egocentric and Exocentric Views of Robot Vision for Human-Robot Interaction
Soo-Han Kang, Ji-Hyeong Han
Citations: 22 • 2021
Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining
Qihang Zhang, Zhenghao Peng, Bolei Zhou
Citations: 18 • 2022