Closed captioning

Related papers: 20

Top Cited Papers

A Critical Review of Recurrent Neural Networks for Sequence Learning

Zachary C. Lipton, John Berkowitz, Charles Elkan

Citations: 2093 • 2015

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

Andy Zeng, Krzysztof Choromański, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael S. Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence

Citations: 171 • 2022

The Long-Short Story of Movie Description

Anna Rohrbach, Marcus Rohrbach, Bernt Schiele

Citations: 124 • 2015

Deep Learning for Image-to-Text Generation: A Technical Overview

Xiaodong He, Li Deng

Citations: 118 • 2017

Actor-Critic Sequence Training for Image Captioning

Li Zhang, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, Timothy M. Hospedales

Citations: 99 • 2017

Automatic Image and Video Caption Generation With Deep Learning: A Concise Review and Algorithmic Overlap

Soheyla Amirian, Khaled Rasheed, Thiab R. Taha, Hamid R. Arabnia

Citations: 94 • 2020

Automatic medical image interpretation: State of the art and future directions

Hareem Ayesha, Sajid Iqbal, Mehreen Tariq, Muhammad Abrar, Muhammad Sanaullah, Ishaq Abbas, Amjad Rehman, Muhammad Farooq Khan Niazi, Shafiq Hussain

Citations: 79 • 2021

Video scene analysis: an overview and challenges on deep learning algorithms

Qaisar Abbas, Mostafa E. A. Ibrahim, M. Arfan Jaffar

Citations: 73 • 2017

Computer Vision and Natural Language Processing

Peratham Wiriyathammabhum, Douglas Summers-Stay, Cornelia Fermüller, Yiannis Aloimonos

Citations: 63 • 2016

A comprehensive survey on image captioning: from handcrafted to deep learning-based techniques, a taxonomy and open research issues

Himanshu Sharma, Devanand Padha

Citations: 48 • 2023

Teaching Machines to Describe Images with Natural Language Feedback

Huan Ling, Sanja Fidler

Citations: 38 • 2017

Visual Image Caption Generation for Service Robotics and Industrial Applications

Ren C. Luo, Yu‐Ting Hsu, Yu-Cheng Wen, Huan-Jun Ye

Citations: 38 • 2019

Evolution of visual data captioning Methods, Datasets, and evaluation Metrics: A comprehensive survey

Dhruv Sharma, Chhavi Dhiman, Dinesh Kumar

Citations: 32 • 2023

Learning Actions from Human Demonstration Video for Robotic Manipulation

Shuo Yang, Wei Zhang, Weizhi Lu, Hesheng Wang, Yibin Li

Citations: 29 • 2019

Video and Audio Deepfake Datasets and Open Issues in Deepfake Technology: Being Ahead of the Curve

Zahid Akhtar, Thanvi Lahari Pendyala, Virinchi Sai Athmakuri

Citations: 29 • 2024

3D-Aware Scene Change Captioning From Multiview Images

Yue Qiu, Yutaka Satoh, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka

Citations: 26 • 2020

Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning

Soheyla Amirian, Khaled Rasheed, Thiab R. Taha, Hamid R. Arabnia

Citations: 24 • 2021

Indoor Scene Change Captioning Based on Multimodality Data

Yue Qiu, Yutaka Satoh, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka

Citations: 23 • 2020

Video Captioning Based on Both Egocentric and Exocentric Views of Robot Vision for Human-Robot Interaction

Soo-Han Kang, Ji-Hyeong Han

Citations: 22 • 2021

Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining

Qihang Zhang, Zhenghao Peng, Bolei Zhou

Citations: 18 • 2022