Publications

Detailed Information

Guest Editorial Introduction to the Special Section on Video and Language

Cited 1 time in Web of Science Cited 1 time in Scopus
Authors

Mei, Tao; Corso, Jason J.; Kim, Gun Hee; Luo, Jiebo; Shen, Chunhua; Zhang, Hanwang

Issue Date
2022-01
Publisher
Institute of Electrical and Electronics Engineers
Citation
IEEE Transactions on Circuits and Systems for Video Technology, Vol.32 No.1, pp.1-4
Abstract
Computer Vision (CV) and Natural Language Processing (NLP) are two most fundamental disciplines under a broad area of artificial intelligence (AI). CV is regarded as a field of research that explores the techniques to teach computers to see and understand digital content such as images and videos. NLP is a branch of linguistics that enables computers to process, interpret, and even generate human language. With the rise and development of deep learning over the past decade, there has been a steady momentum of innovation and breakthroughs that convincingly push the limits and improve the state-of-the-art of both vision and language modeling. An interesting observation is that the research in the two areas starts to interact, with a significant growth in both the volume of publications and extensive applications. Meanwhile, many previous experiences have shown that this can naturally build up the circle of human intelligence.
ISSN
1051-8215
URI
https://hdl.handle.net/10371/184005
DOI
https://doi.org/10.1109/TCSVT.2021.3137430
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share