Computational Methods for Integrating Vision and Language

by Kenichi Kanatani

★★★★☆
4.3 (554)

US$27.50

15% OFF CODE: SAVE15

Description

Modeling data from visual and linguistic modalities together creates opportunities for better understanding of both, and supports many useful applications. Examples of dual visual-linguistic data includes images with keywords, video with narrative, and figures in documents. We consider two key task-driven themes: translating from one modality to another (e.g., inferring annotations for images) and understanding the data using all modalities, where one modality can help disambiguate information i