Vision and Language

Research Thrust: Vision and Language

Shimon Ullman Vision can be combined with aspects of language and social cognition to obtain and communicate complex knowledge about the surrounding environment, for example, to answer a large and flexible set of queries about objects and agents in an image or video in a human-like manner, as captured in the CBMM Challenge. These lectures provide an overview of current approaches aimed at achieving this understanding from visual input, an overview of the START natural language system, and recent efforts to bridge these capabilities. The last lecture of this series addresses a cognitive ability that distinguishes human intelligence from that of other primates: the ability to tell, understand, and recombine stories.

The Center for Brains, Minds & Machines

Brains, Minds & Machines Summer Course

Vision and Language

Research Thrust: Vision and Language

Presentations

Shimon Ullman: Visual Understanding: State of the World, Future Directions

Shimon Ullman: Visual Understanding: State of the World, Future Directions

Boris Katz: Telling Machines about the World, & Daniel Harari: Innate Mechanisms and Learning: ...

Boris Katz: Telling Machines about the World, and Daniel Harari: Innate Mechanisms and Learning: Developing Complex Visual Concepts from Unlabeled Natural Dynamic Scenes

Andrei Barbu: From Language to Vision and Back Again

Andrei Barbu: From Language to Vision and Back Again

Patrick Winston: The Story Understanding Story

Patrick Winston: The Story Understanding Story

Search form

You are here

Brains, Minds & Machines Summer Course

Research Thrust: Vision and Language

Presentations

Shimon Ullman: Visual Understanding: State of the World, Future Directions

Boris Katz: Telling Machines about the World, & Daniel Harari: Innate Mechanisms and Learning: ...

Andrei Barbu: From Language to Vision and Back Again

Patrick Winston: The Story Understanding Story