Multilingual representations for low-resource speech processing

Date Posted: February 10, 2017

Date Recorded: February 3, 2017

Speaker(s): Brian Kingsbury

Speech Representation, Perception and Recognition

Associated CBMM Pages:

CBMM Workshop on Speech Representation, Perception and Recognition

Description:

Brian Kingsbury, IBM

Abstract: A key to achieving good automatic speech recognition performance has been the availability of vast amounts of labeled and unlabeled speech and text data that can be used to train speech models; however, there are thousands of languages in the world that we would like to process automatically and it is impractical to count on having access to thousands of hours of speech and billions of words of text in all of them. In this talk I will describe how multilingual speech representations learned from a variety of languages can reduce the amount of data needed to train a speech recognition system in a new language.

Associated Research Thrust:

Theoretical Frameworks for Intelligence
Exploring Future Directions

The Center for Brains, Minds & Machines

Video

Multilingual representations for low-resource speech processing