Acoustic word embeddings

Acoustic word embeddings
  • Speech Representation, Perception and Recognition

Associated CBMM Pages: 

Date Posted: 

February 8, 2017


Karen Livescu, TTI

Abstract: For a number of speech tasks, it can be useful to represent speech segments of arbitrary length by fixed-dimensional vectors, or embeddings.  In particular, vectors representing word segments -- acoustic word embeddings -- can be used in query-by-example tasks, example-based speech recognition, or spoken term discovery.  *Textual* word embeddings have been common in natural language processing for a number of years now; the acoustic analogue is only recently starting to be explored.  This talk will present our work on acoustic word embeddings, including a variety of models in unsupervised, weakly supervised, and supervised settings.