A data explanatory account of speech perception (and its limits)
- Speech Representation, Perception and Recognition
Bob McMurray, U. Iowa
Abstract: One of the most challenging aspects of speech perception is the rampant variability in the signal. One consequence of this variability is that purely bottom up approaches to categorizing phonemes have not consistently been successful. Top down accounts--analogous to predictive coding in vision--may offer more leverage. Drawing on phonetic analyses, computational models and behavioral experiments I suggest that if listeners recode incoming acoustic information relative to expectations, this variability can largely be explained away, allowing for much more robust categorization. Predicts of this account are tested with ERP studies and studies of anticipatory processing. But, this account may also have limits. Eye-tracking studies on the timecourse of cue integration, and behavioral studies of categorization suggest phenomena that do not fall neatly under this rubric.
have an interactive transcript feature enabled, which appears below the video when playing. Viewers can search for keywords in the video or click on any word in the transcript to jump to that point in the video. When searching, a dark bar with white vertical lines appears below the video frame. Each white line is an occurrence of the searched term and can be clicked on to jump to that spot in the video.
LEARNING