Some Thoughts on "Normalization" in Speech Perception

 

Author: David B. Pisoni

Abstract:
One of the central problems in speech perception concerns stimulus variability, specifically, the mapping relations between acoustic attributes of the signal and linguistic categories resulting from perceptual analysis. Previous accounts of speech perception have treated variability as an undesirable source of noise in the signal that must be reduced or eliminated through a process of perceptual normalization. In this paper, I address what people mean by the term normalization in speech perception and examine the definitions and some of the implicit assumptions that have been made about it in the past. Then I summarize the results of several recent experiments of talker variability and perceptual learning which show that listeners encode fine stimulus details and use indexical attributes of speech in word recognition and sentence perception. Finally, I describe an alternative view of speech perception that is based on ideas from nonanalytic cognition. According to this approach, stimulus variability is "lawful" and "informative" for perceptual analysis. Perceptual normalization, as discussed in past theoretical accounts, may not involve a true "loss" of information but rather may entail the encoding of specific instances and the details of perceptual analysis. The generation of "equivalent forms" may occur at the time of retrieval from memory as a result of computational processes rather than early in perceptual analysis and encoding as previously assumed. This approach to speech perception and word recognition provides a new way of dealing with a number of long-standing problems in the field.