摘要

This study used an active multiple-deviant oddball design to investigate the time-course of normalization processes that help listeners deal with between-speaker variability. Electroencephalograms were recorded while Dutch listeners heard sequences of non-words (standards and occasional deviants). Deviants were [ipapu] or [epsilon papu], and the standard was [(i) (epsilon)papu], where [I (epsilon)] was a vowel that was ambiguous between [epsilon] and [I]. These sequences were presented in two conditions, which differed with respect to the vocal-tract characteristics (i.e., the average 1st formant frequency) of the [papu] part, but not of the initial vowels [I], [epsilon] or [I epsilon] (these vowels were thus identical across conditions). Listeners more often detected a shift from [I(epsilon)papu] to [epsilon papu] than from [I(epsilon)papu] to [Ipapu] in the high F(1) context condition; the reverse was true in the low F(1) context condition. This shows that listeners' perception of vowels differs depending on the speaker's vocal-tract characteristics, as revealed in the speech surrounding those vowels. Cortical electrophysiological responses reflected this normalization process as early as about 120 ms after vowel onset, which suggests that shifts in perception precede influences due to conscious biases or decision strategies. Listeners' abilities to normalize for speaker-vocal-tract properties are for an important part the result of a process that influences representations of speech sounds early in the speech processing stream.

  • 出版日期2011-12