534
Views
28
CrossRef citations to date
0
Altmetric
Original Articles

Auditory sensitivity to formant ratios: Toward an account of vowel normalisation

&
Pages 808-839 | Received 11 Jun 2009, Published online: 17 Jun 2010

References

  • Adank , P. , Smits , R. and van Hout , R. 2004 . A comparison of vowel normalization procedures for language variation research . Journal of the Acoustical Society of America , 116 : 3099 – 3107 .
  • Baayen , R. H. 2008 . Analyzing linguistic data: A practical introduction to statistics using R , Cambridge : Cambridge University Press .
  • Bonte , M. , Valente , G. and Formisano , E. 2009 . Dynamic and task-dependent encoding of speech and voice by phase reorganization of cortical oscillations . Journal of Neuroscience , 29 : 1699 – 1706 .
  • Broad , D. J. and Wakita , H. 1977 . Piecewise-planar representation of vowel formant frequencies . Journal of the Acoustical Society of America , 62 : 1467 – 1473 .
  • Bybee , J. 2001 . Phonology and language use , Cambridge : Cambridge University Press .
  • Claes , T. , Dologlou , I. , ten Bosch , L. and van Compernolle , D. 1998 . A novel feature transformation for vocal tract length normalization in automatic speech recognition . IEEE Transactions on Speech and Audio Processing , 6 : 549 – 557 .
  • de Cheveigné , A. and Simon , J. Z. 2007 . Denoising based on time-shift PCA . Journal of Neuroscience Methods , 165 : 297 – 305 .
  • Delattre , P. , Liberman , A. M. , Cooper , F. S. and Gerstman , L. J. 1952 . An experimental study of the acoustic determinants of vowel color: Observations on one- and two-formant vowels synthesized from spectrographic patterns . Word , 8 : 195 – 210 .
  • Delgutte , B. and Kang , N. Y. S. 1984 . Speech coding in the auditory nerve: I. Vowel-like sounds . Journal of the Acoustical Society of America , 75 : 866 – 878 .
  • Deng , L. and O'Shaughnessy , D. 2003 . Speech processing: A dynamic and optimization-oriented approach , New York : Marcel Dekker .
  • Diesch , E. , Eulitz , C. , Hampson , S. and Ross , B. 1996 . The neurotopography of vowels as mirrored by evoked magnetic field measurements . Brain and Language , 53 : 143 – 168 .
  • Diesch , E. and Luce , T. 1997 . Magnetic fields elicited by tones and vowel formants reveal tonotopy and nonlinear summation of cortical activation . Psychophysiology , 34 : 501 – 510 .
  • Disner , S. F. 1980 . Evaluation of vowel normalization procedures . Journal of the Acoustical Society of America , 67 : 253 – 261 .
  • Eulitz , C. , Diesch , E. , Pantev , C. , Hampson , S. and Elbert , T. 1995 . Magnetic and electric brain activity evoked by the processing of tone and vowel stimuli . Journal of Neuroscience , 15 : 2748 – 2755 .
  • Fant , G. 1960 . Acoustic theory of speech production , The Hague : Mouton .
  • Fitch , R. H. , Miller , S. and Tallal , P. 1997 . Neurobiology of speech perception . Annual Review of Neuroscience , 20 : 331 – 353 .
  • Fitch , W. T. and Giedd , J. 1999 . Morphology and development of the human vocal tract: A study using magnetic resonance imaging . Journal of the Acoustical Society of America , 106 : 1511 – 1522 .
  • Formisano , E. , de Martino , F. , Bonte , M. and Goebel , R. 2008 . “Who” is saying “What”? Brain-based decoding of human voice and speech . Science , 322 : 970 – 973 .
  • Fox , R. A. , Jacewicz , E. and Feth , L. L. 2008 . Spectral integration of dynamic cues in the perception of syllable initial stops . Phonetica , 65 : 19 – 44 .
  • Frye , R. E. , Rezaie , R. and Papanicolaou , A. C. 2009 . Functional neuroimaging of language using magnetoencephalography . Physics of Life Reviews , 6 : 1 – 10 .
  • Fujioka , T. , Ross , B. , Okamoto , H. , Takeshima , Y. , Kakigi , R. and Pantev , C. 2003 . Tonotopic representation of missing fundamental complex sounds in the human auditory cortex . European Journal of Neuroscience , 18 : 432 – 440 .
  • Fujisaki , H. , & Kawashima , T. 1968 . The role of pitch and higher formants in the perception of vowels . IEEE Transactions on Audio and Electroacoustics, AU-16 , 73 77 .
  • Gage , N. , Roberts , T. P. L. and Hickok , G. 2006 . Temporal resolution properties of human auditory cortex: Reflections in the neuromagnetic auditory evoked m100 component . Brain Research , 1069 : 166 – 171 .
  • Goldinger , S. D. 1996 . Words and voices: Episodic traces in spoken word identification and recognition memory . Journal of Experimental Psychology: Learning, Memory and Cognition , 22 : 1166 – 1183 .
  • Govindarajan , K. K. , Phillips , C. , Poeppel , D. , Roberts , T. P. L. and Marantz , A. 1998 . Latency of MEG M100 response indexes first formant frequency . Journal of the Acoustical Society of America , 103 : 2982 – 2983 .
  • Halberstam , B. and Raphael , L. J. 2004 . Vowel normalization: The role of fundamental frequency and upper formants . Journal of Phonetics , 32 : 423 – 434 .
  • Hari , R. , Aittoniemi , K. , Järvinen , M. L. , Katila , T. and Varpula , T. 1980 . Auditory evoked transient and sustained magnetic fields of the human brain: Localization of neural generators . Experimental Brain Research , 40 : 237 – 240 .
  • Hari , R. , Levänen , S. and Raij , T. 2000 . Timing of human cortical functions during cognition . Trends in Cognitive Sciences , 4 : 455 – 462 .
  • Hickok , G. and Poeppel , D. 2007 . The cortical organization of speech processing . Nature Reviews Neuroscience , 8 : 393 – 402 .
  • Hillenbrand , J. M. , Getty , L. A. , Clark , M. J. and Wheeler , K. 1995 . Acoustic characteristics of American English vowels . Journal of the Acoustical Society of America , 97 : 3099 – 3111 .
  • Huber , J. E. , Stathopoulos , E. T. , Curione , G. M. , Ash , T. A. and Johnson , K. 1999 . Formants of children, women, and men: The effects of vocal intensity variation . Journal of the Acoustical Society of America , 106 : 1532 – 1542 .
  • Irino , T. and Patterson , R. D. 2002 . Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-mellin transform . Speech Communication , 36 : 181 – 203 .
  • Ives , D. T. , Smith , D. R. R. and Patterson , R. D. 2005 . Discrimination of speaker size from syllable phrases . Journal of the Acoustical Society of America , 118 : 3816 – 3822 .
  • Johnson , K. 1997 . “ Speech perception without speaker normalization ” . In Talker variability in speech processing , Edited by: Johnson , K. and Mullennix , J. W. 145 – 165 . San Diego, CA : Academic Press .
  • Johnson , K. 2005 . “ Speaker normalization in speech perception ” . In The handbook of speech perception , Edited by: Pisoni , D. B. and Remez , R. E. 363 – 389 . Oxford : Blackwell .
  • Kuhl , P. K. 1979 . Speech perception in early infancy: Perceptual constancy for spectrally dissimilar vowel categories . Journal of the Acoustical Society of America , 66 : 1668 – 1679 .
  • Kuhl , P. K. 1983 . Perception of auditory equivalence classes for speech in early infancy . Infant Behavior & Development , 6 : 263 – 285 .
  • Ladefoged , P. and Broadbent , D. E. 1957 . Information conveyed by vowels . Journal of the Acoustical Society of America , 29 : 98 – 104 .
  • Ladefoged , P. and Maddieson , I. 1996 . The sounds of the world's languages , Oxford : Blackwell .
  • Lloyd , R. J. 1890 . Speech sounds: Their nature and causation . Phonetische Studien , 3 : 251 – 278 .
  • Lounasmaa , O. V. , Hämäläinen , M. , Hari , R. and Salmelin , R. 1996 . Information processing in the human brain: Magnetoencephalographic approach . Proceedings of the National Academy of Sciences , 93 : 8809 – 8815 .
  • Mäkelä , A. M. , Alku , P. and Tiitinen , H. 2003 . The auditory N1m reveals the left-hemispheric representation of vowel identity in humans . Neurocsience Letters , 353 : 111 – 114 .
  • McQueen , J. M. , Cutler , A. and Norris , D. 2006 . Phonological abstraction in the mental lexicon . Cognitive Science , 30 : 1113 – 1126 .
  • Miller , J. D. 1989 . Auditory-perceptual interpretation of the vowel . Journal of the Acoustical Society of America , 85 : 2114 – 2134 .
  • Miyawaki , K. , Strange , W. , Verbrugge , R. , Liberman , A. M. , Jenkins , J. J. and Fujimura , O. 1975 . An effect of linguistic experience: The discrimination of [r] and [l] by native speakers of Japanese and English . Perception & Psychophysics , 18 : 331 – 340 .
  • Monahan , P. J. , de Souza , K. , & Idsardi , W. J. 2008 . Neuromagnetic evidence for early auditory restoration of fundamental pitch . PLoS ONE, 3 , e2900 .
  • Näätänen , R. and Picton , T. 1987 . The N1 wave of the human electric and magnetic response to sound: A review and an analysis of the component structure . Psychophysiology , 24 : 375 – 425 .
  • Nearey , T. M. 1989 . Static, dynamic, and relational properties in vowel perception . Journal of the Acoustical Society of America , 85 : 2088 – 2113 .
  • Norris , D. , McQueen , J. M. and Cutler , A. 2003 . Perceptual learning in speech . Cognitive Psychology , 47 : 204 – 238 .
  • Obleser , J. and Eisner , F. 2009 . Pre-lexical abstraction of speech in the auditory cortex . Trends in Cognitive Sciences , 13 : 14 – 19 .
  • Obleser , J. , Lahiri , A. and Eulitz , C. 2004 . Magnetic brain response mirrors extraction of phonological features from spoken vowels . Journal of Cognitive Neuroscience , 16 : 31 – 39 .
  • Ohl , F. W. and Scheich , H. 1997 . Orderly cortical representation of vowels based on formant interaction . Proceedings of the National Academy of Sciences , 94 : 9440 – 9444 .
  • Oldfield , R. C. 1971 . Assessment and analysis of handedness: Edinburgh inventory . Neuropsychologia , 9 : 97 – 113 .
  • Peterson , G. E. 1951 . The phonetic value of vowels . Language , 27 : 541 – 553 .
  • Peterson , G. E. 1961 . Parameters of vowel quality . Journal of Speech and Hearing Research , 4 : 10 – 29 .
  • Peterson , G. E. and Barney , H. L. 1952 . Control methods used in a study of the vowels . Journal of the Acoustical Society of America , 24 : 175 – 184 .
  • Phillips , C. 2001 . Levels of representation in the electrophysiology of speech perception . Cognitive Science , 25 : 711 – 731 .
  • Picton , W. , Woods , D. L. , Baribeau-Braun , J. and Healey , T. M. 1976 . Evoked potential audiometry . Journal of Otolaryngology , 6 : 90 – 119 .
  • Pierrehumbert , J. B. 2002 . “ Word-specific phonetics ” . In Laboratory phonology 7 , Edited by: Gussenhoven , C. and Warner , N. 101 – 139 . Berlin : Mouton de Gruyter .
  • Pisoni , D. B. 1997 . “ Some thoughts on “normalization” in speech perception ” . In Talker variability in speech processing , Edited by: Johnson , K. and Mullennix , J. W. 9 – 31 . San Diego, CA : Academic Press .
  • Poeppel , D. , Phillips , C. , Yellin , E. , Rowley , H. A. , Roberts , T. P. L. and Marantz , A. 1997 . Processing of vowels in supratemporal auditory cortex . Neuroscience Letters , 221 : 145 – 148 .
  • Potter , R. K. and Steinberg , J. C. 1950 . Toward the specification of speech . Journal of the Acoustical Society of America , 22 : 807 – 820 .
  • Purnell , T. , Idsardi , W. and Baugh , J. 1999 . Perceptual and phonetic experiments on American English dialect identification . Journal of Language and Social Psychology , 18 : 10 – 30 .
  • R Development Core Team 2006 . R: A language and environment for statistical computing . R Foundation for Statistical Computing Vienna Retrieved from http://www.r-project.org
  • Roberts , T. P. L. , Ferrari , P. , Stufflebeam , S. M. and Poeppel , D. 2000 . Latency of the auditory evoked neuromagnetic field components: Stimulus dependence and insights toward perception . Journal of Clinical Neurophysiology , 17 : 114 – 129 .
  • Roberts , T. P. L. , Flagg , E. J. and Gage , N. M. 2004 . Vowel categorization induces departure of M100 latency from acoustic prediction . Neuroreport , 15 : 1679 – 1682 .
  • Roberts , T. P. L. and Poeppel , D. 1996 . Latency of auditory evoked m100 as a function of tone frequency . Neuroreport , 7 : 1138 – 1140 .
  • Rosner , B. S. and Pickering , J. B. 1994 . Vowel perception and production , Oxford : Oxford University Press .
  • Sachs , M. B. and Young , E. D. 1979 . Encoding of steady-state vowels in the auditory nerve: Representation in terms of discharge rate . Journal of the Acoustical Society of America , 66 : 470 – 479 .
  • Sarvas , J. 1987 . Basic mathematical and electromagnetic concepts of the biomagnetic inverse problem . Physics in Medicine and Biology , 32 : 11 – 22 .
  • Scott , S. K. and Johnsrude , I. S. 2003 . The neuroanatomical and functional organization of speech perception . Trends in Neurosciences , 26 : 100 – 107 .
  • Slawson , A. W. 1968 . Vowel quality and musical timbre as functions of spectrum envelopes and fundamental frequency . Journal of the Acoustical Society of America , 43 : 87 – 101 .
  • Smith , D. R. R. and Patterson , R. D. 2005 . The interaction of glottal-pulse rate and vocal-tract length in judgements of speaker size, sex, and age . Journal of the Acoustical Society of America , 118 : 3177 – 3186 .
  • Smith , D. R. R. , Patterson , R. D. , Turner , R. , Kawahara , H. and Irnio , T. 2005 . The processing and perception of size information in speech sounds . Journal of the Acoustical Society of America , 117 : 305 – 318 .
  • Stevens , K. N. 1998 . Acoustic phonetics , Cambridge, MA : MIT Press .
  • Stevens , K. N. and Bickley , C. 1991 . Constraints among parameters simplify control of klatt formant synthesizer . Journal of Phonetics , 19 : 161 – 174 .
  • Stevens , S. S. and Volkmann , J. 1940 . The relation of pitch to frequency: A revised scale . The American Journal of Psychology , 53 : 329 – 353 .
  • Strange , W. 1989 . Evolving theories of vowel perception . Journal of the Acoustical Society of America , 85 : 2081 – 2087 .
  • Strange , W. , Jenkins , J. J. and Johnson , T. L. 1983 . Dynamic specification of coarticulated vowels . Journal of the Acoustical Society of America , 74 : 695 – 705 .
  • Sussman , H. M. 2000 . Phonemic representation: A twenty-first century challenge . Brain and Language , 71 : 237 – 240 .
  • Syrdal , A. K. and Gopal , H. S. 1986 . A perceptual model of vowel recognition based on the auditory representation of American English vowels . Journal of the Acoustical Society of America , 79 : 1086 – 1100 .
  • Tiitinen , H. , Mäkelä , A. M. , Mäkinen , V. , May , P. J. , & Alku , P. 2005 . Disentangling the effects of phonation and articulation: Hemispheric asymmetries in the auditory N1m response of the human brain . BMC Neuroscience , 6 , 62 .
  • Tiitinen , H. , Sivonen , P. , Alku , P. , Virtanen , J. and Näätänen , R. 1999 . Electromagnetic recordings reveal latency differences in speech and tone processing in humans . Cognitive Brain Research , 8 : 355 – 363 .
  • Virtanen , J. , Ahveninen , J. , Ilmoniemi , R. J. , Näätänen , R. and Pekkonen , E. 1998 . Replicability of MEG and EEG measures of the auditory N1/N1m-response . Electroencephalography and Clinical Neurophysiology , 108 : 291 – 298 .
  • Young , E. D. and Sachs , M. B. 1979 . Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers . Journal of the Acoustical Society of America , 66 : 1381 – 1403 .
  • Zahorian , S. A. and Jagharghi , J. 1993 . Spectral-shape features versus formants as acoustic correlates for vowels . Journal of the Acoustical Society of America , 94 : 1966 – 1982 .
  • Zwicker , E. 1961 . Subdivision of the audible frequency rage into critical bands . The Journal of the Acoustical Society of America , 33 , 248 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.