501
Views
16
CrossRef citations to date
0
Altmetric
Speech Recognition in Adverse Conditions

Speech-in-speech recognition: A training study

Pages 1089-1107 | Received 22 Nov 2010, Accepted 21 Dec 2011, Published online: 30 Apr 2012

REFERENCES

  • Audacity Team . 2006 . Audacity [Software] . Available from http://www.audacity.sourceforge.net .
  • Bamford , J. and Wilson , I. 1979 . “ Methodological considerations and practical aspects of the BKB sentence lists ” . In Speech-hearing tests and the spoken language of hearing-impaired children , Edited by: Bench , J. and Bamford , J. 148 – 187 . London : Academic Press .
  • Bamiou , D.-E. , Musiek , F. E. and Luxon , L. M. 2001 . Aetiology and clinical presentations of auditory processing disorder: A review . Archives of Disease in Childhood , 85 : 1 – 9 .
  • Bent , T. , Buchwald , A. and Pisoni , D. B. 2009 . Perceptual adaptation and intelligibility of multiple talkers for two types of degraded speech . Journal of the Acoustical Society of America , 126 ( 5 ) : 2660 – 2669 .
  • Bradlow , A. R. and Bent , T. 2008 . Perceptual adaptation to non-native speech . Cognition , 106 ( 2 ) : 707 – 729 .
  • Bradlow , A. R. and Pisoni , D. B. 1999 . Recognition of spoken words by native and non-native listeners: Talker-, listener-, and item-related factors . Journal of the Acoustical Society of America , 106 ( 4 ) : 2074 – 2085 .
  • Broersma , P ., & Weenink , D . 2009 . Praat: Doing phonetics by computer [Software] . Available from http://www.fon.hum.uva.nl/praat/.
  • Brouwer , S ., Van Engen , K. J ., Calandruccio , L ., & Bradlow , A. R . in press . Linguistic contributions to speech-on-speech masking for native and non-native listeners: Language familiarity and semantic content . Journal of the Acoustical Society of America .
  • Brungart , D. S. , Simpson , B. D. , Ericson , M. A. and Scott , K. R. 2001 . Informational and energetic masking effects in the perception of multiple simultaneous talkers . Journal of the Acoustical Society of America , 110 ( 5 ) : 2527 – 2538 .
  • Calandruccio , L. , Van Engen , K. J. , Dhar , S. and Bradlow , A. R. 2010 . The effects of clear speech as a masker . Journal of Speech, Language, and Hearing Research , 53 : 1458 – 1471 .
  • Clarke , C. M. and Garrett , M. F. 2004 . Rapid adaptation to foreign-accented English . Journal of the Acoustical Society of America , 101 ( 4 ) : 2299 – 2310 .
  • Clopper , C. G. and Pisoni , D. B. 2007 . Free classification of regional dialects of American English . Journal of Phonetics , 35 : 421 – 438 .
  • Cooke , M. , Garcia Lecumberri , M. L. and Barker , J. 2008 . The foreign language cocktail party problem: Energetic and informational masking effects on non-native speech perception . Journal of the Acoustical Society of America , 123 ( 1 ) : 414 – 427 .
  • Cycling' 74 . 2005 . Max/MSP 4.5 [Software] .
  • Davis , M. H. , Johnsrude , I. S. , Hervais-Adelman , A. , Taylor , K. and Carolyn , M. 2005 . Lexical information drives perceptual learning of distorted speech: Evidence from the comprehension of noise-vocoded sentences . Journal of Experimental Psychology: General , 134 ( 2 ) : 222 – 241 .
  • Dupoux , E. and Green , K. P. 1997 . Perceptual adjustment to highly compressed speech: Effects of talker and rate changes . Journal of Experimental Psychology: Human Perception and Performance , 23 : 914 – 927 .
  • Francis , A. L. , Nusbaum , H. C. and Fenn , K. 2007 . Effects of training on the acoustic phonetic representation of synthetic speech . Journal of Speech Language and Hearing Research , 50 : 1445 – 1465 .
  • Garcia Lecumberri , M. L. and Cooke , M. 2006 . Effect of masker type on native and non-native consonant perception in noise . Journal of the Acoustical Society of America , 119 ( 4 ) : 2445 – 2454 .
  • Greenspan , S. L. , Nusbaum , H. C. and Pisoni , D. B. 1988 . Perceptual learning of synthetic speech produced by rule . Journal of Experimental Psychology: Learning, Memory, and Cognition , 14 : 421 – 433 .
  • Hazan , V. and Simpson , A. 2000 . The effect of cue-enhancement on consonant intelligibility in noise: Speaker and listener effects . Language and Speech , 43 ( 3 ) : 273 – 294 .
  • Helfer , K. S. and Freyman , R. L. 2009 . Lexical and indexical cues in masking by competing speech . Journal of the Acoustical Society of America , 125 ( 1 ) : 447 – 456 .
  • Hervais-Adelman , A. , Davis , M. H. , Johnsrude , I. S. and Carlyon , R. P. 2008 . Perceptual learning of noise vocoded words: Effects of feedback and lexicality . Journal of Experimental Psychology: Human Perception and Performance , 34 ( 2 ) : 460 – 474 .
  • Hugdahl , K. , Heiervang , E. , Nordby , H. , Smievoll , A. I. , Steinmetz , H. , Stevenson , J. and Lund , A. 1998 . Central auditory processing, MRI morphometry, and brain laterality: Applications to dyslexia . Scandinavian Audiology , 27 ( Suppl ) : 26 – 34 .
  • Jaeger , T. F. 2008 . Categorical data analysis: Away from ANOVAs (transformation or not) and towards logit mixed models . Journal of Memory and Language , 59 : 434 – 446 .
  • Kidd , G. , Mason , C. R. , Richards , V. M. , Gallun , F. J. and Durlach , N. I. 2007 . “ Informational masking ” . In Auditory perception of sound sources , Edited by: Yost , W. A. , Popper , A. N. and Fay , R. R. 143 – 189 . US : Springer .
  • Killion , M. C. and Niquette , P. A. 2000 . What can the pure-tone audiogram tell us about a patient's SNR loss? . The Hearing Journal , 53 : 46 – 53 .
  • King , W. M. , Lombardino , L. J. , Crandell , C. C. and Leonard , C. M. 2003 . Comorbid auditory processing disorder in developmental dyslexia . Ear and Hearing , 24 : 448 – 456 .
  • Kraus , N. , McGee , T. , Carrell , T. D. , King , C. , Tremblay , K. and Nicol , T. 1995 . Central auditory system plasticity associated with speech discrimination training . Journal of Cognitive Neuroscience , 7 : 25 – 32 .
  • Liu , E. H. , Mercado , E. I. , Church , B. A. and Orduña , I. 2008 . The easy-to-hard effect in human (homo sapiens) and rat (rattus norvegicus) auditory identification . Journal of Comparative Psychology , 122 ( 2 ) : 132 – 145 .
  • Lively , S. E. , Logan , J. S. and Pisoni , D. B. 1993 . Training Japanese listeners to identify English /r/ and /l/. II: The role of phonetic environment and talker variability in learning new perceptual categories . Journal of the Acoustical Society of America , 94 ( 3 ) : 1242 – 1255 .
  • Loebach , J. L. and Pisoni , D. B. 2008 . Perceptual learning of spectrally degraded speech and environmental sounds . Journal of the Acoustical Society of America , 123 ( 2 ) : 1126 – 1139 .
  • Logan , J. S. , Lively , S. E. and Pisoni , D. B. 1991 . Training Japanese listeners to identify English /r/ and /l/: A first report . Journal of the Acoustical Society of America , 89 ( 2 ) : 874 – 886 .
  • Lunner , T. 2003 . Cognitive function in relation to hearing aid use . International Journal of Audiology , 42 : S49 – S58 .
  • Mayo , L. H. , Florentine , M. and Buus , S. 1997 . Age of second-language acquisition and perception of speech in noise . Journal of Speech, Language and Hearing Research , 40 : 686 – 693 .
  • McGarr , N. S. 1983 . The intelligibility of deaf speech to experienced and inexperienced listeners . Journal of Speech and Hearing Research , 26 : 451 – 458 .
  • Mullenix , J. , Pisoni , D. B. and Martin , C. S. 1989 . Some effects of talker variability on spoken word recognition . Journal of the Acoustical Society of America , 85 : 365 – 378 .
  • Nilsson , M. , Soli , S. D. and Sullivan , J. A. 1994 . Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise . Journal of the Acoustical Society of America , 95 ( 2 ) : 1085 – 1099 .
  • Nygaard , L. C. and Pisoni , D. B. 1998 . Talker-specific learning in speech perception . Perception and Psychophysics , 60 : 355 – 376 .
  • Nygaard , L. C. , Sommers , M. S. and Pisoni , D. B. 1994 . Speech perception as a talker contingent process . Psychological Science , 5 : 42 – 46 .
  • Pallier , C. , Sebastian Galles , N. , Dupoux , E. , Christophe , A. and Mehler , J. 1998 . Perceptual adjustment to time-compressed speech: A cross-linguistic study . Memory and Cognition , 26 ( 4 ) : 844 – 851 .
  • Parbery-Clark , A. , Skoe , E. , Lam , C. and Kraus , N. 2009 . Musician enhancement for speech-in-noise . Ear and Hearing , 30 ( 6 ) : 653 – 661 .
  • Pearson Education Inc . 2008 . Wechsler adult intelligence scale , 4th ed . San Antonio , TX : Pearson Education, Inc .
  • Peters , R. W. , Moore , B. C. and Baer , T. 1998 . Speech reception thresholds in noise with and without spectral and temporal dips for hearing-imparied and normally-hearing people . Journal of the Acoustical Society of America , 103 : 577 – 587 .
  • Plomp , R. and Mimpen , A. M. 1979 . Speech-reception threshold for sentences as a function of age and noise level . Journal of the Acoustical Society of America , 66 ( 5 ) : 1333 – 1342 .
  • R Development Core Team . 2005 . R: A language and environment for statistical computing . Vienna : R Foundation for Statistical Computing .
  • Rogers , C. L. , Lister , J. J. , Febo , D. M. , Besing , J. M. and Abrams , H. B. 2006 . Effects of bilingualism, noise, and reverberation on speech perception by listeners with normal hearing . Applied Psycholinguistics , 27 : 465 – 485 .
  • Schwab , E. C. , Nusbaum , H. C. and Pisoni , D. B. 1985 . Effects of training on the perception of synthetic speech . Human Factors , 27 : 395 – 408 .
  • Sidaras , S. K. , Alexander , J. E. D. and Nygaard , L. C. 2009 . Perceptual learning of systematic variation in Spanish-accented speech . Journal of the Acoustical Society of America , 125 ( 5 ) : 3306 – 3316 .
  • Smiljanic , R. and Bradlow , A. R. 2005 . Production and perception of clear speech in Croatian and English . Journal of the Acoustical Society of America , 118 ( 3 ) : 1677 – 1688 .
  • Smoorenburg , G. F. 1992 . Speech reception in quiet and in noisy conditions by individuals with noise-induced hearing loss in relation to their tone audiogram . Journal of the Acoustical Society of America , 91 ( 1 ) : 421 – 437 .
  • Song , J. H. , Skoe , E. , Banai , K. and Kraus , N. 2011 . Training to improve speech in noise perception: Biological mechanisms . Cerebral Cortex , 122 : 1890 – 1898 .
  • Song , J. H. , Skoe , E. , Wong , P. C. M. and Kraus , N. 2008 . Plasiticity in the adult human auditory brainstem following short-term linguistic training . Journal of Cognitive Neuroscience , 20 ( 10 ) : 1892 – 1902 .
  • Tremblay , K. L. and Kraus , N. 2002 . Auditory training induces asymmetrical changes in cortical neural activity . Journal of Speech, Language and Hearing Research , 45 ( 3 ) : 564 – 572 .
  • Tremblay , K. , Kraus , N. , Carrell , T. D. and McGee , T. 1997 . Central auditory system plasticity: Generalization to novel stimuli following listening training . Journal of the Acoustical Society of America , 102 ( 6 ) : 3762 – 3773 .
  • Van Engen , K. J . 2010a . Similarity and familiarity: Second language sentence recognition in first- and second-language multi-talker babble . Speech Communication , 52 , 943 – 953 .
  • Van Engen , K. J . 2010b . Linguistic factors in speech-in-speech perception . Evanston , IL : Northwestern University .
  • Van Engen , K. J. and Bradlow , A. R. 2007 . Sentence recognition in native- and foreign-language multi-talker background noise . Journal of the Acoustical Society of America , 121 ( 1 ) : 519 – 526 .
  • Van Wijngaarden , S. , Steeneken , H. and Houtgast , T. 2002 . Quantifying the intelligibility of speech in noise for non-native listeners . Journal of the Acoustical Society of America , 111 : 1906 – 1916 .
  • Wang , Y. , Jongman , A. and Sereno , J. A. 2003 . Acoustic and perceptual evaluation of Mandarin tone productions before and after perceptual training . Journal of the Acoustical Society of America , 113 ( 2 ) : 1033 – 1043 .
  • Wang , Y. , Spence , M. M. , Jongman , A. and Sereno , J. A. 1999 . Training American listeners to perceive Mandarin tones . Journal of the Acoustical Society of America , 106 ( 6 ) : 3649 – 3658 .
  • Weil , S. A . 2001 . Foreign-accented speech: Adaptation and generalization . Columbus , OH : The Ohio State University .
  • Wong , P. C. M. and Perrachione , T. K. 2007 . Learning pitch patterns in lexical identification by native English-speaking adults . Applied Psycholinguistics , 28 : 565 – 585 .
  • Wright , B. A. , Lombardino , L. J. , King , W. M. , Puranik , C. S. , Leonard , C. M. and Merzenich , M. M. 1997 . Deficits in auditory temporal resolution and spectral resolution in language-impaired children . Nature , 387 : 176 – 178 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.