109
Views
16
CrossRef citations to date
0
Altmetric
Original Articles

Encoding Sound Timbre in the Auditory System

Pages 145-156 | Published online: 26 Mar 2015

REFERENCES

  • Bregman, A S, Auditory scene analysis: The perceptual organization of sound, Cambridge, Massachusetts: MIT Press, 1991.
  • Moore B, An Introduction of the Psychology of Hearing, Academic Press, London, 1989.
  • Pickles J O, An Introduction to the Physiology of Hearing, Academic Press, 1988.
  • Auditory Computations, Edited by H Hawkins, E T Mc-Mullen, A Popper & R Fay, Springer Verlag, pp 221–270.
  • Center for Auditory and Acoustic Research, www.isr.umd.edu/CAAR
  • S Greenberg, The ear as a speech analyzer, Journal Phonetics, pp 139–150, 1988.
  • Clarey J et al, Physiology of thalamus and cortex. The Mammalian Auditory Pathway: Neurophysiology, Edited by D Webster, et al, Springer Verlag, pp 232–334,1992.
  • Shamma S, J Fleshman & P Wiser, Response Area Organization in the Ferret Primary Auditory Cortex, Journal Neurophys, 69(2), pp 367–383, 1993.
  • Evans, E & Whitfield I, Classification of unit responses in auditory cortex of the unanesthetized and unrestrained cat, Journal Physiol 171, pp 476–493, 1964.
  • Schreiner C & Urbas J, Representation of Amplitude Modulation in the Auditory Cortex of the Cat, I: The anterior field, Hear Res, 21, pp 227–241, 1988.
  • Middlebrooks J C et al, Binaural response-specific bands in primary auditory cortex of the cat: topographical organization orthogonal to isofrequency contours, Brain Res, 181, pp 31–48, 1980.
  • Nelken I, Versnel H, Responses to linear and logarithmic frequency-modulated sweeps in ferret primary auditory cortex, Eur Journal Neurosci, 12(2): pp 549–562, 2000.
  • Wang X, et al. Representation of Species-Specific Vocalizations in the Primary Auditory Cortex of Common Marmosets: Temporal and Spectral Characteristics, Journal Neurophysiol, 74, pp 2685–2706, 1995.
  • N Kowalski, et al. Analysis of dynamic spectra in ferret primary auditory cortex: 1 Characteristics of single unit responses to moving ripple spectra, Journal Neurophysiol, 76(5) pp 3503–3523, 1996.
  • Klein D J, et al, Robust spectro-temporal reverse correlation for the auditory system: Optimizing stimulus design, Journal Comput Neuroscience, 9, pp 85–111, 1999
  • R Lyon & S A Shamma (1996), Auditory Computation, volume 6 of Springer Handbook of Auditory Research, chapter: Auditory representations of timbre and pitch, Springer-Verlag, New York, Inc, pp 221–270, 1996.
  • K Wang & S A Shamma, Self-normalization and noise- robustness in early auditory representations, IEEE Trans on Speech and Audio Processing, 2(3), pp 421–435, 1994.
  • X Yang, K Wang & S A Shamma, Auditory representations of acoustic signals, IEEE Trans on Information Theory, 38(2), pp 824–839, 1992.
  • K Wang & S A Shamma, Spectral shape analysis in the central auditory system, IEEE Trans on Speech and Audio Processing, 3(5), pp 382–395, 1995.
  • T Chi, Y Gao, M C Guyton, P Ru, & SA Shamma, Spectro-temporal modulation transfer functions and speech intelligibility, The Journal of the Acoustical Society of America, 106(5), pp 2719–2732, 1999.
  • S A Shamma, Methods of Neuronal modeling, chapter: Spatial and temporal processing in the auditory system, MIT Press, second edition, pp 411–460, 1998.
  • S A Shamma, Speech Processing in the Auditory System: II. Lateral inhibition and the central processing of speech evoked activity in the auditory nerve, Journal Acoust Soc Am, 78, pp 1622–1632, 1985.
  • N Viemeister, Temporal modulation transfer functions based upon modulation thresholds, Journal Acoust Soc Am, 66(5), pp 1364–1380, 1979.
  • D Green, Frequency' and the detection of spectral shape change Auditory Frequency Selectivity, Edited by BCJ Moore and RD Patterson, Plenum Press, Cambridge, pp 351–359, 1986.
  • Goldstein J, An optimum processor theory for the central formation of pitch of complex tones, Journal Acoust Soc Am, 54, pp 1496–1516, 1973.
  • Terhardt E, Calculating Virtual Pitch, Hearing Res 1, pp 155–182, 1979.
  • F Wightman, A Pattern Transformation Model of Pitch, Journal Acoust Soc Am, 54, pp 397–406, 1973.
  • S A Shamma & Klein D, The case of the missing pitch templates: How harmonic templates emerge in the early auditory system, Journal Acoust Soc Am, 107, pp 2631–2644, 2000.
  • Licklieder J, A Duplex Theory of Pitch Perception, Experientia vol 7, pp 128–133, 1951.
  • Slaney M & Lyon R, On the importance of time—A temporal representation of sound. In M Cooke, S Beet & M Crawford (Eds) Visual Representations of Speech Signals, J Wiley and Sons, Sussex England, 1993.
  • Jeffress A, A place theory of sound localization, Journal Comp Physiol Psych, 61, pp 468–486, 1948.
  • Durlach N & Colburn S, Binaural phenomena in Handbook of Perception, Edited by E C Carterette and M P Friedman, pp 365–466, 1978.
  • J Blauert, Spatial Hearing: The Psychophysics of Human Sound Localization, Revised Edition (MIT Press, Cambridge, MA), 1997.
  • Rabiner L, Schafer R, Digital processing of speech signals (Prentice Hall, Englewood Cliffs, NJ), 1978.
  • R Drullman, J Festen & R Plomp, Effect of envelope smearing on speech perception, The Journal of the Acoustical Society of America, 95(2), pp 1053–1064, 1994.
  • T Dau, D Püschel & A Kohlrausch, A quantitative model of the “effective” signal processing in the auditory system, I Model structure, The Journal of the Acoustical Society of America, 99(6), pp 3615–3622, 1999.
  • H Hermansky & N Morgan, RASTA processing of speech. IEEE Trans on Speech and Audio Processing, 2(4), pp 578–589, 1994.
  • R Shannon, F-G Zeng, J Wygonski, V Kamath & M Ekelid, Speech Recognition with primarily temporal cues, Science, (270), pp 303–304, 1995.
  • S Greenberg, T Arai & R Silipo, Speech intelligibility derived from exceedingly sparse spectral information, Proceedings of the International Conference on Spoken Language Processing, Sydney, in press, 1998
  • T Arai, M Pavel, H Hermansky & C Avendano, Intelligibility of speech with filtered time trajectories of spectral envelopes, Proceedings of ICSLP, pp 2490–2492, 1996.
  • S Greenberg & T Arai, Speech intelligibility is highly tolerant of cross-channel spectral asynchrony, Proceedings of the Joint Meeting of the Acoustical Society of America and the International Congress on Acoustics, Seattle, pp 2677–2678, 1998.
  • K Saben & D R Perrott, Cognitive restoration of reversed speech, Nature, 398, p 760, 29 April 1999.
  • American National Standards Institute, New York. American national standard methods for calculation of the speech intelligibility index, ANSIS3.5, 1997.
  • T Houtgast & H J M Steeneken, Predicting speech intelligibility in rooms from the modulation transfer function, I General room acoustics, Acoustica, 46, pp 60–72, 1980.
  • K D Kryter, Methods for the calculation and use of the articulation index, The Journal of the Acoustical Society of America, 34(11), pp 1689–1697, 1962.
  • JS Bradley, Predictors of speech intelligibility in rooms, The Journal of the Acoustical Society of America, 80(3), pp 837–845, 1986.
  • T Houtgast & H J M Steeneken, A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria, The Journal of the Acoustical Society of America, 77(3), pp 1069–1077, 1985.
  • H J M Steeneken & T Houtgast, A physical method for measuring speech-transmission quality. The Journal of the Acoustical Society of America, 67(1), pp 318–326, 1979.
  • Sussman, E, Ceponiene, R, Shestakova, A, Naatanen, R & Winkler I, Auditory stream segregation processes operate similarly in school-aged children and adults, Hear Res, 153, pp 108–114, 2001.
  • Mounya Elhilali, Taishih Chi, & Shihab A Shamma, A Spectro-Temporal Modulation Index (STMI) for assessment of speech intelligibility, Speech Communication (in press), 2002.
  • J Baras & S Wolk, Efficient organization of large ship radar databases using wavelets and structured vector quantization, Proc of the 27th Asilomar Conference on Signals, Systems, and Computers, 1993.
  • Culling JF, Darwin CJ, Role of timbre in the segregation of simultaneous voices with intersecting F0 contours, Percept Psychophys, 54(3), pp 303–309, 1993.
  • Assmann, P F, Fundamental frequency and the intelligibility of competing voices, Proceedings of the 14th International Congress of Phonetic Sciences, pp 179–182, 1999.
  • Bashford J, Meyers M, Brubaker B & Warren R, Illusory continuity of interrupted speech: Speech rates determines durational limits, Journal Acoust Soc Am, 84(5), pp 1635–1638, 1988.
  • Brungart D S, Informational and energetic masking effects in the perception of two simultaneous talkers, Journal Acoust Soc Am, 109, pp 1101–1109, 2001.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.