960
Views
165
CrossRef citations to date
0
Altmetric
Research Article

Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model

, &
Pages 825-841 | Received 21 May 2009, Accepted 06 Aug 2009, Published online: 05 Nov 2009

References

  • Awan, S. N., & Roy, N. (2005). Acoustic prediction of voice type in adult females with functional dysphonia. Journal of Voice, 19, 268–282.
  • Awan, S. N., & Roy, N. (2006). Toward the development of an objective index of dysphonia severity: a four-factor model. Clinical Linguistics & Phonetics, 20, 35–49.
  • Awan, S. N., & Roy, N. (2009). Outcomes measurement in voice disorders: application of an acoustic index of dysphonia severity. Journal of Speech, Language, and Hearing Research, 52, 482–499.
  • Baken, R. J. (1987). Clinical Measurement of Speech and Voice. Boston, MA: Little, Brown and Co.
  • Bielamowicz, S., Kreiman, J., Gerratt, B. R., Dauer, M. S., & Berke, G. S. (1996). Comparison of voice analysis systems for perturbation measurement. Journal of Speech and Hearing Research, 39, 126–134.
  • Bridger, M. M., & Epstein, R. (1983). Functional voice disorders: a review of 109 patients. Journal of Laryngology and Otology, 97, 1145–1148.
  • Callan, D. E., Kent, R. D., Roy, N., & Tasko, S. M. (1999). Self-organizing maps for the classification of normal and disordered female voices. Journal of Speech and Hearing Research, 42, 355–366.
  • Coyle, S. M., Weinrich, B. D., & Stemple, J. C. (2001). Shifts in relative prevalence of laryngeal pathology in a treatment-seeking population. Journal of Voice, 15, 424–440.
  • CSL Computerized Speech Lab [Computer program]. (1994). Pine Brook, NJ: Kay Elemetrics.
  • de Krom, G. (1994). Consistency and reliability of voice quality ratings for different types of speech fragments. Journal of Speech and Hearing Research, 37, 985–1000.
  • de Krom, G. (1995). Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. Journal of Speech and Hearing Research, 38, 794–811.
  • Fairbanks, G. (1960). Voice and Articulation Drillbook, 2nd Ed. New York: Harper & Row.
  • Field, A. P. (2005). Discovering Statistics Using SPSS. London: SAGE Publication.
  • Halberstam, B. (2004). Acoustic and perceptual parameters relating to connected speech are more reliable measures of hoarseness than parameters relating to sustained vowels. ORL, 60, 70–73.
  • Hall, K., & Yairi, E. (1992). Fundamental frequency, jitter, and shimmer in preschoolers who stutter. Journal of Speech, Language, and Hearing Research, 35, 1002–1008.
  • Hammarberg, B., Fritzell, B., Gauffin, J., Sundberg, J., & Wedin, L. (1980). Perceptual and acoustic correlates of abnormal voice qualities. Acta Otolaryngologica, 90, 441–451.
  • Hartelius, L., Buder, E. H., & Strand, E. A. (1997). Long-term phonatory instability in individuals with multiple sclerosis. Journal of Speech, Language, and Hearing Research, 40, 1056–1072.
  • Hartl, D., Hans, S., Vaissiere, J., Riquet, M., & Brasnu, D. (2001). Objective voice quality analysis before and after onset of unilateral vocal fold paralysis. Journal of Voice, 15, 351–361.
  • Heman-Ackah, Y., Heuer, R. J., Michael, D. D., Ostrowski, R., Horman, M., Baroody, M. M., Hillenbrand, J., & Sataloff, R. T. (2003). Cepstral peak prominence: a more reliable measure of dysphonia. Annals of Otology, Rhinology, and Laryngology, 112, 324–333.
  • Heman-Ackah, Y. D., Michael, D. D., & Goding, G. S. (2002). The relationship between cepstral peak prominence and selected parameters of dysphonia. Journal of Voice, 16, 20–27.
  • Hillenbrand, J. (1987). A methodological study of perturbation and additive noise in synthetically generated voice signals. Journal of Speech and Hearing Research, 30, 448–461.
  • Hillenbrand, J., & Houde, R. A. (1996). Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. Journal of Speech, Language, and Hearing Research, 39, 298–310.
  • Hillenbrand, J., Cleveland, R. A., & Erickson, R. L. (1994). Acoustic correlates of breathy vocal quality. Journal of Speech and Hearing Research, 37, 769–778.
  • Klingholz, F. (1990) Acoustic recognition of voice disorders: a comparative study of running speech versus sustained vowels. Journal of the Acoustic Society of America, 87, 2218–2224.
  • Koufman, J. A., & Blalock, P. D. (1991). Functional voice disorders. Otolaryngology Clinics of North America, 4, 1059–1073.
  • Kreiman, J., Gerratt, B., Kempster, G. B., Erman, A., & Berke, G. S. (1993). Perceptual evaluation of voice quality: review, tutorial, and a framework for future research. Journal of Speech and Hearing Research, 36, 21–40.
  • Laflen, J. B., Lazarus, C. L., and Amin, M. R. (2008). Pitch deviation analysis of pathological voice in connected speech. Annals of Otology, Rhinology and Laryngology, 117, 90–97.
  • Martin, D., Fitch, J., & Wolfe, V. (1995). Pathologic voice type and the acoustic prediction of severity. Journal of Speech and Hearing Research, 38, 765–771.
  • Maryn, Y., Corthals, P., Van Cauwenberge, P., Roy, N., & De Bodt, M. ( in press). Toward improved ecological validity in the acoustic measurement of overall voice quality: Combining continuous speech and sustained vowels. Journal of Voice.
  • MATLAB [Computer Program]. (1994). Natic, MA: The Mathworks, Inc.
  • McGraw, K., & Wong, S. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 30–46.
  • Noll, A. M. (1964). Short-term spectrum and ‘cepstrum’ techniques for vocal pitch detection. Journal of the Acoustic Society of America, 41, 293–309.
  • Orlikoff, R. F., Dejonckere, P. H., Dembowski, J., Fitch, J., Gelfer, M. P., Gerratt, B. R., Haskell, J. A., Kreiman, J., Metz, D. E., Schiavetti, N., Watson, B. C., & Wolfe, V. (1999). The perceived role of voice perception in clinical practice. Phonoscope, 2, 89–104.
  • Parsa, V., & Jamieson, D. G. (2001). Acoustic discrimination of pathological voice: Sustained vowels versus continuous speech. Journal of Speech, Language, and Hearing Research, 44, 327–339.
  • Qi, Y., Hillman, R. E., and Milstein, C. (1999). The estimation of signal to noise ratio in continuous speech for disordered voices. Journal of the Acoustic Society of America, 105, 2532–2535.
  • Rabinov, C. R., Kreiman, J., Gerratt, B., & Bielamowicz, S. (1995). Comparing reliability of perceptual ratings of roughness and acoustic measures of jitter. Journal of Speech and Hearing Research, 38, 26–32.
  • Roy, N., & Bless, D. M. (1998). Manual circumlaryngeal techniques in the assessment and treatment of voice disorders. Current Opinion in Otolaryngology Head and Neck Surgery, 6, 151–155.
  • Roy, N., Gouse, M., Mauszycki, S. C., Merrill, R. M., & Smith, M. E. (2005). Task specificity in adductor spasmodic dysphonia versus muscle tension dysphonia. Laryngoscope, 115, 311–316.
  • Roy, N., Merrill, R. M., Gray, S. D., & Smith, E. M. (2005). Voice disorders in the general population: prevalence, risk factors, and occupational impact. Laryngoscope, 115, 1988–1995.
  • Sama, A., Carding, P. N., & Price, S. (2001). The clinical features of functional dysphonia. Laryngoscope, 111, 458–463.
  • Schalen, L., & Andersson, K. (1992). Differential diagnosis and treatment of psychogenic voice disorder. Clinical Otolaryngology, 17, 225–230.
  • Sheskin, D. (2004). Handbook of Parametric and Nonparametric Statistical Procedures. 3rd Ed. Boca Raton: CRC Press.
  • SigmaPlot 10.0 for Windows [Computer program]. (2006). San Jose, CA: Systat Software, Inc.
  • SPSS 15.0 for Windows [Computer program]. (2006). Chicago, IL: SPSS, Inc.
  • Stevens, J. P. (1992). Applied Multivariate Statistics for the Social Sciences. 2nd Ed. Hillsdale, NJ: Erlbaum.
  • Takahashi, H., & Koike, Y. (1975). Some perceptual dimensions and acoustical correlates of pathologic voices. Acta Otolaryngologica (Stockholm), 338, 1–24.
  • Wolfe, V., & Martin, D. (1997). Acoustic correlates of dysphonia: type and severity. Journal of Communication Disorders, 30, 403–416.
  • Wolfe, V., & Steinfatt, T. M. (1987). Prediction of vocal severity within and across voice types. Journal of Speech and Hearing Research, 30, 230–240.
  • Wolfe, V. I., Martin, D. P., & Palmer, C. I. (2000). Perception of dysphonic voice quality by naïve listeners. Journal of Speech and Hearing Research, 43, 697–705.
  • Yiu, E., Worrall, L., Longland, J., & Mitchell, C. (2000). Analysing vocal quality of connected speech using Kay’s computerized speech lab: a preliminary finding. Clinical Linguistics & Phonetics, 14, 295–305.
  • Yumoto, E., Gould, W. J., & Baer, T. (1982). Harmonics-to-noise ratio as an index of the degree of hoarseness. Journal of Acoustical Society of America, 71, 1544–1550.
  • Zhang, Y., & Jiang, J. J. (2008). Acoustic analyses of sustained and running voices from patients with laryngeal pathologies. Journal of Voice, 22, 1–9.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.