40
Views
6
CrossRef citations to date
0
Altmetric
Research Article

Digital speech synthesis: Tutorial

&
Pages 14-25 | Published online: 12 Jul 2009

References

  • Allen, J. (1976). Synthesis of speech from unrestricted text. Proceedings of the IEEE, 64,433–442.
  • Allen, J., Hunnicutt, S., & Klatt, D. (1987). From text to speech: The MITalk system. Cambridge: Cambridge University Press.
  • Atal, B. S., & Hanauer, S. L. (1971). Speech analysis and synthe-sis by linear prediction of the speech wave. Journal of the Acoustical Society of America, 50, 637–655.
  • Brandenburg, S. A., & Vanderheiden, G. (Eds.). (1987). Commu-nication, control, and computer access for disabled and elderly individuals: Resource book 3. Boston: College-Hill Press.
  • Bristow, G. (1984). Electronic speech synthesis. New York: McGraw-Hill.
  • Bruckert, E. (1984). A new text-to-speech product produces hu-man-quality voice. Speech Technology, 2, 114–119.
  • Chandra, S., & Lin, W. C. (1977). Linear prediction with a variable analysis frame size. IEEE Transactions in Acoustics, Speech, and Signal Processing, 25, 322–330.
  • Charpentier, F. J., & Steil, M. G. (1986). Diphone synthesis using overlap-add technique for speech waveforms concatenation. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, (pp. 2015–2018). New York: IEEE.
  • Charpentier, F. J., & Moulines, E. (1988). Text-to-speech algo-rithms based on FFT synthesis. Proceedings of the Interna-tional Conference on Acoustics, Speech, and Signal Proces-sing (pp. 667–670). New York: IEEE.
  • Deller, Jr., JR., Proakis, J.G., & Hansen, J.H.L. (1993). Discreet-time processing speech signals. New York: Macmillan.
  • Dunn, H. K., & White, S. D. (1940). Statistical measurements on conversational speech. Journal of the Acoustical Society of America, 11, 278–288.
  • Dutoit, T., & Leich, H. (1993). MBR-PSOLA: Text-to-speech syn-thesis based on an MBE re-synthesis of the segments data-base. Speech Communication, 13, 435–440.
  • Edwards, A. D. N. (1991). Speech synthesis: Technology for disabled people. London: Paul Chapman.
  • Elovitz, H. S., Johnson, R., McHugh, A., & Shore, J. E. (1976). Letter-to-sound rules for automatic translation of English text to phonetics. IEEE Transactions in Acoustics, Speech, and Signal Processing, 24, 446–459.
  • Fant, C. G. M. (1960). Acoustic theory of speech production. The Hague, The Netherlands: Mouton.
  • Flanagan, J. L. (1957). Note on the design of terminal ana-log speech synthesizers. Journal of the Acoustical Society of America, 29, 306–310.
  • French, N. R., & Steinberg, J. C. (1947). Factors governing the intelligibility of speech sounds. Journal of the Acoustical Soci-ety of America, 19, 90–119.
  • Green, B. G., Logan, J. S., & Pisoni, D. B. (1986). Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems. Behavior Research Methods, Instruments and Computers, 18, 100–107.
  • Hunnicutt, S. (1980). Grapheme-to-phoneme rules: A review. Speech Transmission Laboratory: Quarterly Progress and Status Report. 2–3, 38–60.
  • Hollingum, J., & Cassford, G.(1988). Speech technology at work. London: IFS Publications.
  • Jayant, N. S., & Noll, P. (1984). Digital coding of waveforms. Englewood Cliffs, NJ: Prentice-Hall.
  • Kent, R. D., & Read, C. (1992). The acoustic analysis of speech. San Diego: Singular Publishing.
  • Klatt, D. H. (1980). Software for a cascade/parallel formant syn-thesizer. Journal of the Acoustical Society of America, 67, 971–975.
  • Klatt, D. H. (1987). Review of text-to-speech conversion of English. Journal of the Acoustical Society of America, 82, 737–793.
  • Levinson, S. E., Olive, J. P., & Tschirgi, J. S. (1993). Speech synthesis in telecommunications. IEEE Communications Magazine, 31, 46–53.
  • Liberman, A., Cooper, F., Shankweiler, D., & Studdert-Kennedy, M. (1967). Perception of speech code. Psychological Review, 74,431–461.
  • Linggard, R. (1985). Electronic synthesis of speech. Cambridge: Cambridge University Press.
  • Meyer, P., Wilhelms, R., & Strube, H. W. (1989). A quasiarticula-tory speech synthesizer for German language running in real time. Journal of the Acoustical Society of America, 86, 523–539.
  • Mirenda, P., & Beukelman, D. R. (1990). A comparison of intelli-gibility among natural speech and seven speech synthesizers with listeners from three age groups. Augmentative and Alter-native Communication, 6,61–68.
  • Morgan, N. (1984). Talking chips. New York: McGraw-Hill. Moulines, E., & Charpentier, F. (1990). Pitch-synchronous wave- form processing techniques for text-to-speech synthesis using diphones. Speech Communication, 9,453–467.
  • O'Shaughnessy, D. (1987). Speech communication: Human and machine. Reading, MA: Addison-Wesley.
  • O'Shaughnessy, D., Barbeau, L., Bernardi, D., & Archambault, D. (1988). Diphone speech synthesis. Speech Communica-tion, 7,55–65.
  • Parsons, T. (1987). Voice and speech processing. New York: McGraw-Hill.
  • Rahim, M. G., Goodyear, C., Kleijn, W. B., Schroeter, J., & Sondhi, M. (1993). On the use of neural networks in articula-tory speech synthesis. Journal of the Acoustical Society of America, 93,1109–1121.
  • Schwartz, R., Klovstad, J., Makhoul, J., Klatt, D., & Zue, V. (1979). Diphone synthesis for phonetic vocoding. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (pp. 891–894). New York: IEEE.
  • Sondhi, M. M., & Schroeter, J. (1987). A hybrid time-frequency domain articulatory speech synthesizer. IEEE Transactions in Acoustics, Speech, and Signal Processing, 35,955–967.
  • Street Electronics Corporation. (1986). Echo Ilb user manual. Carpintaria, CA: Author.
  • Uvarov, E. B., Chapman, D. R., & Isaacs, A. (1964). A dictionary of science. London: Penguin Books.
  • Witten, I. H. (1982). Principles of computer speech. London: Academic Press.
  • Yannakoudakis, E. J., & Hutton, P. J., (1987). Speech synthesis and recognition systems. Chichester, England: Ellis Norwood.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.