61
Views
2
CrossRef citations to date
0
Altmetric
Articles

VEP Detection for Read, Extempore and Conversation Speech

ORCID Icon &

References

  • K. E. Manjunath, and K. S. Rao, “Source and system features for phone recognition,” Int. J. Speech Technol., Vol. 18, no. 2, pp. 257–70, 2015. doi: 10.1007/s10772-014-9266-0
  • R. Pradeep, and K. S. Rao, “Deep neural networks for Kannada phoneme recognition,” in Proceedings of Ninth International Conference on Contemporary Computing (IC3), Noida, India, 2016, pp. 1–6.
  • S. Scanzio, P. Laface, L. Fissore, R. Gemello, and F. Mana, “On the use of a multilingual neural network front-end,” in Proceedings of Interspeech, Brisbane, Australia, 2008, pp. 2711–14.
  • K. Tripathi, and K. S. Rao, “Improvement of phone recognition accuracy using speech mode classification,” Int. J. Speech Technol., Vol. 21, no. 3, pp. 489–500, 2018. doi: 10.1007/s10772-017-9483-4
  • K. Tripathi, and K. S. Rao, “Analysis of sparse representation based feature on speech mode classification,” in Proceedings of Interspeech, Hyderabad, India, 2018, pp. 731–5.
  • S. M. Prasanna, B. S. Reddy, and P. Krishnamoorthy, “Vowel onset point detection using source, spectral peaks, and modulation spectrum energies,” IEEE Tran Audio Speech Lang Process, Vol. 17, no. 4, pp. 556–65, 2009. doi: 10.1109/TASL.2008.2010884
  • A. K. Vuppala, J. Yadav, S. Chakrabarti, and K. S. Rao, “Vowel onset point detection for low bit rate coded speech,” IEEE Tran Audio Speech Lang Process, Vol. 20, no. 6, pp. 1894–903, 2012. doi: 10.1109/TASL.2012.2191284
  • G. Pradhan, and S. M. Prasanna, “Speaker verification by vowel and nonvowel like segmentation,” IEEE Tran Audio Speech Lang Process, Vol. 21, no. 4, pp. 854–67, 2013. doi: 10.1109/TASL.2013.2238529
  • A. Kumar, S. Shahnawazuddin, and G. Pradhan, “Non-local estimation of speech signal for vowel onset point detection in varied environments,” in Proceedings of Interspeech, Stockholm, Sweden, 2017, pp. 429–33.
  • D. Povey, et al., “The subspace Gaussian mixture modela structured model for speech recognition,” Comput. Speech. Lang., Vol. 25, no. 2, pp. 404–39, 2011. doi: 10.1016/j.csl.2010.06.003
  • A. Kumar, S. Shahnawazuddin, and G. Pradhan, “Exploring different acoustic modeling techniques for the detection of vowels in speech signal,” in Proceedings of Twenty Second National Conference on Communication (NCC), Guwahati, India, 2016, pp. 1–5.
  • R. Thirumuru, S. V. Gangashetty, and A. K. Vuppala, “Improvements in the detection of vowel onset and offset points in a speech sequence,” Circuits Syst. Signal Process., Vol. 36, no. 6, pp. 2315–40, 2017. doi: 10.1007/s00034-016-0409-1
  • S. V. Gangashetty, C. C. Sekhar, and B. Yegnanarayana, “Detection of vowel on set points in continuous speech using autoassociative neural network models,” in Proceedings of Interspeech, Jeju Island, Korea, 2004, pp. 1081–84.
  • S. V. Gangashetty, C. C. Sekhar, and B. Yegnanarayana, “Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances,” in Proceedings of International Conference on Intelligent Sensing and Information Processing, Chennai, India, 2004, pp. 159–64.
  • J.-F. Wang, C.-H. Wu, S.-H. Chang, and J.-Y. L. Lee, “A hierarchical neural network model based on a c/v segmentation algorithm for isolated mandarin speech recognition,” IEEE Trans. Signal Process., Vol. 39, no. 9, pp. 2141–6, 1991. doi: 10.1109/78.134458
  • K. Tripathi, and K. Sreenivasa Rao. “VOP Detection for Read and Conversation Speech using CWT Coefficients and Phone Boundaries,” arXiv e-prints, p. arXiv:1908.08668, Aug 2019.
  • S. Furui, “On the role of spectral transition for speech perception,” J. Acoust. Soc. Am., Vol. 80, no. 4, pp. 1016–25, 1986. doi: 10.1121/1.393842
  • J. Yadav, and K. S. Rao, “Detection of vowel offset point from speech signal,” IEEE Signal Process Lett., Vol. 20, no. 4, pp. 299–302, 2013. doi: 10.1109/LSP.2013.2245647
  • R. Thirumuru, S. V. Gangashetty, and A. K. Vuppala, “Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points,” Multimed. Tools. Appl., Vol. 77, no. 4, pp. 4753–67, 2018. doi: 10.1007/s11042-017-5044-8
  • J. S. Garofalo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, “The DARPA TIMIT acoustic-phonetic continuous speech corpus cdrom,” Ling Data Consortium, 1993.
  • S. B. Sunil Kumar, K. S. Rao, and D. Pati, “Phonetic and prosodically rich transcribed speech corpus in Indian languages: Bengali and Odia,” in Proceedings of International Conference Oriental COCOSDA held jointly with Conference on Asian Spoken Language Research and Evaluation, Gurgaon, India, 2013, pp. 1–5.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.