Search in:

Advanced search

IETE Journal of Research Volume 68, 2022 - Issue 4

Submit an article Journal homepage

Views

CrossRef citations to date

Altmetric

Articles

VEP Detection for Read, Extempore and Conversation Speech

Kumud TripathiDepartment of Computer Science and Engineering, Indian Institute of Technology Kharagpur, IndiaCorrespondence[email protected]

https://orcid.org/0000-0003-4198-7430 View further author information

K. Sreenivasa RaoDepartment of Computer Science and Engineering, Indian Institute of Technology Kharagpur, IndiaView further author information

Pages 2652-2660 | Published online: 02 Mar 2020

Cite this article
https://doi.org/10.1080/03772063.2020.1724835
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

References

K. E. Manjunath, and K. S. Rao, “Source and system features for phone recognition,” Int. J. Speech Technol., Vol. 18, no. 2, pp. 257–70, 2015. doi: 10.1007/s10772-014-9266-0
Web of Science ®Google Scholar
R. Pradeep, and K. S. Rao, “Deep neural networks for Kannada phoneme recognition,” in Proceedings of Ninth International Conference on Contemporary Computing (IC3), Noida, India, 2016, pp. 1–6.
Google Scholar
S. Scanzio, P. Laface, L. Fissore, R. Gemello, and F. Mana, “On the use of a multilingual neural network front-end,” in Proceedings of Interspeech, Brisbane, Australia, 2008, pp. 2711–14.
Google Scholar
K. Tripathi, and K. S. Rao, “Improvement of phone recognition accuracy using speech mode classification,” Int. J. Speech Technol., Vol. 21, no. 3, pp. 489–500, 2018. doi: 10.1007/s10772-017-9483-4
Web of Science ®Google Scholar
K. Tripathi, and K. S. Rao, “Analysis of sparse representation based feature on speech mode classification,” in Proceedings of Interspeech, Hyderabad, India, 2018, pp. 731–5.
Google Scholar
S. M. Prasanna, B. S. Reddy, and P. Krishnamoorthy, “Vowel onset point detection using source, spectral peaks, and modulation spectrum energies,” IEEE Tran Audio Speech Lang Process, Vol. 17, no. 4, pp. 556–65, 2009. doi: 10.1109/TASL.2008.2010884
Web of Science ®Google Scholar
A. K. Vuppala, J. Yadav, S. Chakrabarti, and K. S. Rao, “Vowel onset point detection for low bit rate coded speech,” IEEE Tran Audio Speech Lang Process, Vol. 20, no. 6, pp. 1894–903, 2012. doi: 10.1109/TASL.2012.2191284
Web of Science ®Google Scholar
G. Pradhan, and S. M. Prasanna, “Speaker verification by vowel and nonvowel like segmentation,” IEEE Tran Audio Speech Lang Process, Vol. 21, no. 4, pp. 854–67, 2013. doi: 10.1109/TASL.2013.2238529
Web of Science ®Google Scholar
A. Kumar, S. Shahnawazuddin, and G. Pradhan, “Non-local estimation of speech signal for vowel onset point detection in varied environments,” in Proceedings of Interspeech, Stockholm, Sweden, 2017, pp. 429–33.
Google Scholar
D. Povey, et al., “The subspace Gaussian mixture modela structured model for speech recognition,” Comput. Speech. Lang., Vol. 25, no. 2, pp. 404–39, 2011. doi: 10.1016/j.csl.2010.06.003
Web of Science ®Google Scholar
A. Kumar, S. Shahnawazuddin, and G. Pradhan, “Exploring different acoustic modeling techniques for the detection of vowels in speech signal,” in Proceedings of Twenty Second National Conference on Communication (NCC), Guwahati, India, 2016, pp. 1–5.
Google Scholar
R. Thirumuru, S. V. Gangashetty, and A. K. Vuppala, “Improvements in the detection of vowel onset and offset points in a speech sequence,” Circuits Syst. Signal Process., Vol. 36, no. 6, pp. 2315–40, 2017. doi: 10.1007/s00034-016-0409-1
Web of Science ®Google Scholar
S. V. Gangashetty, C. C. Sekhar, and B. Yegnanarayana, “Detection of vowel on set points in continuous speech using autoassociative neural network models,” in Proceedings of Interspeech, Jeju Island, Korea, 2004, pp. 1081–84.
Google Scholar
S. V. Gangashetty, C. C. Sekhar, and B. Yegnanarayana, “Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances,” in Proceedings of International Conference on Intelligent Sensing and Information Processing, Chennai, India, 2004, pp. 159–64.
Google Scholar
J.-F. Wang, C.-H. Wu, S.-H. Chang, and J.-Y. L. Lee, “A hierarchical neural network model based on a c/v segmentation algorithm for isolated mandarin speech recognition,” IEEE Trans. Signal Process., Vol. 39, no. 9, pp. 2141–6, 1991. doi: 10.1109/78.134458
Web of Science ®Google Scholar
K. Tripathi, and K. Sreenivasa Rao. “VOP Detection for Read and Conversation Speech using CWT Coefficients and Phone Boundaries,” arXiv e-prints, p. arXiv:1908.08668, Aug 2019.
Google Scholar
S. Furui, “On the role of spectral transition for speech perception,” J. Acoust. Soc. Am., Vol. 80, no. 4, pp. 1016–25, 1986. doi: 10.1121/1.393842
PubMed Web of Science ®Google Scholar
J. Yadav, and K. S. Rao, “Detection of vowel offset point from speech signal,” IEEE Signal Process Lett., Vol. 20, no. 4, pp. 299–302, 2013. doi: 10.1109/LSP.2013.2245647
Web of Science ®Google Scholar
R. Thirumuru, S. V. Gangashetty, and A. K. Vuppala, “Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points,” Multimed. Tools. Appl., Vol. 77, no. 4, pp. 4753–67, 2018. doi: 10.1007/s11042-017-5044-8
Web of Science ®Google Scholar
J. S. Garofalo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, “The DARPA TIMIT acoustic-phonetic continuous speech corpus cdrom,” Ling Data Consortium, 1993.
Google Scholar
S. B. Sunil Kumar, K. S. Rao, and D. Pati, “Phonetic and prosodically rich transcribed speech corpus in Indian languages: Bengali and Odia,” in Proceedings of International Conference Oriental COCOSDA held jointly with Conference on Asian Spoken Language Research and Evaluation, Gurgaon, India, 2013, pp. 1–5.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

VEP Detection for Read, Extempore and Conversation Speech

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

VEP Detection for Read, Extempore and Conversation Speech

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date