220
Views
4
CrossRef citations to date
0
Altmetric
Articles

Evaluation of Lexical-Based Approaches to the Semantic Similarity of Malay Sentences

, &

References

  • Abdullah, M. T., Ahmad, F., Mahmod, R. T., & Sembok, T. M. (2003). Evaluating the effectiveness of thesaurus and stemming methods in retrieving Malay translated al-Quran documents. In T. M. T. Sembok, H. B. Zaman, H. Chen, S. R. Urs, & S.-H. Myaeng (Eds), Proceedings of the 6th International Conference on Asian Digital Libraries, (ICADL) 2003 (pp. 663–665). Berlin: Springer-Verlag.
  • Ahmad, F., Yusoff, M., & Sembok, T. M. (1996). Experiments with a stemming algorithm for Malay words. Journal of the American Society of Information Science, 47(12), 909–918.10.1002/(ISSN)1097-4571
  • Aliguliyev, R. M. (2009). A new sentence similarity measure and sentence based extractive technique for automatic text summarization. Expert Systems with Applications, 36, 7764–7772.10.1016/j.eswa.2008.11.022
  • Bollegala, D., Matsuo, Y., & Ishizuka, M. (2011). A web search engine-based approach to measure semantic similarity between Words. IEEE Transactions on Knowledge and Data Engineering, 23(7), 977–990.10.1109/TKDE.2010.172
  • Castillo, J. J., & Cardenas, M. E. (2010). Using sentence semantic similarity based on wordnet in recognizing textual entailment. In A. Kuri-Morales & G. R. Simari (Eds), Advances in Artificial Intelligence – 12th Ibero-American Conference on AI (IBERAMIA) (pp. 366V375). Berlin: Springer-Verlag.
  • Chang, M. S. (1980). The morphological analysis of Bahasa Malaysia. Proceedings of the 8th Conference on Computational Linguistics, Penang, Malaysia (pp. 578–585).
  • Cilibrasi, R., & Vitanyi, P. M. B. (2006). Similarity of objects and the meaning of words. In J.-Y. Chai, S. B. Cooper, & A. Li (Eds), Proceedings of the 3rd Conference on Theory and Applications of Models of Computation (TAMC) (pp. 21–45). Berlin: Springer-Verlag.10.1007/11750321
  • Egozi, O., Markovitch, S., & Gabrilovich, E. (2011). Concept-based information retrieval using explicit semantic analysis. ACM Transactions on Information Systems, 29(2), 1–34.10.1145/1961209
  • Kamus Dewan. (2005). Dewan Bahasa dan Pustaka. Malaysia: Kuala Lumpur.
  • Karim, N. S. (1995). Malay Grammar for Academics and Professionals. Kuala Lumpur: Dewan Bahasa dan Pustaka.
  • Karov, Y., & Edelmen, S. (1998). Similarity-based word sense disambiguation. Computational Linguistics, 24(1), 41–59.
  • Ko, Y., Park, J., & Seo, J. (2004). Improving text categorization using the importance of sentences. Information Processing and Management, 40(1), 65–79.10.1016/S0306-4573(02)00056-0
  • Kong, T. E., & Yusoff, Z. (1995). Natural language analysis in machine translation (MT) based on the string-tree correspondence grammar (STCG). Paper presented at the 10th Pacific Asia Conference on Language, Information and Computation (PACLIC10).
  • Leacock, C., & Chodorow, M. (1998). Combining local context and WordNet sense similarity for word sense identification. In C. Fellbaum (Ed.), WordNet, an Electronic Lexical Database (pp. 305–332). Boston, MA: The MIT Press.
  • Lee, M. C. (2011). A novel sentence similarity measure for semantic-based expert systems. Expert Systems with Applications, 38(5), 6392–6399.10.1016/j.eswa.2010.10.043
  • Lemaire, B., & Denhière, G. (2006). Effects of high-order co-occurrences on word semantic similarity. Current Psychology Letters, 18(1). http://cpl.revues.org/document471.html.
  • Lesk, M. E. (1986). Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In V. DeBuys (Ed.), Proceedings of the 5th Annual International Conference On Systems Documentation (pp. 24–26). Canada: University of Toronto.10.1145/318723
  • Li, Y., McLean, D., Bandar, Z. A., O’Shea, J. D., & Crockett, K. (2006). Sentence similarity based on semantic nets and corpus statistics. IEEE Transactions on Knowledge and Data Engineering, 18(8), 1138–1150.10.1109/TKDE.2006.130
  • Liu, S., Liu, F., Yu, C., & Meng, W. (2004). An effective approach to document retrieval via utilizing WordNet and recognizing phrases. In K. Jarvelin, J. Allan, P. Bruza, & M. Sanderson (Eds), Proceedings of the 27th Annual International ACM SIGIR Conference (pp. 266–272). New York: ACM.
  • Liu, H., & Wang, P. (2013). Assessing sentence similarity using wordnet based word similarity. Journal of Software, 8(6), 1451–1458.
  • Metzler, D., Bernstein, Y., Croft, W. B., Moffat, A., & Zobel, J. (2005). Similarity measures for tracking information flow. In O. Herzog, H.-J. Schek, N. Fuhr, A. Chowdhury, & W. Teiken (Eds), Proceedings of the CIKM’05 (pp. 517–524). New York: ACM.
  • Mihalcea, R., Corley, C., & Strapparave, C. (2006). Corpus based and knowledge based measures of text semantic similarity. In A. Cohn (Ed.), Proceedings of the American Association for Artificial Intelligence (AAAI 2006) (pp. 775–780). Boston, MA: Massachusetts.
  • Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.10.1145/219717.219748
  • Moidin, A. H. (2008). Sinonim A-Z untuk Pelajar. Kuala Lumpur: IBS.
  • Noah, S. A., Amruddin, A. Y., & Omar, N. (2007). Semantic similarity measures for Malay sentences. In D. H.-L. Goh, T. H. Cao, I. Sølvberg, & E. M. Rasmussen (Eds), Proceedings of the ICADL 2007 (pp. 117–126). Berlin: Springer-Verlag.
  • Noor, N. K. M., Noah, S. A., Aziz, M. J. A., & Hamzah, M. P. (2012). Malay anaphor and antecedent candidate identification: A proposed solution. In J.-S. Pan, S.-M. Chen, & N. T. Nguyen (Eds), Proceedings of the Asia Conference on Intelligent Information and Database Systems (ACIIDS) (3) (pp. 141–151). Berlin: Springer-Verlag.10.1007/978-3-642-28493-9
  • O’Shea, K. (2012). An approach to conversational agent design using semantic sentence similarity. Applied Intelligence, 37(4), 558–568.10.1007/s10489-012-0349-9
  • Othman, A. (1993). Pengakar perkataan melayu untuk sistem capaian dokumen [Malay words stemmer for document retrieval system]. MSc Thesis. National University of Malaysia, Bangi, Malaysia.
  • Qiu, G., Bu, J., Chen, C., Huang, P., & Cai, K. (2007). Syntactic impact on sentence similarity measure in archive-based QA system. In J. Pei, V. S. Tseng, L. Cao, H. Motoda, & G. Xu (Eds), Proceedings of 11th Asia Pacific Conference on Advances in Knowledge Discovery and Data Mining (pp. 769–776). Berlin: Springer-Verlag.10.1007/978-3-540-71701-0
  • Resnik, P. (1995). Using information content to evaluate the semantic similarity. In C. S. Mellish (Ed.), Proceedings of the 14th International Joint Conference on Artificial Intelligence (pp. 448–453). San Francisco: Morgan Kaufmann.
  • Rocchio Jr., J. J. (1971). Relevance feedback in information retrieval. In G. Salton (Ed.), The Smart Retrieval Systems – Experiments in Automatic Document Processing (pp. 313–323). New Jersey: Prentice-Hall.
  • Salton, G., & Lesk, M. (1971). Computer evaluation of indexing and text processing. Journal of the ACM, 15(1), 8–36.
  • Schutze, H. (1998). Automatic word sense discrimination. Computational Linguistics, 24(1), 97–124.
  • Sparck-Jones, K. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28(1), 11–21.10.1108/eb026526
  • Turney, P. (2001). Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In L. D. Raedt & P. A. Flach (Eds), Proceedings of the 12th European Conference on Machine Learning (pp. 491–502). Berlin: Springer-Verlag.
  • Verberne, S. (2007). Paragraph retrieval for why-question answering. In W. Kraaij, A. P. de Vries, C. L. A. Clarke, N. Fuhr, & N. Kando (Eds), Proceedings of the 30th Annual International ACM SIGIR Conference (pp. 922–922). New York: ACM.
  • Verberne, S., Boves, L., Oostdijk, N., & Coppen, P.-A. (2008). Evaluating paragraph retrieval for why–QA. In C. Macdonald, I. Ounis, V. Plachouras, I. Ruthven, & R. W. White (Eds), Proceedings of the 30th European Conference on IR Research, ECIR 2008 (pp. 669–673). Berlin: Springer-Verlag.
  • Wiemer-Hastings, P. (2000). Adding syntactic information to LSA. In L. R. Gleitman & A. K. Joshi (Eds), Proceedings of the 22nd Annual Conference on Cognitive Science (pp. 989–993). San Francisco: Morgan Kaufmann.
  • Zhang, Z. Q., Gentile, A. N., & Ciravegna, F. (2012). Recent advances in methods of lexical semantic relatedness – A survey. Natural Language Engineering, 19(4), 411–479.
  • Zhao, Z., Yan, J., Fang, L., & Wang, P. (2009). Measuring semantic similarity based on WordNet. In Proceedings of the 6th Web Information Systems and Applications Conference (pp. 88–92). New Jersey: IEEE.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.