139
Views
2
CrossRef citations to date
0
Altmetric
Articles

An ontology learning based approach for focused web crawling using combined normalized pointwise mutual information and Resnik algorithm

ORCID Icon & ORCID Icon
Pages 1123-1129 | Received 25 Jun 2019, Accepted 18 Oct 2019, Published online: 30 Oct 2019

References

  • Chakrabarti S, van den Berg M, Dom B. Focused crawling: a new approach to top-specific web source discovery. Comput Networks. 1999;31(11–16):1623–1640. doi: 10.1016/S1389-1286(99)00052-3
  • Salton G, Wong A, Yang C. Information retrieval and language processing: a vector space model for automatic indexing. Commun ACM. 1975;18(11):613–620. doi: 10.1145/361219.361220
  • Liu Z, Du Y, Zhao Y. Focused crawler based on domain ontology and FCA. J Inf Comput Sci. 2011;8(10):1909–1917.
  • Hliaoutakis A, Varelas G, Voutsakis E, et al. Information retrieval by semantic similarity. Int J Semant Web Inf Syst. 2011;2(3):55–73. doi: 10.4018/jswis.2006070104
  • Voutsakis E. Semantic similarity methods in WordNet and their. WIDM '05 Proceedings of the 7th annual ACM international workshop on Web information and data management. 2005;1:10–16.
  • Geng Z, Shang D, Zhu Q, et al. Research on improved focused crawler and its application in food safety public opinion analysis. Chinese Autom Congr. 2017:2847–2852.
  • Hassan T, Cruz C, Bertaux A. Predictive and evolutive cross-referencing for web textual sources. Proc Comput Conf. 2018;2018-Janua(July):1114–1122.
  • Menczer F, Menczer F, Pant G, et al. Topical web crawlers: evaluating adaptive algorithms. ACM Trans Internet Technol 2003;V(February):38.
  • Park JR, Yang C, Tosaka Y, et al. Developing an automatic crawling system for populating a digital repository of professional development resources: a pilot study. J Electron Resour Librariansh. 2016;28(2):63–72. doi: 10.1080/1941126X.2016.1164549
  • Agre GH, Mahajan NV. Keyword focused web crawler. 2nd Int Conf Electron Commun Syst ICECS. 2015:1089–1092.
  • Liu H, Janssen J, Milios E. Using HMM to learn user browsing patterns for focused web crawling. Data Knowl Eng 2006;59(2):270–291. doi: 10.1016/j.datak.2006.01.012
  • Zheng HT, Kang BY, Kim HG. An ontology-based approach to learnable focused crawling. Inf Sci (Ny). 2008;178(23):4512–4522. doi: 10.1016/j.ins.2008.07.030
  • Chen Z, Ma J, Lei J, et al. A cross-language focused crawling algorithm based on multiple relevance prediction strategies. Comput Math with Appl. 2009;57(6):1057–1072. doi: 10.1016/j.camwa.2008.09.021
  • Diligenti M, Coetzee F, Lawrence S, et al. Focused crawling using context graphs. Proc 26th … . 2000:527–534.
  • Pant G, Srinivasan P. Link contexts in classifier-guided topical crawlers. IEEE Trans Knowl Data Eng 2006;18(1):107–122. doi: 10.1109/TKDE.2006.12
  • Du Y, Pen Q, Gao Z. Data & knowledge engineering a topic-specific crawling strategy based on semantics similarity. Datak. 2013;88:75–93.
  • Bouma G. Normalized (pointwise) mutual information in collocation extraction. Proc Bienn GSCL Conf. 2009;31–40.
  • Resnik P. August 20 - 25, 1995. Using information content to evaluate semantic similarity in a taxonomy, IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.