2,956
Views
26
CrossRef citations to date
0
Altmetric
Special Issue Papers

A segmentation method for web page analysis using shrinking and dividing

, &
Pages 93-104 | Received 01 Aug 2008, Accepted 25 Aug 2008, Published online: 19 Mar 2010

References

  • Bar-Yossef , Z. and Rajagopalan , S. 2002 . “ Template detection via data mining and its applications ” . In Proceedings of the 11th International Conference on World Wide Web (WWW)
  • Cai , D. , He , X. , Ma , W.-Y. , Wen , J.-R. and Zhung , H. 2004 . “ Organizing www images based on the analysis of page layout and web link structure ” . In IEEE International Conference on Multimedia and Expo (ICME)
  • D. Cai, S. Yu, J. Wen, and W.-Y. Ma, VIPS: A vision-based page segmentation algorithm, Tech. Rep. MSR-TR-2003-79, (2003)
  • Canny , J. 1986 . A computational approach to edge detection . IEEE Trans. Pattern Anal. Mach. Intell. , 8 ( 6 ) : 679 – 698 .
  • Feng , H.M. , Liu , B. , Liu , Y.M. , Fang , Y. and Song , G. 2005 . Framework of web page analysis and content extraction with coordinate trees . J Tsinghua Univ (Sci & Tech) , 45 ( S1 ) : 1767 – 1771 .
  • A. Finn, N. Kushmerick, and B. Smyth, Fact or fiction: content classification for digital librarie, in Joint DELO-NSF Workshop on Personalisation and Recommender Systems in Digital Libraries, Dublin, (2001)
  • Fu , A.Y. , Wenyin , L. and Deng , X. 2006 . Detecting phishing web pages with visual similarity assessment based on earth mover's distance (EMD) . IEEE Trans. Dependable Security Comput. , 3 ( 4 ) : 301 – 311 .
  • S. Gupta, G. Kaiser, D. Neistadt, and P. Grimm, DOM based content extraction of HTML documents, in Proceedings of 12th world wide web conference (WWW), Budapest, Hungary, (2003)
  • Jain , A.K. and Bhattacharjee , S. 1992 . Text segmentation using Gabor filters for automatic document processing . Mach. Vision Appl. , 5 ( 3 ) : 169 – 184 .
  • Keith , V.N. and Carsten , F. 2002 . “ Applying gestalt principles to animated visualizations of network data ” . In Proceedings of the Sixth International Conference on Information Visualisation (IV’02) , IEEE Computer Society Press .
  • S-H. Lin and J.-M. Ho, Discovering information content blocks from web documents, in Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (SIGKDD’02), (2002)
  • W. Liu, H. Guanglin, X. Liu, M. Zhang, and X. Deng, Phishing web page detection, in Proceedings of Eighth International Conference on Documents Analysis and Recognition, (2005)
  • Liu , W. , Deng , X. , Huang , G. and Fu , A.Y. 2006 . An anti-phishing strategy based on visual similarity assessment . IEEE Internet Comput. , 10 ( 2 ) : 58 – 65 .
  • Nagy , G. , Seth , S. and Stoddard , S.D. 1986 . Document analysis with an expert system , in Pattern Recognition Practice North-Holland : Elsevier Science Publishers .
  • L. Wenyin, G. Huang, L. Xiaoyue, Z. Min, and X. Deng, Detection of phishing web pages based on visual similarity, in Proceedings of the 14th International World Wide Web Conference, (2005)

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.