126
Views
8
CrossRef citations to date
0
Altmetric
Original Articles

Mining a web citation database for document clustering

Pages 283-302 | Published online: 30 Nov 2010

  • Aggarwal , C. , Gates , S. and Yu , P. On the merits of building categorization system by supervised clustering . Proceedings of the 5th ACM SIGKDD International Conference on knowledge Discovery and Data Mining . August 15-18 1999 , San Diego, CA. pp. 352 – 356 .
  • Bollacker , K. , Lawrence , S. and Giles , C. CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications . Proceedings of the 3rd ACM Conference on Digital Libraries . June 23-26 1998 , Pittsburgh, PA. pp. 116 – 123 .
  • Bollacker , K. , Lawrence , S. and Giles , C. 2000 . Discovering relevant scientific literature on the Web . IEEE Intelligent Systems , 15 (2) : 42 – 49 .
  • Gallon , M. , Courtial , J. P. and Laville , F. 1991 . Co-word analysis as a tool for describing the network of interactions between basic and technological research: the case of polymer chemistry . Scientometrics , 22 (1) : 153 – 203 .
  • Carpenter , G. , Grossberg , S. and Rosen , D. 1991 . Fuzzy ART: fast stable learning and categorization of analog patterns by an adaptive resonance system . Neural Networks , 4 : 759 – 771 .
  • Deerwester , S. , Dumais , S. , Furnas , G. and Landauer , K. 1990 . Indexing by latent semantic analysis . Journal of American Society for Information Science , 41 : 391 – 407 .
  • Digital Equipment Corporation (DEC) . 2000 . “ Virtual Paper Project ” . http: //www/research.digital.com/SRC/virtualpaper/home.html
  • Fayyad , U. M. , Piatetsky-Shapiro , G. and Smyth , P. 1996 . “ From data mining to knowledge discovery: an overview ” . In Advances in Knowledge Discovery and Data Mining , Edited by: Fayyad , U. M. , Piatetsky-Shapiro , G. , Smyth , P. and Uthurusamy , R. 1 – 34 . Menlo Park, CA : AAAI/MIT Press .
  • Garfield , B. 1979 . Citation Indexing: Its Theory and Application in Science, Technology, and Humanities , New York : John Wiley & Sons .
  • Hatter , S. P. 1992 . Psychological relevance and information science . Journal of the American Society for Information Science , 43 : 602 – 615 .
  • Hartigan , J. A. 1975 . Clustering Algorithms , New York : John Wiley and Sons .
  • He , Y. 2000 . “ Mining a Web Citation Database for the Retrieval of Scientific Publications over the WWW ” . Singapore : School of Computer Engineering, Nanyang Technological University . M.A.Sc. Thesis
  • Honkela , T. , Kaski , S. , Kohonen , T. and Lagus , K. 1998 . “ Self-organizing maps of very large document collections: Justification for the WEBSOM method ” . In Classification, Data Analysis, and Data Highways , Edited by: Balderjahn , L. , Mathar , R. and Schader , M. 245 – 252 . Berlin : Springer .
  • Jain , A. K. , Murty , M. N. and Flynn , P.J. 1999 . Data clustering: a review . ACM Computer Surveys , 31 (3) : 264 – 323 .
  • Kaski , S. Dimensionality reduction by random mapping: fast similarity computation for clustering . Proceedings of International Joint Conference on Neural Networks (IJCNN'98) . May 5-9 1998 , Anchorage, AK. Vol. 1 , pp. 413 – 418 .
  • Kaski , S. , Lagus , K. , Honkela , T. and Kohonen , T. 1998 . Statistical aspects of the WEBSOM system in organizing document collections . Computing Science and Statistics , 29 : 281 – 290 .
  • Kaufman , L. and Rousseeuw , P. 1990 . Finding Groups on Data: An Introduction to Cluster Analysis , New York : John Wiley and Sons .
  • Kohonen , T. 1995 . Self-Organizing Maps , Springer .
  • Kohonen , T. Self-organizing of very large document collections: state of the art . Proceedings of the 8th International Conference on Artificial Neural Networks . September 2-4 1998 , Skövde, Sweden. pp. 65 – 74 .
  • Kohonen , T. , Kaski , S. , Lagus , K. , Salojärvi , J. , Paatero , V. and Saarela , A. 2000 . Self organization of a massive document collection. IEEE Transactions on Neural Networks . Special Issue on Neural Networks for Data Mining and Knowledge Discovery , 11 (3) : 574 – 585 .
  • Lin , X. 1997 . Map displays for information retrieval . Journal of the American Society for Information Science , 48 : 40 – 54 .
  • Mitchell , T. 1999 . Machine learning and data mining . Communications of the ACM , 42 (11) : 31 – 36 .
  • Pao , M. L. 1993 . Term and citation retrieval: a field study . Information Processing & Management , 29 (1) : 95 – 112 .
  • Rauber , A. and Merkl , D. SOMLib: A digital library system based on neural networks . Proceedings of the 4th ACM Conference on Digital Libraries (DL'99) . August 11-14 1999 , Berkeley, CA. pp. 240 – 241 .
  • Rocchio , J. 1966 . “ Document Retrieval Systems-Optimization and Evaluation ” . Cambridge, MA, , USA : Ph.D. Diff Harvard University .
  • Salton , G. 1991 . Developments in automatic text retreival . Science , 253 : 974 – 979 .
  • Salton , G. and McGiIl , M.J. 1983 . Introduction to Modern Information Retrieval , New York : McGraw-HiIl Publishing Company .
  • Saracevic , T. 1996 . “ Relevance reconsidered ” . In Information Science: Integration in Perspective , Edited by: Ingwersen , P. and Pors , N. O. 210 – 218 . Copenhagen : Royal School of Librarianship .
  • Schatz , B. and Chen , H. 1996 . Building large-scale digital libraries . IEEE Computer , 29 (5) : 22 – 26 .
  • Slonim , N. and Tishby , N. Document clustering using word clusters via the information bottleneck method . Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval . July 24-28 2000 , Athens, Greece. pp. 208 – 215 .
  • Tishby , N. , Pererira , F. C. and Bialek , W. The information bottleneck method . Proceedings of 37th Allerton Conference on Communication and Computation . September 22-24 1999 , Urbana, IL. pp. 368 – 377 .
  • Turtle , H. and Croft , W.B. 1991 . Evaluation of an inference network-based retrieval model . ACM Transactions on Information Systems , 9 (3) : 187 – 222 .
  • Van Rijsbergen , C. 1979 . Information Retrieval. , 2nd ed , London, England : Utterworths .
  • White , H. D. and Griffith , B.C. 1981 . Author co-citation: a literature measure of intellectual structure . Journal of the American Society for Information Studies , 32 : 163 – 171 .
  • Word Net. 2000 . “ WordNet-A Lexical Database for English ” . http://www.cogsci.princeton.edu/wn
  • Yang , Y. and Liu , X. A re-examination of text categorization methods . Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval . August 15-19 1999 , Berkeley, CA. pp. 42 – 49 .
  • Zamir , O. and Etzioni , O. Web document clustering: a feasibility demonstration . Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval . August 24-28 1998 , Melbourne, Australia. pp. 46 – 54 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.