320
Views
11
CrossRef citations to date
0
Altmetric
Web based cooperation and collaboration

A text categorisation tool for open source communities based on semantic analysis

, , &
Pages 532-544 | Received 17 Sep 2009, Accepted 05 Sep 2011, Published online: 26 Oct 2011

References

  • Abedin , B. and Sohrabi , B. 2009 . Graph theory application and web page ranking for website link structure improvement . Behaviour & Information Technology , 28 ( 1 ) : 63 – 72 . (doi:10.1080/01449290701840948)
  • Ardichvili , A. , Page , V. and Wentling , T. 2003 . Motivation and barriers to participation in virtual knowledge-sharing communities of practice . Journal of Knowledge Management , 7 ( 1 ) : 64 – 77 . (doi:10.1108/13673270310463626)
  • Barrero , F. , Toral , S. L. and Gallardo , S. 2008 . EDSPLAB: remote laboratory for experiments on DSP applications . Internet Research , 18 ( 1 ) : 79 – 92 . (doi:10.1108/10662240810849603)
  • Blei , D. M. , Ng , A. Y. and Jordan , M. I. 2003 . Latent Dirichlet allocation . Journal of Machine Learning Research , 3 : 993 – 1022 .
  • Cai , D. , He , X. and Han , J. 2005 . Document clustering using locality preserving indexing . IEEE Transactions on Knowledge and Data Engineering , 17 ( 12 ) : 1624 – 1637 . (doi:10.1109/TKDE.2005.198)
  • Cho , H. 2005 . Development of computer-supported collaborative social networks in a distributed learning community . Behaviour & Information Technology , 24 ( 6 ) : 435 – 447 . (doi:10.1080/01449290500044049)
  • Deerwester , S. 1990 . Indexing by latent semantic analysis . Journal of the American Society for Information Science and Technology , 41 ( 6 ) : 391 – 407 . (doi:10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9)
  • Diesner , J. and Carley , K. M. 2008 . Conditional random fields for entity extraction and ontological text coding . Journal of Computational and Mathematical Organization Theory , 14 : 248 – 262 . (doi:10.1007/s10588-008-9029-z)
  • Griffiths , T. L. and Steyvers , M. Finding scientific topics . Proceedings of the National Academy of Sciences . Vol. 101 , pp. 5228 – 5235 .
  • Hair , J.F. Jr. 1995 . Multivariate data analysis with readings , London : Prentice Hall International .
  • Harrer , A. Combining social network analysis with semantic relations to support the evolution of a scientific community . Mice, minds, and society – The Computer Supported Collaborative Learning (CSCL) Conference 2007 . July 16–21 , New Brunswick . Edited by: Chinn , C. , Erkens , G. and Puntambekar , S. pp. 267 – 276 . NJ. Hong Kong : International Society of the Learning Sciences .
  • Hemetsberger , A. and Reinhardt , C. 2006 . Learning and knowledge-building in open-source communities: a social-experiential approach . Management Learning , 37 ( 2 ) : 187 – 214 . (doi:10.1177/1350507606063442)
  • Hertel , G. , Niedner , S. and Herrmann , S. 2003 . Motivation of software developers in Open Source projects: an Internet-based survey of contributors to the Linux kernel . Research Policy , 32 : 1159 – 1177 . (doi:10.1016/S0048-7333(03)00047-7)
  • Hew , K. F. 2009 . Determinants of success for online communities: an analysis of three communities in terms of members’ perceived professional development . Behavior and Information Technology , 28 ( 5 ) : 433 – 445 . (doi:10.1080/01449290802005995)
  • Hildreth , P. , Kimble , C. and Wright , P. 2000 . Communities of practice in the distributed international environment . Journal of Knowledge Management , 4 ( 1 ) : 27 – 38 . (doi:10.1108/13673270010315920)
  • Hildreth , P. M. and Kimble , C. 2002 . The duality of knowledge . Information Research , 8 ( 1 ) : 1 – 17 .
  • Hofmann , T. 2001 . Unsupervised learning by probabilistic latent semantic analysis . Machine Learning Journal , 42 ( 1 ) : 177 – 196 . (doi:10.1023/A:1007617005950)
  • Kankanhalli , A. 2003 . The role of IT in successful knowledge management initiatives . Communications of the ACM , 46 ( 9 ) : 69 – 73 . (doi:10.1145/903893.903896)
  • Klamma , R. Pattern-based cross media social network analysis for technology enhanced learning in Europe . Innovative approaches to learning and knowledge sharing, Proceedings of the 1st European Conference on Technology Enhanced Learning (EC-TEL 2006) . October 1–3 , Hersonissou , Greece. Edited by: Nejdl , W. and Tochtermann , K. pp. 242 – 256 . Berlin : Springer-Verlag . LNCS 4227
  • Kogut , B. and Metiu , A. 2001 . Open source software and distributed innovation . Oxford Review of Economic Policy , 17 ( 2 ) : 248 – 264 . (doi:10.1093/oxrep/17.2.248)
  • Koh , J. and Kim , Y. G. 2004 . Knowledge sharing in virtual communities: an e-business perspective . Expert Systems with Applications , 26 ( 2 ) : 155 – 166 . (doi:10.1016/S0957-4174(03)00116-7)
  • Kuk , G. 2006 . Strategic interaction and knowledge sharing in the KDE developer mailing list . Management Science , 52 ( 7 ) : 1031 – 1042 . (doi:10.1287/mnsc.1060.0551)
  • Lave , J. and Wenger , E. 1991 . Situated learning: legitimate peripheral participation , Cambridge , , UK : Cambridge University Press .
  • Lee , G. K. and Cole , R. E. 2003 . The Linux kernel development: an evolutionary model of knowledge creation . Organization Science , 14 ( 6 ) : 633 – 649 . (doi:10.1287/orsc.14.6.633.24866)
  • Li , W. Smoothing LDA model for text categorization . AIRS 2008, LNCS 4993 . January 15–18 , Harbin , China. Edited by: Li , H. pp. 83 – 94 . New York : Springer .
  • Lin , H.-F. and Lee , G.-G. 2006 . Determinants of success for online communities: an empirical study . Behaviour & Information Technology , 25 ( 6 ) : 479 – 488 . (doi:10.1080/01449290500330422)
  • Lu , X. 2006 . Enhancing text categorization with semantic-enriched representation and training data augmentation . Journal of the American Medical Informatics Association , 13 ( 5 ) : 526 – 535 . (doi:10.1197/jamia.M2051)
  • Maedche , A. and Staab , S. 2001 . Ontology learning for the semantic web . IEEE Intelligent Systems , 16 ( 2 ) : 72 – 79 . (doi:10.1109/5254.920602)
  • Manning , C. D. and Schutze , H. 1999 . Foundations of statistical natural language processing , Cambridge , MA : MIT Press .
  • Martínez-Torres , M. R. 2006 . A procedure to design a structural and measurement model of intellectual capital: an exploratory study . Information & Management , 43 ( 5 ) : 617 – 626 . (doi:10.1016/j.im.2006.03.002)
  • Martínez-Torres , M. R. 2010 . The role of Internet in the development of future software projects . Internet Research , 20 ( 1 ) : 72 – 86 . (doi:10.1108/10662241011020842)
  • Martínez-Torres , M. R. and Toral , S. L. 2010 . Strategic group identification using evolutionary computation . Experts Systems with Applications , 37 ( 7 ) : 4948 – 4954 . (doi:10.1016/j.eswa.2009.12.019)
  • Menon , R. On the effectiveness of latent semantic analysis for the categorization of call centre records . Proceedings of the IEEE International Engineering Management Conference . October 18–21 , Singapore . Vol. 2 , pp. 546 – 550 . Piscataway , NJ : IEEE .
  • Michlmayr , M. and Senyard , A. 2006 . “ A statistical analysis of defects in Debian and strategies for improving quality in free software projects ” . In The economics of open source software development , Edited by: Bitzer , J. , Philipp , J.H. and Schröder . 131 – 148 . Amsterdam, the Netherlands : Elsevier .
  • Mika , P. 2007 . Semantic web and beyond: computing for human experience , New York : Springer .
  • Moler , C. B. 2004 . Numerical computing with MATLAB , Philadelphia , PA : Mathworks Inc .
  • Newman , D. 2006 . Analyzing entities and topics in news articles using statistical topic models LNCS 3975 , New York : Intelligence and Security Informatics, Springer .
  • Ng , A. Y. , Jordan , M. and Weiss , Y. 2001 . “ On spectral clustering: analysis and an algorithm ” . In Advances in Neural Information Processing Systems 14 , 849 – 856 . Cambridge , MA : MIT Press .
  • Pan , S. L. and Leidner , D. E. 2003 . Bridging communities of practice with information technology in pursuit of global knowledge sharing . Journal of Strategic Information Systems , 12 : 71 – 88 . (doi:10.1016/S0963-8687(02)00023-9)
  • Rheingold , H. 1993 . The virtual community: homesteading on the electronic frontier , Reading , MA : Addison-Wesley .
  • Register , A. H. 2007 . A guide to MATLAB object-oriented programming , Boca Raton, FL: Chapman & Hall/CRC Press .
  • Rencher , A. C. 2002 . Methods of multivariate analysis. Wiley Series in Probability and Statistics , 2nd ed , New York : John Wiley & Sons .
  • Rigutini , L. and Maggini , M. A semi-supervised document clustering algorithm based on EM . Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence . September 19–22 , Compiègne , France. pp. 200 – 206 . Piscataway , NJ : IEEE .
  • Salto , G. and McGill , M. J. 1983 . An introduction to modern information retrieval , New York : McGraw-Hill .
  • Shi , J. and Malik , J. 2000 . Normalized cuts and image segmentation . IEEE Transaction on Pattern Analysis and Machine Intelligence , 22 ( 8 ) : 888 – 905 . (doi:10.1109/34.868688)
  • Sowe , S. , Stamelos , I. and Angelis , L. 2006 . Identifying knowledge brokers that yield software engineering knowledge in OSS projects . Information and Software Technology , 48 ( 11 ) : 1025 – 1033 . (doi:10.1016/j.infsof.2005.12.019)
  • Stevens , J. 1992 . Applied multivariate statistics for the social sciences , 2 , Mahwah , NJ, USA : Lawrence Erlbaum .
  • Steyvers , M. and Griffiths , T. 2007 . “ Probabilistic topic models ” . In Handbook of latent semantic analysis , Edited by: Landauer , T. Hillsdale , NJ : Erlbaum .
  • Toral , S. L. , Martínez-Torres , M. R. and Barrero , F. 2009a . Modelling mailing list behaviour in open source projects: the case of ARM embedded Linux . Journal of Universal Computer Science , 15 ( 3 ) : 648 – 664 .
  • Toral , S. L. , Martínez-Torres , M. R. and Barrero , F. 2009b . Virtual communities as a resource for the development of OSS projects: the case of Linux ports to embedded processors . Behavior and Information Technology , 28 ( 5 ) : 405 – 419 . (doi:10.1080/01449290903121394)
  • Toral , S. L. , Vargas , M. and Barrero , F. 2009c . Embedded multimedia processors for road-traffic parameter estimation . Computer , 42 ( 12 ) : 61 – 68 . (doi:10.1109/MC.2009.392)
  • Toral , S. L. , Martínez-Torres , M. R. and Barrero , F. 2009d . An empirical study of the driving forces behind online communities . Internet Research , 19 ( 4 ) : 378 – 392 . (doi:10.1108/10662240910981353)
  • Toral , S. L. , Martínez-Torres , M. R. and Barrero , F. 2010 . Analysis of virtual communities supporting OSS projects using social network analysis . Information and Software Technology , 52 ( 3 ) : 296 – 303 . (doi:10.1016/j.infsof.2009.10.007)
  • Wai , F. B. 2008 . Reuse of knowledge assets from repositories: a mixed methods study . Information & Management , 45 ( 6 ) : 365 – 375 . (doi:10.1016/j.im.2008.06.001)
  • Weal , M. J. 2007 . Ontologies as facilitators for repurposing web documents . International Journal of Human–Computer Studies , 65 : 537 – 562 . (doi:10.1016/j.ijhcs.2007.02.001)
  • Wellman , B. and Gulia , M. 1995 . Net surfers don’t ride alone: virtual communities as communities , Berkeley : University of California Press .
  • Wenger , E. C. and Snyder , W. M. 2000 . Communities of practice: the organizational frontier . Harvard Business Review , 78 ( 1 ) : 139 – 144 .
  • Xu , W. and Gong , Y. Document clustering by concept factorization . Proceedings of international conference research and development in information retrieval . pp. 202 – 209 . New York : ACM .
  • Xu , W. , Liu , X. and Gong , Y. Document clustering based on non-negative matrix factorization . Proceedings of international conference research and development in information retrieval . pp. 267 – 273 . New York : ACM .
  • Zha , H. 2001 . “ Spectral relaxation for k-means clustering ” . In Advances in neural information processing systems 14 , 1057 – 1064 . Cambridge , MA : MIT Press .
  • Zhou , S. , Li , K. and Liu , Y. Text categorization based on topic model . Proceedings of the 3rd international conference on rough sets and knowledge technology . May 17–19 , Chengdu , China. Edited by: Wang , G. , Li , T. , Grzymala-Busse , J. W. , Miao , D. , Skowron , A. and Yao , Y. pp. 572 – 579 . Berlin : Springer-Verlag . Lecture notes in computer science

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.