862
Views
7
CrossRef citations to date
0
Altmetric
Articles

Conceptually categorizing geographic features from text based on latent semantic analysis and ontologies

Pages 113-127 | Received 26 Feb 2015, Accepted 28 Dec 2015, Published online: 16 Feb 2016

References

  • Baharudin, B., L. Lee, and K. Khan. 2010. “A Review of Machine Learning Algorithms for Text-Documents Classification.” Journal of Advances in Information Technology 1 (1): 4–20. doi:10.4304/jait.1.1.4-20.
  • Bian, L., and S. Hu. 2007. “Identifying Components for Interoperable Process Models Using Concept Lattice and Semantic Reference System.” International Journal of Geographical Information Science 21 (9): 1009–1032. doi:10.1080/13658810601169907.
  • Blei, D., A. Ng, and M. Jordan. 2003. “Latent Dirichlet Allocation.” Journal of Machine Learning Research 3: 993–1022.
  • Chen, R., J. Liang, and R. Pan. 2008. “Using Recursive ART Network to Construction Domain Ontology Based on Term Frequency and Inverse Document Frequency.” Expert Systems with Applications 34: 488–501. doi:10.1016/j.eswa.2006.09.019.
  • Comber, A., A. Lear, and R. Wadsworth. 2010. “A Comparision of Different Semantic Methods for Integrating Thematic Geographical Information: The Example of Land Cover.” 13th AGILE International Conference on Geographic Information Science, Guimaraes, May 10–14. Edited by J. Carswell and T. Tekuza.
  • Deerwester, S., S. Dumais, G. Furnas, T. Landauer, and R. Harshman. 1990. “Indexing by Latent Semantic Analysis.” Journal of the American Society for Information Science 41 (6): 391–407. doi:10.1002/(ISSN)1097-4571.
  • Denhiere, G., B. Lemaire, C. Bellissens, and S. Jhean-Larose. 2006. “A Semantic Space for Modeling Children’s Semantic Memory.” In Handbook of Latent Semantic Analysis, edited by T. K. Landauer, D. S. McNamara, S. Dennis, and W. Kintsch, 143–165. Mahwah, NJ: Lawrence Erlbaum Associates, Publishers.
  • Dick-Peddie, W. 1992. New Mexico Vegetation: Past, Present, and Future. Albuquerque: University Of New Mexico Press.
  • Duckham, M., and M. Worboys. 2005. “An Algebraic Approach to Automated Geospatial Information Fusion.” International Journal of Geographical Information Science 19 (5): 537–557. doi:10.1080/13658810500032339.
  • Fabrikant, S., D. Montello, M. Ruocco, and R. Middleton. 2004. “The Distance-Similarity Metaphor in Network-Display Spatializations.” Cartography and Geographic Information Science 31 (4): 237–252. doi:10.1559/1523040042742402.
  • Fisher, D. 1987. “Knowledge Acquisition via Incremental Conceptual Clustering.” Machine Learning 2: 139–172. doi:10.1007/BF00114265.
  • Goldberg, D., J. Wilson, and C. Knoblock. 2009. “Extracting Geographic Features from the Internet to Automatically Build Detailed Regional Gazetteers.” International Journal of Geographical Information Science 23 (1): 93–128. doi:10.1080/13658810802577262.
  • Gruber, T. 1993. “A Translation Approach to Portable Ontology Specifications.” Knowledge Acquisition 5 (2): 199–220. doi:10.1006/knac.1993.1008.
  • Guarino, N. “Formal Ontology and Information System.” 1998. In Formal Ontology in Information Systems. Proceedings of FOIS’98, Trento, Italy June, edited by N. Guarino, 3–15. Amsterdam: IOS Press.
  • Hearst, M. 1999. “The Use of Categories and Clusters for Organizing Retrieval Results.” In Natural Language Information Retrieval, edited by T. Strzalkowski, 333–374. Dordrecht, The Netherland: Kluwer Academic.
  • Hill, L. L. 2000. “Core Elements of Digital Gazetteers: Placenames, Categories, and Footprints.” In Proceedings of Research and Advanced Technology for Digital Libraries, 4th European Conference (ECDL ‘00), edited by J. L. Borbinha, and T. Baker, 280–290. Vol. 1923. London: Springer.
  • Huang, Y. 2011. “A Latent Semantic Analysis-Based Approach to Geographic Feature Categorization from Text.” In Proceedings of the Fifth IEEE International Conference on Semantic Computing, edited by S. Guadarrama and N. Ludwig. Palo Alto, CA: Stanford University.
  • Julyan, R. 2006. The Mountains of New Mexico. Albuquerque: University of New Mexico Press.
  • Kavouras, M., and M. Kokla. 2002. “A Method for the Formalization and Integration of Geographical Categorizations.” International Journal of Geographical Information Science 16 (5): 439–453. doi:10.1080/13658810210129120.
  • Kobayashi, M., and M. Aono. 2006. “Exploring Overlapping Clusters Using Dynamic Re-Scaling and Sampling.” Knowledge Information Systems 10 (3): 295–313. doi:10.1007/s10115-006-0005-y.
  • Kontostathis, A., and W. Pottenger. 2006. “A Framework for Understanding Latent Semantic Indexing (LSI) Performance.” Information Processing & Management 42: 56–73. doi:10.1016/j.ipm.2004.11.007.
  • Kuhn, W. 2002. “Modelling the Semantics of Geographic Categories through Conceptual Integration.” In Proceedings of the Second International Conference on Geographic Information Science – GIScience, edited by M. Egenhofer and D. Mark, 108–118. London: Springer-Verlag.
  • Kumar, M. A., and M. Gopal. 2010. “A Comparison Study on Multiple Binary-Class SVM Methods for Unilabel Text Categorization.” Pattern Recognition Letters 31 (11): 1437–1444. doi:10.1016/j.patrec.2010.02.015.
  • Landauer, T., and S. Dumais. 1997. “A Solution to Plato’s Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge.” Psychological Review 104 (2): 211–240. doi:10.1037/0033-295X.104.2.211.
  • Landauer, T., P. Foltz, and D. Laham. 1998. “An Introduction to Latent Semantic Analysis.” Discourse Processes 25 (2–3): 259–284. doi:10.1080/01638539809545028.
  • Larson, R. 1996. “Geographic Information Retrieval and Spatial Browsing.” In Geographic Information Systems and Libraries: Patrons, Maps, and Spatial Information, edited by L. C. Smith and M. Gluck, 81–124. Champaign: University of Illinois at Urbana–Champaign.
  • Lieberman, M. S., H. Samet, J. Sankaranarayanna, and G. J. Sperlin. 2007. “STEWARD: Arthitecture of a Spatio-Textual Search Engine.” In Proceedings of the 15th ACM Int. Symp. on Advances in GIS (ACMGIS’07), edited by H. Samet, C. Shahabi, and M. Schneider, 186–193. New York: ACM.
  • Mark, D., B. Smith, and B. Tversky. 1999. “Ontology and Geographic Objects: An Empirical Study of Cognitive Categorization.” COSIT’99, LNCS 1661, Stade, August 25–29, 283–298. Edited by C. Freksa and D. M. Mark.
  • Michalski, R. S. 1980. “Knowledge Acquisition through Conceptual Clustering: A Theoretical Framework and an Algorithm for Partitioning Data into Conjunctive Concepts.” Journal of Policy Analysis and Information Systems 4 (3): 219–244.
  • Millis, K., J. Magliano, K. Wiemer-Hastings, S. Todaro, and D. MxNamara. 2006. “Assessing and Improving Comprehension with Latent Semantic Analysis.” In Handbook of Latent Semantic Analysis, edited by T. K. Landauer, D. S. McNamara, S. Dennis, and W. Kintsch, 207–225. Mahwah, NJ: Lawrence Erlbaum Associates, Publishers.
  • Noy, N. F., R. W. Fergerson, and M. A. Musen. 2000. “The Knowledge Model of Protégé 2000: Combining Interoperability and Flexibility.” Proceedings of the 12th International Conference on Knowledge Engineering and Knowledge Management (EKAW’2000), Juan-les-Pins, October 2–6. Edited by R. Dieng and O. Corby.
  • Nunzio, G. 2009. “Using Scatterplots to Understand and Improve Probabilistic Models for Text Categorization and Retrieval.” International Journal of Approximate Reasoning 50 (7): 945–956. doi:10.1016/j.ijar.2009.01.002.
  • Odgers, N., A. McBratney, and B. Minasny. 2011. “Bottom-Up Digital Soil Mapping. I. Soil Layer Classes.” Geoderma 163: 38–44. doi:10.1016/j.geoderma.2011.03.014.
  • Rauch, E., M. Bukatin, and K. Baker. 2003. “A Confidence-Based Framework for Disambiguating Geographic Terms.” In Proceedings of the HLT-NAACL 2003 Workshop on Analysis of Geographic References, edited by S. Kornai and B. Sundheim, 50–54. Morristown, NJ: Association for Computational Linguistics.
  • Robertson, S. 2004. “Understanding Inverse Document Frequency: On Theoretical Arguments for IDF.” Journal of Documentation 60 (5): 503–520. doi:10.1108/00220410410560582.
  • Salton, G. 1989. Automatic Text Processing - The Transformation, Analysis, and Retrieval of Information by Computer. Reading, MA: Addison-Wesley.
  • Sebastiani, F. 2002. “Machine Learning in Automated Text Categorization.” ACM Computing Surveys 34 (1): 1–47. doi:10.1145/505282.505283.
  • Smith, B., and D. Mark. 2001. “Geographical Categories: An Ontological Investigation.” International Journal of Geographical Information Science 15 (7): 591–612. doi:10.1080/13658810110061199.
  • Smith, B., and D. Mark. 2003. “Do Mountains Exist? Towards an Ontology of Landforms.” Environment and Planning B: Planning and Design 30 (3): 411–427. doi:10.1068/b12821.
  • Song, W., and S. Park. 2009. “Genetic Algorithm for Text Clustering Using Ontology and Evaluating the Validity of Various Semantic Similarity Measures.” Expert Systems with Applications 36: 9095–9104. doi:10.1016/j.eswa.2008.12.046.
  • Srinivasan, P. 2001. “Meshmap: A Text Mining Tool for Medline.” In Proc. AMIA Symp, edited by S. Bakken, November 3–7, 642–646. Washington, DC.
  • Uguz, H. 2011. “A Two-Stage Feature Selection Method for Text Categorization by Using Information Gain, Principal Component Analysis and Genetic Algorithm.” Knowledge-based Systems 24 (7): 1024–1032. doi:10.1016/j.knosys.2011.04.014.
  • Usery, L. 1993. “Category Theory and the Structure of Features in Geographic Information Systems.” Cartography and Geographic Information Science 20 (1): 5–12. doi:10.1559/152304093782616751.
  • Wei, C., C. Yang, and C. Lin. 2008. “A Latent Semantic Indexing-Based Approach to Multilingual Document Clustering.” Decision Support Systems 45: 606–620. doi:10.1016/j.dss.2007.07.008.
  • Wiemer-Hastings, P. 1999. “How Latent Is Latent Semantic Analysis?” In Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, edited by T. Dean, 932–937. San Francisco, CA: Morgan Kaufmann.
  • Wise, J. A., J. J. Thomas, K. Pennock, D. Lantrip, M. Pottier, A. Schur, and V. Crow. 1995. “Visualizing the Non-Visual: Spatial Analysis and Interaction with Information from Text Documents.” In Information Visualization, 1995. Proceedings, edited by N. Gershon and S. Eick, Atlanta, October 30, 51–58. IEEE.
  • Yu, B., Z. Xu, and C. Li. 2008. “Latent Semantic Analysis for Text Categorization Using Neural Network.” Knowledge-Based Systems 21: 900–904. doi:10.1016/j.knosys.2008.03.045.
  • Zha, H., and H. Simon. 1998. “A Subspace-Based Model for Latent Semantic Indexing in Information Retrieval.” In Proceedings of the Thirteenth Symposium on the Interface, edited by W. Eddy, 315–320. New York: Springer-Verlag.
  • Zheng, H., C. Borchert, and Y. Jiang. 2010. “A Knowledge-Driven Approach to Biomedical Document Conceptualization.” Artificial Intelligence in Medicine 49 (2): 67–78. doi:10.1016/j.artmed.2010.02.005.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.