702
Views
21
CrossRef citations to date
0
Altmetric
Articles

Enabling maps/location searches on mobile devices: constructing a POI database via focused crawling and information extraction

, , , , &
Pages 1405-1425 | Received 22 Jul 2015, Accepted 04 Dec 2015, Published online: 12 Jan 2016
 

ABSTRACT

With the popularity of mobile devices and smartphones, we have witnessed rapid growth in mobile applications and services, especially in location-based services (LBS). According to a mobile marketing survey, maps/location searches are among the most utilized services on smartphones. Points of interest (POIs), such as stores, shops, gas stations, parking lots, and bus stops, are particularly important for maps/location searches. Existing map services such as Google Maps and Wikimapia are constructed manually either professionally or with crowd sourcing. However, manual annotation is costly and limited in current POI search services. With the abundance of information on the Web, many store POIs can be extracted from the Web. In this paper, we focus on automatically constructing a POI database to enable store POI map searches. We propose techniques that are required to construct a POI database, including focused crawling, information extraction, and information retrieval techniques. We first crawl Yellow Page web sites to obtain vocabularies of store names. These vocabularies are then investigated with search engines to obtain sentences containing these store names from search snippets in order to train a store name recognition model. To extract POIs scattered across the Web, we propose a query-based crawler to find address-bearing pages that might be used to extract addresses and store names. We crawled 1.25 million distinct POI pairs scattered across the Web and implemented a POI search service via Apache Lucent’s search platform, called Solr. The experimental results demonstrate that the proposed geographical information retrieval model outperforms Wikimapia and a commercial app called ‘What’s the Number?’

Disclosure statement

No potential conflict of interest was reported by the authors.

Notes

Additional information

Funding

This work was partially supported by the Ministry of Science and Technology, Taiwan under [grant number MOST103-2221-E-008-094].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 704.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.