159
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Algorithmic uncertainties in geolocating social media data for disaster management

ORCID Icon, , ORCID Icon, ORCID Icon, , & show all
Received 01 Feb 2023, Accepted 14 Nov 2023, Published online: 21 Dec 2023
 

ABSTRACT

The rapid development of information and communications technology has turned individuals into sensors, fostering the growth of human-generated geospatial big data. In disaster management, geospatial big data, mainly social media data, have opened new avenues for observing human responses to disasters in near real-time. Previous research relies on geographical information in geotags, content, and user profiles to locate social media messages. However, less than 1% of users geotag their messages, leaving geolocating users through user profiles or message content addresses increasingly crucial. This paper evaluates and visualizes the margin of error incurred when using user profiles or message-mentioned addresses to geolocate social media data for disaster research. Using Twitter data during the 2017 Hurricane Harvey as an example, this research assessed the inconsistencies in predicting users’ locations in various administrative units during each disaster phase using three geolocating strategies. The results reveal that the similarities between geotags, and user profile locations decrease from 94.07% to 64.56%, 43.9%, 31.82%, 27.05%, and 26.7% as the geographical scale changes from country to state, county, block group, 1-kilometer, and 30-meter levels. These similarities are overall higher than the agreements between locations derived from geotags and tweet content. The geolocation consistencies among the three methods remain stable across disaster phases. The impacts of uncertainties in geolocating Twitter data for disaster management applications were further unraveled. The findings offer valuable insights into the trade-off between spatial scale and geolocation accuracy and inform the selection of appropriate scales when applying different geolocating strategies in future social media-based investigations.

Acknowledgments

We extend our sincere appreciation to the anonymous reviewers whose constructive comments have greatly improved the quality of this manuscript.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

Data will be available upon request. It will be hosted in a private GitHub Repo: https://github.com/rohan-debayan/DeidentifiedHarveyTweets.git

Additional information

Funding

This work was supported by the NSF under Grant No. [1762600] (P.I.: Dr. Nina Lam) and [2318206].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 78.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.