Publication Cover
Transportation Letters
The International Journal of Transportation Research
Latest Articles
0
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Mitigating biases in big mobility data: a case study of monitoring large-scale transit systems

ORCID Icon, , , &
Received 30 Sep 2023, Accepted 09 Jul 2024, Published online: 17 Jul 2024

References

  • Aktay, A., S. Bavadekar, G. Cossoul, J. Davis, D. Desfontaines, A. Fabrikant, E. Gabrilovich, et al. “Google COVID-19 Community Mobility Reports: Anonymization Process Description (Version 1.1).” Accessed July 26, 2022. http://arxiv.org/abs/2004.04145.
  • Apple. “Apple Makes Mobility Data Available to Aid COVID-19 Efforts.” Apple Newsroom. Accessed July 26, 2022. https://www.apple.com/newsroom/2020/04/apple-makes-mobility-data-available-to-aid-covid-19-efforts/.
  • Arai, A., Z. Fan, D. Matekenya, and R. Shibasaki. 2016. “Comparative Perspective of Human Behavior Patterns to Uncover Ownership Bias Among Mobile Phone Users.” ISPRS International Journal of Geo-Information 5 (6): 85. https://doi.org/10.3390/ijgi5060085.
  • Balayn, A., C. Lofi, and G.-J. Houben. 2021. “Managing Bias and Unfairness in Data for Decision Support: A Survey of Machine Learning and Data Engineering Approaches to Identify and Mitigate Bias and Unfairness within Data Management and Analytics Systems.” The VLDB Journal 30 (5): 739–768. https://doi.org/10.1007/s00778-021-00671-8.
  • X. Ban, C. Chen, F. Wang, J. Wang, and Y. Zhang. 2018. “Promises of Data from Emerging Technologies for Transportation Applications: Puget Sound Region Case Study.” Publication FHWA-HEP-19–026. Federal Highway Administration.
  • Berrebi, S. J., S. Joshi, and K. E. Watkins. 2022. “Cross-Checking Automated Passenger Counts for Ridership Analysis.” Journal of Public Transportation 24:100008. https://doi.org/10.5038/2375-0901.23.2.5.
  • Bian, Z., F. Zuo, J. Gao, Y. Chen, S. S. C. Pavuluri Venkata, S. Duran Bernardes, K. Ozbay, X. (Jeff) Ban, and J. Wang. 2021. “Time Lag Effects of COVID-19 Policies on Transportation Systems: A Comparative Study of New York City and Seattle.” Transportation Research Part A: Policy and Practice 145:269–283. https://doi.org/10.1016/j.tra.2021.01.019.
  • Brubaker, J., T. Kilic, P. Wollburg, and B. Van Campenhout. 2021. “Representativeness of Individual-Level Data in COVID-19 Phone Surveys: Findings from Sub-Saharan Africa.” PLOS ONE 16 (11): e0258877. https://doi.org/10.1371/journal.pone.0258877.
  • Celis, L. E., V. Keswani, and N. Vishnoi. 2020. “Data Preprocessing to Mitigate Bias: A Maximum Entropy Based Approach.” Presented at the International Conference on Machine Learning. https://proceedings.mlr.press/v119/celis20a.html.
  • Chen, C., X. (Jeff) Ban, F. Wang, J. Wang, C. Siddique, R. Fan, and J. Lee. 2017. Understanding GPS and Mobile Phone Data for Origin-Destination Analysis Publication FHWA-HEP-19-027. Federal Highway Administration. https://www.fhwa.dot.gov/planning/tmip/publications/other_reports/gps_mobile_data/index.cfm.
  • Chen, C., J. Ma, Y. Susilo, Y. Liu, and M. Wang. 2016. “The Promises of Big Data and Small Data for Travel Behavior (Aka Human Mobility) Analysis.” Transportation Research Part C: Emerging Technologies 68:285–299. https://doi.org/10.1016/j.trc.2016.04.005.
  • Cho, S.-H., and H.-C. Park. 2021. “Exploring the Behaviour Change of Crowding Impedance on Public Transit Due to COVID-19 Pandemic: Before and After Comparison.” Transportation Letters 13 (5–6): 367–374. https://doi.org/10.1080/19427867.2021.1897937.
  • Deville, P., C. Linard, S. Martin, M. Gilbert, F. R. Stevens, A. E. Gaughan, V. D. Blondel, and A. J. Tatem. 2014. “Dynamic Population Mapping Using Mobile Phone Data.” Proceedings of the National Academy of Sciences 111 (45): 15888–15893. https://doi.org/10.1073/pnas.1408439111.
  • Fekih, M., L. Bonnetain, A. Furno, P. Bonnel, Z. Smoreda, S. Galland, and T. Bellemans. 2022. “Potential of Cellular Signaling Data for Time-Of-Day Estimation and Spatial Classification of Travel Demand: A Large-Scale Comparative Study with Travel Survey and Land Use Data.” Transportation Letters 14 (7): 787–805. https://doi.org/10.1080/19427867.2021.1945854.
  • Gensheimer, J., J. Chen, A. J. Turner, A. Shekhar, A. Wenzel, and F. N. Keutsch. 2021. “What are the Different Measures of Mobility Telling Us About Surface Transportation CO2 Emissions During the COVID-19 Pandemic?” Journal of Geophysical Research Atmospheres 126 (11): e2021JD034664. https://doi.org/10.1029/2021JD034664.
  • Guangnian, X., L. Qiongwen, N. Anning, and C. Zhang. 2023. “Research on Carbon Emissions of Public Bikes Based on the Life Cycle Theory.” Transportation Letters 15 (4): 278–295. https://doi.org/10.1080/19427867.2022.2123142.
  • He, Z. 2021. “Portraying Ride-Hailing Mobility Using Multi-Day Trip Order Data: A Case Study of Beijing, China.” Transportation Research Part A: Policy and Practice 146:152–169. https://doi.org/10.1016/j.tra.2021.01.017.
  • Hecht, B., and M. Stephens. 2014. “A Tale of Cities: Urban Biases in Volunteered Geographic Information.” Proceedings of the Eighth International AAAI conference on Weblogs and Social Media, 9. https://doi.org/10.1609/icwsm.v8i1.14554.
  • Hossain, S., and K. N. Habib. 2022. “Inferring Origin and Destination Zones of Transit Trips Through Fusion of Smart Card Transactions, Travel Surveys, and Land-Use Data.” Transportation Research Part A: Policy and Practice 165:267–284. https://doi.org/10.1016/j.tra.2022.09.010.
  • Hu, T., S. Wang, B. She, M. Zhang, X. Huang, Y. Cui, J. Khuri, et al. 2021. “Human Mobility Data in the COVID-19 Pandemic: Characteristics, Applications, and Challenges.” International Journal of Digital Earth 14 (9): 1126–1147. https://doi.org/10.1080/17538947.2021.1952324.
  • Jiao, J., M. Bhat, and A. Azimian. 2021. “Measuring Travel Behavior in Houston, Texas with Mobility Data During the 2020 COVID-19 Outbreak.” Transportation Letters 13 (5–6): 461–472. https://doi.org/10.1080/19427867.2021.1901838.
  • Lazer, D., R. Kennedy, G. King, and A. Vespignani. 2014. “The Parable of Google Flu: Traps in Big Data Analysis.” Science 343 (6176): 1203–1205. https://doi.org/10.1126/science.1248506.
  • Liu, L., H. J. Miller, J. Scheff, and C. Yang. 2020. “The Impacts of COVID-19 Pandemic on Public Transit Demand in the United States.” PLOS ONE 15 (11): e0242476. https://doi.org/10.1371/journal.pone.0242476.
  • Liu, Z., P. Ciais, Z. Deng, S. J. Davis, B. Zheng, Y. Wang, D. Cui, et al. 2020. “Carbon Monitor, a Near-Real-Time Daily Dataset of Global CO2 Emission from Fossil Fuel and Cement Production.” Scientific Data 7 (1): 392. https://doi.org/10.1038/s41597-020-00708-7.
  • Lu, S., Z. Fang, X. Zhang, S.-L. Shaw, L. Yin, Z. Zhao, and X. Yang. 2017. “Understanding the Representativeness of Mobile Phone Location Data in Characterizing Human Mobility Indicators.” ISPRS International Journal of Geo-Information 6 (1): 7. https://doi.org/10.3390/ijgi6010007.
  • Malik, M. M., H. Lamba, C. Nakos, and J. Pfeffer. 2015. “Population Bias in Geotagged Tweets.” 9th International AAAI Conference on Web and Social Media. https://doi.org/10.1609/icwsm.v9i4.14688.
  • Mislove, A., and S. Lehmann. 2011. “Understanding the Demographics of Twitter Users.” Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, 4. https://doi.org/10.1609/icwsm.v5i1.14168.
  • Ranjan, G., H. Zang, Z.-L. Zhang, and J. Bolot. 2012. “Are Call Detail Records Biased for Sampling Human Mobility?” ACM SIGMOBILE Mobile Computing and Communications Review 16 (3): 33. https://doi.org/10.1145/2412096.2412101.
  • Sheugh, L., and S. H. Alizadeh. 2015. “A Note on Pearson Correlation Coefficient As a Metric of Similarity in Recommender System.” Presented at the 2015 AI & Robotics (IRANOPEN). https://doi.org/10.1109/RIOS.2015.7270736.
  • Uğurel, E., X. Guan, Y. Wang, S. Huang, Q. Wang, and C. Chen. 2024. “Correcting Missingness in Passively-Generated Mobile Data with Multi-Task Gaussian Processes.” Transportation Research Part C: Emerging Technologies 161:104523. https://doi.org/10.1016/j.trc.2024.104523.
  • Valliant, R., and J. A. Dever. 2011. “Estimating Propensity Adjustments for Volunteer Web Surveys.” Sociological Methods & Research 40 (1): 105–137. https://doi.org/10.1177/0049124110392533.
  • Wang, F., and C. Chen. 2018. “On Data Processing Required to Derive Mobility Patterns from Passively-Generated Mobile Phone Data.” Transportation Research Part C: Emerging Technologies 87:58–74. https://doi.org/10.1016/j.trc.2017.12.003.
  • Wang, F., J. Wang, J. Cao, C. Chen, and X. (Jeff) Ban. 2019. “Extracting Trips from Multi-Sourced Data for Mobility Pattern Analysis: An App-Based Data Example.” Transportation Research Part C: Emerging Technologies 105:183–202. https://doi.org/10.1016/j.trc.2019.05.028.
  • Wang, J., F. Wang, X. J. Ban, and C. Chen. 2019. “Comparative Analysis of Big and Small (Survey) Data for Deriving Human Mobility Patterns.” Presented at the Transportation Research Board 98th Annual MeetingTransportation Research Board. https://trid.trb.org/view/1572533.
  • Wang, W., D. Rothschild, S. Goel, and A. Gelman. 2015. “Forecasting Elections with Non-Representative Polls.” International Journal of Forecasting 31 (3): 980–991. https://doi.org/10.1016/j.ijforecast.2014.06.001.
  • Wang, Z., S. Hale, D. I. Adelani, P. Grabowicz, T. Hartman, F. Flöck, and D. Jurgens. 2019. Demographic Inference and Representative Population Estimates from Multilingual Social Media Data. San Francisco, CA, USA. https://doi.org/10.1145/3308558.3313684.
  • Wesolowski, A., N. Eagle, A. M. Noor, R. W. Snow, and C. O. Buckee. 2013. “The Impact of Biases in Mobile Phone Ownership on Estimates of Human Mobility.” Journal of the Royal Society Interface 10 (81): 20120986–20120986. https://doi.org/10.1098/rsif.2012.0986.
  • Wesolowski, A., N. Eagle, A. M. Noor, R. W. Snow, C. O. Buckee, and S. Gómez. 2012. “Heterogeneous Mobile Phone Ownership and Usage Patterns in Kenya.” PLOS ONE 7 (4): e35319. https://doi.org/10.1371/journal.pone.0035319.
  • Yang, J., A. A. S. Soltan, D. W. Eyre, and D. A. Clifton. 2023. “Algorithmic Fairness and Bias Mitigation for Clinical Machine Learning with Deep Reinforcement Learning.” Nature Machine Intelligence 5 (8): 884–894. https://doi.org/10.1038/s42256-023-00697-3.
  • Yang, Y., Y. Fan, and R. J. B. Wets. 2018. “Stochastic Travel Demand Estimation: Improving Network Identifiability Using Multi-Day Observation Sets.” Transportation Research Part B: Methodological 107 (Supplement C): 192–211. https://doi.org/10.1016/j.trb.2017.10.007.
  • Yuan, Y., M. Raubal, and Y. Liu. 2012. “Correlating Mobile Phone Usage and Travel Behavior – a Case Study of Harbin, China.” Computers, Environment and Urban Systems 36 (2): 118–130. https://doi.org/10.1016/j.compenvurbsys.2011.07.003.
  • Zagheni, E., I. Weber, N. Askitas, F. Professor Klaus, and P. Zimmermann. 2015. “Demographic Research with Non-Representative Internet Data.” International Journal of Manpower 36 (1): 13–25. https://doi.org/10.1108/IJM-12-2014-0261.
  • Zhang, W., Y. Zhao, X. (Jason) Cao, D. Lu, and Y. Chai. 2020. “Nonlinear Effect of Accessibility on Car Ownership in Beijing: Pedestrian-Scale Neighborhood Planning.” Transportation Research Part D: Transport & Environment 86:102445. https://doi.org/10.1016/j.trd.2020.102445.
  • Zhao, R., F. Wang, and W. Zheng. 2022, p. jtr. “Data‐Driven Inference of Interactions Among Multiple Tourist Attractions for Hourly Demand Forecasting.” International Journal of Tourism Research 24 (5): 701–713. https://doi.org/10.1002/jtr.2532.
  • Zhao, Z., S.-L. Shaw, Y. Xu, F. Lu, J. Chen, and L. Yin. 2016. “Understanding the Bias of Call Detail Records in Human Mobility Research.” International Journal of Geographical Information Science 30 (9): 1738–1762. https://doi.org/10.1080/13658816.2015.1137298.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.