1,106
Views
5
CrossRef citations to date
0
Altmetric
Research Articles

Why did a vehicle stop? A methodology for detection and classification of stops in vehicle trajectories

ORCID Icon, ORCID Icon & ORCID Icon
Pages 1953-1979 | Received 07 Aug 2019, Accepted 07 Mar 2020, Published online: 23 Mar 2020

References

  • Agamennoni, G., Nieto, J., and Nebot, E., 2009. Mining GPS data for extracting significant places. In: Proceedings of the 2009 IEEE international conference on Robotics and Automation. Piscataway, NJ: IEEE Press, 1860–1867.
  • Alfaro, E., Gámez, M., and García, N., 2013. {adabag}: an {R} package for classification with boosting and bagging. Journal of Statistical Software, 54 (2), 1–35. doi:10.18637/jss.v054.i02.
  • Altintasi, O., Tuydes-Yaman, H., and Tuncay, K., 2017. Detection of urban traffic patterns from Floating Car Data (FCD). Transportation Research Procedia, 22, 382–391. doi:10.1016/j.trpro.2017.03.057
  • Altman, N.S., 1992. An introduction to kernel and nearest-neighbor nonparametric regression. The American Statistician, 46 (3), 175–185.
  • Ashbrook, D. and Starner, T., 2003. Using GPS to learn significant locations and predict movement across multiple users. Personal and Ubiquitous Computing, 7 (5), 275–286. doi:10.1007/s00779-003-0240-0.
  • Bermingham, L. and Lee, I., 2018. A probabilistic stop and move classifier for noisy GPS trajectories. Data Mining and Knowledge Discovery, 32 (6), 1634–1662. doi:10.1007/s10618-018-0568-8.
  • Boughorbel, S., Jarray, F., and El-Anbari, M., 2017. Optimal classifier for imbalanced data using matthews correlation coefficient metric. PloS One, 12 (6), e0177678. doi:10.1371/journal.pone.0177678.
  • Bradley, A.P., 1997. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30 (7), 1145–1159. doi:10.1016/S0031-3203(96)00142-2.
  • Breiman, L., 1996a. Bagging predictors. Machine Learning, 24 (2), 123–140. doi:10.1007/BF00058655.
  • Breiman, L., 1996b. Arcing classifiers. Annals of Statistics, 26, 123–140.
  • Breiman, L., 2001. Random forests. Machine Learning, 45 (1), 5–32. doi:10.1023/A:1010933404324.
  • Brunauer, R. and Rehrl, K., 2016. Big Data in der Mobilität - FCD Modellregion Salzburg. In: D. Fasel and A. Meier, eds. Big Data: grundlagen, Systeme und Nutzungspotenziale. Wiesbaden: Springer Fachmedien Wiesbaden, 235–267.
  • Brunauer, R. and Rehrl, K., 2014. Deriving driver-centric travel information by mining delay patterns from single GPS trajectories. In: IWCTS 2014 - Proceedings of the 7th ACM SIGSPATIAL International Workshop on Computational Transportation Science. Texas, United States, 25–30.
  • Cao, X., Cong, G., and Jensen, C.S., 2010. Mining significant semantic locations from GPS data. Proceedings of the VLDB Endowment, 3 (1–2), 1009–1020. doi:10.14778/1920841.1920968.
  • Chawla, N.V., et al., 2002. SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357. doi:10.1613/jair.953
  • Chen, T., et al., 2019. xgboost: Extreme Gradient Boosting.
  • Chen, T. and Guestrin, C., 2016. Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. San Francisco,  California, United States, 785–794.
  • Cohen, J., 1960. A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20 (1), 37–46. doi:10.1177/001316446002000104.
  • Cunningham, P. and Delany, S.J., 2007. K-Nearest neighbour classifiers. Multiple Classifier Systems, 34 (8), 1–17.
  • D’Andrea, A., et al., 2014. A functional road classification with data mining techniques. Transport, 29 (4), 419–430. doi:10.3846/16484142.2014.984329.
  • Diaz-Uriarte, R., 2007. GeneSrF and varSelRF: a web-based tool and R package for gene selection and classification using random forest. BMC Bioinformatics, 8, 328.
  • Diaz-Uriarte, R. and de Andrés, S.A., 2005. Variable selection from random forests: application to gene expression data. arXiv Preprint Q-bio/0503025.
  • Fawcett, T., 2004. ROC graphs: notes and practical considerations for researchers. Machine Learning, 31 (1), 1–38.
  • Fawcett, T., 2006. An introduction to ROC analysis. Pattern Recognition Letters, 27 (8), 861–874. doi:10.1016/j.patrec.2005.10.010.
  • Flach, P.A., 2003. The geometry of ROC space: understanding machine learning metrics through ROC isometrics. In: Proceedings of the 20th international conference on machine learning (ICML-03). Washington D.C., Columbia, United States, 194–201.
  • Freund, Y. and Schapire, R.E., 1997. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55 (1), 119–139. doi:10.1006/jcss.1997.1504.
  • Friedman, J.H., 2001. Greedy function approximation: a gradient boosting machine. Annals of Statistics, 29 (5), 1189–1232. doi:10.1214/aos/1013203451.
  • Giannotti, F., et al., 2007. Trajectory pattern mining. In: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD ’07. New York: ACM Press, 330.
  • Guo, X., et al., 2008. On the class imbalance problem. In: 2008 Fourth international conference on natural computation. Los Alamitos, California, United States, 192–201.
  • Guyon, I. and Elisseeff, A., 2003. An introduction to variable and feature selection. Journal of Machine Learning Research, 3 (Mar), 1157–1182.
  • Hansen, L.K. and Salamon, P., 1990. Neural network ensembles. IEEE Transactions on Pattern Analysis & Machine Intelligence, 12 (10), 993–1001. doi:10.1109/34.58871.
  • Haykin, S., 1994. Neural networks. New York: Prentice hall.
  • Huang, J. and Ling, C.X., 2005. Using AUC and accuracy in evaluating learning algorithms. IEEE Transactions on Knowledge and Data Engineering, 17 (3), 299–310. doi:10.1109/TKDE.2005.50.
  • Japkowicz, N., 2003. Class imbalances: are we focusing on the right issue. In: Workshop on Learning from Imbalanced Data Sets II. Washington D.C., Columbia, United States, 63.
  • Kang, J.H., et al., 2004. Extracting places from traces of locations. In: Proceedings of the 2nd ACM international workshop on Wireless mobile applications and services on WLAN hotspots - WMASH ’04. New York: ACM Press, 110.
  • Keler, A., Krisp, J.M., and Ding, L., 2017. Detecting vehicle traffic patterns in urban environments using taxi trajectory intersection points. Geo-Spatial Information Science, 20 (4), 333–344. doi:10.1080/10095020.2017.1399672.
  • Kotsiantis, S., Kanellopoulos, D., and Pintelas, P., others, 2006. Handling imbalanced datasets: A review. GESTS International Transactions on Computer Science and Engineering, 30 (1), 25–36.
  • Kuhn, M., et al., 2018. caret: Classification and Regression Training.
  • Kuhns, G., et al., 2011. Self evaluation of floating car data based on travel times from actual vehicle trajectories. 2011 IEEE Forum on Integrated and Sustainable Transportation Systems. Vienna, Austria, 109–114.
  • Landis, J.R. and Koch, G.G., 1977. The measurement of observer agreement for categorical data. biometrics, 33 (1), 159–174. doi:10.2307/2529310.
  • Li, T., Zhang, C., and Ogihara, M., 2004. A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression. Bioinformatics, 20 (15), 2429–2437. doi:10.1093/bioinformatics/bth267.
  • Liaw, A. and Wiener, M., 2002. Classification and regression by randomForest. R News, 2 (3), 18–22.
  • Linneberg, C., et al., 1994. Towards semen quality assessment using neural networks. In: Neural networks for signal processing - proceedings of the IEEE workshop. Ermioni, Greece: IEEE, 509–517.
  • Matthews, B.W., 1975. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochimica Et Biophysica Acta (Bba)-protein Structure, 405 (2), 442–451. doi:10.1016/0005-2795(75)90109-9.
  • Mazimpaka, J.D. and Timpf, S., 2016. Trajectory data mining: A review of methods and applications. Journal of Spatial Information Science, 13, 61–99.
  • McCulloch, W.S. and Pitts, W., 1943. A logical calculus of the ideas immanent in nervous activity. The Bulletin of Mathematical Biophysics, 5 (4), 115–133. doi:10.1007/BF02478259.
  • Monard, M.C. and Batista, G.E., 2002. Learning with skewed class distributions. Advances in Logic, Artificial Intelligence, and Robotics: LAPTEC, 85 (2002), 173.
  • Powers, D.M., 2011. Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. Journal of Machine Learning Technologies, 2 (1), 37–63.
  • R Core Team, 2018. R: A Language and Environment for Statistical Computing.
  • Rahmani, M., Jenelius, E., and Koutsopoulos, H.N., 2013. Route travel time estimation using low-frequency floating car data. In: 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013). The Hague, Netherlands: IEEE, 2292–2297.
  • Rehrl, K., et al., 2018b. Towards a national floating car data platform for Austria. In: Proceedings of the 25th World Congress on Intelligent Transportation Systems (ITSWC). Copenhagen, Denmark.
  • Rehrl, K., Brunauer, R., and Gröchenig, S., 2016. Collecting floating car data with smartphones: results from a field trial in Austria. Journal of Location Based Services, 10 (1), 16–30. doi:10.1080/17489725.2016.1169323.
  • Rehrl, K., Gröchenig, S., and Wimmer, M., 2018a. Optimization and evaluation of a high performance open-source map-matching implementation. In: Geospatial Technologies for All. Selected Papers of the 21th AGILE Conference on Geographic Information Science. Lund, Sweden.
  • Schäfer, R.-P., Thiessenhusen, K.-U., and Wagner, P., 2002. Analysis of travel times and routes on urban roads by means of floating-car data. In: ITS World Congress 2002. Copenhagen, Denmark.
  • Schapire, R.E., 1990. The strength of weak learnability. Machine Learning, 5 (2), 197–227. doi:10.1007/BF00116037.
  • Schapire, R.E., et al. 1998. Boosting the margin: A new explanation for the effectiveness of voting methods. The Annals of Statistics, 26 (5), 1651–1686. doi:10.1214/aos/1024691352.
  • Schliep, K. and Hechenbichler, K., 2016. kknn: Weighted k-Nearest Neighbors.
  • Schmid, F. and Richter, K., 2006. Extracting places from location data streams 1. In: International Workshop on Ubiquitous Geographical Information Services. Münster.
  • Schneider, C., et al., 2017. A framework for evaluating stay detection approaches. ISPRS International Journal of Geo-Information, 6 (10), 10. doi:10.3390/ijgi6100315.
  • Venables, W.N. and Ripley, B.D., 2002. Modern applied statistics with S. Fourth. New York: Springer.
  • Visa, S. and Ralescu, A., 2005. Issues in mining imbalanced data sets-a review paper. In: Proceedings of the sixteen midwest artificial intelligence and cognitive science conference. Dayton, Ohio, United States, 67–73.
  • Weiss, G.M. and Provost, F., 2001. The effect of class distribution on classifier learning. Technical Report.
  • Ye, Y., et al., 2009. Mining individual life pattern based on location history. Proceedings - IEEE International Conference on Mobile Data Management. Taipei, Taiwan, 1–10.
  • Yin, L., et al., 2014. Detecting illegal pickups of intercity buses from their GPS traces. 2014 17th IEEE International Conference on Intelligent Transportation Systems, ITSC 2014. Shandong, China, 2162–2167.
  • Zheng, Y., 2015. Trajectory data mining. ACM Transactions on Intelligent Systems and Technology, 6 (3), 1–41. doi:10.1145/2764959.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.