Revealing representative day-types in transport networks using traffic data clustering

Download CSV Display Table

Table 3. Sets of calendar-based clustering categories.

Figure 2. Case study of motorway control system sensors of Stockholm, Sweden active at least one day 2017-2018. The sensors with complete data in case study period are highlighted. (a) Complete network of active sensors. (b) Area around the city center. (c) Zoom-in illustration of the sensor density.

Table 4. Internal evaluation indices for calendar-based clusterings.

Figure 3. Calendar visualization of calendar-based clusterings for year 2017. White cells are days with missing data.

Figure 4. Calendar visualization of data-driven clustering methods clustered on original dataset $X_{T} .$ White cells are days with missing data.

Figure 5. Day-type centroids for p-median and a-lex with 10 clusters. (a) Calendar visualization (b) Aggregated network-wide day-time profiles. (c,d) Space-time matrices of flows across all sensors in the network and all considered time intervals. (c) for p-median and (d) for a-lex.

Figure 6. Total within-cluster variance (TWCV) as a function of the number of clusters across clustering methods. (a) is the original dataset $X_{T}$ and (b) the reduced dataset ${\bar{X}}_{T} .$

Figure 7. Total cluster dissimilarity (TCD) as a function of the number of clusters across clustering methods. (a) is the original dataset $X_{T}$ and (b) the reduced dataset ${\bar{X}}_{T} .$

Figure 8. Silhouette score (SC) as a function of the number of clusters across clustering methods. (a) is the original dataset $X_{T}$ and (b) the reduced dataset ${\bar{X}}_{T} .$

Figure 9. Davies-Bouldin (DB) index as a function of the number of clusters across clustering methods. (a) is the original dataset $X_{T}$ and (b) the reduced dataset ${\bar{X}}_{T} .$

Figure 10. Short-term prediction performance external validation index as a function of the number of clusters across clustering methods. (a) is the original dataset $X_{T}$ and (b) the reduced dataset ${\bar{X}}_{T} .$

Figure 11. Exponential smoothing short-term prediction performance external validation index of the clustering C as a function of the number of clusters. (a) is the original dataset $X_{T}$ and (b) the reduced dataset ${\bar{X}}_{T} .$

Table 5. Best performing number of clusters according to considered internal indices per clustering method. The clusterings considered as best and reasonable are highlighted in bold.

Table 6. Total computational time for clustering methods to run 19 clusterings with 2-20 clusters.

Lopez, C., Leclercq, L., Krishnakumari, P., Chiabaut, N. and Lint, H. (2017). Revealing the day-to-day regularity of urban congestion patterns with 3d speed maps. Scientific Reports, 7(1), 14029. https://doi.org/10.1038/s41598-017-14237-8

PubMedGoogle Scholar

Cebecauer, M., Gundlegård, D., Jenelius, E. and Burghout, W. (2019). 3d speed maps and mean observations vectors for short-term urban traffic prediction. In Transportation Research Board Annual Meeting (TRB) (pp. 1–20).

Krishnakumari, P., Cats, O. and van Lint, H. (2020). A compact and scalable representation of network traffic dynamics using shapes and its applications. Transportation Research Part C: Emerging Technologies, 121, 102850. https://doi.org/10.1016/j.trc.2020.102850

Chiabaut, N. and Faitout, R. (2021). Traffic congestion and travel time prediction based on historical congestion maps and identification of consensual days. Transportation Research Part C: Emerging Technologies, 124, 102920. https://doi.org/10.1016/j.trc.2020.102920

Weijermars, W. and Van Berkum, E. (2005). Analyzing highway flow patterns using cluster analysis. In Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005 (pp. 308–313). https://doi.org/10.1109/ITSC.2005.1520157

Wild, D. (1997). Short-term forecasting based on a transformation and classification of traffic volume time series. International Journal of Forecasting, 13(1), 63–72. https://doi.org/10.1016/S0169-2070(96)00701-7

Yang, C., Yan, F. and Xu, X. (2017). Daily metro origin-destination pattern recognition using dimensionality reduction and clustering methods. In 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC) (pp. 548–553). https://doi.org/10.1109/ITSC.2017.8317899

Chrobok, R., Kaumann, O., Wahle, J. and Schreckenberg, M. (2004). Different methods of traffic forecast based on real data. European Journal of Operational Research, 155(3), 558–568. https://doi.org/10.1016/j.ejor.2003.08.005

Toqué, F., Khouadjia, M., Come, E., Trepanier, M. and Oukhellou, L. (2017). Short & long term forecasting of multimodal transport passenger flows with machine learning methods. In 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC) (pp. 560–566). https://doi.org/10.1109/ITSC.2017.8317939

Ferranti, F. (2020). Public transport origin-destination matrices: Pattern recognition and short-term prediction. KTH Royal institute of technology.

Clark, S. (2003). Traffic prediction using multivariate nonparametric regression. Journal of transportation engineering, 129(2), 161–168. https://doi.org/10.1061/(ASCE)0733-947X(2003)129:2(161)