Publication Cover
Transportation Letters
The International Journal of Transportation Research
Volume 14, 2022 - Issue 8
258
Views
5
CrossRef citations to date
0
Altmetric
Research Article

Deep learning– just data or domain related knowledge adds value?: bus travel time prediction as a case study

ORCID Icon, ORCID Icon &
Pages 863-873 | Published online: 21 Jul 2021
 

ABSTRACT

In recent years, deep learning models proved their ability to solve complex problems in the areas such as computer vision and natural language processing, and are receiving a lot of attention within the community of transportation systems as well. Though these are known as data-driven approaches, it is not yet reported whether providing a huge amount of data is sufficient or whether extra domain knowledge added as features will improve their performance. It is reasonable to expect that the performance of deep learning models will be improved by incorporating field-specific knowledge into the problem. This paper tries to address this question by taking Convolutional Neural Networks (CNNs) as a sample deep learning technique and comparing its performance with and without adding extra information about the data as feature input, for the application of bus travel time prediction. To extract extra information, the data are pre-processed using visual and statistical analyses, and the obtained knowledge is incorporated with the deep learning method. For pre-processing heat maps and statistical analysis were conducted using k-means clustering and Davies-Bouldin (DB) score to identify the optimum number of input groups. Further, the accuracy levels were compared with the deep learning method that was built with just data alone as input. The proposed models were evaluated on two selected bus routes, 19B and M1, in the City of Chennai, India. Results show that the provision of domain-related information having a positive impact on the prediction accuracy of up to 3% in selected routes. Performance comparison with existing methods such as historical average, linear regression, ANN, LSTM, and Conv-LSTM was also carried out and it was observed that the proposed method performed better than other existing methods.

Acknowledgments

The authors acknowledge the support of this study as a part of the IMPRINT project funded by SERB, DST, Government of India, through sanction order number IMP/2018/001850.

DATA AVAILABILITY STATEMENT

All data, models, and code that support the findings of this study are available from the corresponding author upon reasonable request.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Department of Science and Technology (IN) [IMP/2018/001850].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 273.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.