279
Views
2
CrossRef citations to date
0
Altmetric
Articles

Identifying low-quality patterns in accident reports from textual data

, , ORCID Icon, , &
Pages 1088-1100 | Published online: 13 Sep 2022
 

Abstract

Accident investigation reports provide useful knowledge to support companies to propose preventive and mitigative measures. However, the information presented in accident report databases is normally large, complex, filled with errors and has missing and/or redundant data. In this article, we propose text mining and natural language processing techniques to investigate low-quality accident reports. We adopted machine learning (ML) to detect and investigate inconsistencies on accident reports. The methodology was applied to 626 documents collected from an actual hydroelectric power company. The initial ML performances indicated data divergences and concerns related to the report structure. Then, the accident database was restructured to a more proper form confirming the supposition about the quality of the reports investigated. The proposed approach can be used as a diagnostic tool to improve the design of accident investigation reports to provide a more useful source of knowledge to support decisions in the safety context.

Acknowledgements

The authors thank the National Agency for Research (CNPq), the Foundation of Support for Science and Technology of Pernambuco (FACEPE), Coordenação de Aperfeiçoamento de Pessoal de Nível Superior – Brasil and the ‘Human Resources Program (PRH) da National Oil Company (ANP) and Finep (Brazilian Innovation Agency) – PRH-ANP 38.1: ‘Risk Analysis and Environmental Modeling in Exploration, Development and Production of Oil and Gas’ for financial support through research grants.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by CNPQ [Grant Number 305696/2018-1], [Grant Number 309617/2019-7]; Fundação de Amparo a Ciência e Tecnologia do Estado de Pernambuco [Grant Number APQ-1101-3.08/21]; CAPES [Finance Code 001].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 279.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.