3,553
Views
10
CrossRef citations to date
0
Altmetric
Research Note

Corporate Risk Disclosure and Audit Fee: A Text Mining Approach

, , &
Pages 583-594 | Received 01 Feb 2016, Accepted 01 Apr 2017, Published online: 23 May 2017
 

Abstract

The aim of this study is to introduce an innovative text mining approach to assess firms' risks using unstructured textual disclosure from annual reports. Specifically, we use Natural Language Processing techniques to extract firms' self-identified risks including financial, strategic, operational, and hazard risks based on an enterprise risk management framework. We examine the association between these four risk measures derived from the risk factor section in 10-K filings and audit fees. The results show that audit fees are significantly and positively related to firm-specific financial, strategic, and operational risks, indicating the informativeness of corporate textual risk disclosures. This study provides direct support for the recent US reporting regulatory requirement of adding a new section on risk factors in corporate annual reports.

Acknowledgements

We very much appreciate the insightful comments and suggestions of Laurence van Lent (editor) and two anonymous referees of the European Accounting Review that greatly improved this paper. We also thanks for Yi Liang, Botao Chen, Alfred Zhu Liu and seminar participants at the 2016 AAA Annual Conference, Mid-Atlantic Region Meeting and CSWIM Meeting. All authors gratefully acknowledge financial support from Saunders College of Business. All remaining errors or omissions are the authors' responsibility.

Notes

1 Unstructured data refer to information that either does not have a pre-defined data model or is not organized in a pre-defined manner. It is typically text-heavy and often includes multimedia content. While some files may have an internal structure, they are still considered unstructured because the data do not fit in a database (Feldman & Sanger, Citation2007). In contrast, structured data like spreadsheets are easily searchable by basic algorithms. Therefore, it is more difficult to analyze unstructured textual documents than structured ones.

2 The number of observations drops by 10,266 due to the calculation of RESTPROB and ACCR. In untabulated analysis, we re-run our tests excluding these two variables and the results still hold.

3 In the main analysis, we use industry-adjusted risks, measured as the difference between the number of risk sentences from the risk factor section and the industry-median of the sentence count number based on the three-digit SIC industry code in a given year. In the untabulated robustness check, we also use raw risks, measured as the count number of risk sentences from the risk factor section. Our results remain qualitatively unaltered.

4 In addition to the total number of sentences in 10-Ks as a control variable, we recalculate the level and change of raw risk measures deflated by the total number of sentences in 10-Ks and re-estimate the model (1). Our main results still hold.

5 Following Doyle et al. (Citation2007), for our pre-404 period, we start with firms that had a 404 audit in the initial year of the 404 audit, that is, the fiscal year ending on or after 15 November 2004. Then, we trace back one year to obtain ICMW in year 2003. In untabulated tests, our main results presented in remain almost identical no matter we code ICMW using IC weakness under SOX 302 in 2003 or for the entire sample period.

6 For example, one standard deviation of increase in financial risk for a median firm is related to increase in audit fee by $18,256 ($1,005,504*0.001*18.156).

7 The insignificant coefficient on the hazard risk could be the results of imprecise estimation due to the low variance of independent variables.

Additional information

Funding

All authors gratefully acknowledge financial support from Saunders College of Business.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.