164
Views
3
CrossRef citations to date
0
Altmetric
Research Article

Does the Written Word Matter? The Role of Uncovering and Utilizing Information from Written Comments in Housing Ads

, &
Pages 133-155 | Received 19 Apr 2019, Accepted 15 Jul 2019, Published online: 04 Dec 2020
 

Abstract

The hedonic price model is a popular method to estimate the implicit prices of observed attributes of a property. However, the inputs to the model are only numerically quantified information. This study quantifies the unstructured qualitative statements contained in the written descriptions from the Multiple Listing Service (MLS) data. These statements contain unstructured text describing the features and setting of the house, providing important but typically unused qualitative information. Our approach is unique in that we use the qualitative information to classify these words into eight groups that reflect previously unmeasured housing quality. The purpose of the study is to test whether these previously unmeasured attributes of the property have an impact on the selling price of the property and its time on the market. The dataset consists of 5,160 home sales in Ames, Iowa between the second quarter of 2003 and the second quarter of 2015. Our findings show that the role of unstructured qualitative text varies; some are redundant to the quantitative information already in the models and have no effect, while others, particularly those reflecting the quality of the structure, represent unique information and are important predictors in determining housing prices and the time on market.

Notes

Notes

1 The assumption allows us to decompose a review with (E, O) word pairs. First, identify an E word in the review and match to an O word near to an E word. In this frame, all reviews can be separated into a set of bi-term word pairs. Within each word pair, the E word is used for scoring and the O word is used to identify topics.

2 This allocation can be relaxed to incorporate a supervised learning procedure such that we first provide some seed bi-terms and then match the remaining bi-terms to the seed bi-terms using a learned machine or online dictionary. This is explored in Im et al. (2019). In this paper, however, the focus is on the utility of the bi-term construction and interpretation. Manual classification was a feasible approach here given the size of the market studied.

3 The absorption rate is calculated by Flexmls that covers the entirety of Story County, Iowa.

4 Since Ames is a college town, this may reflect the high number of Bachelor’s degrees (graduate students) in lower priced neighborhoods dominated by students.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 102.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.