273
Views
4
CrossRef citations to date
0
Altmetric
Article

Automated Valuation Services: A case study for Aberdeen in Scotland

&
Pages 154-172 | Received 29 Feb 2020, Accepted 03 Dec 2020, Published online: 01 Mar 2021
 

ABSTRACT

Automated valuation services (AVSs) offered by listings platforms predict market values based on property characteristics supplied by users. We investigate the implementation of such a service for the City of Aberdeen. We fit different market value models with machine learning methods and assess them in a rolling windows procedure that mimics an AVS setting. We also investigate the ease and robustness with which the models can be implemented. We discuss how prediction uncertainty can be measured and reported to users. If implemented in the future, such a service has the potential to improve the transparency of the local housing market.

Acknowledgments

We are grateful to the three anonymous referees and the editor of the journal for suggestions and comments that helped to improve the paper. We thank seminar participants at the Technische Universität Berlin, Fiona Stoddard, and Verity Watson for helpful comments. The usual disclaimer applies.

Disclosure statement

No potential conflict of interest was reported by the authors.

Notes

1. Platforms such as ImmoScout24 and Immowelt-Immonet in Germany, Hometrack and Zoopla in the UK, and Redfin and Zillow in the US, offer such automated valuation services.

2. Keeping details about the implementation private prevents that competitors can copy it. Matysiak (Citation2017) finds that US providers are more open than European providers with respect to information on accuracy.

3. As residential markets differ, it seems likely that models that work well for one market might not do so for another market. The no free lunch theorem from machine learning applies here, see Murphy (Citation2012, p. 24): no single statistical model will be best for each and every application.

4. Examples include: Anglin and Gençay (Citation1996), Parmeter et al. (Citation2007), and Haupt et al. (Citation2010), which examine the same data set but use different models (semi-, non-, or fully parametric). Martins-Filho and Bin (Citation2005) is another example of a semiparametric model. Bourassa et al. (Citation2010) fit different spatial models for a given data set and compare predictions. McCluskey et al. (Citation2013) fit linear, neural net, and geospatial models to a given data set and compare predictions.

5. A small number of characteristics will also make it easy to check whether AVS users request plausible characteristic combinations. Naturally, a larger number of characteristics describe properties better and should result in better predictions.

6. The remaining 26,817 properties were social housing and thus in the public sector, which is not relevant here. The above numbers are collated from Aberdeen City Council (Citation2018), National Records of Scotland (Citation2018), and Registers of Scotland (Citation2018).

7. The ASPC distributes also a printed register of the listings in the Aberdeen area.

8. Erroneous values are obvious mistakes such as a property with zero rooms or location coordinates that are outside Aberdeen City.

9. We conducted the analysis also with separate room categories. The results of this analysis are qualitatively identical to the ones reported here, see the web-appendix for details.

10. The mean squared error on the right-hand side of EquationEq. 2 becomes smaller the more (price determining) characteristics x contains.

11. For instance, with a cubic polynomial basis, d=3, we obtain f(x,θ)=[x,x2,x3]θ.

12. The cubic splines basis is f(x,θ)=[x,x2,x3,|xκ1|3,,|xκK|3]θ. It extends a polynomial with truncated terms |xκk|, which join at knots, κK>>κ1. The larger the number of knots K, the more flexible will f(x,θ) be. θ is constrained further to ensure that the function is linear beyond the boundary knots. The thin plate splines basis is a two-dimensional extension of the cubic splines basis (Wood, Citation2017, Ch. 5).

13. If properties have up to three bathrooms, split sets are {1,(2,3)}, {2,(1,3)} {3,(1,2)}.

14. The random forest is also robust against outliers, as unusual observations will appear in only few re-sampled training samples. We deal with unusual observations by setting bounds, but additional robustness can be useful.

15. We also examined the predictive performance when the windows are rolled forward by one month. The results are comparable to those reported here, see the web-appendix. We did not examine the effect the length of the training sample. The samples covering four quarters (twelve months) produce good in-sample fits for all five models and we believe that improvements, if any at all, would be only marginal.

16. The fractions can be obtained by computing the observation-weighted averages of RER(0.1) and RER(0.2) from Table 3 (boosting machine) and Table 4 (penalised spline).

17. Over the period of our case study, the quality-controlled house price index for Aberdeen declined, on average, by 1.5% per quarter. A statistical model could be used to forecast changes of the price trend and these could then be used to adjust market value predictions. However, this task is separate from finding the best market value model. We do not approach it here.

18. Since we fit the models to location coordinates, a web service also needs to integrate digital maps to match a street address – as provided by the user – to coordinates.

19. The large MSRE is caused by a single property in the test sample that has a floor area just outside the range observed in the training sample.

20. The bootstrap is a third approach for the construction of prediction intervals, which often produces better finite sample results than standard method. We refer to Krause et al. (Citation2020) for a comparison of the bootstrap and standard approaches. For an application of conformal prediction intervals see Bellotti (Citation2017).

Additional information

Notes on contributors

Rainer Schulz

Rainer Schulz studied Economics at Freie Universität Berlin and received his doctoral degree in Economics and Econometrics from the Humboldt-Universität zu Berlin. He is a senior lecturer at the University of Aberdeen Business School. His research has been published, amongst others, in the Journal of Real Estate Finance and Economics, the Journal of Housing Economics, and the Journal of Urban Economics.

Martin Wersing studied Business Economics at Freie Universität Berlin and received his doctoral degree in Economics and Econometrics from Technische Universität Berlin. He is a lecturer at the University of Aberdeen Business School. His research has been published, amongst others, in Empirical Economics, the Journal of Housing Economics, and the Journal of Real Estate Finance and Economics.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.