Abstract
This article considers estimating functional-coefficient models in panel quantile regression with individual effects, allowing the cross-sectional and temporal dependence for large panel observations. A latent group structure is imposed on the heterogeneous quantile regression models so that the number of nonparametric functional coefficients to be estimated can be reduced considerably. With the preliminary local linear quantile estimates of the subject-specific functional coefficients, a classic agglomerative clustering algorithm is used to estimate the unknown group structure and an easy-to-implement ratio criterion is proposed to determine the group number. The estimated group number and structure are shown to be consistent. Furthermore, a post-grouping local linear smoothing method is introduced to estimate the group-specific functional coefficients, and the relevant asymptotic normal distribution theory is derived with a normalization rate comparable to that in the literature. The developed methodologies and theory are verified through a simulation study and showcased with an application to house price data from U.K. local authority districts, which reveals different homogeneity structures at different quantile levels.
Supplementary Materials
The supplemental document contains proofs of the main asymptotic theorems, technical lemmas with proofs, some extensions of the developed method and theory, and additional simulation and empirical results.
Acknowledgments
The authors would like to thank an Associate Editor and two reviewers for the constructive comments, which helped to substantially improve the article. The authors also thank Dr Chaowen Zheng for collecting the empirical data and preparing . The usual disclaimer applies.
Disclosure Statement
The authors report there are no completing interests to declare.
Notes
1 In the main model, we assume that Zt is a continuous random variable with a density function satisfying Assumption 2(ii). This ensures that the maximum distance between two consecutive observations of Zt is of order , indicating that there is a large number of observations in any neighborhood of when T is sufficiently large. This is crucial for the kernel-based nonparametric estimation methodology and theory to be developed in the subsequent sections.
2 When the index variable is Zit which varies over i and t, we estimate the functional coefficients at a set of equidistant grid points , where m is a sufficiently large positive integer. Then, we define .
3 Due to data unavailability, we consider LADs only in England and Wales.
4 Four LADs from England and Wales - Aylesbury Vale, Gloucester, Norwich, and Powys, are excluded due to their outlying values. This gives a total of 335 LADs for the subsequent analysis.
5 More data details can be found in Chen, Shin, and Zheng (Citation2022).
6 The membership of the two groups at is similar to that of the two groups at with a large number of overlapping member LADs (e.g., for Group 1, there are 45 overlapping LADs and for Group 2, there are 247 overlapping LADs).