Abstract
The data employed in urban gain modeling classes are often imbalanced, negatively affecting the accuracy of traditional and standard data mining and machine learning models. This study presents a new framework on the basis of clustering-based modeling and ensemble models to deal with the class imbalance problem in urban gain modeling. The random forest (RF), artificial neural network (ANN) and support vector machine (SVM) models served as the base models for the generation and evaluation of the results within this framework. The changes in urban land-use pattern of Isfahan in Iran in two time intervals of 1994-2004 and 2004-2014 were considered for the modeling. The findings showed that the proposed sampling strategy yields higher Hits and Correct Rejections rates than the strategies applied in previous studies in all three models. In the second part of the proposed framework (ensemble models), there was no substantial difference in the confusion matrix entries.
Disclosure statement
No potential conflict of interest was reported by the author(s).