Abstract
The current algorithms can only effectively fill in fewer missing data when filling out missing data in office automation (OA) cooperative office system, and the filling accuracy is worse when the missing data is large. In order to solve this problem, a missing data filling algorithm based on improved Mahalanobis distance is proposed. It used a top-down recursion to process raw data, got the training set of the observation sample and the associated class labels, and generated readable rules and decision trees. It optimizes the generated decision tree by using the genetic algorithm of the elite strategy, and classifies the missing data through the optimized decision tree. According to the classification results combined with the entropy calculation method in the improved Mahalanobis distance algorithm, there is a single decreasing relationship between entropy size and Mahalanobis distance of missing data, which can reduce the impact on the predictive value of related data, and achieve the missing data filling in OA cooperative office system. The experimental results showed that the proposed algorithm can effectively improve the accuracy of data filling, and the accuracy of data filling can be stable over 80% under different missing rates.
Disclosure statement
No potential conflict of interest was reported by the author.
Additional information
Notes on contributors
![](/cms/asset/4824aa9f-5a0b-472a-a42d-57563b929fc6/tjca_a_1662172_ilg0001.gif)
Jia Yu
Jia Yu (1981-) received her Master's degree in computer science and technology from EAST CHINA Jiaotong University in 2009. She is currently a lecturer in the college of Information Engineering and Artificial Intelligence of EAST CHINA Jiaotong University. Her research interests include database management technology, office automation technology and sensor networks. She presided over and completed a provincial project, participated in a number of national research projects, and published a number of related academic papers.