83
Views
0
CrossRef citations to date
0
Altmetric
Articles

Assessment of feature selection for student academic performance through machine learning classification

, , &
 

Abstract

Regression analysis is used to find the trends in the data. The analysis helps to find the relationship between dependent and independent variables in the dataset. It also suggests the degree of influence of Independent variables towards the prediction of desired outcome. Multiple Linear Regression technique builds a model with more than one predictor by identifying the statistical relationship between them. This paper evaluates and analyzes the performance of multiple linear regression models and suggests a way to improve the model by Feature Selection. The performance of the model with and without backward elimination is analyzed for the Student Academic Performance dataset from Kaggle repository. The optimized model is experimented with various classifiers such as Logistic, KNN, Kernel SVM, Naïve Bayes, Decision Tree and Random Forest and its efficiency is assessed through metrics such as Precision, Recall, FScore and Accuracy.

Subject Classification:

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.