Skip to content

Feature selection is done by using filter methods along with data cleaning.

Notifications You must be signed in to change notification settings

IshtyM/Feature-Selection-and-Data-Cleaning-

Repository files navigation

Feature-Selection-and-Data-Cleaning-

feature selection are used as it enables the machine learning algorithm to train faster and reduces the complexity of a model and makes it easier to interpret. It improves the accuracy of a model if the right subset is chosen and it reduces overfitting. Filter methods are generally used as a preprocessing step. The selection of features is independent of any machine learning algorithms. Instead, features are selected on the basis of their scores in various statistical tests for their correlation with the outcome variable. The correlation is a subjective term here. In this file, filter method i.e. select-k-best i.e. chi2 is used for the feature selection.

Libraries Used:

Pandas, Numpy

Programing Language

Python

IDE Used

Jupyter Notebook

Releases

No releases published

Packages

No packages published