Feature-Selection-and-Data-Cleaning-

feature selection are used as it enables the machine learning algorithm to train faster and reduces the complexity of a model and makes it easier to interpret. It improves the accuracy of a model if the right subset is chosen and it reduces overfitting. Filter methods are generally used as a preprocessing step. The selection of features is independent of any machine learning algorithms. Instead, features are selected on the basis of their scores in various statistical tests for their correlation with the outcome variable. The correlation is a subjective term here. In this file, filter method i.e. select-k-best i.e. chi2 is used for the feature selection.

Libraries Used:

Pandas, Numpy

Programing Language

Python

IDE Used

Jupyter Notebook

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Cleaned Data.xlsx		Cleaned Data.xlsx
Data Set.xlsx		Data Set.xlsx
Feature Selection and Data Cleaning .ipynb		Feature Selection and Data Cleaning .ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Feature-Selection-and-Data-Cleaning-

Libraries Used:

Programing Language

IDE Used

About

Releases

Packages

Languages

IshtyM/Feature-Selection-and-Data-Cleaning-

Folders and files

Latest commit

History

Repository files navigation

Feature-Selection-and-Data-Cleaning-

Libraries Used:

Programing Language

IDE Used

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages