K-Nearest-Neighbour-Project

This project demonstrates the application of the k-Nearest Neighbors (KNN) algorithm for classification. The project involves loading a dataset, preprocessing the features, exploring the data through visualizations, building a KNN model, and evaluating its performance.

Getting Started

To get started, ensure you have the required dependencies installed. You can install them using the following command:

pip install pandas numpy seaborn matplotlib scikit-learn

Data

The dataset for this project is stored in the file KNN_Project_Data.csv. It is loaded into a Pandas DataFrame, and the first five rows of the dataset are displayed using the data.head(5) function.

Data Preprocessing

To prepare the features for the KNN algorithm, standardization is applied using the StandardScaler from scikit-learn. Standardization ensures that each feature has a mean of 0 and a standard deviation of 1, which is essential for distance-based algorithms like KNN.

Exploratory Data Analysis (EDA)

An exploratory data analysis is conducted to visually explore relationships between different features. Seaborn's pairplot is utilized, with the hue parameter set to 'TARGET CLASS' to distinguish between classes.

Model Building

The dataset is split into training and testing sets using the train_test_split function from scikit-learn. A KNN classifier is instantiated with n_neighbors=1 and is then trained on the training set.

Model Building

The dataset is split into training and testing sets using the train_test_split function from scikit-learn. A KNN classifier is instantiated with n_neighbors=1 and is then trained on the training set.

Model Evaluation

The performance of the KNN model is evaluated using common classification metrics. The confusion matrix and classification report are printed to the console for a detailed assessment.

from sklearn.metrics import classification_report, confusion_matrix

print(confusion_matrix(y_test, pred))
print(classification_report(y_test, pred))

Conclusion

This README provides a comprehensive overview of the KNN project, covering data loading, preprocessing, exploratory data analysis, model building, and evaluation. Feel free to customize this README based on your specific project details.

For more details on the project, code documentation can be found in the Python script.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
KNN_Project_Data		KNN_Project_Data
README.md		README.md
knn_project_data_analysis.ipynb		knn_project_data_analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

K-Nearest-Neighbour-Project

Table of Contents

Getting Started

Data

Data Preprocessing

Exploratory Data Analysis (EDA)

Model Building

Model Building

Model Evaluation

Conclusion

About

Releases

Packages

Languages

SkJishan04/K-Nearest-Neighbour-Project

Folders and files

Latest commit

History

Repository files navigation

K-Nearest-Neighbour-Project

Table of Contents

Getting Started

Data

Data Preprocessing

Exploratory Data Analysis (EDA)

Model Building

Model Building

Model Evaluation

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages