Ground Water Quality Assessment using Machine Learning

This repository contains a machine learning model for assessing the quality of ground water based on various chemical and physical parameters. The model is designed to predict the suitability of ground water for different purposes, such as drinking, livestock, poultry, and crop cultivation.

Project Overview

The project aims to develop and compare the performance of various machine learning algorithms for a multi-class classification problem, where the goal is to predict the ground water quality based on provided features. The features include districts, mandals, villages, latitude, longitude, and various chemical concentrations.

The primary objectives of this project are:

Preprocess and integrate the available ground water quality data.
Implement and train various machine learning models, including Softmax Classification, Decision Trees, Random Forests, Naive Bayes, and K-Nearest Neighbors.
Evaluate and compare the performance of the trained models using metrics such as accuracy, precision, recall, F1-score, and area under the receiver operating characteristic curve (AU-ROC).
Explore additional techniques like feature engineering, ensemble methods (e.g., XGBoost, GBM), and dimensionality reduction to improve model performance.
Investigate the calculation of the Entropy-based Water Quality Index (EWQI) as an alternative to the traditional Water Quality Index (WQI).
Develop a web application for deploying the trained models, allowing users to input chemical compositions and predict ground water quality.

Dataset

The project uses the following datasets provided by the Telangana Open Data Portal:

Repository Structure

The repository is structured as follows:

ground-water-quality-model/
├── data/
│   ├── pre_monsoon/
│   └── post_monsoon/
├── models/
│   ├── softmax.py
│   ├── decision_tree.py
│   ├── random_forest.py
│   ├── naive_bayes.py
│   └── knn.py
├── utils/
│   ├── data_preprocessing.py
│   ├── evaluation.py
│   └── visualization.py
├── app/
│   ├── static/
│   ├── templates/
│   └── app.py
├── requirements.txt
├── README.md
└── LICENSE

data/: Directory for storing the ground water quality datasets.
models/: Directory containing the implementations of various machine learning algorithms.
utils/: Directory with utility functions for data preprocessing, evaluation, and visualization.
app/: Directory for the web application code, including static files and templates.
requirements.txt: File listing the required Python packages and their versions.
README.md: This file, providing an overview of the project and repository.
LICENSE: File containing the license information for the project.

Getting Started

To get started with this project, follow these steps:

Clone the repository: git clone https://github.com/your-username/ground-water-quality-model.git
Install the required Python packages: pip install -r requirements.txt
Preprocess the data by running the appropriate scripts in the utils/ directory.
Train the machine learning models by executing the corresponding scripts in the models/ directory.
Evaluate the trained models using the evaluation functions in utils/evaluation.py.
Explore additional techniques, such as feature engineering, ensemble methods, and dimensionality reduction, as desired.
To run the web application, navigate to the app/ directory and execute python app.py.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
KNN		KNN
Premonsoon		Premonsoon
Softmax_Implementation_And_Data_Preprocessing		Softmax_Implementation_And_Data_Preprocessing
decision_tree_implementation		decision_tree_implementation
postmonsoon		postmonsoon
.DS_Store		.DS_Store
CS361 Machine Learning Project Presentation - Neural Nexus.pptx		CS361 Machine Learning Project Presentation - Neural Nexus.pptx
LICENSE		LICENSE
Neural Nexus - Ground Water Assessment.pdf		Neural Nexus - Ground Water Assessment.pdf
PCA.ipynb		PCA.ipynb
README.md		README.md
Reduced_features_dataset.csv		Reduced_features_dataset.csv
combined_dataset.ipynb		combined_dataset.ipynb
dataset.ipynb		dataset.ipynb
dataset.py		dataset.py
heatmap_ref.png		heatmap_ref.png
label_years.png		label_years.png
multi_class_svm.ipynb		multi_class_svm.ipynb
output.png		output.png
pca.png		pca.png
pca_heatmap.png		pca_heatmap.png
random_forest_classifier.ipynb		random_forest_classifier.ipynb
requirements.txt		requirements.txt
rf.png		rf.png
svm.py		svm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ground Water Quality Assessment using Machine Learning

Project Overview

Dataset

Repository Structure

Getting Started

License

About

Releases

Packages

Contributors 4

Languages

License

heckop/GroundWater_analyzer

Folders and files

Latest commit

History

Repository files navigation

Ground Water Quality Assessment using Machine Learning

Project Overview

Dataset

Repository Structure

Getting Started

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages