This repository contains a project focused on predicting diamond prices using the Diamonds Price Dataset. With features ranging from carat weight to clarity, the goal is to develop a robust machine learning model that accurately forecasts diamond valuations.
The dataset includes the following columns:
- carat: Diamond weight in carat
- cut: Diamond cutting quality
- color: Diamond color ranging from J (worst) to D (best)
- clarity: Clarity measure from I1 (worst) to IF (best)
- x: Diamond length in mm
- y: Diamond width in mm
- z: Diamond depth in mm
- depth: Percentage depth (z / mean(x,y))
- table: Width of the widest point at the top of the diamond
- price: Target variable representing diamond price
- Data Exploration: Analyze diamond attributes and their correlation with prices.
- Model Development: Train machine learning models to predict diamond prices.
- Evaluation: Assess model performance using appropriate metrics.
- Insights: Extract meaningful insights for diamond valuation.
- Python
- Pandas
- Scikit-learn
- Matplotlib
- Seaborn
-
Clone the repository: git clone https://github.com/wonderakwei/diamonds-price-prediction.git
-
Navigate to the project directory: cd diamonds-price-prediction
-
Install required packages: pip install -r requirements.txt
-
Run the Jupyter Notebook or Python scripts to explore and train models.