Members:
- Anidipta Pal
- Ankana Datta
- Ananyo Dasgupta
Phone calls in no-mobile zones can be disruptive and pose safety risks. To address this, we propose a mobile detector system that identifies individuals talking on phones in restricted areas such as gas stations, train stations, and hospitals. The system will trigger alerts and store details of offenders in a database.
-
📝 Data Collection and Annotation: We collected CCTV and surveillance camera footage, creating a dataset of images at 40 FPS. These images were annotated with bounding boxes around relevant objects using LabelImg.
-
🔧 Image Preprocessing: Images were reshaped, scaled, normalized, and filtered. They were formatted for training, validation, and testing using YOLO's
.yaml
extension, including class labels and bounding box coordinates. -
🛠️ Model Building and Configuration: Two models were developed: YOLOv8x and a hybrid architecture combining YOLOv8 with ResNetv2. These models detect and classify people talking on phones and public place violations. Configurations include hyperparameters like image size, batch size, anchor boxes, and activation functions.
-
📚 Model Training: The dataset is split into training (70%), validation (20%), and testing (10%) sets. Both models are trained to detect objects and minimize detection loss.
-
📊 Evaluation: Models are evaluated using metrics such as precision, recall, and Mean Average Precision (mAP) to measure their performance.
-
💾 Database Connection: The system will connect to an SQL database to store and display information about detected violations in a chart format. (To be developed and integrated into the project)
-
🔧 Fine-Tuning: Model performance is enhanced using additional CNN layers and by training with more epochs to achieve lower log loss and improved accuracy, ROC, and IoU.
-
🔍 Model Testing: Both models are tested on unseen data to evaluate generalization and real-world performance. After training, the model will be deployed and monitored using Azure ML infrastructure.
-
🏥 Hospitals: Detect phone use in hospital wards and cabins to maintain a peaceful environment for patients and visitors.
-
🚦 Pedestrian Crossings: Identify individuals talking on phones while crossing roads to enhance safety.
-
⛽ Gas Stations: Detect phone use at gas stations to prevent safety hazards and preserve fire safety.
-
📚 Libraries: Ensure a quiet environment by detecting phone use and reminding patrons to keep noise to a minimum.
-
🎥 Dataset Collection: Obtaining real-time CCTV footage was challenging as such datasets are not publicly available and are protected.
-
🖼️ Image Processing: Standardizing and normalizing images for training was difficult due to noise and limited availability.
- Matplotlib
- YOLOv8x
- Scikit-Learn
- Pillow
- ResNetV2
- OpenCV
- Streamlit
In this video, we showcase our working model prototype for images, videos, and webcam.