prj.mp4
I have developed a pose-based deep learning model that is capable of recognizing six Indian dynamic signs using the Mediapipe framework for pose estimation. By using a sequential model, LSTM, I was able to achieve a high accuracy rate of 96.2%. The model has been tested in real-time and shows promising results for its ability to recognize dynamic signs.
LSTM networks are an extension of recurrent neural networks (RNNs) mainly introduced to handle situations where RNNs fail. Long Short-Term Memory (LSTM) was brought into the picture. It has been so designed that the vanishing gradient problem is almost completely removed, while the training model is left unaltered. LSTMs provide us with a large range of parameters such as learning rates, and input and output biases.
MediaPipe is a Framework for building machine learning pipelines for processing time-series data like video, audio, etc. This cross-platform Framework works on Desktop/Server, Android, iOS, and embedded devices like Raspberry Pi and Jetson Nano.
- OpenCV
- Mediapipe
- Matplotlib
- Numpy
- TensorFlow
- sklearn
The dataset consists of 5000+ data points. The format of data is .npy, which means numpy arrays.
- Creating Dataset
- Importing the required Libraries
- Specify the path for storing images
- Data Processing
- Designing Model structure
- Training Model
- Evaluation of model on test data
- Download the
.h5
file from your Google Drive
- Importing the required Libraries
- Implementation of Mediapipe
- Calling the Trained Model
- Implementation of Recoginition Model