Action Recognition and chatbot

Human Action Recognition (HAR) aims to understand human behavior and assign a label to each action. It has a wide range of applications, and therefore has been attracting increasing attention in the field of computer vision. Human actions can be represented using various data modalities, such as RGB, skeleton, depth, infrared, point cloud, event stream, audio, acceleration, radar, and WiFi signal, which encode different sources of useful yet distinct information and have various advantages depending on the application scenarios.

Consequently, lots of existing works have attempted to investigate different types of approaches for HAR using various modalities.

Our Task is to build an Image Classification Model using CNN that classifies to which class of activity a human is performing. We also built a chatbot which can answers queries about videos.

HAR-using-keras

This repository contains all the code and data used in the project. It includes the dataset from kaggle (https://www.kaggle.com/datasets/shashankrapolu/human-action-recognition-dataset) completed Jupyter notebook for training, the Flask app to serve predictions, and other utility scripts. Please feel free to modify any and all aspects of the code to suit your needs.

Once you have cloned repository, make sure to install dependencies using pipenv with the provided Pipfile and execute all commands using pipenv. Also, please make sure to add the correct path to the video file in camera.py on line 11. Next, to install pipenv, the dependencies, and run the main.py file, execute the following commands from your terminal or command prompt, making sure to add the right paths where necessary:

$ cd \path\to\Project\

$ pip install pipenv

$ pipenv install

$ python main.py/flask_main.py

or

$ pipenv run python3 main.py/flask_main.py

make sure to instal flask,opencv,tensorflow as major dependencies.

Dataset

Link to dataset - https://dphi-live.s3.eu-west-1.amazonaws.com/dataset/Human+Action+Recognition-20220526T101201Z-001.zip

The dataset features 15 different classes of Human Activities. The dataset contains about 12k+ labelled images including the validation images. Each image has only one human activity category and are saved in separate folders of the labelled classes

From the above link you will be able to download a zip file named ‘Human_Activities.zip’. After you extract this zip file, you will get four files:

Train - contains all the images that are to be used for training your model. In this folder you will find 15 folders namely - 'calling', ’clapping’, ’cycling’, ’dancing’, ‘drinking’, ‘eating’, ‘fighting’, ‘hugging’, ‘laughing’, ‘listening_to_music’, ‘running’, ‘sitting’, ‘sleeping’, texting’, ‘using_laptop’ which contain the images of the respective human activities.
Test - contains 5400 images of Human Activities. For these images you are required to make predictions as the respective class names -'calling', ’clapping’, ’cycling’, ’dancing’, ‘drinking’, ‘eating’, ‘fighting’, ‘hugging’, ‘laughing’, ‘listening_to_music’, ‘running’, ‘sitting’, ‘sleeping’, texting’, ‘using_laptop’.

Dataset structure

Download the dataset using

kaggle datasets download -d aryarishabh/hand-gesture-recognition-dataset

Make sure the directory structure looks like

.
├── Dataset
│   ├── test
│   │   ├──Image_1.jpg
│   │   ├──Image_10.jpg
│   │   ├──Image_100.jpg
│   │   ├──Image_1000.jpg
│   │   ├──Image_1001.jpg
│   │   ├── .
│   │   ├── ..
│   │   ├── ...

│   │   └── .....
│   └── train
│   │   ├──Image_1.jpg
│   │   ├──Image_10.jpg
│   │   ├──Image_100.jpg
│   │   ├──Image_1000.jpg
│   │   ├──Image_1001.jpg
│   │   ├── .
│   │   ├── ..
│   │   ├── ...


Summary
18.0k files

Sample Images of Actions

Training and Validation Accuracy & Loss

Test Prediction

Chatbot

We tried to create a chatbot for video context, but it didn't worked well in the time frame.
At first tried using IBM wattson assisstant chatbot to give answer using retrival documents but due to api key and cloud accound problems it don't worked out. Then due to time limitations just created a basic program which can answer about video from dictionary created.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__pycache__		__pycache__
output		output
static		static
templates		templates
utils		utils
.gitattributes		.gitattributes
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
Training.ipynb		Training.ipynb
action-recognition_fine_tuning.ipynb		action-recognition_fine_tuning.ipynb
camera.py		camera.py
checkpoint		checkpoint
flask_main.py		flask_main.py
haarcascade_frontalface_default.xml		haarcascade_frontalface_default.xml
main.py		main.py
model.json		model.json
model.png		model.png
model.py		model.py
requirements.txt		requirements.txt
trial_video.mp4		trial_video.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Action Recognition and chatbot

HAR-using-keras

Dataset

Link to dataset - https://dphi-live.s3.eu-west-1.amazonaws.com/dataset/Human+Action+Recognition-20220526T101201Z-001.zip

Dataset structure

Sample Images of Actions

Training and Validation Accuracy & Loss

Test Prediction

Chatbot

Flask app run

Upload screen

Welcome screen

Chat Screen

Chatbot Screen

About

Releases

Packages

Languages

License

Rupesh-rkgit/Human_Activity_recognition

Folders and files

Latest commit

History

Repository files navigation

Action Recognition and chatbot

HAR-using-keras

Dataset

Link to dataset - https://dphi-live.s3.eu-west-1.amazonaws.com/dataset/Human+Action+Recognition-20220526T101201Z-001.zip

Dataset structure

Sample Images of Actions

Training and Validation Accuracy & Loss

Test Prediction

Chatbot

Flask app run

Upload screen

Welcome screen

Chat Screen

Chatbot Screen

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages