GitHub - PancakeAwesome/ran_two_stream_to_recognize_drives: using Recurrent attention network using spatial-temporal relations for action recognition in driving

Action Recognition using Visual Attention

We propose a soft attention based model for the task of action recognition in videos. We use multi-layered Recurrent Neural Networks (RNNs) with Long-Short Term Memory (LSTM) units which are deep both spatially and temporally. Our model learns to focus selectively on parts of the video frames and classifies videos after taking a few glimpses. The model essentially learns which parts in the frames are relevant for the task at hand and attaches higher importance to them. We evaluate the model on UCF-11 (YouTube Action), HMDB-51 and Hollywood2 datasets and analyze how the model focuses its attention depending on the scene and the action being performed.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
scripts		scripts
src		src
util		util
README.md		README.md
__init__.py		__init__.py
model_ucf11.npz.pkl		model_ucf11.npz.pkl
model_ucf11_0-0001.npz.pkl		model_ucf11_0-0001.npz.pkl
model_ucf11_10.npz.pkl		model_ucf11_10.npz.pkl
run.log		run.log
run_10.log		run_10.log
test.py		test.py
test_results_last20_model_ucf11.txt		test_results_last20_model_ucf11.txt
test_results_last30_model_ucf11.txt		test_results_last30_model_ucf11.txt
test_results_last30_model_ucf11_0-0001.txt		test_results_last30_model_ucf11_0-0001.txt
test_results_last30_model_ucf11_10.txt		test_results_last30_model_ucf11_10.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Action Recognition using Visual Attention

Dependencies

About

Releases

Packages

Languages

PancakeAwesome/ran_two_stream_to_recognize_drives

Folders and files

Latest commit

History

Repository files navigation

Action Recognition using Visual Attention

Dependencies

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages