Skip to content

Latest commit

 

History

History
14 lines (11 loc) · 614 Bytes

README.md

File metadata and controls

14 lines (11 loc) · 614 Bytes

Deep Text Classifier

Implementation of document classification model described in Hierarchical Attention Networks for Document Classification (Yang et al., 2016).

How to run

Get Yelp review dataset here: https://www.yelp.com/dataset_challenge

python3 yelp_prepare.py yelp_academic_dataset_review.json
python3 worker.py --mode=train --device=/gpu:0 --batch-size=30

Results

I am getting 65% accuracy on a dev set (16% of data) after 3 epochs. Results reported in the paper are 71% on Yelp'15. No systemic hyperparameter optimization was performed.