Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/multi class support #50

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

makinada
Copy link

I used this code for multi-class classification of short Japanese text (which worked quite well, btw. thanks!) I made the following enhancements.

  • Support of multi-class classification in terms of data loading (MultiClassDataLoader)
  • Support of predefined training/cross validation (dev) data sets (MultiClassDataLoader)
  • Support of character-based embedding (CharDataProcessor)

While making these changes, I kept the original implementation as BinaryClassDataLoader and WordDataProcessor. The code is backward compatible by using the combo.

You can mix and swap the classes ((Binary|Multi)ClassDataLoader and (Word|Char)DataProcessor) in train.py and eval.py Any combo works.

I also added a new set of data files for multi-class classification as an example. They're just straight format conversion from the original RT data.

Mak Inada added 2 commits November 28, 2016 15:17
  - multi-classification
  - predefined training/cv data set
  - character-based embedding
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant