This is the README file for the CCG Induction, models, CCGbank Simplification, etc in
Induction
Y. Bisk and J. Hockenmaier, “Simple Robust Grammar Induction with Combinatory Categorial Grammars,”
in Proceedings of the Twenty-Sixth Conference on Artificial Intelligence (AAAI-12), Toronto, Canada,
July 2012, pp. 1643–1649
Model
Y. Bisk and J. Hockenmaier, “An HDP Model for Inducing Combinatory Categorial Grammars,”
Transactions of the Association for Computational Linguistics, pp. 75–88, 2013.
CCGbank Simplification/Evaluation
Y. Bisk and J. Hockenmaier, “Probing the linguistic strengths and limitations of unsupervised
grammar induction,” in Proceedings of the 53rd Annual Meeting of the Association for Computational
Linguistics (Volume 1: Long Papers), Beijing,China, July 2015
Please don't hesitate to contact me or file bugs. I have tried to clean up the code, but may have introduced bugs or deleted something important in the process. Thanks!
git clone https://github.com/ybisk/CCG-Induction.git
- Download and Install Maven
- Move into git repo:
cd CCG-Induction/
- Building a jar file:
mvn package [-DskipTests]
All the classes (and files under resources) are in target/
java -jar target/CCGInduction-1.0-jar-with-dependencies.jar
For training a parser/HMM: java -jar CCGInduction.jar config.txt
For Evaluation:
java -cp CCGInduction.jar CCGInduction.evaluation.PARGDependencies
java -cp CCGInduction.jar CCGInduction.evaluation.CoNLLDependencies
java -cp CCGInduction.jar CCGInduction.evaluation.SupertagAccuracy
java -cp CCGInduction.jar CCGInduction.evaluation.TagEvaluator
For Utilities:
java -cp CCGInduction.jar CCGInduction.utils.JSONFormat
java -cp CCGInduction.jar CCGInduction.utils.ConvertFromAUTO
java -cp CCGInduction.jar CCGInduction.utils.ComputeLexicon