This repository contains the data, configuration files, and scripts needed to reproduce the following results on the ATIS, GEO, and JOBS semantic parsing datasets, using the AllenNLP framework:
Model | ATIS | GEO | JOBS |
---|---|---|---|
S2S + attention | 79.9 | 68.9 | 71.4 |
S2S + attention + ELMo | 83.3 | 75.7 | 77.9 |
S2S + attention + OpenAI GPT | 83.3 | 76.8 | 83.6 |
S2S + attention + BERT (Base) | 83.5 | 75.7 | 82.9 |
S2S + attention + BERT (Large) | 83.0 | 73.2 | 80.7 |
Make sure you have AllenNLP installed first!
To train a model:
make train
# ... follow the prompts to specify the path to your model config
# (e.g. experiments/atis/seq2seq.json) and serialization directory.
After training, to generate predictions:
allennlp predict --output-file [FILENAME] --predictor simple_seq2seq \
[SERIALIZED_MODEL] [INPUT_JSONL]
For example, to generate predictions on ATIS for a model that has been serialized to /tmp/models/atis/seq2seq/run_001:
allennlp predict --output-file predictions/atis/seq2seq.jsonl \
--predictor simple_seq2seq \
--include-package nlpete.data.dataset_readers \
--include-package nlpete.models \
--include-package nlpete.training.metrics \
/tmp/models/atis/seq2seq/run_001/model.tar.gz data/atis/atis_test.jsonl
I have already included the predictions in the "predictions" folder for those who simply want to verify the results. Once the predictions have been generated, the accuracy of the model can be calculated against the gold outputs on the test set by following the code in results.ipynb.