This repository is a collection of community resources and contains scripts, tools and helpers for developing spaCy, adding new languages and training new models. Feel free to submit a pull request to contribute.
Directory | Description |
---|---|
corpus-utils |
Scripts used to transform treebanks. Work in progress. |
fabfile |
Utility commands to be used with Fabric. |
jupyter-displacy |
Simple extension to visualise displaCy dependency parses in Jupyter Notebooks. |
spacy-annotator |
Trainable server for named entities that can be connected to a front-end, by @tcrossland |
templates |
Template files for adding spaCy extensions, like new languages. |
training |
Scripts and services for producing spaCy models. |
vocab |
Fabric scripts for creating vocabulary, Brown clusters and word vectors from a Wikipedia and other corpora. |
We always appreciate pull requests! 🙌 For more info on how to contribute to the project, see our contribution guidelines.