Skip to content

marpozzi/cltk

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The Classical Language Toolkit

PyPi downloads PyPI version Build Status Coverage Status Dependency Status DOI

Join the chat at https://gitter.im/kylepjohnson/cltk

About

The Classical Language Toolkit (CLTK) offers natural language processing support for Classical languages. In some areas, it extends the NLTK. The goals of the CLTK are to:

  • compile analysis-friendly corpora in a variety of Classical languages (currently available for Chinese, Coptic, Greek, Latin, Pali, and Tibetan);
  • gather, improve, and generate linguistic data required for NLP (Greek and Latin are in progress, with more in the pipeline);
  • develop a free and open platform for generating reproducible, scientific research that advances the study of the languages and literatures of the ancient world.

Installation

See installation instructions available in the docs.

Documentation

The docs are at docs.cltk.org. More information is available on the CLTK's website, cltk.org.

Corpora, training sets, models, etc.

Corpora are kept in the CLTK's GitHub user group. A language's trained models are found in, e.g., latin_models_cltk and greek_models_cltk. The CLTK imports files and stores them locally to your computer at ~/cltk_data.

Citation

Each major release of the CLTK is given a DOI, a type of unique identity for digital documents. This DOI ought to be included in your citation, as it will allow your readers to reproduce your scholarship should the CLTK's API or codebase change. To find the CLTK's current DOI, observe the blue DOI button in the repository's home (README.md). To the end of your bibliographic entry, append DOI plus the current identifier.

Therefore, please cite as follows:

Kyle P. Johnson et al.. (2014-2015). CLTK: The Classical Language Toolkit. DOI 10.5281/zenodo.23448

A style-neutral BibTeX entry would look like this:

@Misc{johnson2014,
author = {Kyle P. Johnson et al.},
title = {CLTK: The Classical Language Toolkit},
howpublished = {\url{https://github.com/kylepjohnson/cltk}},
note = {{DOI} 10.5281/zenodo.23448},
year = {2014--2015}
}

Optionally you may add version/release number, e.g., v0.0.1.22, to the entry.

License

The CLTK is Copyright (c) 2015 Kyle P. Johnson, under the MIT License. See 'LICENSE' for details.

About

The Classical Language Toolkit

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%