This project contains categorized German words in seperated text files.
Experimental version v1 is ready now, but excluded gennum outputs due to huge size. If you need them, then you can generate numbers in text with our gennum tool from 0 to 999999.
You can download wordlists from assets in latest release.
- Lines are sorted and unique in final output files.
- Files are categorized by word types in English.
- Word lists may contain words that are incorrect, miscategorized, or meaningles.
Tools are located in tools folder in this repository.
Sources are located in sources folder in this repository.
- https://danielnaber.de/morphologie/
- https://de.wiktionary.org
- https://dumps.wikimedia.org/mirrors.html
- https://en.wiktionary.org
- https://en.wiktionary.org/wiki/Category:German_lemmas
- https://extensions.libreoffice.org/en/extensions/show/german-de-de-frami-dictionaries
- https://gist.github.com/MarvinJWendt/2f4f4154b8ae218600eb091a5706b5f4
- https://github.com/adbar/German-NLP
- https://github.com/languagetool-org/german-pos-dict
- https://github.com/michmech/lemmatization-lists
- https://www-user.tu-chemnitz.de/~fri/ding/
- https://www.dwds.de/lemma/list
- https://www.koeblergerhard.de/publikat.html
- https://www.openthesaurus.de/about/download
German Categorized Wordlist by YNSRC is licensed under Creative Commons Attribution 4.0 International
Feel free to use our wordlists in your personal, open-source or even commercial projects with attrubition. But take care conditions from license of owners if you want to use directly third-party data sources that we are processing to generating wordlists.