Vulgaris

Project to analyze Italian Diachronic Language Varieties.

Have a look at the project page - Vulgaris for more details.

Technical report here - accepted at VarDial2020 Workshop, co-located with COLING 2020.

Cite

@inproceedings{zugarini2020vulgaris,
  title={Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian Language},
  author={Zugarini, Andrea and Tiezzi, Matteo and Maggini, Marco},
  booktitle={Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects},
  pages={150--159},
  year={2020}
}

Download Script

Disclaimer: we retrieved and analyzed the data from Biblioteca Italiana solely for personal and academic non-commercial purposes. To replicate our analyzes and ease the diachronic language research, we provide the following script that retrieves and organizes the corpus in a convenient structure.

To install all the required dependencies:

pip install -r download_requirements.txt

Then, run the script:

python vulgaris_project.py

By running that script, you declare to respect the following copyright of Biblioteca Italiana: Creative Common License

Perplexity-based Analysis

First you should retrieve the data.

python char_diachronic_lm_exp.py path/to/vulgaris.csv

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
configs		configs
datasets		datasets
distances		distances
experiments		experiments
models		models
utils		utils
README.md		README.md
__init__.py		__init__.py
char_diacrhonic_lm_exp.py		char_diacrhonic_lm_exp.py
download_requirements.txt		download_requirements.txt
vulgaris_project.py		vulgaris_project.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vulgaris

Cite

Download Script

Perplexity-based Analysis

About

Releases

Packages

Contributors 2

Languages

andreazugarini/vulgaris

Folders and files

Latest commit

History

Repository files navigation

Vulgaris

Cite

Download Script

Perplexity-based Analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages