Skip to content

ICS02: 10. Text analysis with R

Gabriel Bodard edited this page Jan 16, 2019 · 17 revisions

Sunoikisis Digital Classics, Spring 2019

Session 10. Text analysis, stylometry and visualisation using R

Thursday Mar 14, 16:00 UK = 18:00 EET

Convenors: Maciej Eder (Kraków), Robert Gorman (University of Nebraska–Lincoln) & Christopher Ohge (University of London)

YouTube link: tba

Slides: tba

Session outline

This session will examine some specialist libraries in R for text analysis. We will review the tidytext package from the previous session, then examine in depth two crucial (and complementary) forms of text analysis. The first will work with larger datasets with Stylo, and the second will show how to analyse and visualise encoded texts in XML.

  1. Review tidytext from previous session (Ohge).
  2. Stylometry: intro to the Stylo package (Eder).
  3. XML library: treebanking and linguistic analyses of encoded texts (Gorman).

Seminar readings

  • (two open access papers)

Further reading

Essay title

  • tba

Exercise

  1. tba
Clone this wiki locally