-
Notifications
You must be signed in to change notification settings - Fork 4
ICS02: 10. Text analysis with R
Gabriel Bodard edited this page Jan 16, 2019
·
17 revisions
Thursday Mar 14, 16:00 UK = 18:00 EET
Convenors: Maciej Eder (Kraków), Robert Gorman (University of Nebraska–Lincoln) & Christopher Ohge (University of London)
YouTube link: tba
Slides: tba
This session will examine some specialist libraries in R for text analysis. We will review the tidytext package from the previous session, then examine in depth two crucial (and complementary) forms of text analysis. The first will work with larger datasets with Stylo, and the second will show how to analyse and visualise encoded texts in XML.
- Review tidytext from previous session (Ohge).
- Stylometry: intro to the Stylo package (Eder).
- XML library: treebanking and linguistic analyses of encoded texts (Gorman).
- (two open access papers)
- Büchler, Marco, et al. (2013), "Measuring the Influence of a Work by Text-Reuse." In ed. Dunn/Mahony, The Digital Classicist 2013. Bulletin of the Institute of Classical Studies, Supplement 122. Pp. 63–79.
- Kestermont, Mike & Justin A. Stover (2016), "The Authorship of the Historia Augusta: Two new computational studies." Bulletin of the Institute of Classical Studies 59.2. Pp. 140–157. Available: https://onlinelibrary.wiley.com/doi/epdf/10.1111/j.2041-5370.2016.12043.x
- Eder, M., Rybicki, J., Kestemont, M. 'Stylometry with R.' The R Journal 8/1 (Aug. 2016). Available: https://journal.r-project.org/archive/2016-1/eder-rybicki-kestemont.pdf
- tba
- tba