Patrick Burns, Building a Text Analysis Pipeline for Classical Languages

With large text collections for Ancient Greek and Latin now widelyavailable, classicists are increasingly interested in extracting information sys-tematically from these texts. The fields of information retrieval and natural lan-guage processing offer tools and methods to address this, but classical-language support can be limited and researchers must often cobble togetherseparate, sometimes incompatible tools to accomplish basic text analysis tasks.In this chapter, I review the tools currently available for digital philologicalwork on Ancient Greek and Latin and introduce the Classical Language Toolkit,an open-source Python framework that addresses the desideratum ofa complete text analysis pipeline for historical languages.

Paolo Monella Curriculum
DH bibliography
Paolo Monella home page