Index Thomisticus Treebank
Started by Roberto Busa SJ in 1949, the Index Thomisticus is considered as a groundbreaking project in computational linguistics. It is a corpus containing the opera omnia of Thomas Aquinas (118 texts) as well as 61 texts by other authors related to Thomas, for a total of approximately 11 million words, each morphologically tagged and lemmatized by hand.