TOME project 3

News from January 2024

  • The first published result of the project (in German), although produced without the use of distant reading and computational analysis, was authored by Petr Pavlas in Comenius-Jahrbuch 31. You can read and download the pdf here.
  • Vojtěch Kaše and his group created four word-embedding models for Noscemus corresponding to the periods 1501-1550, 1551-1600, 1601-1650 and 1650-1700. These models were trained in the same manner as those developed by Sprugnoli et al. (2020), with the addition of the most frequent words from Noscemus to the dictionary. You can view the embeddings here. Notably, it is interesting to observe how some individual words shift within scientific discourse from one cluster (semantic genus in a Nominalist sense) to another, as well as how the proximity and distance between the clusters change over time.
  • Vojtěch Kaše also found and downloaded the entire Corpus Corporum database, which contains 7,819 Latin works from various periods, totaling approximately 500 million words. The Corpus Corporum database is available here.
  • Jo Hedesan and her group are successfully continuing the creation of the digital corpus of early modern Latin alchemical prints. At this moment, they have recognized and carefully cleaned dozens of works.

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *