Digitizing a dictionary today from text acquisition to publication:

The case study of Lana's latin dictionary

Authors

  • Francesca Michelone Università del Piemonte Orientale

DOI:

https://doi.org/10.6092/issn.2532-8816/21242

Keywords:

Digital dictionaries, TEI Lex-0, Latin-Italian Bilingual Dictionary, Nature in the Ancient World, Open Access, #AIUCD 2024

Abstract

The article aims to present, through a case study, a possible methodology for creating an open-access digital dictionary from an existing print edition. Specifically, it describes the digitization process of the Latin dictionary published by Italo Lana in 1978 by the Paravia publishing house. From this dictionary, entries related to the semantic domain of nature in the ancient world have been extracted. The content of the entries has been enriched through the study of literary works, and the reflections derived from this analysis are reflected in the dictionary’s encoding.

The approach to the text is based on the study of best practices currently adopted in digital lexicography and seeks to align with the principles of FAIR data production. While developed within the specific context of Latin lexicography, this methodology offers a replicable model for anyone undertaking a similar project.

References

[1] André, Jacques. Les noms des plantes dans la Rome antique Paris Les Belles Lettres, 2010.

[2] Costa Rute, Salgado Ana, Fahad Khan Anas, Carvalho Sara, Romary Laurent, et al.. «MOR Digital: The Advent of a New Lexicographical Portuguese Project.» Electronic lexicography in the 21st century. Proceedings of the eLex 2021 conference, edited by Kosem, I., Cukr, M., Jakubíček, M., Kallas, J., Krek, S. & Tiberius, C., pp. 312-324.

[3] Crane, Gregory R. «Perseus Digital Library». Tufts University. Accessed January 18, 2024. https://www.perseus.tufts.edu/hopper/.

[4] «DigilibLT». “Biblioteca digitale di testi latini tardoantichi ”. Università del Piemonte Orientale. Accessed January 29, 2025.

[5] Hallig Rudolf, Wartburg Walther von, Begriffssystem als Grundlage für die Lexikographie / Système raisonné des concepts pour servir de base à la lexicographie, Berlin (Akademie-Verlag), [1952] 1963.

[6] «LiLa: Linking Latin». Università Cattolica del Sacro Cuore. Accessed May 14, 2024. https://lila-erc.eu

[7] «Logeion». University of Chicago. Accessed January 18, 2024. https://logeion.uchicago.edu.

[8] Lopez, Patrice. «Kermitt2/Grobid». January 11, 2024. https://github.com/kermitt2/grobid.

[9] Mambrini Francesco, Cecchini Flavio Massimiliano, Franzini Greta, Litta Eleonora, Passarotti Marco Carlo, Ruffolo Paolo. «LiLa: Linking Latin Risorse Linguistiche per Il Latino Nel Semantic Web (AIUCD 2019)». Umanistica Digitale, no. 8 (2020): 63-78.

[10] «Musisque Deoque». Università Ca’ Foscari. January 6, 2024. https://mizar.unive.it/mqdq/public/index.

[11] «Plants of the World Online». "Plants of the World Online. Facilitated by the Royal Botanic Gardens, Kew. Published on the Internet; https://powo.science.kew.org/

Accessed January 28, 2025.

[12] «Rackham, Oliver. 1996. «Ecology and pseudo-ecology: the example of ancient Greece». In Human landscapes in classical antiquity: environment and culture, a cura di D. Graham J. Shipley e John Salmon, 6:16–43. Leicester-Nottingham studies in ancient society. London: Routledge.

[13] «Rescribe OCR». Durham University. Accessed January 13, 2024. https://rescribe.xyz/.

[14] Romary, Laurent, and Toma Tasovac. «TEI Lex-0: A Target Format for TEI-Encoded Dictionaries and Lexical Resources». TEI Conference and Members’ Meeting, Sep 2018. https://inria.hal.science/hal-02265312

[15] Sassolini, Eva, Sebastiana Cucurullo, e Marco Biffi. “L’informatizzazione del GDLI [Grande Dizionario della Lingua Italiana di G. Battaglia]: risultati, prospettive, sfide future.” Me.Te. Digitali. Mediterraneo in rete tra testi e contesti, Proceedings del XIII Convegno Annuale AIUCD2024, Di Silvestro, A. & Spampinato, D. (Eds.). Catania: AIUCD, pp. 281–286. ISBN 978-88-942535-8-0. DOI: 10.6092/unibo/amsacta/7927. Quaderni di Umanistica Digitale, p. 595.

[16] Tiberius, Carole, Simon Krek, Katrien Depuydt, Polona Gantar, Jelena Kallas, Iztok Kosem, and Michael Rundell. «Towards the Elexis Data Model: Defining a Common Vocabulary for Lexicographic Resources». Electronic lexicography in the 21st century. Proceedings of the eLex 2021 conference, edited by Kosem, I., Cukr, M., Jakubíček, M., Kallas, J., Krek, S. & Tiberius, C. pp. 56-77.

[17] Tittel, Sabine, Frances Gillis-Webber, and Alessandro A. Nannini. «Towards an Ontology Based on Hallig-Wartburg’s Begriffssystem for Historical Linguistic Linked Data». Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020). Edited by European Language Resources Association. pp. 1–10. https://aclanthology.org/2020.ldl- 1.1.

[18] «TLL Open Access: Thesaurus Linguae Latinae». Bayerische Akademien der Wissenschaften. Accessed January 11, 2024. https://thesaurus.badw.de/tll-digital/tll-open-access.html.

[19] «Transkribus». READ-COOP. Accessed January 12, 2024. https://readcoop.eu/it/transkribus/.

[20] Wilkinson, Mark D., Michel Dumontier, IJsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton, Arie Baak, Niklas Blomberg, Jan-Willem Boiten, Luiz Bonino da Silva Santos, e Philip E. Bourne. 2016. «The FAIR Guiding Principles for scientific data management and stewardship». Scientific data 3 (1): 1–9.

Published

2025-07-14

How to Cite

Michelone, F. (2025). Digitizing a dictionary today from text acquisition to publication:: The case study of Lana’s latin dictionary . Umanistica Digitale, 9(20), 507–526. https://doi.org/10.6092/issn.2532-8816/21242