CoPhiEditor: The DSL-Based DSE Methodology within the ERC Advanced Grant 885222-GreekSchools
DOI:
https://doi.org/10.6092/issn.2532-8816/21231Keywords:
Domain-Specific Languages, Computational Philology, Digital Philology, DSE tools, AIUCD2023, AIUCD2024Abstract
This paper explores the integration of traditional philological methods with computational approaches, aiming to establish a more effective and rigorous framework for textual studies. To this end, we propose a preliminary set of evaluation criteria—familiarity, transparency, completeness, compactness, consistency, and actionability—for analyzing text editing tools. Using this framework, we suggest the use of Domain-Specific Languages (DSLs) for the digital papyrological editions of the ERC-885222 GreekSchools project that is our case study.
We introduce CoPhiEditor, a software platform designed to support the creation of Digital Scholarly Editions through the use of DSLs. This approach enables scholars to retain familiar workflows while benefiting from machine-actionable data representations and computational functionalities. Key features of CoPhiEditor include its recursive data model, automatic error-checking and suggestions, automatic TEI-conformant serialization, and advanced collaboration capabilities that streamline teamwork on shared projects. Furthermore, its extensible and domain-agnostic architecture broadens its applicability across various scholarly editing contexts. The aim of CoPhiEditor allows philologists to incorporate digital tools without sacrificing their established editorial practices.
References
[1] Bradley, John. 2015. “How about Tools for the Whole Range of Scholarly Activities?” In Global Digital Humanities. ADHO. Sydney, Australia: Paul Arthur, University of Western Sidney. https://dh-abstracts.library.virginia.edu/works/2269.
[2] Nantke, Julia, and Frederik Schlupkothen, eds. 2020. Annotations in Scholarly Editions and Research. Functions, Differentiation, Systematization. Berlin, Boston: De Gruyter. https://doi.org/doi:10.1515/9783110689112.
[3] Robinson, Peter. 2017. “Some Principles for Making Collaborative Scholarly Editions in Digital Form.” Digital Humanities Quarterly 11 (2). http://www.digitalhumanities.org/dhq/vol/11/2/000293/000293.html.
[4] Parr, Terence. 2014. Language Implementation Patterns. Create Your Own Domain-Specific and General Programming Languages. Pragmatic Bookshelf.
[5] Parr, Terence. 2013. The Definitive ANTLR 4 Reference. 2nd ed. Pragmatic Bookshelf.
[6] Fowler, M. 2010. Domain-Specific Languages. Addison-Wesley Signature Series (Fowler). Pearson Education.
[7] Boersma, Meinte. 2021. Domain-Specific Languages Made Easy. Manning Publications Co.
[8] Mugelli, Gloria, Federico Boschetti, Riccardo Del Gratta, Angelo Mario Del Grosso, Fahad Khan, and Andrea Taddei. 2016. “A User-Centered Design to Annotate Ritual Facts in Ancient Greek Tragedies.” Bulletin of the Institute of Classical Studies 59 (2): 103–20. https://doi.org/10.1111/j.2041-5370.2016.12041.x.
[9] Boschetti, Federico, and Angelo Mario Del Grosso. 2020. “L’annotazione di testi storico-letterari al tempo dei social media.” Italica Wratislaviensia (Print) 11:65–99. https://doi.org/10.15804/IW.2020.11.1.03.
[10] Zowghi, Didar, and Vincenzo Gervasi. 2003. “On the Interplay between Consistency, Completeness, and Correctness in Requirements Evolution.” Information and Software Technology, Eighth International Workshop on Requirements Engineering: Foundation for Software Quality, 45 (14): 993–1009. https://doi.org/10.1016/S0950-5849(03)00100-9.
[11] Buzzetti, Dino. 2002. “Digital Representation and the Text Model.” New Literary History 33 (1): 61–88.
[12] Melighetti, Francesco. 2019. “Domain-Specific Languages per Facilitare Annotatori Inesperti Di XML-TEI Nella Codifica Di Documenti Storici.” Tesi di laurea triennale, Pisa: Università di Pisa, corso di laurea in Informatica Umanistica.
[13] Mugelli, Gloria, Giulia Re, and Andrea Taddei. 2020. “Annotazione Digitale Di Testi Antichi. Lingue Antiche e Digital Humanities, Tra Ricerca e Didattica.” Umanistica Digitale 5 (9): 35–60. https://doi.org/10.6092/issn.2532-8816/9962.
[14] Bambaci, Luigi, and Federico Boschetti. 2020. “Encoding the Critical Apparatus by Domain Specific Languages: The Case of the Hebrew Book of Qohelet.” In La Svolta Inevitabile: Sfide e Prospettive per l’Informatica Umanistica., edited by Marco Passarotti, Cristina Marras, and Eleonora Litta. Quaderni Di Umanistica Digitale. Milano: Università Cattolica del Sacro Cuore.
[15] Zunke, Saurabh, and Veronica D’Souza. 2014. “JSON vs XML: A Comparative Performance Analysis of Data Exchange Formats.” IJCSN International Journal of Computer Science and Network 3 (4): 257–61.
[16] Evans, Eric. 2003. Domain-Driven Design: Tackling Complexity In the Heart of Software. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc.
[17] Sun, Chengzheng, and Clarence Ellis. 1998. “Operational Transformation in Real-Time Group Editors: Issues, Algorithms, and Achievements.” In Proceedings of the 1998 ACM Conference on Computer Supported Cooperative Work, 59–68. CSCW ’98. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/289444.289469.
[18] Mehdi, Ahmed-Nacer, Pascal Urso, Valter Balegas, and Nuno Perguiça. 2014. “Merging OT and CRDT Algorithms.” In Proceedings of the First Workshop on Principles and Practice of Eventual Consistency, 1–4. PaPEC ’14. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/2596631.2596636.
[19] Fischer, Franz. 2013. “All Texts Are Equal, But… Textual Plurality and the Critical Text in Digital Scholarly Editions.” Variants 10:77–92.
[20] Bozzi, Andrea. 2019. Percorsi Di Linguistica e Di Filologia Computazionali. Edited by Maria Sofia Bozzi and Giacomo Ferrari. Edizioni ETS.
[21] Bordalejo, Barbara, and Peter Robinson. 2015. “A New System for Collaborative Online Creation of Scholarly Editions in Digital Form.” In 1st Dixit Convension on Technology, Software, Standards for the Digital Scholarly Edition Workshop. The Hague.
[22] Robinson, Peter. 2016. “The Digital Revolution in Scholarly Editing.” Stockholm University Press, November. https://doi.org/10.16993/baj.h.
[23] Zenzaro, Simone, Angelo Mario Del Grosso, Federico Boschetti, and Graziano Ranocchia. 2022. “Verso La Definizione di Criteri per Valutare Soluzioni Di Scholarly Editing Digitale: Il Caso d’uso GreekSchools.” In Culture Digitali. Intersezioni, Filosofia, Arti e Media. Università del Salento - AIUCD. https://doi.org/10.6092/unibo/amsacta/6848.
[24] Del Grosso, Angelo Mario, Emiliano Giovannetti, and Simone Marchi. 2017. “The Importance of Being... Object-Oriented: Old Means for New Perspectives in Digital Textual Scholarship.” In Advances in Digital Scholarly Editing, 1:269–74.
[25] Siemens, Ray, Meagan Timney, Cara Leitch, Corina Koolen, Alex Garnett, with the ETCL INKE, and PKP Research Groups. 2012. “Toward Modeling the Social Edition: An Approach to Understanding the Electronic Scholarly Edition in the Context of New and Emerging Social Media.” Literary and Linguistic Computing 27 (4): 445–61.
[26] Vogeler, Georg. 2019. “The ‘Assertive Edition.’” International Journal of Digital Humanities 1 (2): 309–22. https://doi.org/10.1007/s42803-019-00025-5.
[27] Ellis, Clarence A., and Simon John Gibbs. 1989. “Concurrency Control in Groupware Systems.” SIGMOD Rec. 18 (2): 399–407. https://doi.org/10.1145/66926.66963.
[28] Zenzaro, Simone, Angelo Mario Del Grosso, Federico Boschetti, and Graziano Ranocchia. 2022. “Verso La Definizione di Criteri per Valutare Soluzioni Di Scholarly Editing Digitale: Il Caso d’uso GreekSchools.” In Culture Digitali. Intersezioni, Filosofia, Arti e Media. Università del Salento - AIUCD. https://doi.org/10.6092/unibo/amsacta/6848.
[29] Zenzaro, Simone, Angelo Mario Del Grosso, Federico Boschetti, and Graziano Ranocchia. 2023. “Ease the Collaboration Making Scholarly Editions: The GreekSchools Case Study.” In La Memoria Digitale: XII Convegno Annuale AIUCD, 230–32. Siena: Alma Mater Studiorum-Università di Bologna. https://doi.org/10.6092/unibo/amsacta/7721
[30] Del Grosso, Angelo Mario, Federico Boschetti, Simone Zenzaro, and Graziano Ranocchia. 2023. “GreekSchools: Making Traditional Papyrology Machine Actionable Through Domain-Driven Design.” In 2023 7th IEEE Congress on Information Science and Technology (CiSt) (IEEE CiSt’23), edited by Mohammed El Mohajir, Mohammed Al Achhab, Badr Eddine El Mohajir, Bernadetta Kwintiana Ane, and Ismail Jellouli, 6. Agadir - Essaouira, Morocco: IEEE Computer Society.
[31] Parr, Terence. 2014. Language Implementation Patterns. Create Your Own Domain-Specific and General Programming Languages. Pragmatic Bookshelf.
[32] Fielding, Roy Thomas. 2000. “Architectural Styles and the Design of Network-Based Software Architectures.” Publication, University of California, Irvine. https://www.ics.uci.edu/~fielding/pubs/dissertation/top.htm.
[33] Del Grosso, Angelo Mario, Davide Albanesi, Emiliano Giovannetti, and Simone Marchi. 2016. “Defining the Core Entities of an Environment for Textual Processing in Literary Computing.” In DH2016 Abstracts, 771–75. Kraków.
[34] Retter, Adam, and Erik Siegel. 2014. eXist: A NoSQL Document Database and Application Platform. United States of America: O’Reilly Media.
[35] Del Grosso, Angelo Mario Del, Simone Zenzaro, Federico Boschetti, and Graziano Ranocchia. 2025. “Bridging Traditional and Digital Papyrology with Domain-Specific Languages: The GreekSchools Case Study.” In The Digital Critical Edition of Greek Papyri: Issues, Projects, and Perspectives, edited by Nicola Reggiani, 125–52. Berlin, Boston: De Gruyter. https://doi.org/doi:10.1515/9783111070162-009.
[36] Rosselli Del Turco, Roberto, Giancarlo Buomprisco, Chiara Di Pietro, Julia Kenny, Raffaele Masotti, and Jacopo Pugliese. 2014. “Edition Visualization Technology: A Simple Tool to Visualize TEI-Based Digital Editions.” Journal of the Text Encoding Initiative 8. https://doi.org/10.4000/jtei.1077.
[37] Cacioli, Erica, Chiara Di Pietro, Sara Maenza, Roberto Rosselli Del Turco, and Simone Zenzaro. 2022. “There and Back Again: What to Expect in the next EVT Version.” In AIUCD 2022 - Culture Digitali. Intersezioni: Filosofia, Arti, Media. Proceedings Della 11a Conferenza Nazionale, Lecce, 2022, edited by Fabio Ciracì, Giulia Miglietta, and Carola Gatto, 212–17. Lecce: Quaderni di Umanistica Digitale.
[38] Venuti, Martina, Angelo Mario Del Grosso, Federico Boschetti, Luigi Tessarolo, Alessia Prontera, Dylan Bovet, Gianmario Cattaneo, and Valeria Melis. 2023. “La ‘Galassia MQDQ’: un concetto di filologia tradizionale, digitale, sostenibile.” Relations 4 (1): 71–120. https://doi.org/10.30687/mag/2724-3923/2023/07/003.
[39] Dombrowski, Quinn, and Seth Denbo. 2013. “TEI and Project Bamboo.” Journal of the Text Encoding Initiative, no. 5. https://doi.org/10.4000/jtei.787.
[40] Dekker, Ronald H., and Gregor Middell. 2011. “Computer-Supported Collation with CollateX: Managing Textual Variance in an Environment with Varying Requirements.” Supporting Digital Humanities, 17–18.
[41] Hagel, Stefan. 2007. “The Classical Text Editor. An Attempt to Providing for Both Printed and Digital Editions.” In Digital Philology and Medieval Texts, 77–84. http://opac.regesta-imperii.de/id/1327309.
[42] Zenzaro, Simone, Daniele Marotta, and Amos Bertolacci. 2018. “CEED: A Cooperative Web-Based Editor for Critical Editions.” AIUCD 2018, 93.
[43] Bodard, Gabriel. 2010. “EpiDoc: Epigraphic Documents in XML for Publication and Interchange.” Latin On Stone: Epigraphic Research and Electronic Archives, 101–18.
[44] Hu, Vincent C., D. Richard Kuhn, David F. Ferraiolo, and Jeffrey Voas. 2015. “Attribute-Based Access Control.” Computer 48 (2): 85–88. https://doi.org/10.1109/MC.2015.33.
[45] Alshuqayran, Nuha, Nour Ali, and Roger Evans. 2016. “A Systematic Mapping Study in Microservice Architecture.” In 2016 IEEE 9th International Conference on Service-Oriented Computing and Applications (SOCA), 44–51. https://doi.org/10.1109/SOCA.2016.15.
[46] Lou, Burnard. 2014. What is the Text Encoding Initiative? Encyclopédie numérique 3. Marseille: OpenEdition Press. http://books.openedition.org/oep/426.
[47] Robinson, Peter M W. 2022. “An Approach to Complex Texts in Multiple Documents.” Digital Scholarship in the Humanities 37 (4): 1179–96. https://doi.org/10.1093/llc/fqab108.
[48] Chagué, Alix, Floriane Chiffoleau, Matthias Gille Levenson, Hugo Scheithauer, and Ariane Pinche. 2024. “Chaînes d’acquisition, de Traitement et de Publication Du Texte.” Consortium Ariane - Axe 1. https://hal.science/hal-04734959.
[49] Morlock, Emmanuelle. 2017. “TeiPublisher for EpiDoc.” In Visible Words: Digital Epigraphy in a Global Perspective. Providence, United States: John Bodel, Brown University and Michèle Brunet, Université Lumière Lyon 2, HiSoMA. https://shs.hal.science/halshs-01773203.
[50] Cayless, Hugh, and Raffaele Viglianti. 2018. “CETEIcean: TEI in the Browser.” In Proceedings of Balisage: The Markup Conference 2018. Vol. 21. Balisage Series on Markup Technologies. https://doi.org/10.4242/BalisageVol21.Cayless01.
[51] Rockwell, Geoffrey, e Stéfan Sinclair. 2016. Hermeneutica: Computer-Assisted Interpretation in the Humanities. Ebook Central. MIT Press.
[52] Tiepmar, Jochen, Christoph Teichmann, Gerhard Heyer, Monica Berti, and Gregory Crane. 2014. “A New Implementation for Canonical Text Services.” In Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH), Gothenburg, Sweden, edited by Kalliopi Zervanou and Cristina Vertan, 1–8. New York, NY, USA: ACL.
[53] Fusi, Daniele. 2017. Tecnologie informatiche per l’umanista digitale. Roma: Edizioni Nuova Cultura. https://doi.org/10.4458/9323.
[54] Buzzetti, Dino. 2023. “Towards an Operational Approach to Computational Text Analysis.” In On Making in the Digital Humanities: The Scholarship of Digital Humanities Development in Honour of John Bradley. UCl Press. https://library.oapen.org/handle/20.500.12657/62237.
[55] Bodard, Gabriel, and Polina Yordanova. 2020. “Publication, Testing and Visualization with EFES: A Tool for All Stages of the EpiDoc XML Editing Process.” Studia Universitatis Babeș-Bolyai Digitalia 65 (1): 17–35. https://doi.org/10.24193/subbdigitalia.2020.1.02.
[56] Assael, Yannis, Thea Sommerschield, Brendan Shillingford, Mahyar Bordbar, John Pavlopoulos, Marita Chatzipanagiotou, Ion Androutsopoulos, Jonathan Prag, and Nando de Freitas. 2022. “Restoring and Attributing Ancient Texts Using Deep Neural Networks.” Nature 603 (7900): 280–83. https://doi.org/10.1038/s41586-022-04448-z.
[57] Jurafsky, Daniel, and James H. Martin. 2024. Speech and Language Processing (3rd Edition). Upper Saddle River, NJ, USA: Prentice-Hall, Inc.
[58] Riemenschneider, Frederick, e Anette Frank. 2023. «Exploring Large Language Models for Classical Philology». In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), a cura di Anna Rogers, Jordan Boyd-Graber, e Naoaki Okazaki, 15181–99. Toronto, Canada: Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.acl-long.846.
[59] Devlin, Jacob, Ming-Wei Chang, Kenton Lee, e Kristina Toutanova. 2019. «BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding». arXiv. http://arxiv.org/abs/1810.04805.
[60] Voelter, Markus. Generic Tools, Specific Languages. Delft University of Technology, 2014.
[61] O’Sullivan, James, Michael Pidd, Bridgette Wessels, Órla Murphy, Michael Kurzmeier, Sophie Whittle, Andrew Prescott, et al. Digital Editing and Publishing in the Twenty-First Century. Scottish Universities Press. Scottish Universities Press, 2025. https://books.sup.ac.uk/sup/catalog/download/4/8/49.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Simone Zenzaro, Angelo Mario Del Grosso, Federico Boschetti, Graziano Ranocchia

This work is licensed under a Creative Commons Attribution 4.0 International License.