We present DanteSources, a focused Digital Library that aims at building innovative services supporting scholars in creating, evolving and consulting a digital encyclopaedia of Dante Alighieri’s works. DanteSources is endowed with Web services that allow extracting and visualizing data about Dante Alighieri’s primary sources, i.e. other authors’ works which Dante referred to in his own works. The heart of the digital library is a knowledge base built on an ontology that was developed in the RDF/S language, representing Dante’s works and the knowledge about them, extracted from some authoritative commentaries. Finally, on top of this graph, we developed DanteSources, which allows users to extract and display the knowledge stored in the knowledge base in the form of charts and tables. The methodology and the tool we developed are easily reusable to represent the knowledge about primary sources of other authors of the Italian or the international literature.
In questo articolo presentiamo DanteSources, una Digital Library il cui scopo è fornire servizi innovativi di supporto alla creazione, evoluzione e consultazione dell'enciclopedia digitale delle fonti di Dante Alighieri. DanteSources è dotato di servizi che consentono l'estrazione e la visualizzazione delle fonti primarie di Dante, i.e. opere di altri autori a cui Dante ha fatto riferimento nelle proprie opere. Il cuore del progetto è una base di conoscenza costruita attorno ad uno Schema RDF rappresentante le opere di Dante e le relative informazioni estratte da autorevoli commentari. A partire dal grafo è stato sviluppata l'applicazione DanteSources, che permette agli utenti di estrarre e visualizzare i contenuti sotto forma di grafici e tabelle. La metodologia e il tool realizzato sono di facile riutilizzo per la rappresentazione di conoscenza relativa a fonti primarie, siano esse italiane o internazionali.
One kind of knowledge that the scholars consider essential for of the study of old literary works is the analysis of their primary sources, i.e. the works of other authors that an author makes reference to in his/her texts. Usually, this knowledge is reported in modern paper commentaries, which the readers use to understand several aspects of the literary works. Having this knowledge available in a machine-readable form instead of in natural language could allow using computers to make inferences that could facilitate and improve the study to scholars, e.g. automatic calculation of the distribution of the primary sources and of the authors cited in an old text.
Our research is part of the Towards a Digital Dante Encyclopedia project, an Italian National Research project supporting scholars in formally expressing the knowledge about primary sources presented in Dante Alighieri’s works and more generally in literary texts. In order to represent this information we (i) created an ontology providing a formal representation of the terms required for expressing knowledge about the primary sources, and (ii) develop a semantic Digital Library (DL) based on the ontology that allows scholars to make inferences on the collected data and visualize them in a friendly and easy way.
In the Digital Humanities field, there are ontologies focusing on different aspects of textual information but, up to now, an ontology for representing knowledge about primary sources of literary texts has not been developed yet. For this reason, we created an ontology in the RDF/S language to represent Dante’s works and the knowledge about their primary sources. To develop the ontology, we reused existing vocabularies where possible, in order to maximize its interoperability, adding our own classes and relationships for the representation of the categories of knowledge that were not addressed by the existing ontologies. For a detailed description of the ontology we refer to .
Then, we populated the ontology with the knowledge extracted from some authoritative commentaries, obtaining an RDF knowledge base. On top of this knowledge base, we developed DanteSources, a focused Digital Library that allows extracting and visualizing information on Dante Alighieri’s primary sources in form of charts and tables and in CSV format. In this paper we describe the DL DanteSources and the methodology we used to develop it.
The paper is structured as follows: reports a review of related works; in the functionalities of DanteSources are described in detail; in conclusions are reported.
In the Semantic Data Modeling field, several ontologies have been developed in order to represent textual knowledge. Up to now, a formal ontology for representing knowledge about the primary sources of literary texts has not appeared yet. So we developed a new ontology taking into account the existing ones, reusing their classes and properties in order to improve its interoperability. In particular, we reused, among others, some classes and properties from authoritative ontologies like FRBRoo , SKOS , Dublin Core, DoCo , and the CIDOC CRM .
In the Digital Humanities field, several projects about Dante Alighieri and his works have been published online. Most of them offer digital editions of Dante's works with commentaries, textual search, and multimedia resources. For example, the Dartmouth Dante Project (DDP) was launched in 1985 with the aim of publishing the full searchable text of the Divine Comedy and several of its commentaries in digital format . On top of the DDP in 2013 the web application Dante Lab was developed. This application allows the concurrent visualization of the original text of the Comedy, some translations and more than 75 commentaries. The Princeton Dante Project includes the full searchable text of the Divine Comedy, Dante’s minor works, several commentaries and multimedia resources . Digital Dante offers the full text of all Dante’s works with commentaries, illustrations and recorded readings. It also features a subproject called Intertextual Dante, the first digital attempt at connecting passages of Dante’s works with the corresponding fragments of cited primary sources. However, the project is limited to the works of the Roman poet Ovid . Recently, two Italian research projects have been published: DanteSearch and Dante Medieval Archive (DaMA). DanteSearch is a complete lemmatization, grammatical and syntactic annotation of Dante’s works allowing users to perform morphological and syntactic queries on the full text of the author's works. On the other hand, DaMA is a digital archive containing the full text of Dante’s works, commentaries and several primary sources in XML-TEI format.
With the advent of the Semantic Web, several projects were started with a focus on the application of the new semantic technologies (RDF, OWL) to the Humanities. For instance, describes a semantic model to connect repertoires of poetic writings. presents an ontology for the 3D visualization of cultural heritage; describe an ontology for annotating geographical places in texts. In this context, we developed DanteSources DL, which uses the technologies of the Semantic Web to represent the knowledge included in the works of Dante Alighieri, focusing on primary sources.
First of all, in order to develop the DL DanteSources, we analyzed the representation requirements about citations of primary sources, in collaboration with a Dante’s expert. Then, on the basis of this analysis, we created an ontology responding to the requirements, providing a formal representation of the terms required for expressing knowledge about the primary sources, using the RDF/S Semantic Web language. After this phase, we developed a semi-automatic tool for populating the ontology and storing the resulting data in a knowledge base. Finally, we developed DanteSources, a web application running on top of the knowledge base that allows making inferences on the contents of the knowledge base and visualizes them in a friendly and easy way, using charts and tables.
The authoritative commentaries used to populate the ontology are the following: Vita Nova , Vita Nuova , De vulgari eloquentia , Convivio , Monarchia , and Rime . DanteSources is developed in Java using JavaScript and Ajax functions. The DL extracts knowledge by running SPARQL queries on the RDF knowledge base. DanteSources shows the knowledge about the primary sources cited by Dante in the form of tables and column bar charts. In particular, we used the Highcharts JavaScript library to implement these charts. Highcharts allows exporting the charts in various well-known formats: PDF, PNG, JPEG, SVG. Furthermore, we implemented an additional JavaScript function allowing users to automatically export and download all the data in CSV format. This feature was particularly appreciated by the scholars since it allows obtaining and managing raw data in a simple and standard format, in order to apply further data analyses in addition to the ones already provided by the DL.
Currently, DanteSources makes available eight different predefined SPARQL queries to extract data. They can be distinguished into three different groups. The first group includes three queries. In order to make these queries, a search form allows users to choose either one Dante's work or all his works and, in addition, a specific subpart of the work (e.g. a book). The queries produce column bar charts regarding the distribution of the works, the authors and the thematic areas cited by Dante. For example, shows the chart of the distribution of the ten most-cited primary sources in Dante’s work Rime.
Further information about primary sources, authors and thematic areas can be visualized by clicking on their names on the charts. In particular, for each primary source the DL shows a table reporting information about: (i) the book, (ii) the chapter, (ii) the paragraph or verse and (iv) the fragment of the Dante's work in which the primary source is cited, (v) the type of reference, (vii) the reference to a fragment of the primary source cited in the commentary, (viii) the thematic area and (ix) the author of the primary source. An example of this table, related to Brunetto Latini’s work Tesoretto, is shown in .
Similarly, when clicking on the name of a cited author, a table reporting all the works of that author cited in the Dante’s text chosen by the user is shown. Furthermore, for each primary source the DL reports the book, the chapter (or poem) and the paragraph (or verse) of the Dante’s text in which the author is cited. shows an example of table related to cited author Brunetto Latini.
The same happened for the thematic areas, indeed clicking on one name the user obtains the following information: the primary sources included in that area, their authors and the book, chapter (or poem) and paragraph (or verse) of the Dante’s work where the thematic area is cited. shows an example of table related to thematic area Aristotelianism.
The three queries of the second group allow visualizing several charts that report the distribution of a particular primary source, a cited author or a thematic area respectively.
The data regarding the distributions are not only available for an entire Dante's work, like Convivio, but also for its subparts like books, chapters or poem. Indeed, by clicking on one bar in the chart representing the distribution of the information onto a Dante’s work, it is possible to visualize information about its subparts. For example, it is possible to visualize the distribution of the selected knowledge in all three books of Dante’s Monarchia and also in each chapter of the first book of Monarchia. shows the distribution of a primary source on three different levels: all Dante’s works (a), one particular Dante’s work (b), a subpart of a Dante’s work (c).
The last group includes two queries that visualize the distribution of the three types of reference to primary sources: explicit, strict and generic, focusing on either on Dante’s works or on all primary sources written by a specific author. Indeed, with the support of Dante’s experts, we defined three types of reference:
1. explicit, if the reference is explicitly made by Dante, e.g. As the Philosopher says at the beginning of the First Philosophy
, where the Philosopher is Aristotle and the First Philosophy is Metaphysics;
2. strict, if the reference is indicated by a scholar and refers to a specific work, e.g. “SI MANUCA: it is the bread of the angels, the manna as called in the Old Testament (Ps. 77, 25 Panem angelorum manducavit homo)”;
3. generic, if the reference is indicated by a scholar, and refers to a concept (e.g. Medieval comments to Aristotle’s works).
shows the distribution of the three types of reference in a specific primary source. By clicking on the bars of the chart, the data regarding the distribution of the types of reference are available both for an entire Dante's work and for its subparts, i.e. books and chapters. shows the distribution of the three types of reference in all primary sources written by a specific author, Albertus Magnus.
In this paper we have presented DanteSources, a Digital Library that allows free access to the knowledge on Dante Alighieri’s primary sources. In order to represent this knowledge, we developed an ontology, expressed in RDF/S, on top of which DanteSources was created. The main advantages of using an ontology instead of a traditional database are the following: (i) it allows researchers to add classes and relationships thereby refining the ontology, (ii) the ontology can be linked to other ontologies to extend the represented domain, (iii) any user can download and use the knowledge base freely, using the paradigm of Linked Data.
Furthermore, this is the first time that the knowledge about Dante’s primary sources is coded in digital format. This availability allows improving and making the scholars’ searches more efficient. Eventually, the digitization of the knowledge about primary sources and its visualization in form of charts and tables allow the scholars to have a complete overview of the data. This simplifies the study by the scholars on the evolution in time of Dante’s cultural background, also in relation with the diverse stages of his biography.
The ontology, the tool to populate it and the developed DL are easily and freely reusable in order to represent the knowledge about primary sources of other authors of the Italian and international literature.
DanteSources was the winner of the Digital Humanities Awards 2015 in the category Best DH Tool or Suite of Tools.
Alighieri, Dante. Convivio.
In Opere, a cura di Gianfranco Fioravanti, Vol. 2. Milano: Mondadori, 2014.
Alighieri, Dante. De Vulgari Eloquentia.
In Opere, a cura di Mirko Tavoni, Vol. 1. Milano: Mondadori, 2011.
Alighieri, Dante. Monarchia.
In Opere, a cura di Diego Quaglioni, Vol. 2. Milano: Mondadori, 2014.
Alighieri, Dante. Rime.
In Opere, a cura di Claudio Giunta, Vol. 1. Milano: Mondadori, 2011.
Alighieri, Dante. Vita nova.
In Opere, a cura di Guglielmo Gorni, Vol. 1. Milano: Mondadori, 2011.
Alighieri, Dante. Vita nuova. a cura di Domenico De Robertis. Milano-Napoli: Ricciardi, 1980.
Bartalesi, Valentina e Carlo Meghini. Using an ontology for representing the knowledge on literary texts: the Dante Alighieri case study.
Semantic Web 8, num. 3 (2017): 385–94.
Doerr, Martin. The CIDOC CRM - An ontological approach to semantic interoperability of metadata,
2002.
González-Blanco, Elena, Levente Seláf, Gimena Del Rio Riande, Clara Martínez Cantón e María Dolores Martos Pérez. Building a metrical ontology as a model to link digital poetic repertoires.
Lausanne, 2014.
Hollander, Robert. The Dartmouth Dante project.
Quaderni d’italianistica 10, num. 1–2 (1989): 287–98.
Hollander, Robert. The Princeton Dante project.
Humanist studies & the digital age 3, num. 1 (2013): 53–59.
Lana, Maurizio, Fabio Ciotti, Diego Magro, Silvio Peroni, Francesca Tomasi e Fabio Vitali. Annotating Texts with Ontologies, from Geography to Persons and Events,
494–96. Lausanne, 2014.
Manola, Frank, Eric Miller e Brian McBride. RDF Primer.
W3C, 2004. https://www.w3.org/TR/2004/REC-rdf-primer-20040210/.
Miles, Alistair, Brian Matthews, Michael Wilson e Dan Brickley. SKOS Core: simple knowledge organisation for the Web.
International conference on Dublin Core and metadata applications, 2005, 3–10.
Riva, Pat, Martin Doerr e Maja Žumer. FRBRoo: enabling a common view of information from memory institutions,
2008.
Shotton, David. CiTO, the citation typing ontology.
Journal of Biomedical Semantics 1, num. 1 (June 22, 2010): S6.
Van Peteghem, Julie. Digital readers of allusive texts: Ovidian intertextuality in the Commedia and the digital concordance on intertextual Dante.
Humanist studies & the digital age 4, num. 1 (2015): 39–59.
Vitale, Valeria. An ontology for 3D visualization in cultural heritage. MP4. Digital classicist. London, 2013. http://www.digitalclassicist.org/wip/wip2013-02vv.html.
http://dublincore.org/documents/dcmi-type vocabulary/index.shtml
http://digitaldante.columbia.edu/digital-projects/intertexual-dante/
http://www.perunaenciclopediadantescadigitale.eu:8080/dantesearch/