Automatic extraction of opera character characteristics through lexical-syntactic patterns

Authors

  • Paolo Bonora University of Bologna
  • Angelo Pompilio University of Bologna

DOI:

https://doi.org/10.6092/issn.2532-8816/12426

Keywords:

information extraction, NLP, characters, opera, patterns

Abstract

The paper presents the experimentation of rules based on syntactic patterns for the extraction of interpersonal relationships between characters from the opera repertoire of the 18th century. The study illustrates the application of this unsupervised approach to identify this kind of relationships described within the captions accompanying the lists of characters in the librettos. The results demonstrate the effectiveness of the proposed solution in extracting relations defined through formal ontologies with a precision that allows them to be included in a domain knowledge base without further supervision. The experimentation contributes to the elaboration of a formal model for the description of the features of opera characters required for the reconstruction of their profiles. Given the size of the repertoire under examination and the number of characters, the use of automated analysis offers the researcher a useful tool to support the critical analysis of the sources. The ability to reconstruct the network of relationships and the features of the characters is in turn preparatory to the use of the character as a dimension of analysis to reconstruct the complex tradition of this kind of texts.

References

Accorsi, Maria Grazia. 1989. «Problemi testuali dei libretti d’opera fra Sei e Settecento». Giornale storico della letteratura italiana, 1989.

Aprosio, Alessio Palmero, e Giovanni Moretti. 2016. «Italy Goes to Stanford: A Collection of CoreNLP Modules for Italian». ArXiv:1609.06204 [Cs], settembre. http://arxiv.org/abs/1609.06204.

Bianconi, Lorenzo. 2017. «Il libretto d’opera». In Musica. Istituto della Enciclopedia Italiana.

Bonora, Paolo, e Angelo Pompilio. in print. «Corago in LOD: the debut of an Opera repository into the Linked Data arena». JLIS.it.

Coletti, Vittorio. 2017. Da Monteverdi a Puccini. Einaudi.

De Marneffe, Marie-Catherine, Timothy Dozat, Natalia Silveira, Katri Haverinen, Filip Ginter, Joakim Nivre, e Christopher Manning. 2014. «Universal Stanford Dependencies: A Cross-Linguistic Typology». LREC 14.

De Marneffe, Marie-Catherine, e Christopher D Manning. 2008. «Stanford Typed Dependencies Manual». Stanford University.

Elam, Keir. 1988. Semiotica del teatro. Bologna: Il Mulino.

Jandelli, Cristina. 2002. I ruoli nel teatro italiano tra Otto e Novecento. Firenze: Le lettere.

Lenci, Alessandro, Simonetta Montemagni, e Vito Pirrelli. 2005. Testo e computer. Roma: Carocci.

Martin, James H, e Daniel Jurafsky. 2009. Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition. Prentice Hall series in artificial intelligence. Upper Saddle River, NJ: Prentice Hall, Pearson Education International.

Mintz, Mike, Steven Bills, Rion Snow, e Dan Jurafsky. 2009. «Distant Supervision for Relation Extraction without Labeled Data». In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - ACL-IJCNLP ’09, 2:1003. Suntec, Singapore: Association for Computational Linguistics. https://doi.org/10.3115/1690219.1690287.

Moro, Andrea, Hong Li, Sebastian Krause, Feiyu Xu, Roberto Navigli, e Hans Uszkoreit. 2013. «Semantic Rule Filtering for Web-Scale Relation Extraction». In Advanced Information Systems Engineering, a cura di Camille Salinesi, Moira C. Norrie, e Óscar Pastor, 7908:347–62. Berlin-Heidelberg: Springer. https://doi.org/10.1007/978-3-642-41335-3_22.

Navigli, Roberto, e Simone Paolo Ponzetto. 2012. «BabelNet: The Automatic Construction, Evaluation and Application of a Wide-Coverage Multilingual Semantic Network». Artificial Intelligence 193: 217–50.

Poria, Soujanya, Erik Cambria, Lun-Wei Ku, Chen Gui, e Alexander Gelbukh. 2014. «A Rule-Based Approach to Aspect Extraction from Product Reviews». In Proceedings of the Second Workshop on Natural Language Processing for Social Media (SocialNLP), 28–37. Dublin, Ireland: Association for Computational Linguistics and Dublin City University. https://doi.org/10.3115/v1/W14-5905.

Roccatagliati, Alessandro. 1996. Felice Romani librettista. Quaderni di Musica/realtà. Lucca: Libreria musicale italiana.

Seragnoli, Daniele. 1987. «La struttura del personaggio e della fabula». In Il teatro italiano nel rinascimento, 297–317. Problemi e prospettive. Serie di musica e spettacolo. Bologna: Il Mulino.

Stara, Arrigo. 2004. L’avventura del personaggio. Firenze: Le Monnier università.

Stevens, Robert, Nicolas Matentzoglu, Uli Sattler, e Margaret Stevens. 2014. «A Family History Knowledge Base in OWL 2». Informal Proceedings of the 3rd International Workshop on OWL Reasoner Evaluation (ORE 2014), 6.

Published

2021-09-09

How to Cite

Bonora, P., & Pompilio, A. (2021). Automatic extraction of opera character characteristics through lexical-syntactic patterns. Umanistica Digitale, 5(10), 193–210. https://doi.org/10.6092/issn.2532-8816/12426

Issue

Section

Articles