StilometrIA alla prova delle scritture collettive: Da “Quaderni piacentini” e “Quindici” a ChatGPT

Marco De Cristofaro; Mariangela Giglio

doi:10.6092/issn.2532-8816/21219

Authors

Marco De Cristofaro Université de Mons https://orcid.org/0009-0002-9723-1448
Mariangela Giglio Università di Bologna

DOI:

https://doi.org/10.6092/issn.2532-8816/21219

Keywords:

Stylometry, Authorship attribution, Large Language Models, GPT-4, Magazines, #AIUCD2024

Abstract

The research aims to explore the ability of GPT-4 to emulate the style of two Italian cultural magazines active in the 1960s, «Quaderni Piacentini» and «Quindici». Using a corpus derived from the early issues of these magazines, the study assesses whether GPT-4 can bypass stylometric analysis by producing text that reflects the editorial strategy of a specific magazine. After using GPT-4 for generating emulative texts, a stylometric analysis was conducted to compare the AI generated texts with the original corpus. Comparison with traditional stylometric methodologies has allowed the identification of aspects where the two journals diverge, and consequently, the nodes on which the model focuses for stylistic and thematic differentiation. The research intends to open new applications on the use of stylometry for the computational analysis of texts related to specific cultural contexts; indeed, until now, the scientific community has focused on the ability of LLMs to faithfully reproduce the styles of different authors. Applying these methodologies to the field of magazines would allow for broader considerations on editorial strategies, reading trends, and the processes of idea circulation.

References

[1] Asor Rosa, Alberto. 1975. «La cultura». In Dall’Unità a oggi. Vol. IV. Storia d’Italia. Torino: Einaudi.

[2] Asprovska, Marijana, e Nathan Hunter. 2024. «The Tokenization Problem: Understanding Generative AI’s Computational Language Bias». Ubiquity Proceedings. https://doi.org/10.5334/uproc.123.

[3] Auerbach, Erich. Mimesis : il realismo nella letteratura occidentale. 2 voll. Torino: Einaudi, 1956.

[4] Baldini, Anna. 2023. A regola d’arte: storia e geografia del campo letterario italiano (1902-1936). Prima edizione. Letteratura tradotta in Italia. Macerata: Quodlibet.

[5] Balestrini, Nanni, a c. di. 2008. Quindici: una rivista e il Sessantotto. Saggi universale economica Feltrinelli. Milano: Feltrinelli.

[6] Baranelli, Luca, e Grazia Cherchi, a c. di. 1977. “Quaderni piacentini” 1962-1968. Milano: Gulliver.

[7] Barthes, Roland. Il grado zero della scrittura. Milano: Lerici, s.d.

[8] Bazzocchi, Marco A. 2021. “Comico e marginalità”. In Cento anni di letteratura italiana 1910-2010, edited by M. A. Bazzocchi. Einaudi.

[9] Berruto, Gaetano. 1987. Sociolinguistica dell’italiano contemporaneo. La nuova Italia Scientifica.

[10] Bortolotto, Francesco, e Davide Paone. 2018. «Una crepa nel sistema: dalla crisi di “Quindici” alla ricostruzione di “Alfabeta”». In Sistema periodico. Il secolo interminabile delle riviste, 189–209. Bologna: Pendragon.

[11] Bortolotto, Francesco, Eleonora Fuochi, Davide Antonio Paone, e Federica Parodi, a c. di. 2018. Sistema periodico. Il secolo interminabile delle riviste. Bologna: Pendragon.

[12] Bourdieu, Pierre. 1992. Les Règles de l’art. Genèse et structure du champ littéraire. Paris: Le Seuil.

[13] Burrows, J. 2002. «“Delta”: A Measure of Stylistic Difference and a Guide to Likely Authorship». Literary and Linguistic Computing 17 (3): 267–87. https://doi.org/10.1093/llc/17.3.267.

[14] Burrows, J. 2007. «All the Way Through: Testing for Authorship in Different Frequency Strata». Literary and Linguistic Computing 22 (1): 27–47. https://doi.org/10.1093/llc/fqi067.

[15] Cadioli, A. 1998. La ricezione. Laterza.

[16] Cadioli, A., Decleva, E. and Spinazzola, V., eds. 1999. La mediazione editoriale. Il Saggiatore/Fondazione Arnoldo e Alberto Mondadori.

[17] Carpi, Umberto. 1981. L’estrema avanguardia del Novecento. Roma: Editori Riuniti.

[18] Eder, M. 2013. «Mind Your Corpus: Systematic Errors in Authorship Attribution». Literary and Linguistic Computing 28 (4): 603–14. https://doi.org/10.1093/llc/fqt039.

[19] Eder, Maciej. 2015. «Does Size Matter? Authorship Attribution, Small Samples, Big Problem». Digital Scholarship in the Humanities 30 (2): 167–82. https://doi.org/10.1093/llc/fqt066.

[20] Eder, Maciej. 2017. «Short Samples in Authorship Attribution: A New Approach». In Digital Humanities Conference. https://api.semanticscholar.org/CorpusID:7574620.

[21] Eder, Maciej, Jan Rybicki, e Mike Kestemont. 2016. «Stylometry with R: A Package for Computational Text Analysis». The R Journal 8 (1): 107. https://doi.org/10.32614/RJ-2016-007.

[22] Fofi, Goffredo, e Vittorio Giacopini, a c. di. 1998. Prima e dopo il ’68: antologia dei Quaderni piacentini. 1. ed. Roma: Minimum fax.

[23] Franzini, Greta, Mike Kestemont, Gabriela Rotari, Melina Jander, Jeremi K. Ochab, Emily Franzini, Joanna Byszuk, e Jan Rybicki. 2018. «Attributing Authorship in the Noisy Digitized Correspondence of Jacob and Wilhelm Grimm». Frontiers in Digital Humanities 5 (aprile):4. https://doi.org/10.3389/fdigh.2018.00004.

[24] Giuliani, Alfredo. 1967. «Le cerimonie sadiche della critica». Quindici, 1967.

[25] Gray, Andrew. 2024. «ChatGPT “contamination”: estimating the prevalence of LLMs in the scholarly literature». arXiv. https://doi.org/10.48550/ARXIV.2403.16887.

[26] Guerriero, Stefano. 2021. «Salotto, laboratorio, dipartimento: la rivista come istituzione letteraria nel secondo Novecento, da “Aretusa” a “Linea d’Ombra”». In Passeurs. La letteratura italiana del SEcondo Novecento fuori d’Italia: ricezione e immaginario (1945-1989). Vol. 49. Liminaires. Bruxelles: Peter Lang.

[27] Haaf, Susanne, Frank Wiegand, e Alexander Geyken. 2013. «Measuring the Correctness of Double-Keying: Error Classification and Quality Control in a Large Corpus of TEI-Annotated Historical Text». Journal of the Text Encoding Initiative, fasc. Issue 4 (marzo). https://doi.org/10.4000/jtei.739.

[28] Holley, Rose. 2009. «How Good Can It Get?: Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs». D-Lib Magazine 15 (3/4). https://doi.org/10.1045/march2009-holley.

[29] Italia, Paola. 2013. Editing Novecento. Salerno.

[30] Jauss, Hans R. 1982. Ästhetische Erfahrung und literarische Hermeneutik. Suhrkamp.

[31] Jones, Cameron R., e Benjamin K. Bergen. 2025. «Large Language Models Pass the Turing Test». https://doi.org/10.48550/ARXIV.2503.23674.

[32] Kestemont, Mike. 2014. «Function Words in Authorship Attribution. From Black Magic to Theory?» In Proceedings of the 3rd Workshop on Computational Linguistics for Literature (CLFL), 59–66. Gothenburg, Sweden: Association for Computational Linguistics. https://doi.org/10.3115/v1/W14-0908.

[33] Kichuk, Diana. 2015. «Loose, Falling Characters and Sentences: The Persistence of the OCR Problem in Digital Repository E-Books». Portal: Libraries and the Academy 15 (1): 59–91. https://doi.org/10.1353/pla.2015.0005.

[34] Kjell, Bradley, W.Addison Woods, e Ophir Frieder. 1994. «Discrimination of Authorship Using Visualization». Information Processing & Management 30 (1): 141–50. https://doi.org/10.1016/0306-4573(94)90029-9.

[35] Köbis, Nils, e Luca Mossink. 2020. «Artificial Intelligence versus Maya Angelou: Experimental evidence that people cannot differentiate AI-generated from human-written poetry». arXiv. https://doi.org/10.48550/ARXIV.2005.09980.

[36] Koppel, Moshe, e Yaron Winter. 2014. «Determining If Two Documents Are Written by the Same Author». Journal of the Association for Information Science and Technology 65 (1): 178–87. https://doi.org/10.1002/asi.22954.

[37] Liang, Weixin, Zachary Izzo, Yaohui Zhang, Haley Lepp, Hancheng Cao, Xuandong Zhao, Lingjiao Chen, et al. 2024. «Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews». arXiv. https://doi.org/10.48550/ARXIV.2403.07183.

[38] Liang, Weixin, Mert Yuksekgonul, Yining Mao, Eric Wu, e James Zou. 2023. «GPT detectors are biased against non-native English writers». arXiv. https://doi.org/10.48550/ARXIV.2304.02819.

[39] Luperini, Romano. 1981. Il Novecento. Torino: Loescher.

[40] Mangoni, Luisa. 1974. L’interventismo della cultura. Intellettuali e riviste del fascismo. Roma-Bari: Laterza.

[41] Mitchell, Melanie, e David C. Krakauer. 2023. «The Debate over Understanding in AI’s Large Language Models». Proceedings of the National Academy of Sciences 120 (13): e2215907120. https://doi.org/10.1073/pnas.2215907120.

[42] Muraca, Giuseppe. 1990. «Cronistoria dei ‘Quaderni piacentini’». In Da Il Politecnico a ‘Linea D’ombra. Poggibonsi: Lalli.

[43] Patat, Alejandro, e Brigitte Poitrenaud-Lamesi, a c. di. 2021. Passeurs: La letteratura italiana del Secondo Novecento fuori d’Italia: ricezione e immaginario (1945-1989). Brussels: Peter Lang.