Hoenen, Armin, Cemre Koc, and Marc Daniel Rahn. “A Manual for Web Corpus Crawling of Low Resource Languages”. Umanistica Digitale 4, no. 8 (January 1, 2020). Accessed June 6, 2025. https://umanisticadigitale.unibo.it/article/view/9931.