Hoenen, Armin, et al. “A Manual for Web Corpus Crawling of Low Resource Languages”. Umanistica Digitale, vol. 4, no. 8, Jan. 2020, doi:10.6092/issn.2532-8816/9931.