Computational Linguistics and the COVID-19 Outbreak
This page is maintained by AILC (the Italian Association for Computational Linguistics). It groups some of the initiatives that the Computational Linguistics community is carrying out to contribute to the fight against COVID-19. Everyone is invited to collaborate by reporting new initiatives. Please do so through our contact form.
CORD-19 – The Allen Institute COVID-19 Open Research Dataset, a collection of Covid-19 scientific papers, weekly updated (March 2020)
Processed CORD-19 – The Allen Institute corpus processed with Sketch Engine (March 2020)
40wita – A dataset of tweets in Italian collected daily by the University of Turin
Corona Corpus – A corpus of texts from online newspapers and magazines in 20 different English-speaking countries and part of the English-Corpora.org suite of corpora
COVID-19 Browser – A semantic search tool on COVID-19 scientific papers developed by Gabriele Sarti and hosted by Area Science Park (April 2020)