Covid NZ Twitter Corpus

The Covid NZ Twitter (CovidNZT) Corpus consists of 40,243 tweets posted between 22 February and 10 November 2020, 1,000 of which have been manually coded. All tweets contain the hashtag #covid19nz, including variations with one or more capital letters. The aim of the project is to study linguistic strategies which tweeters use to express their viewpoints and stance in Covid-related tweets. In particular, one of our points of interests is the use of directives.

Covid19NZ Poster

If you would like to talk to us about this project, please email Andreea Calude.

Downloading the CovidNZT Corpus

The CovidNZT Twitter Corpus can be downloaded here. In accordance with Twitter's terms of service, we can only provide the raw tweet IDs; however, we also include a script for extracting the tweets via the Twitter API. Please follow the instructions in readme.txt.


Media Attention



We graciously acknowledge the generous support of: