COVID Dataset
11 de Março de 2024

Due to the relevance of the COVID-19 global pandemic, we are releasing our dataset of tweets acquired from the Twitter Stream related to COVID-19 chatter. Since our first release we have received additional data from our new collaborators, allowing this resource to grow to its current size. Dedicated data gathering started from March 11th yielding over 4 million tweets a day. We have added additional data provided by our new collaborators from January 27th to March 27th, to provide extra longitudinal coverage. Version 10 added ~1.5 million tweets in the Russian language collected between January 1st and May 8th, gracefully provided to us by: Katya Artemova (NRU HSE) and Elena Tutubalina (KFU). From version 12 we have included daily hashtags, mentions and emoijis and their frequencies the respective zip files. From version 14 we have included the tweet identifiers and their respective language for the clean version of the dataset. Since version 20 we have included language and place location for all tweets.

Titulo em português
Conjunto de dados de menções da COVID no Twitter
Plataforma
Twitter
Tipos de dataset
Tabular
Formato do Dataset
Anonimizados
Repositório
Zenodo
Autores
Juan Banda (Georgia State University)
Tipo de coleta
API
Ferramenta e Método de Coleta
Início da coleta
Final da coleta
Procedimento de Reidratação
Classificação do Dataset
Ciências Sociais Aplicadas
Palavras-chave
Data de criação
11 de Março de 2024
Organizações Financiadoras
ABNT
Banda, J. COVID Dataset. DOI: https://doi.org/10.5281/zenodo.7834392
APA
Banda, J. (2024). COVID Dataset. https://doi.org/10.5281/zenodo.7834392