COVID-19 Resources
The Academic Data Science Alliance is working with partners to pull together data and data science resources related to the COVID-19 pandemic. This is a living list of resources and we welcome additions, suggestions, and collaborations. Please send additions, corrections, comments, and suggestions to us using this feedback form.
Please keep suggestions limited to research and data resources, avoiding opinion pieces and teaching resources.
Note that the resources listed here are provided as-is. Use your professional judgement and best practices to vet these resources before you use them for research and scientific communications.
Datasets and Data Collections
UVA Biocomplexity Institute - COVID-19 Datasets
Corona Data Scraper - contributed and curated data sources for COVID-19 tracking
COVID-19 Tracking Project - managed by Alexis Madrigal from The Atlantic - tracks US state-by-state COVID-19 progression
Johns Hopkins University Coronavirus - COVID-19 Global Cases, by country
Wikidata WikiProject COVID-19 - Project to collect Wikidata resources related to COVID-19 and SARS-CoV-2. Resources may also relate to relevant epidemiological events like 2019–20 COVID-19 pandemic
Data repository for the 2019 Novel Coronavirus Visual Dashboard operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE) - https://github.com/CSSEGISandData/COVID-19
NY Times Github Repo https://github.com/nytimes/covid-19-data
Mozilla Firefox - Opening data to understand social distancing – Data@Mozilla
COVID-19 Public Datasets: BigQuery Public Datasets Program - Covid-19 datasets available (free) in BigQuery as part of the Google Public Datasets program.
C3.ai COVID-19 Data Lake - a collection of datasets from governmental, NGO, and commercial sources - requires registration
Postman COVID-19 API Resource Center - List of APIs and Blueprints from Postman
COVID Graph - A platform for developing a knowledge graph on Covid-19. This site also has a number of collected datasets
iDigBio Portal - access to aggregated biodiversity occurrence data for discovering vouchered specimens relevant to COVID vector research
Global Biodiversity Information Facility - Occurrences - access to aggregated biodiversity occurrence data for discovering vouchered specimens relevant to COVID vector research
COVID-19 Data Hub - The goal of the project is to provide the research community with a unified data hub by collecting worldwide fine-grained case data merged with demographics, air pollution, and other exogenous variables helpful for a better understanding of COVID-19.
COVID-19 Data Portal - COVID-19 data platform from the European Commission and EMBL’s European Bioinformatics Institute (EMBL-EBI), together with EU Member States and research partners such as ELIXIR
COVID-19 Geospatial & Situational Awareness Resources from the National Alliance for Public Safety GIS
Open-Access Data and Computational Resources to Address COVID-19 - NIH Office of Data Science Strategy seeks to provide the research community with links to open-access data, computational, and supporting resources
Visualizations Archive Data Repository for CGDV - datasets provided through the Center for Global Data Visualization
SafeGraph COVID-19 Data Consortium - access to SafeGraph point of interest location data for COVID-19 research
JieYingWu - COVID-19_US_County-level_Summaries
COVID-19 Interventions Data - country-level data on timing of interventions for COVID-19
Open ICPSR COVID-19 Data Repository - Inter-university Consortium for Political and Social Research repository for COVID-19 data
Dataset For Exploring The Coronavirus Narrative In Global Online News
Microsoft Bing Coronavirus Query Set - Dataset containing Aggregated and anonymized queries from across the world with Coronavirus intent
Microsoft Bing-COVID-19-Data - A repo for coronavirus related case count data from around the world. The repo will be regularly updated
Microsoft Academic resources and their application to COVID-19 research - COVID search of research literature
COVID-19 Research Database (health data) - The COVID-19 research database is a collection of de-identified data sets made freely available to public health and policy researchers to extract insights for combating the COVID-19 pandemic.
Social Media for Public Health - This site serves as a platform for collecting data resources and publications in the fight against COVID-19. These resources are focused on social media data and how it can be used to prevent the spread of COVID-19. Possible applications include the combating of misinformation, supporting messaging from public health organizations and tracking information about the ongoing COVID-19 pandemic.
Neo4j Knowledge Graph for COVID-19 Data - Community effort to build a Neo4j Knowledge Graph (KG) that links heterogeneous data about COVID-19
Outbreak.info - outbreak.info is a database of COVID-19 and SARS-CoV-2 resources and epidemiology data to easily discover this information. Built by Scripps Research with funding from NIH
Analytic Tools
COVID analyses for R - This repository links to a collection of analyses on and representations of COVID19 data in R
Quantified Flu - citizen science project that tries to address the question of whether our wearables can warn us when we’re getting sick
BoaC: a data infrastructure for analyzing COVID-19 data - a data infrastructure to make it easier to analyze COVID-19 data, e.g. thousands of papers on coronaviruses made available by the Allen Institute for AI (AI2). BoaC provides a web-based interface to perform fine-grained analysis over the data. The data is stored in a structured form and each paragraph and sentence of these papers can be analyzed. They are looking for feedback to improve it.
Coronavirus Act Now - state by state modeled estimates of COVID-19 progression over the coming months. Put together by a team of data scientists with guidance from public health experts.
CurveFit Package for the IHME data visualizations (see below) - Generic curve fitting package with nonlinear mixed effects model
Notebook based visualizations of Johns Hopkins and NY Times datasets (from David Culler at UC Berkeley)
Stanford Medicine COVID-19 Tools - a suite of data visualization tools and calculators to help healthcare providers and policy makers understand demands on the healthcare system
Academic Research Article Collections
Semantic Scholar - Open Research Dataset (article corpus)
UVA Biocomplexity Institute - COVID-19 Publications and Presentations
Outbreak Science Rapid PREreview - rapid peer review for preprints from a number of preprint sources
Free access to ACM's digital library through June 30
Events and Conversations
Henry Wheeler Center for Emerging & Neglected Diseases (CEND) Hackathon - March 25-26, 2020
COVID-19 Global Hackathon (projects due March 30th)
COVID-19 Biohackathon April 5-11 2020
Berkeley Institute for Data Science - COVID-19 Seminars - a collection of presentations and seminars on COVID-19 from BIDS
Columbia University COVID-19 Virtual Symposia - video recordings of weekly symposia for COVID-19 research at Columbia University
ELLIS against Covid-19 | European Lab for Learning - The European Laboratory for Learning and Intelligent Systems (ELLIS) has recently organized an online workshop which has been live streamed (with Joshua Bengio and Bernhard Schölkopf as one of the guest speakers).
University of Washington School of Public Health COVID-19 Updates - recorded webinars, panel discussions, and other events and opportunities from the UW School of Public Health
SDSC Research Data Services - San Diego Supercomputer Center Talks - including some COVID-19 related content
DSA 2020 : Data Science in Action - Speaker Series from the University of Connecticuit - Data Science in Action in Response to the Outbreak of COVID-19
MIDAS COVID-19 webinar series - Speaker Series from the University of Michigan - COVID-19 Data Science Research Special Webinar Series
Challenges
White House OSTP Call for Machine Readable COVID-19 data - combines the Allen Institute corpus (see below) with a call for tools to parse data therein
MIT COVID-19 Challenge - April 3-5 - Build a solution for the COVID-19 crisis
Funding Opportunities
(we are grateful to NYU's Office of Research for many of these links)
US National Institutes of Health
US National Science Foundation
Department of Energy Dear Colleagues Letter 3/12/20 [pdf]
Wellcome: Epidemic Preparedness: COVID-19 funding call
COVID-19 Therapeutics Accelerator (Gates, Wellcome, Mastercard)
Gordon & Betty Moore Foundation: Diagnostic Excellence Initiative (not COVID-specific)
Chan Zuckerberg Initiative: CZI Coronavirus Response
AWS Diagnostic Development Initiative (DDI)
MIT SOLVE Health Security and Pandemics
COVID-19 High-Performance Computing Consortium - industry, government, and academic consortium to provide funding and resources for COVID-19 response
NSF Request for SBIR/STTR Phase I Proposals Addressing COVID-19 - https://www.nsf.gov/pubs/2020/nsf20065/nsf20065.jsp
C3.ai Digital Transformation Institute Request for Proposals
UK Research and Innovation - Open Call to address COVID-19
Mozilla Open Source Software Solutions Fund
Fast Grants has rapid response grants of up to $500,000 for research projects that could help with the COVID-19 pandemic within the next six months
MSI STEM Research and Development Consortium - Funding Opportunities
Data Visualizations
Financial Times COVID-19 Visualizations
An interactive visualization of the exponential spread of COVID-19
UVA Biocomplexity Institute - COVID-19 Dashboards
Tableau’s Covid-19 Data Hub - trusted covid-19 global data from community experts
Institute for Health Metrics and Evaluation COVID-19 forecasting tool
Novel Coronavirus Infection Map - from the University of Washington Humanistic GIS Laboratory
Covidvis - Covidvis is a collaborative effort across computational epidemiology, public health, and visualization researchers at UC Berkeley (EECS, School of Information, and School of Public Health), University of Illinois (Computer Science), and Georgia Tech (Computational Science and Engineering).
rt.live - site to visualize the effective reproduction number (Rt) in the US. Data sourced from covidtracking.com
Computing Resources
COVID-19 High-Performance Computing Consortium - industry, government, and academic consortium to provide funding and resources for COVID-19 response
Open-Access Data and Computational Resources to Address COVID-19 - NIH Office of Data Science Strategy seeks to provide the research community with links to open-access data, computational, and supporting resources (includes some computing resources)
Research Tracking
COVID-19 Social Science Research Project Tracker
Coronavirus Clinical Trials Explorer - mapped and tabular data from clinicaltrials.gov
Causaly Coronavirus Treatment Database - Causaly, an AI/ML literature-based knowledge discovery tool, is offering free access to its identification of 42 possible treatment options for COVID-19.
Support Networks
Open Source Software helpdesk for COVID-19 - community of open source software experts who can assist with software issues for COVID-19 research
Data Against Covid - Resource page for COVID-19 data science assistance and team creation
US Digital Response - pairing government entities with data scientists and teams
Call to Action coordinated by the Royal Society - Rapid Assistance In Modelling The Pandemic: Ramp
Science Responds - resource to match “big science” researchers with COVID-19 researchers
Cornoavirus Tech Handbook - provides a library for technologists, civic organisations, public and private institutions, researchers, educators and specialists of all kinds to collaborate on an agile and sophisticated response to the Coronavirus and sequential impacts.
Collaboratorium for Social Media and Online Behavioral Studies (University of Arkansas, Little Rock) - COVID-19 Misinformation Database - a database and tools for reporting misinformation about COVID-19 and discovering known misinformation
Technology SWAT Teams to Support New York COVID-19 Response - state level effort to match-make technologists with COVID-19 response needs
RDA-COVID19 - Research Data Alliance effort to build guidance for data sharing and data release in health emergencies
Other Collections of Resources
Columbia University Libraries Health Data Guide (including data sets)
HealthIL CouterCorona - HealthIL site that includes challenges and other opportunities in the COVID-19 space
US Digital Response - a collection of tools and services for responding to COVID-19. Includes a match-making functionality for people looking for assistance on COVID-19 related research
COVID Symptom Tracker - Help slow the spread of COVID-19
Mathematical Resources to Help Understand COVID-19 from the Society for Industrial and Applied Mathematics
SAGE Ocean - Coronavirus Resources
#DATA4COVID - NYU GovLab resources for COVID-19 research