Data Science

Data 4 COVID19

This document serves as a repository for data collaboratives seeking to address the spread of COVID-19 and its secondary effects. It invites individuals to share projects that show a commitment to privacy protection, data responsibility, and overall user well-being. A project’s inclusion in this living repository does not indicate endorsement by The GovLab or confirmation of its success in meeting these goals. It is updated regularly with new projects and includes initiatives from around the world organized by area. It is curated by GovLab.

Resources

Looking for

Any skill set can be useful, from project management to design to data science.

How to get involved

  • Find a project that interests you and check out their webpage or contribute to the repository.

Project Open Air (Just One Giant Lab)

Project Open Air/Helpful Engineers, hosted within the broader open science group Just One Giant Lab has created a 5,000 strong volunteer organization, with engineers, scientists and doctors around the world who work to organize themselves into teams to design innovative ways to fight the virus. They are quickly trying to design and arrange the production of pragmatic devices to move life-saving air, extend existing tools, or organize volunteers on the ground to bolster medical workers. They have both hardware and software based projects including designing open source, scalable easy to DIY ventilators and O2 concentrators, sentiment analysis for persuasive public health media, diagnostic apps, and more!

Resources

Looking for

MANY different skills sets: app developers, ML researchers, data scientists, data visualization folks, makers, fabricators, designers, organizers, and especially folks who can volunteer time for project management.

How to get involved

  • See this doc and fill out the intake form
  • Join the Slack and introduce yourself in #introductions
  • Email opensourcehumankind@gmail.com
  • If you are a medical professional, additionally fill out this form

Johns Hopkins Mapping nCOV Project

This team has developed an interactive web-based dashboard hosted by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University, to visualize and track reported cases in real-time. The dashboard illustrates the location and number of confirmed COVID-19 cases, deaths and recoveries for all affected countries. It was developed to provide researchers, public health authorities and the general public with a user-friendly tool to track the outbreak as it unfolds. Additionally, the data (which is verified and updated daily) is available on GitHub. They are interested in conducting additional modeling of the emerging outbreak.

Resources

Looking for

It doesn’t explicitly say on the website, but presumably ML researchers, model-builders, data visualization folks, data scientists.

How to get involved

  • Reach out to the group at COVID19map@jhu.edu or jhusystems@gmail.com
  • Grab the data and build your own project

U.S. Digital Response for COVID-19

The U.S. Digital Response Team was started by three former U.S. Deputy CTOs. This project seeks to connect veted technology, data, and healthcare experts to help meet the data and digital capacity needs of the US Government.

Resources

Looking for

Anyone with relevant experience in healthcare, data, engineering & product development, general management, operations, supply chain/procurement, and more.

How to get involved

Fill out the volunteer application form

Folding at Home

Folding at Home uses computer simulations to understand the proteins that viruses use to reproduce and suppress immune systems. This is done by taking the experimental structures as starting points, we can simulate how all the atoms in the protein move. Doing so can help health researchers find and understand new therapeutic opportunities. The calculations involved in these simulations are enormous, and so volunteers can use their existing compute resources to run folding simulations in the background and contribute to the larger F@H database.

Resources

Looking for

Anyone with a computer! Particularly in the areas of

  • Strategic Product Design
  • Gamification
  • Website Enhancement
  • Client Programming (C++, Web dev, JS, vue.js, JSON API)
  • Project Viewer
  • Stats
  • Protein Viewer
  • Github Issue Triage

How to get involved

Follow the instructions on the COVID-19 page Sign up for the dev team at:

COVID Compare

This project is just kicking off and is working on tracking/mapping the relative risk of Covid19 in the US. 50 cases in Kentucky is not the same as 50 cases in NYC. People aren’t good at normalizing out other variables like population/hospital access so I want to do that for them visually.

Resources

Looking for

Data scientists, model builders, public health folks to help build a quantifier for risk and a predictive tool, web app developers and data visualizers, data collectors.

How to get involved

Reach out to collaborators on GitHub

Help with COVID-19

A website to connect volunteers with open source projects. Users can sort projects by skill type or project focus. Currently lists over 500 projects ranging from telemedicine to community support apps.

Resources

Looking for

Any skill set can be useful, from project management to design to data science

How to get involved

  • Hop on the webstie and find a project that interests you (or add your own)!

COVID-19 Community

COVID-19 Community is a community effort to build a Neo4j Knowledge Graph (KG) that links heterogenous data about COVID-19 to help fight this outbreak! It serves as a sandbox and incubator project and the best ideas will be incorporated into the Covid-19-Net KG.

Resources

Looking for

Data scientists and machine learning developers, particularly datavisualization pros!

How to get involved

  • Run the current project instance
  • Suggest additional data sets or propose queries
  • Add example Jupyter notebooks or data visualizations

COVID-19 Risk

This project is focusing on research that could lead to an app that shows individuals their risk of COVID-19 infection based off GPS data, known cases, and Bluetooth proximity networks. They aim to do so in a way that preserves transparency and personal privacy. This is a group of volunteers researching and designing an open-source app to make this idea a reality. Ideally, this app will take the GPS and Bluetooth data of users in an anonymised way, as well as some information about their potential or confirmed infection status, and use it to the benefit of both them and everybody else. There are several ways this could happen: through secure and anonymous contact tracing, through the creation of a risk heatmap, and through more accurate epidemiological modelling.

Resources

Looking for

Public health or epidemiology researchers, app developers, web developers, legal experts around data security/privacy, software engineers to implement spread models, folks who have worked in fundraising/grant writing.

How to get involved

Email collaborations@covid19risk.com with the role you would like to fill, an introduction to yourself, and links to previous work.

CHIME: COVID-19 Hospital Impact Model for Epidemics

As we prepare for the additional demands that the COVID-19 outbreak will place on our hospital system, our operational leaders need up-to-date projections of what additional resources will be required. Informed estimates of how many patients will need hospitalization, ICU beds, and mechanical ventilation over the coming days and weeks will be crucial inputs to readiness responses and mitigation strategies.To this end, the Predictive Healthcare team at Penn Medicine has developed a tool that leverages SIR modeling to assist hospitals with capacity planning around COVID-19. CHIME allows hospitals to enter information about their population and modify assumptions around the spread and behavior of COVID-19. It then runs a standard SIR model to project the number of new hospital admissions each day, along with the daily hospital census. These projections can then be used to create best- and worst-case scenarios to assist with capacity planning. It is currently being used by a few hospitals and they’re looking to expand!

Resources

Looking for

Primarily Devops folks to help scale up the models/app and automate data collection and Project Managers to help develop documentation and ensure user accessibility. Possibly some modeling folks as well.

How to get involved

COVID-19 Vulnerability Index (CV19 Index)

The CV19 Index is an open source, AI-based predictive model that identifies people who are likely to have a heightened vulnerability to severe complications from COVID-19 (commonly referred to as “The Coronavirus”). The CV19 Index is intended to help hospitals, federal / state / local public health agencies and other healthcare organizations in their work to identify, plan for, respond to, and reduce the impact of COVID-19 in their communities. The CV19 Index is currently deployed in Chicago – Medical Home Network – The largest Medicaid ACO in the country and is undergoing testing in New York and Florida.

Resources

Looking for

You can run the predictive model yourself by following the instructions on the homepage. Unfortunately, the team is not allowed to publicly share the training data set but they are look for volunteers to help out in many areas such as data prep and transformation, developing documentation, and adding more platform support.

How to get involved

Email the team at covid19-info@closedloop.ai

Crowdfight COVID-19

This is an initiative from the scientific community to put all available resources at the service of the fight against COVID-19. It has separate portals for those already doing COVID-19 related research to request resources and for those not yet engaged to volunteer. As of March 21, 2020 this appears to be a (primarily) European initiative.

Resources

Looking for

Almost any skill set, from the ability to transcribe data from text into databases to highly expert. They will try to match people to appropriate projects.

How to get involved

Register on their volunteers page.

COVID Care Map

This project aims to understand, anticipate, and act to support and ramp up our health systems’ capacity (beds, staffing, ventilators, supplies) to effectively care for a rapidly growing number of active COVID19 patients in need of hospitalization and intensive (ICU) care. We do open-source data collection, geospatial analysis, visualizations, and scenario-planning tools aimed at informing resource planning and deployment decisionmaking to support our healthcare providers at local, county, & state scale when and where the needs are greatest. USA-centric at the moment but tools and knowhow are intended to be applicable globally.

Resources

Looking for

Data scientists to collect and analyze open data sets, data visualizers, web developers, and project managers.

How to get involved

  • See the ‘How to Help’ section on GitHub
  • Run the mapping yourself and submit comments