COVID-19 Metrics

Designed by Sumit Khanna. This is open source, but the code is crap and I wouldn't recommend looking at it.

This project came about by being frustrated with all the crappy visualizations out there. The daily updating code on the server was failing too, but I never bothered to fix it and just ran it locally for myself. I graph the fatality data below because it seems more relevant than case numbers that will always just go up with testing. There are much better visualizations out there today. My scripts are probably full of bugs.

I've written some blog posts you might be interested in.


Another measure that is incredibly useful, are the number of fatalities per day

Both of these graphs were made with the 2019-nCoV Johns Hopkins CSSE data sets.

One of the most amazing visualizations is this animation by Aatish Bhatia in collaboration with Minute Physics. I've already talked about my issues with statistics on reported cases, but in this graph, reported cases are placed on a logarithmic scale. A straight line represents exponential growth. The drop-offs can be an indicator of when the exponential growth has ended. The trouble with exponential growth is that you can't tell where the end is while you're within the growth itself. A full explanation can be found in this video.

Age Distribution

The following is an age distribution graph, created by heresacorrection on Reddit for those who died from COVID19 in Italy, Spain, South Korea and China.

Graph of Fatalities by Age Distribution for Spain, South Korea, Italy and China

We've heard reports of people who are healthy, in their 20s or 30s, non-smokers, suddenly needing ventilators due to COVID-19. It is very possible for young healthy people to have a critical reaction to this disease, but most of the numbers we're seeing suggest that it is a very small possibility.



It'd be nice if we could compare this data to the time series data for seasonal flu fatalities. When searching, I came up with information like the following from the CDC Website:

As it does for the numbers of flu cases, doctor’s visits and hospitalizations, CDC also estimates deaths in the United States using mathematical modeling. CDC estimates that from 2010-2011 to 2013-2014, influenza-associated deaths in the United States ranged from a low of 12,000 (during 2011-2012) to a high of 56,000 (during 2012-2013). Death certificate data and weekly influenza virus surveillance information was used to estimate how many flu-related deaths occurred among people whose underlying cause of death on their death certificate included respiratory or circulatory causes.

Here is another infographic grom the CDC page on estimates for the current flu season:

Infographic from CDC webiste on 2019-2020 Flu Season with estimated fatalities at 24,000 – 62,000

People keep comparing SARS-CoV-2 to the Flu, but there really is no comparison. The data for mortality and infection rates for seasonal Flu simply don't exist. Except for Flu mortality in infants, the CDC doesn't track this specifically and uses models to estimate the numbers after the Flu season ends.

Other Deaths

The following is a chart pulled from the CDC report on the leading causes for deaths in 2017 (page 9):

Chart from CDC Report on the Leading Causes of Death in 2017

This report places Flu deaths in the range of around 55,000 in a year for the US. Accidents fall into the 160,000 to 170,000 range for the two years covered. Both cancer and heart disease lead the chart, each killing more than half a million yearly.

More Bad Data

Corriere Della Sera reports that many deaths in Lombardy, Italy may not be getting correctly reported as COVID-19. The following graph show the average death rate, compared to the current death rate and those fatalities reported as COVID-19.

Data plot of a Lombardy region town average deaths for 2015-2019 contrasted to 2020 covid-19 deaths
Data plot of a Lombardy region town:
Blue line average deaths years 2015-2019
Green line official covid-19 deaths
Red line overall 2020 deaths