You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 39 Next »

Contents

Introduction

This is an attempt to use motion charts to visualize the daily growth, since January 2020, in US Covid-19 cases (confirmed and deaths) by State and by California county showing the population by bubble size and color by median age, education or income.

The vizualization tool is https://www.charte.ca/, a Motion graphics chart tool. In particular, we use it to visualize the growth in time of Covid-19 cases, using the bubble size for population or population density and colors for education, age, political leaning, income etc.

In particular, try out the motion charts in the Results section below. There is a slider at the bottom of each chart to move backward and forward in time, and one can move the mouse over a bubble to find more details.

Method

The Covid-19 statistics are from the Johns Hopkins University (JHU). There are raw data for confirmed cases and deaths by date for each county in each US state. For the US state analysis, the data from the counties are aggregated into the values for each state.

For each state or California (CA) county, we extracted various demographics including the ISO 2 character label for the US or the Abbreviation for CA, population, area, population density, education, income, median age, political leaning. The demographics were obtained from the following sources for the US.

For CA they were obtained from here:


A Perl script covid-us.pl was developed to gather the above information and cast it in a suitable form for the www.charte.ca motion charts and correlation data.  See here for an example of the output of covid-us.pl for California counties or here for the US. The script also ranks the age, income, and education demographics for each state or county into low, medium, or high based on their tertiles. This is so these demographics can be used with charte.ca's grouping feature.

Results

US

Motion chart of Covid-19 deaths vs Confirmed cases with bubbles sized by population for US states and colored by political leaning

Motion chart of deaths vs Covid-19 Confirmed cases with bubbles sized by population for US states and colored by % Bachelor degrees

Motion chart of deaths vs Covid-19 Confirmed cases with bubbles sized by population for US states  and colored by median income

CA

Motion chart of Covid-19 deaths vs Confirmed cases with bubbles sized by population density for CA Counties and colored by per capita income

Notes

Demographic correlations for CA

We investigated the correlations between the demographics and the confirmed Covid-19 cases for CA. We used the R squared coefficient of determination to characterize the degree of correlation between the various demographics using a linear fit.

DemographicIncomeEducationCasesPoliticsAgePopulation densityPopulation
Per capita income*0.810.0032.280.0080.120.19

% of people completing college education for CA

0.81*0.0160.430.0090.200.043
Covid-19 confirmed cases0.00320.016*0.340.0120.140.77
% of Registered voters who are registered as Democratic0.280.430.034*0.0890.150.081
Median age0.0080.0080.0120.0089*0.00290.084
Population density (people/sq mile)0.120.200.140.250.0029* 
Population0.190.0430.770.0810.084 *

It is seen that there is a strong correlation between the per capita income and education and between confirmed cases and population, a medium correlation between registered voter political leaning and education. The excel spreadsheet of the analysis of the above demographics and their correlations can be found here.

Ungrouped

Deaths vs Confirmed:

  • We tend to use a log log chart which provides a greater visibility of a wide range of data (compare the two charts below), and since both the confirmed data and deaths are exponential in their behavior for most states. Also note that with a logarithmic scale: a straight line means exponential growth, and the steeper a line, the faster the total number of confirmed coronavirus cases or deaths is doubling.

    Linear plotLog Log plot


    .

  • The first confirmed cases were seen by WA 1/22/20, IL a day later and  AZ & CA on 1/25/20
    .
  • First deaths reported for Washington State were at the start of March
    .
  • Deaths start to really increase in the second week in March
    .
  • By the start of April, NY followed by NJ were leading the way in both deaths and confirmed cases
  • By the end of the second week in April, SD and UT are noticeably below the general line followed by other states
    .
  • On 3/30/20 WV was the last state to record a Covid-19 death  
  • At the end of the second week in April, WY appears to be the last state to have greater than one Corona-19 death.
    .

Deaths/Million Population vs  Confirmed cases/Million Population

  • The leading states in terms of deaths per million population are: NY, NJ, CT, MA, LA, MI, DE,DC, RI.
    The leading states in terms of confirmed per million of population are: NY, NJ, MA, DE, CT,RI, LA, DC, MI

    If one does not normalize by population the NY and NJ standout followed by the top of the bunch being MA and MI
    .
    Looking at a log vs log plot the trailing states one sees the lowest deaths are for AK SD, HI, MT, WY, ND. The lowest Confirmed cases are for AK, SD, SD,HI, MT, WY, ND.


  • Note that since it is a log-log scale no bubble appears for a state until there is at least 1 confirmed case and 1 death for the state.
  • % Confirmed and deaths both low for AK, VT, NH, ID
  • Cluster of DE, DC and RI with low deaths compared to the % confirmed cases
  • NY, NJ, MA, DE, CT, LA, RI, DC have the highest % confirmed cases.
  • By March 14th, WA, NY, CA, FL were reporting deaths.

Grouped data

If we color the bubbles by each state's political leaning the chart appears below. It is apparent that Covid-19 is impacting democratic states the hardest, followed by the swing states.

We can also group the data by age, income or education tertiles

Grouped by IncomeGrouped by AgeGrouped by Education (Bachelor degree or Equivalent)



  • No labels