This site uses cookies that are essential for our site to work. We would also like to use non-essential cookies to help us improve your browsing experience and help make this website better, by collecting and reporting information on how you use our site.
ResourcesData Gallery

Confounding correlations: Enlightenment and noise in the age of so much data

By Nick Stepro, Chief Product and Technology Officer at Arcadia
Healthcare Analytics Medical Cost Containment

Data, data, everywhere — but how to make sense of it all? In a playful new interactive data visualization, we look at the challenges of finding meaning among the correlations.

Confounding correlations

In an age of expanding data and increasingly powerful tools to process and visualize it, it’s increasingly difficult to filter out the noise and find meaning in a sea of numbers. With this piece, Arcadia vice president of product development Nick Stepro has fun with this concept by running over 1,000 combinations of metrics through a correlation matrix. The links he visualizes are both obvious and completely unexpected.

Of the obvious findings — risk scores, healthcare expenditures and inpatient utilization are all positively correlated (though costs and inpatient utilization are most strongly correlated). Less obvious — counties with a higher rate of women physicians are more densely populated, and have a lower per-capita Medicare spend and fewer knee replacements per 1,000. Nurse practitioners are more common in states with less college attendance, and are also associated to higher rates of colonoscopies.


D3.js SVG, with Illustrator
Data from CMS Public Use File, US Census American Community Survey, and CMS NPPES NPI Registry