Happiness and Causal Inference

Howard Wainer

doi:10.1017/CBO9781316424315.006

Introduction

My old, and very dear, friend Henry Braun describes a data scientist as someone who's pretty good with numbers but hasn't got the personality to be an accountant. I like the ambiguity of the description, vaguely reminiscent of a sign next to a new housing development near me, “Never so much for so little.” But although ambiguity has an honored place in humor, it is less suitable within science. I believe that although some ambiguity is irreducible, some could be avoided if we could just teach others to think more like data scientists. Let me provide one illustration.

Issues of causality have haunted human thinkers for centuries, with the modern view usually ascribed to the Scot David Hume. Statisticians Ronald Fisher and Jerzy Neyman began to offer new insights into the topic in the 1920s, but the last forty years, beginning with Don Rubin's unlikely sourced 1974 paper, have witnessed an explosion in clarity and explicitness on the connections between science and causal inference. A signal event in statisticians’ modern exploration of this ancient topic was Paul Holland's comprehensive 1986 paper “Statistics and Causal Inference,” which laid out the foundations of what he referred to as “Rubin's Model for Causal Inference.”

Causa latet: vis est notissima

Ovid, Metamorphosis, IV c. 5

A key idea in Rubin's model is that finding the cause of an effect is a task of insuperable difficulty, and so science can make itself most valuable by measuring the effects of causes. What is the effect of a cause? It is the difference between what happens if some unit is exposed to some treatment versus what would have been the result had it not been. This latter condition is a counterfactual and hence impossible to observe. Stated in a more general way, the causal effect is the difference between the actual outcome and some unobserved potential outcome.

Counterfactuals can never be observed hence, for an individual, we can never calculate the size of a causal effect directly. What we can do is calculate the average causal effect for a group. This can credibly be done through randomization.

Book contents

3 - Happiness and Causal Inference

Summary

Access options

Book contents

3 - Happiness and Causal Inference

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive