Causality and Causal Inference
Understanding causation has long been one of the central questions in economics. But it is important to understand that causality and causal inference are two different things. Causality defines the relationship between a cause and an effect. Simply, Causality is understanding if A causes B and how much does A cause B. It is a very simple question but one very difficult to answer. The concept of causality is central concept in science, philosophy, economics, and also everyday reasoning.
Causal inference is, however, about beliefs. When can we say that A caused B, when can we say that A did not cause B. The type of causality we are interested in is A causes B if we change the value of A, then B also changes without changing anything else (Ceteris Paribus)
Much of the earlier development looked into causal estimates from a modelling perspective. The biggest jump in this understanding was when Gauss derived the OLS estimator in 1809. The evolution is well explained in the book Causal Inference: The Mixtape by Scott Cunningham. For me, understanding the history behind my work is essential, so I will give you how we reached the potential outcomes framework and then see what it means and its impact.
Emergence of Potential Outcome
The first notion of potential outcomes can be traced back to Neyman in 1923. I will simplify how the notion of potential outcome came from this. In his thought experiment, he divides a farm into ‘m’ number of plots. And with him he has ‘v’ number of different fertilizers. He wants to know what happens when each fertilizer is applied to every plot. But the problem here is that, in reality you can only apply one fertilizer to each plot. For simplicity, lets say we have plot A, B and C and two fertilizer 1 and 2. In his framework he considers the following:
The yield of Plot A if given Fertilizer 1
The yield of Plot A if given Fertilizer 2
The yield of Plot B if given Fertilizer 1
The yield of Plot B if given Fertilizer 2
The yield of Plot C if given Fertilizer 1
The yield of Plot C if given Fertilizer 2
Given the situation, these hypothetical yields for each plot under each fertilizer are a ‘potential outcome’. In this approach, the effect of Fertilizer 1 vs Fertilizer 2 on Plot A would be the difference between it’s yield in these two state. Since we can only observe one of these situations at a time, only one of the potential outcomes can be observed. This is also known as the ‘fundamental problem of causal inference’. Neyman then solves this issue with randomization, which will be our topic in the future. But the takeaway is that his contribution to potential outcomes helps lay the groundwork for estimating causal effects. And thanks to this paper by Donald Rubin, it is now widely used in expressing the causal statements.
Potential Outcome Framework
In the potential outcome framework, the causal effect is the comparison between two different states of the world. In the previous example, the difference in effect of fertilizers 1 and 2 on plot A was the difference between these two states of the world. Let’s understand this better with the help of notations we use in the potential outcomes framework.
Let’s consider a new fertilizer on the market. We want to know its effect on yield. In this case, let's assume there was no fertilizer use before this. We are essentially comparing fertilizer to non-fertilizer yield.
Let’s define a treatment variable, Di, which is a binary variable that takes value 1 if plot i receives fertilizer and 0 if it does not receive fertilizer. Potential outcomes (in this case yeilds) are then defined as Yi1 if uni i received the treatment (fertilizer) and as Yi0 if the unit did not receive treatment. One important thing that you can notice here is that both the outcomes have subscript i, meaning that it indicates two different states for same individual i. They are sometimes represented like Yi(0) and Yi(1). All of this can be represented as follows:
The true causal effect of fertilizer on plot i is therefore given by:
Here these outcomes are hypothetical states of the world, but in reality as I said we can only observe only one state. So, it is impossible to estimate the true causal effect using the equation above. We can identify the observed outcome based on this model by this simple equation.
This equation gives us a realized outcome for each of the plots i. It is impossible to observe both Yi1 and Yi0 for the same plot. This equation gives us a situation like the following, which I like to view as this makes the impossibility of observing both states clearer.
So the bottom line is that, since we cannot observe two states at once, we have the problem of missing data. And we can also call causal inference simply a missing data problem.
Average Treatment Effects
Last thing I want to leave with you today is that, in general we are not concerned with individual effects, rather we are concerned about the average treatment effect. In practicality, policies are aimed at groups of people, or groups of areas. In this case groups of fields. So, the average effect of fertilizer is called the average treatment effect. It can be expressed as following:
And I suggest you to get used to such notation. E here means expectations, and in simple words, it is a mean. Here, the average treatment effect is the average effect of fertilizer on treated group minus the average effect of fertilizer on non-treated groups. Think about what the issue here it. How can there be an effect of fertilizer on non-treated groups? Feel free to answer in the comments or send me an email.
See you in next post!