Synthetic Control

Advantages of synthetic control

The weights are transparent. In synthetic control, the contribution of each untreated group is clearly shown, while in regression, the contribution is implicit unless you do a Bacon decomposition.

The choice of synthetic control requires no post-treatment data.

The following explanation is adpted from (Abadie et al., 2010).

YitY_{it} is the outcome where i=1,,J+1i=1,\dots,J+1 and t=1,,Tt=1,\dots,T. YitNY_{it}^N and YitIY_{it}^I are counterfactual outcome for the untreated and treated. The treated group is denoted by i=1i=1 and the control groups (the “donor pool”) are denoted by i=2,J+1i=2,\dots J+1. The treatment is applied on T0T_{0} where 1T0<T1\leq T_{0}<T.

WW is a (J×1)(J\times 1) vector (w2,w3,,wJ+1)(w_{2},w_{3},\dots,w_{J+1})’ with nonnegative elements. The synthetic control at time tt is produced by j=2J+1wjYjtN\sum_{j=2}^{J+1} w_{j}Y_{jt}^N. The treatment effect at tt is Y1tIj=2J+1wjYjtNY_{1t}^I - \sum_{j=2}^{J+1} w_{j}Y_{jt}^N.

As you can see, the key procedure is to estimate WW. Consider a (k×1)(k\times 1) vector X1=(Z1,Y1(1),Y1(2),,Y1(m))X_{1}=(Z_{1}, Y_{1}^{(1)}, Y_{1}^{(2)},\dots,Y_{1}^{(m)}). Z1Z_{1} is a vector of covariates for the treated. Typically, they’re predictors of Y1Y_{1}. Y1(m)Y_{1}^{(m)} is a combination of pre-treatment Y1Y_{1}. The most obvious choice is Y1(1)=Y11,,Y1(m)=Y1T0Y_{1}^{(1)}=Y_{11}, \dots, Y_{1}^{(m)}=Y_{1T_{0}}, i.e., the Y1tY_{1t} in every pre-treatment year. Conceptually, X1X_{1} captures the characteristics of the treated. Note that there’s no tt subscript in X1X_{1}, indicating it’s an average over the pre-treatment periods. For example, if Z1Z_{1} could be the average per-capita GDP during the pre-treatment years.

Now consider another (k×J)(k\times J) matrix X0X_{0}. X0X_{0} is similar to X1X_{1} but it captures the characteristics of all units in the donor pool. Our weight WW is given by minimizing:

minW(X0X1W)V(X0X1W)(1) \min_{W} (X_{0}-X_{1}W)'V(X_{0}-X_{1}W) \hspace{3em}(1)

i.e., minimizing the characteristics between the treated and the synthetic control. VV is typically a semi-positive diagonal matrix whose elements determines the importance of each covariate in XX. VV is typically given by:

V=argmin(Z1Z0W(V))(Z1Z0W(V))(2)V^*=\arg\min (Z_{1}-Z_{0}W^*(V))'(Z_{1}-Z_{0}W^*(V)) \hspace{3em}(2)

where ZZ is the pre-treatment trajectory of YY, i.e., Z1=(Y11,,Y1T0)Z_{1}=(Y_{11},\dots,Y_{1T_{0}}). So basically Eq(2) says VV is determined by minimizing the prediction error of the outcome.

Now you know how to estimate WW and VV, the rest is straightforward.

The relationship between WW and VV
WW is determined by VV. Once VV is given, we can compute WW with Eq(1).
Only pre-treatment data is needed!

As can be seen, X0X_{0}, X1X_{1}, Z1Z_{1} and Z0Z_{0} are all from pre-treatment periods.

That’s one advantage of synthetic control: It allows researchers to decide on study design without knowing how those decisions will affect the conclusions of their studies.

One can fixed the treatment dates but randomly shuffle the treatment states. In the California Proposition 99 case (Abadie et al., 2010), the treatment date is 1988, the treated state is California, and there’s a donor pool. To do a placebo falsification, the authors:

“In each iteration we reassign in our data the tobacco control intervention to one of the 38 control states, shifting California to the donor pool. That is, we proceed as if one of the states in the donor pool would have passed a large-scale tobacco control program in 1988, instead of California.”

Then, the author re-estimate the model and plot the real and synthetic cigarette consumption before and after the treatment data:

As can be seen from the figure, California is in the “extremely negative side” of the scale.

For a numerical test of the results conveyed in the above figure, one can:

  1. Calculate the pre-treatment prediction error (i.e., t=0T0(Y1tj=2J+1wjYjt)2\sum_{t=0}^{T_{0}} (Y_{1t}-\sum_{j=2}^{J+1} w_{j}Y_{jt})^2, measured in MSE)
  2. Similarly, calculate the post-treatment prediction error.
  3. Calculate the ratio of post- to pre-treatment MSE
  4. Now you have a distribution of ratios, you can compute p-value of the treated state.

When we do a placebo date, we keep the treated and untreated states fixed, but each time choose a different treatment date. Usually, the placebo dates are dates before the treatment date (i.e., we rewind the time). An example is Abadie, Diamond, and Hainmueller (2015).

Nickname
Email
Website
0/500
  • OωO
  • |´・ω・)ノ
  • ヾ(≧∇≦*)ゝ
  • (☆ω☆)
  • (╯‵□′)╯︵┴─┴
  •  ̄﹃ ̄
  • (/ω\)
  • ∠( ᐛ 」∠)_
  • (๑•̀ㅁ•́ฅ)
  • →_→
  • ୧(๑•̀⌄•́๑)૭
  • ٩(ˊᗜˋ*)و
  • (ノ°ο°)ノ
  • (´இ皿இ`)
  • ⌇●﹏●⌇
  • (ฅ´ω`ฅ)
  • (╯°A°)╯︵○○○
  • φ( ̄∇ ̄o)
  • ヾ(´・ ・`。)ノ"
  • ( ง ᵒ̌皿ᵒ̌)ง⁼³₌₃
  • (ó﹏ò。)
  • Σ(っ °Д °;)っ
  • ( ,,´・ω・)ノ"(´っω・`。)
  • ╮(╯▽╰)╭
  • o(*////▽////*)q
  • >﹏<
  • ( ๑´•ω•) "(ㆆᴗㆆ)
  • 😂
  • 😀
  • 😅
  • 😊
  • 🙂
  • 🙃
  • 😌
  • 😍
  • 😘
  • 😜
  • 😝
  • 😏
  • 😒
  • 🙄
  • 😳
  • 😡
  • 😔
  • 😫
  • 😱
  • 😭
  • 💩
  • 👻
  • 🙌
  • 🖕
  • 👍
  • 👫
  • 👬
  • 👭
  • 🌚
  • 🌝
  • 🙈
  • 💊
  • 😶
  • 🙏
  • 🍦
  • 🍉
  • 😣
  • 颜文字
  • Emoji
  • Bilibili
0 comments
No comment