Faced with concerns of endogeneity, researchers frequently turn to instrumental variable (IV) models. Convenient instruments are often constructed from realizations of the endogenous variable in geographically or otherwise proximate units—thus creating spatial instruments, such as regional or global averages, by using other-unit values as instruments for own-unit endogenous predictors. Following the lead of prominent work in political science and economics, such instruments are increasingly widely used (see, e.g., Stasavage Reference Stasavage2005; Acemoglu *et al.*
Reference Acemoglu, Johnson, Robinson and Yared2008; Büthe and Milner Reference Büthe and Milner2008; Boix Reference Boix2011). In this research note, we highlight two problems with this strategy. Jointly, they cast substantial doubt on the validity of spatial instruments.

First, spatial instruments require supporting the presence of one type of spatial relationship while concurrently denying other spatial relationships in the form of spillovers in predictors and interdependence in the outcome.Footnote
^{1}
These alternative pathways are rarely discussed explicitly, yet are likely to exist in many applications. When present, they cause violations of the exclusion restriction.

Second, spatial instruments produce simultaneity in the first stage and therefore are not exogenous—put simply, spatial instruments imply a first stage where left-hand side outcomes are included as right-hand side predictors. Despite the wide use of spatial instruments, this has not been acknowledged as an inherent concern in IV estimation. And while the first problem is a matter of plausibility, theory, and evidence, the second problem is not: the violation of the exclusion restriction arises by construction and is endemic to spatial instruments.

## 1 Spatial Instruments

To clarify the problems with spatial instruments, we consider a simple linear-additive model

where $i=1,2,\ldots ,N$ indexes observations, $y_{i}$ is the outcome, $x_{i}$ the predictor, and $e_{i}$ the error term. The ordinary least squares (OLS) estimator of $\unicode[STIX]{x1D6FD}$ is the ratio of the sample covariance of $x$ and $y$ to the sample variance of $x$ , which yields the probability limit

Equation (2) shows that $\widehat{\unicode[STIX]{x1D6FD}}_{OLS}$ is asymptotically biased if $\text{cov}(x,e)\neq 0$ , that is, if $x$ is endogenous. This result is covered in any introductory econometrics textbook alongside common sources of endogeneity: omitted variable bias, simultaneity, and measurement error.

To address the resulting bias, IV methods are often employed. Following Sovey and Green (Reference Sovey and Green2011), we introduce IV estimation using the following system of equations:

where
$x$
is endogenous if
$\text{cov}(u,e)\neq 0$
.Footnote
^{2}
The variable
$z$
is a valid instrument for
$x$
if the instrument is relevant, such that
$\text{cov}(z,x)\neq 0$
, and if it satisfies the exclusion restriction, such that
$\text{cov}(z,e)=0$
. That is,
$z$
needs to be correlated with the endogenous variable
$x$
, but uncorrelated with the error term in equation (3). Figure 1 depicts the relationships assumed under IV estimation, with no pathways connecting
$z$
to
$y$
but through
$x$
.

Given the difficulty of finding valid instruments, researchers frequently turn to values of the endogenous variable from other units as instruments for own-unit endogenous values. More explicitly, spatial instruments are generated as a function of other-unit values, such that
$z_{i}=\unicode[STIX]{x1D6F4}_{j\neq i}w_{i,j}x_{j}$
, with
$w_{i,j}$
indicating the relationship between units
$i$
and
$j$
. In studies on the effect of democracy on growth, for example, neighboring democracy (Persson and Tabellini Reference Persson and Tabellini2009) or regional waves of democracy (Acemoglu *et al.*
Reference Acemoglu, Naidu, Restrepo and Robinsonforthcoming) have served as instruments. In both instances, a weighted sum of other-unit democracy scores is used to instrument for own-unit democracy. Spatial instruments are not unique to this literature and have become increasingly popular. Since 2004, over twenty articles have appeared in the APSR, AJPS, JOP, BJPS, and IO that rely on spatial instruments, and their use seems to have become more common over time.Footnote
^{3}

Spatial instruments are attractive for at least three reasons. First, they are convenient, because the data already exist in the sample. Researchers need only determine an appropriate weighting scheme to construct the instrument—which might be as simple as calculating a regional or global average. Second, the existing literature often provides reasons, such as diffusion, competitive or peer effects, spillovers, or learning, to expect the instrument to have power. Third, it is argued or assumed that such instruments satisfy the exclusion restriction.

In what follows we challenge whether this latter belief—that instruments based on values of the endogenous variable in other units satisfy the exclusion restriction—is plausible (Section 1.1) or, in fact, even possible (Section 1.2).Footnote
^{4}

### 1.1 Direct and indirect effects of other-unit predictors

To highlight the implicit assumptions made when using spatial instruments, we re-express Figure 1 in the left panel of Figure 2, with $x_{j}$ , other-unit values of $x_{i}$ , inserted for instrument $z_{i}$ . Analogously to unit $i$ , other units are related to their own outcomes $y_{j}$ . As usual with causal path diagrams, note the assumed and implied absence of causal relationships, two of which we have highlighted in the right panel. The right panel displays the same graph, but includes two additional plausible pathways between the instrument and the outcome: directly via spillovers ( $x_{j}\rightarrow y_{i}$ ) and indirectly via interdependent outcomes ( $x_{j}\rightarrow y_{j}\rightarrow y_{i}$ ). For instruments to be valid, both of these pathways must be ruled out explicitly. This is more difficult to do in the context of spatial instruments, where the argument implies that interdependence exists in one variable but not in others.

First, spillovers in $x$ produce an obvious violation of the exclusion restriction. If other-unit predictor values directly affect outcomes, they cannot be valid instruments. Direct spillovers are often expressed as $y_{i}=\unicode[STIX]{x1D6FD}x_{i}+\unicode[STIX]{x1D70C}\sum _{j\neq i}w_{i,j}x_{j}+e_{i}$ , which immediately shows that the exclusion restriction is violated, because the outcome $y_{i}$ is a direct function of $x_{j}$ . Spillovers are common and occur when actions, choices, or policies of one unit directly affect outcomes in other units (Manski Reference Manski1993). For example, unit $j$ ’s policies on industrial emissions affect pollution not just in $j$ but also in $i$ , as pollutants spread via air and water currents. Similarly, import tariffs in unit $j$ directly affect GDP growth in units $i$ which export that product. In short, spillovers exist whenever externalities are generated by a change in $x_{i}$ , and without recourse render spatial instruments invalid.

Second, the exclusion restriction is violated through interdependence in the outcome, such that the outcomes of some units affect the outcomes of other units. Theories of interdependence are ubiquitous in political science. The contagion of conflict and crises, the diffusion of policies, the spread of institutions and ideologies, deepening economic integration and resulting policy coordination all provide examples (Franzese and Hays Reference Franzese and Hays2007). Interdependence in outcomes entails a relationship between $x_{j}$ and $y_{i}$ through $y_{j}$ , and therefore a violation of the exclusion restriction. Interdependence is commonly interpreted as a spatial-autoregressive process, such that $y_{i}=\unicode[STIX]{x1D70C}\sum _{j\neq i}w_{i,j}y_{j}+e_{i}$ , which can be rewritten as $y_{i}=\unicode[STIX]{x1D70C}\sum _{j\neq i}w_{i,j}[\unicode[STIX]{x1D6FD}x_{j}+e_{j}]+\unicode[STIX]{x1D6FD}x_{i}+e_{i}$ , indicating how $x_{j}$ affects $y_{i}$ . The problem is readily visible in Figure 2. If interdependence exists in the outcomes, then $y_{i}$ is a function of outcomes in other units, $y_{j}$ , and therefore of $x_{j}$ .

As an illustration, consider the literature on economic development and democratization. In an effort to isolate the causal effect of income on democracy, Acemoglu *et al.* (Reference Acemoglu, Johnson, Robinson and Yared2008) use trade-weighted world income to instrument for income. Both spillovers and interdependence in outcomes plausibly occur in this example, rendering the instrument invalid. First, as argued in the literature on economic benchmarking, cross-national income comparisons condition citizen’s political actions (Kayser and Peress Reference Kayser and Peress2012), which creates spillovers: the benchmarking argument implies that income in other states directly influences political outcomes, such as demands for political liberalization, and therefore violates the exclusion restriction. Second, a vast literature examines the diffusion of democracies (Gleditsch and Ward Reference Gleditsch and Ward2006), which creates interdependence in the outcomes and a violation of the exclusion restriction.

When spatial instruments are used, these additional pathways have to be ruled out explicitly. Figure 2 visually illustrates the difficulty of arguing for the absence of a relationship between
$y_{j}$
and
$y_{i}$
as well as between
$x_{j}$
and
$y_{i}$
when an analogous, parallel relationship is argued to exist between
$x_{i}$
and
$x_{j}$
.Footnote
^{5}
Where spatial relationships exist in one variable, other variables are likely to exhibit spatial relationships as well.

The assumptions necessary for spatial instruments therefore produce a tension. Spatial dependence has to exist in the endogenous variable, while at the same time other spatial relationships must be ruled out. This is rarely done explicitly in applications, and we believe that for most cases it is difficult to convincingly make such an argument: Given that the use of spatial instruments depends on and argues for spatial relationships in the endogenous variable, it is difficult to also argue for the absence of other spatial relationships.Footnote
^{6}

### 1.2 Spatial simultaneity and instrumental variables

Even assuming the conditions elaborated in Section 1.1 are satisfied, a heretofore unrecognized problem with spatial instruments remains: simultaneity between the instrument and endogenous predictor. Intuitively, if units are related to each other, as implied by spatial instruments, unit $i$ is affected by other units, but other units are also affected by unit $i$ —and, in particular, are a function of the endogenous component of unit $i$ . Thus, other units cannot serve as valid instruments. A valid instrument needs to provide exogenous variation from outside the system of equations; a spatial instrument cannot satisfy this requirement because it is determined within that system.

The first stage of two-stage least squares (2SLS) is an OLS regression of the endogenous variable on the instrument (and all other exogenous predictors). With spatial instruments, this implies

with $w_{i,j}$ indicating the relationship between units $i$ and $j$ . Equation (5) can be expressed as

where
$\mathbf{W}$
is a
$N$
-by-
$N$
connectivity matrix (with zero elements on the diagonal) defining which and to what extent units are related to each other. Recognizing that the first stage takes the form in equations (5) and (6) makes the problem immediately obvious: because
$x$
appears on both the left-hand side and the right-hand side of the equation, the first stage suffers from simultaneity bias.Footnote
^{7}
In particular, this first stage *is* a spatial-autoregressive model, where it is well known that
$\mathbf{W}\mathbf{x}$
is not exogenous to, but simultaneously determined with
$\mathbf{x}$
(Franzese and Hays Reference Franzese and Hays2007). This (spatial) simultaneity *necessarily* produces a violation of the exclusion restriction: feedback from the endogenous predictor to the instrument makes the instrument a function of the source of endogeneity—that is, it makes the instrument itself endogenous.Footnote
^{8}

The problem arising from simultaneity can also be seen by recalling the standard IV model in equations (3) and (4) and supposing that additionally, $z_{i}=\unicode[STIX]{x1D6FC}x_{i}+v_{i}$ . Simultaneity between $x$ and $z$ arises if $\unicode[STIX]{x1D6FC}\neq 0$ . Substitution for $x_{i}$ produces $z_{i}=\unicode[STIX]{x1D6FC}(\unicode[STIX]{x1D6FE}z_{i}+u_{i})+v_{i}$ , showing that $z$ is related to $u$ and, in turn, $e$ . It follows that $\text{cov}(z,e)\neq 0$ , indicating the violation of the exclusion restriction. If $z$ is simultaneously determined with $x$ , it is necessarily related to the error term in the outcome equation and cannot serve as a valid instrument. As spatial instruments are by design simultaneously determined with, and therefore a function of, the endogenous variable, they violate the exclusion restriction.

Note what this implies. First, a spatial instrument is exogenous to the outcome equation if $x$ is exogenous—that is, when no instrument would have been required. Conversely, the endogeneity of the instrument increases in the endogeneity of the variable of interest, and hence is most severe where the instrument is needed most. Second, a spatial instrument is exogenous if there are no simultaneous relationships between units—that is, when the instrument is irrelevant. Thus, the strength of the instrument is, with spatial instruments, no longer independent from the exclusion restriction. When used as instrument for an endogenous variable, a spatial instrument cannot jointly satisfy both conditions required for a valid instrument: if the instrument is strong, it violates the exclusion restriction; if the instrument does not violate the exclusion restriction, it is irrelevant.

## 2 Discussion

We highlighted a simple problem: faced with an endogenous variable, using realizations of that variable in other units as instruments cannot produce unbiased, consistent estimates. First, these spatial instruments require arguing for one type of spatial dependence in the data, while simultaneously ruling out other—often plausible—spatial relationships. Second, by design, these instruments violate the exclusion restriction, because the endogenous variable and the instrument are simultaneously determined. Despite the popularity of spatial instruments in the social sciences, these problems are rarely acknowledged, let alone addressed. Jointly, these two problems imply that spatial instruments cannot be valid instruments. They do not even guarantee an improvement over OLS. We therefore advocate against the use of spatial instruments in their current form, and caution against interpreting estimates based on those instruments as superior to OLS.

More generally, this letter points to a potential misunderstanding in some applications of instrumental variables. Two requirements are typically mentioned for valid instruments: the exclusion restriction and correlation with the endogenous variable. These are often jointly summarized as the requirement that the instrument affects the outcome through and only through the endogenous predictor. As Angrist, Imbens, and Rubin (Reference Angrist, Imbens and Rubin1996, p. 450) emphasize, such language obfuscates the conditions necessary for valid instruments and “hinders assessment and communication of the plausibility of the underlying model.” As causal diagrams as well as the potential outcome framework clarify, the exclusion restriction is more nuanced: there must be no pathway from the instrument to the outcome except through the variable of interest, and there must be no pathway to the instrument. Both are implied by the common assumptions for valid instruments, but the latter is often neglected in applications. Spatial instruments violate this condition and therefore cannot produce valid inferences.

## Supplementary material

For supplementary material accompanying this paper, please visithttps://doi.org/10.1017/pan.2018.10.