1,813
Views
22
CrossRef citations to date
0
Altmetric
Articles

Regression Discontinuity Designs With Sample Selection

Pages 171-186 | Received 01 Mar 2016, Published online: 08 Aug 2017
 

Abstract

This article extends the standard regression discontinuity (RD) design to allow for sample selection or missing outcomes. We deal with both treatment endogeneity and sample selection. Identification in this article does not require any exclusion restrictions in the selection equation, nor does it require specifying any selection mechanism. The results can therefore be applied broadly, regardless of how sample selection is incurred. Identification instead relies on smoothness conditions. Smoothness conditions are empirically plausible, have readily testable implications, and are typically assumed even in the standard RD design. We first provide identification of the “extensive margin” and “intensive margin” effects. Then based on these identification results and principle stratification, sharp bounds are constructed for the treatment effects among the group of individuals that may be of particular policy interest, that is, those always participating compliers. These results are applied to evaluate the impacts of academic probation on college completion and final GPAs. Our analysis reveals striking gender differences at the extensive versus the intensive margin in response to this negative signal on performance.

ACKNOWLEDGMENT

The author would like to thank anonymous referees for valuable comments.

Notes

1 Identification of the standard RD design has been discussed in Hahn, Todd, and van der Klaauw (Citation2001), Lee (Citation2008), and Dong (Citation2016). Inference was discussed by Porter (2003), Imbens and Kalyanaraman (Citation2012), Calonico, Cattaneo, and Titiunik (Citation2014), Cattaneo, Frandsen, and Titiunik (2015), Otsu, Xu, and Matsushita (2015), and Feir, Lemieux, and Marmer (2016). See Cattaneo, Titiuni, and Vazquez-Bare (2016) for a comparison of different inference approaches for the standard RD design.

2 In particular, Staub (Citation2014) discussed bounds under two alternative assumptions. The first assumption assumes that treatment effects are nonnegative for everyone. The second assumption assume that treatment effects are nonnegative for switchers and have the same sign for always participants, and further that one knows that ATE>0 or ATE<0.

3 Y*tY*(t, St) for t = 0, 1.

4 Assume T = h(R, V) for unobservables V, which can be a vector. Without loss of generality, one can write T = h1(R, V)Z + h0(R, V)(1 − Z). The function hz(R, V) for z = 0, 1 describes the treatment assignment below or above the cutoff. Define then Tz(r) ≡ hz(r, V) for z = 0, 1.

5 Alternatively, one could assume that FYt*,St,Θ|R(y,s|r) for any Θ ∈ {A, N, C} is continuous at r0.

6 That is, smoothness conditions need to hold for the selected sample in order for Equation (Equation3) to identify a causal effect for the selected sample.

7 In practice, the fuzzy RD estimator along with its robust bias-corrected inference can be conveniently implemented using the Stata command rdrobust.ado (https://sites.google.com/site/rdpackages/rdrobust).

8 Zhang and Rubin (Citation2003) and Imai (Citation2008) discussed similar bounds in the context of randomized experiments with perfect compliance. See also Lee (Citation2009), Blanco, Flores, and Flores-Lagunes (Citation2013), and Chen and Flores (Citation2014) for construction of bounds in evaluating the effects of Job Corps.

9 In practice, these quantiles can be conveniently estimated by using the RD quantile treatment effect estimator proposed by Frandsen, Frölich, and Melly (Citation2012), after replacing T with ST and (1 − T) with S(1 − T) to deal with sample selection.

10 Fletcher and Tokmouline (Citation2010) also used the THEOP data, but all the data used in this article are obtained and processed independently.

11 In practice, when a student is considered as scholastically deficient, he or she may only be given an academic warning. However, a quick survey administered to the relevant academic deans suggests that students are generally placed on probation in this case.

12 The close-to-cutoff sample is used to produce sample summary statistics and figures only.

13 All our figures are conveniently generated using the Stata command, rdplot.ado. Details can be found in Calonico, Cattaneo, and Titiunik, (2015).

14 Students whose first semester GPAs are exactly 2.0 are not included in our sample, considering possible rounding at this value. We assume that observations away from 2.0 are correctly measured.

15 For notational convenience, in all the tables, I drop C and R = r0 in the conditioning set. Nevertheless, all estimates are among the compliers at the probation threshold.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.