Abstract
A commentary by Thoemmes on Wiedermann and Sebastian’s introductory article on Direction Dependence Analysis (DDA) is responded to in the interest of elaborating and extending direction dependence principles to evaluate causal effect directionality. Considering Thoemmes’ observation that some DDA principles are already well-established in machine learning, we argue that several other connections between DDA and research lines in theoretical statistics, econometrics, and quantitative psychology exist, suggesting that DDA is best conceptualized as a framework that summarizes and extends existing knowledge on properties of linear models under non-normality. Further, Thoemmes articulates concerns about assumptions of error distributions used in our main article and presents an artificial data example in which some DDA tests have suboptimal statistical power. We present extensions of DDA to entirely relax distributional assumptions about errors and describe two sensitivity analysis approaches to critically evaluate DDA results. Both sensitivity approaches are illustrated using Thoemmes’ artificial data example. Incorporating DDA sensitivity results yields correct causal conclusions. Thus, overall, we stay with our initial conclusion that the use of higher moments in causal inference constitutes an exciting open research area.
Article information
Conflict of interest disclosures: Each author signed a form for disclosure of potential conflicts of interest. No authors reported any financial or other conflicts of interest in relation to the work described.
Ethical principles: The authors affirm having followed professional ethical guidelines in preparing this work. These guidelines include obtaining informed consent from human participants, maintaining ethical treatment and respect for the rights of human or animal participants, and ensuring the privacy of participants and their data, such as ensuring that individual participants cannot be identified in reported results or from publicly available original or archival data.
Funding: This work was supported in part by Grant R305A120706 from the Institute of Education Sciences (IES).
Role of the funders/sponsors: None of the funders or sponsors of this research had any role in the design and conduct of the study; collection, management, analysis, and interpretation of data; preparation, review, or approval of the manuscript; or decision to submit the manuscript for publication.
Acknowledgments: The authors would like to thank Phillip K. Wood and the Associate Editor Stephen G. West for their comments on prior versions of this manuscript. The ideas and opinions expressed herein are those of the authors alone, and endorsement by the authors’ institution is not intended and should not be inferred.
Notes
1 We do not embark on a discussion of the conceptual differences between a confirmatory test of an a priori selected causal theory (which is the focus of DDA) and the task of learning causal structures from data alone without a priori expert knowledge - the latter, of course, carries a massive exploratory element and may be prone to incorrect causal conclusions. For a detailed discussion see, e.g., Wiedermann and von Eye (Citation2015b) and Hernán, Hsu, and Healy (Citation2019).
2 In particular Sungur’s (Citation2005) approach using copulas led to another branch of DDA known as copula-based Directional Dependence Analysis (Kim & Kim, Citation2014; for an overview see Wiedermann, Kim, Sungur & von Eye, Citation2019).
3 In essence, a link between IV estimation and so-called higher order covariances of x and y, (for details see the section “Relaxing Distributional Assumptions of the Error” in this rejoinder) exist when defining an instrument (w) as higher powers of the predictor x. In the confounded model,
with
(u being the confounder and
and
denoting error terms), the causal effect
can be estimated through
if
0 and
= 0 (Durbin, Citation1954). Using
, for example, one obtains
which provides an unbiased estimate of the causal effect when u is symmetrically distributed. Evaluating properties of this higher moment estimator (e.g., quantifying biases due to violated distributional assumptions) is material for future work.
4 In the unconfounded model, , x was gamma-distributed with zero mean, unit variance, and a skewness of
= 1.5, the error
followed a standard normal distribution and the causal effect was
= 0.39 representing a medium effect size (Cohen, Citation1988). In the confounded model,
with
, the confounder u was gamma-distributed with zero mean, unit variance, and a skewness of
= 1.5, the two error terms (
and
) were standard normally distributed, and all coefficients were set to 0.39. The sample size was n = 500 for both models.