PhilSci Archive

Simpson's Paradox Beyond Confounding

Dong, Zili and Cai, Weixin and Zhao, Shimin (2024) Simpson's Paradox Beyond Confounding. [Preprint]

This is the latest version of this item.

[img] Text
Dong_Cai_Zhao_SP beyond confounding.pdf

Download (838kB)

Abstract

Simpson's paradox (SP) is a statistical phenomenon where the association between two variables reverses, disappears, or emerges, after conditioning on a third variable. It has been proposed (by, e.g., Judea Pearl) that SP should be analyzed using the framework of graphical causal models (i.e., causal DAGs) in which SP is diagnosed as a symptom of confounding bias. This paper contends that this confounding-based analysis cannot fully capture SP: there are cases of SP that cannot be explained away in terms of confounding. Previous works have argued that some cases of SP do not require causal analysis at all. Despite being a logically valid counterexample, we argue that this type of cases poses only a limited challenge to Pearl’s analysis of SP. In our view, a more powerful challenge to Pearl comes from cases of SP that do require causal analysis but can arise without confounding. We demonstrate with examples that accidental associations due to genetic drift, the use of inappropriate aggregate variables as causes, and interactions between units (i.e., inter-unit causation) can all give rise to SP of this type. The discussion is also extended to the amalgamation paradox (of which SP is a special form) which can occur due to the use of non-collapsible association measures, in the absence of confounding.


Export/Citation: EndNote | BibTeX | Dublin Core | ASCII/Text Citation (Chicago) | HTML Citation | OpenURL
Social Networking:
Share |

Item Type: Preprint
Creators:
CreatorsEmailORCID
Dong, Zilizdong67@uwo.ca0000-0002-3697-1592
Cai, Weixinw3cai@ucsd.edu0009-0001-9906-3700
Zhao, Shiminszhao249@wisc.edu0000-0002-5639-0242
Keywords: Simpson's paradox, causal modelling, DAGs, confounding
Subjects: General Issues > Data
Specific Sciences > Biology
General Issues > Causation
Specific Sciences > Computer Science
General Issues > Evidence
General Issues > Explanation
Depositing User: Weixin Cai
Date Deposited: 21 Aug 2024 10:39
Last Modified: 21 Aug 2024 10:39
Item ID: 23815
Subjects: General Issues > Data
Specific Sciences > Biology
General Issues > Causation
Specific Sciences > Computer Science
General Issues > Evidence
General Issues > Explanation
Date: 2024
URI: https://philsci-archive.pitt.edu/id/eprint/23815

Available Versions of this Item

Monthly Views for the past 3 years

Monthly Downloads for the past 3 years

Plum Analytics

Actions (login required)

View Item View Item