. 2024 Jul:235:53597-53618.

DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

Yinjun Wu¹, Mayank Keoliya², Kan Chen³, Neelay Velingker², Ziyang Li², Emily J Getzen⁴, Qi Long^{2

4}, Mayur Naik², Ravi B Parikh⁴, Eric Wong²

Affiliations

¹ School of Computer Science, Peking University, Beijing, China.
² Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, United States.
³ School of Public Health, Harvard University, Boston, MA, United States.
⁴ Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.

PMID: 39205826
PMCID: PMC11350397

DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

Yinjun Wu et al. Proc Mach Learn Res. 2024 Jul.

. 2024 Jul:235:53597-53618.

Authors

Yinjun Wu¹, Mayank Keoliya², Kan Chen³, Neelay Velingker², Ziyang Li², Emily J Getzen⁴, Qi Long^{2

4}, Mayur Naik², Ravi B Parikh⁴, Eric Wong²

Affiliations

¹ School of Computer Science, Peking University, Beijing, China.
² Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, United States.
³ School of Public Health, Harvard University, Boston, MA, United States.
⁴ Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.

PMID: 39205826
PMCID: PMC11350397

Abstract

Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers for black-box models lack faithfulness guarantees, and self-interpretable models greatly compromise accuracy. To address these issues, we propose DISCRET, a self-interpretable ITE framework that synthesizes faithful, rule-based explanations for each sample. A key insight behind DISCRET is that explanations can serve dually as database queries to identify similar subgroups of samples. We provide a novel RL algorithm to efficiently synthesize these explanations from a large search space. We evaluate DISCRET on diverse tasks involving tabular, image, and text data. DISCRET outperforms the best self-interpretable models and has accuracy comparable to the best black-box models while providing faithful explanations. DISCRET is available at https://github.com/wuyinjun-1993/DISCRET-ICML2024.

PubMed Disclaimer

Figures

**Figure 5.**
The curve of ATE errors on test split of IHDP by DISCRET

**Figure 6.**
Frequency of the outcome values on Uganda dataset

**Figure 7.**
Extracted concepts from Uganda dataset

**Figure 1.**
Motivating examples from the Uganda dataset. We predict how providing economic aid (the treatment) helps to develop remote regions of the country (the outcome) via satellite images. The task is to estimate the ITE for each sample $x_{1}$ and $x_{2}$ . DISCRET predicts that, because both images have several indicators of rich soil and urbanization, they will have similar ITE if given aid. Self-interpretable models such as Causal Forest (Athey & Wager, 2019) produce *consistent* ITE estimates (i.e., samples with same explanations have same model predictions, *viz*. 3.97 and 3.97), but have poor accuracy ( $I T {\hat{E}}_{x_{1}} ≪ I T E_{x_{1}} = 4.25$ ). Black-box models such as TransTEE (Zhang et al., 2022), are accurate but do not produce similar predictions for samples $x_{1}$ and $x_{2}$ with similar explanations, when the explanations are sourced from post-hoc explainers such as Anchor (Ribeiro et al., 2018). DISCRET produces both consistent and accurate predictions.

**Figure 2.**
Illustration of DISCRET on the IHDP dataset, which tracks premature infants. Given a sample $x$ , DISCRET synthesizing an explanation $L_{1 : k}$ where it iteratively constructs each literal in the explanation. In particular, DISCRET (i) embeds the given sample and any previously generated literals ( $Θ_{0}$ ), (ii) passes the embedding to the feature selection network $(Θ_{1})$ to pick a feature, and then (iii) passes the embedding and selected feature to the constant selection network $(Θ_{2})$ to get a thresholding constant. The operator is auto-assigned based on the feature and sample. DISCRET executes this explanation on the database to find relevant samples, which are used (i) during training to compute a reward function for $Θ_{0}, Θ_{1}$ and $Θ_{2}$ , and (ii) during testing to calculate the ITE.

**Figure 3.**
Consistency scores (higher is better) for DISCRET and a black-box model (TransTEE) combined with a post-hoc explainer. Our results confirm that DISCRET produces faithful explanations, and importantly, show that post-hoc explanations are rarely faithful, as evidenced by low consistency scores across datasets.

**Figure 4.**
DISCRET identifies similar samples across diverse datasets – tabular (IHDP), image (Uganda), and text (EEEC). 1) In the first setting, given a tabular sample $x$ describing a premature infant, DISCRET establishes a rule associating extremely underweight (weight ≤ 1.5) infants born to teenage mothers (mom age ≤ 19) with a history of drug use; such groups likely benefit from childcare visits (treatment), and will have highly improved cognitive outcomes. 2) In the second scenario on satellite images, for a sample $x$ , DISCRET discerns a rule based on the presence of concepts like “high soil moisture” (reddish-pink pixels) and absence of minimal soil (brown pixels); thus characterizing areas with high soil moisture. DISCRET’s synthesized rule aligns with findings that government grants (treatment) are more effective in areas with higher soil moisture content (outcome) (Jerzak et al., 2023b). 3) Likewise, the text setting aims to measure the impact of gender (treatment) on the mood (outcome). Given a sentence $x$ where the gendered noun (“Betsy”) does not affect the semantic meaning, DISCRET’s rule focuses on mood-linked words in the sentence, i.e., “hilarious”.

See this image and copyright information in PMC

References

1. Achanta R, Shaji A, Smith K, Lucchi A, Fua P, and Süsstrunk S Slic superpixels compared to state-of-the-art superpixel methods. IEEE transactions on pattern analysis and machine intelligence, 34(11):2274–2282, 2012. - PubMed
1. Agarwal R, Melnick L, Frosst N, Zhang X, Lengerich B, Caruana R, and Hinton GE Neural additive models: Interpretable machine learning with neural nets. Advances in neural information processing systems, 34: 4699–4711, 2021.
1. Anderson DW, Kish L, and Cornell RG On stratification, grouping and matching. Scandinavian Journal of Statistics, pp. 61–66, 1980.
1. Athey S and Wager S Estimating treatment effects with causal forests: An application. Observational studies, 5 (2):37–51, 2019.
1. Baeza-Yates R, Ribeiro-Neto B, et al. Modern information retrieval, volume 463. ACM press; New York, 1999.

Grants and funding

LinkOut - more resources

Full Text Sources
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

Affiliations

DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

Authors

Affiliations

Abstract

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources