Similar Items: Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability
- Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability
- Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability
- Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability
- Inductive Definition and Domain Theoretic Properties of Fully Abstract
- Foundations of probability-raising causality in Markov decision processes
- Position: Mechanistic Interpretability Must Disclose Identification Assumptions for Causal Claims