Professor Thomas Krödel | University of Hamburg

Thomas Krödel is Professor of Philosophy of Science at University of Hamburg. His principal areas of research are Philosophy of Science, Epistemology, Metaphysics and Philosophy of Mind.

Date | 8 October 2025

Title | 'Causal Complications for Explainable AI'

Abstract:

How can we explain the behaviour of AI agents such as large language models? A natural approach is to explain their behaviour causally by identifying internal representations of the system and their causal relations to the system’s output.

It turns out, however, that the nature of these representations makes it difficult to determine their causal roles. What seems to be our best theory of causation, namely the interventionist theory, delivers incorrect verdicts about these roles, and there is no quick fix.