One Explanation Does Not Fit All

Sokol, Kacper; Flach, Peter

One Explanation Does Not Fit All

dc.contributor.author	Sokol, Kacper
dc.contributor.author	Flach, Peter
dc.date.accessioned	2021-04-23T09:34:08Z
dc.date.available	2021-04-23T09:34:08Z
dc.date.issued	2020
dc.description.abstract	The need for transparency of predictive systems based on Machine Learning algorithms arises as a consequence of their ever-increasing proliferation in the industry. Whenever black-box algorithmic predictions influence human affairs, the inner workings of these algorithms should be scrutinised and their decisions explained to the relevant stakeholders, including the system engineers, the system’s operators and the individuals whose case is being decided. While a variety of interpretability and explainability methods is available, none of them is a panacea that can satisfy all diverse expectations and competing objectives that might be required by the parties involved. We address this challenge in this paper by discussing the promises of Interactive Machine Learning for improved transparency of black-box systems using the example of contrastive explanations—a state-of-the-art approach to Interpretable Machine Learning. Specifically, we show how to personalise counterfactual explanations by interactively adjusting their conditional statements and extract additional explanations by asking follow-up “What if?” questions. Our experience in building, deploying and presenting this type of system allowed us to list desired properties as well as potential limitations, which can be used to guide the development of interactive explainers. While customising the medium of interaction, i.e., the user interface comprising of various communication channels, may give an impression of personalisation, we argue that adjusting the explanation itself and its content is more important. To this end, properties such as breadth, scope, context, purpose and target of the explanation have to be considered, in addition to explicitly informing the explainee about its limitations and caveats. Furthermore, we discuss the challenges of mirroring the explainee’s mental model, which is the main building block of intelligible human–machine interactions. We also deliberate on the risks of allowing the explainee to freely manipulate the explanations and thereby extracting information about the underlying predictive model, which might be leveraged by malicious actors to steal or game the model. Finally, building an end-to-end interactive explainability system is a challenging engineering task; unless the main goal is its deployment, we recommend “Wizard of Oz” studies as a proxy for testing and evaluating standalone interactive explainability algorithms.	de
dc.identifier.doi	10.1007/s13218-020-00637-y
dc.identifier.pissn	1610-1987
dc.identifier.uri	http://dx.doi.org/10.1007/s13218-020-00637-y
dc.identifier.uri	https://dl.gi.de/handle/20.500.12116/36294
dc.publisher	Springer
dc.relation.ispartof	KI - Künstliche Intelligenz: Vol. 34, No. 2
dc.relation.ispartofseries	KI - Künstliche Intelligenz
dc.subject	Counterfactuals
dc.subject	Explanations
dc.subject	Interactive
dc.subject	Personalised
dc.title	One Explanation Does Not Fit All	de
dc.type	Text/Journal Article
gi.citation.endPage	250
gi.citation.startPage	235

Sammlungen

Künstliche Intelligenz 34(2) - Juni 2020

One Explanation Does Not Fit All

Dateien

Sammlungen