Logo des Repositoriums
 

One Explanation Does Not Fit All

dc.contributor.authorSokol, Kacper
dc.contributor.authorFlach, Peter
dc.date.accessioned2021-04-23T09:34:08Z
dc.date.available2021-04-23T09:34:08Z
dc.date.issued2020
dc.description.abstractThe need for transparency of predictive systems based on Machine Learning algorithms arises as a consequence of their ever-increasing proliferation in the industry. Whenever black-box algorithmic predictions influence human affairs, the inner workings of these algorithms should be scrutinised and their decisions explained to the relevant stakeholders, including the system engineers, the system’s operators and the individuals whose case is being decided. While a variety of interpretability and explainability methods is available, none of them is a panacea that can satisfy all diverse expectations and competing objectives that might be required by the parties involved. We address this challenge in this paper by discussing the promises of Interactive Machine Learning for improved transparency of black-box systems using the example of contrastive explanations—a state-of-the-art approach to Interpretable Machine Learning. Specifically, we show how to personalise counterfactual explanations by interactively adjusting their conditional statements and extract additional explanations by asking follow-up “What if?” questions. Our experience in building, deploying and presenting this type of system allowed us to list desired properties as well as potential limitations, which can be used to guide the development of interactive explainers. While customising the medium of interaction, i.e., the user interface comprising of various communication channels, may give an impression of personalisation, we argue that adjusting the explanation itself and its content is more important. To this end, properties such as breadth, scope, context, purpose and target of the explanation have to be considered, in addition to explicitly informing the explainee about its limitations and caveats. Furthermore, we discuss the challenges of mirroring the explainee’s mental model, which is the main building block of intelligible human–machine interactions. We also deliberate on the risks of allowing the explainee to freely manipulate the explanations and thereby extracting information about the underlying predictive model, which might be leveraged by malicious actors to steal or game the model. Finally, building an end-to-end interactive explainability system is a challenging engineering task; unless the main goal is its deployment, we recommend “Wizard of Oz” studies as a proxy for testing and evaluating standalone interactive explainability algorithms.de
dc.identifier.doi10.1007/s13218-020-00637-y
dc.identifier.pissn1610-1987
dc.identifier.urihttp://dx.doi.org/10.1007/s13218-020-00637-y
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/36294
dc.publisherSpringer
dc.relation.ispartofKI - Künstliche Intelligenz: Vol. 34, No. 2
dc.relation.ispartofseriesKI - Künstliche Intelligenz
dc.subjectCounterfactuals
dc.subjectExplanations
dc.subjectInteractive
dc.subjectPersonalised
dc.titleOne Explanation Does Not Fit Allde
dc.typeText/Journal Article
gi.citation.endPage250
gi.citation.startPage235

Dateien