“Who's there?”: Depicting identity in interaction

Patrick G. T. Healey; Christine Howes; Ruth Kempson; Gregory J. Mills; Matthew Purver; Eleni Gregoromichelaki; Arash Eshghi; Julian Hough

doi:10.1017/S0140525X22001492

“Who's there?”: Depicting identity in interaction

Published online by Cambridge University Press: 05 April 2023

Eleni Gregoromichelaki ,

Arash Eshghi and

Julian Hough

Show author details

Patrick G. T. Healey: Affiliation:
School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, UK p.healey@qmul.ac.uk m.purver@qmul.ac.uk j.hough@qmul.ac.uk http://cogsci.eecs.qmul.ac.uk
Christine Howes: Affiliation:
Department of Philosophy, Linguistics, Theory of Science, University of Gothenburg, 41255 Gothenburg, Sweden christine.howes@gu.se eleni.gregoromichelaki@gu.se
Ruth Kempson: Affiliation:
Department of Philosophy, King's College London, London WC2R 2LS, UK ruth.kempson@kcl.ac.uk https://gu-clasp.github.io/people/ruth-kempson/
Gregory J. Mills: Affiliation:
Faculty of Arts, Computational Linguistics (CL), University of Groningen, 9712 EK Groningen, Netherlands School of Computer Science and Mathematics, Kingston University, Surrey KT1 1LQ, UK G.Mills@kingston.ac.uk
Matthew Purver: Affiliation:
School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, UK p.healey@qmul.ac.uk m.purver@qmul.ac.uk j.hough@qmul.ac.uk http://cogsci.eecs.qmul.ac.uk Jožef Stefan Institute, Ljubljana, Slovenia
Eleni Gregoromichelaki: Affiliation:
Department of Philosophy, Linguistics, Theory of Science, University of Gothenburg, 41255 Gothenburg, Sweden christine.howes@gu.se eleni.gregoromichelaki@gu.se Department of Philosophy, King's College London, London WC2R 2LS, UK ruth.kempson@kcl.ac.uk https://gu-clasp.github.io/people/ruth-kempson/
Arash Eshghi: Affiliation:
School of Mathematical & Computer Sciences, Heriot-Watt University, Edinburgh EH14 4AS, UK A.Eshghi@hw.ac.uk
Julian Hough: Affiliation:
School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, UK p.healey@qmul.ac.uk m.purver@qmul.ac.uk j.hough@qmul.ac.uk http://cogsci.eecs.qmul.ac.uk

Article contents

Abstract
Financial support
Competing interest
References

Rights & Permissions

Abstract

Social robots have limited social competences. This leads us to view them as depictions of social agents rather than actual social agents. However, people also have limited social competences. We argue that all social interaction involves the depiction of social roles and that they originate in, and are defined by, their function in accounting for failures of social competence.

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 46 , 2023 , e37

DOI: https://doi.org/10.1017/S0140525X22001492 [Opens in a new window]
Copyright: Copyright © The Author(s), 2023. Published by Cambridge University Press

Clark and Fischer (C&F) provide a timely reminder that there is a large and underappreciated gap between the ambitions of social robotics and the actual social competence of robots (Park, Healey, & Kaniadakis, Reference Park, Healey and Kaniadakis2021). As they demonstrate, natural conversation presents complex challenges that go well beyond current engineering capabilities (see also Healey, Reference Healey, Muggleton and Chater2021). Nonetheless, they also point to parallels in the ways in which people interact with each other and with social robots.

This commentary questions the ontological distinction underlying C&F's discussion. Specifically, does their account of depiction provide a principled basis for their argument that depictions of social agency fundamentally differ from actual social agency?

C&F discuss various examples of depictions of social agents including Laurence Olivier's performance of Hamlet. Depiction in these examples is complex. The character – Hamlet – is based on a mixture of characters from earlier plays (possibly also Shakespeare's son); there are multiple versions of the text of Hamlet; different productions select different parts of those texts, different actors perform those parts differently; direction, costume, staging, scenography vary, and so on. C&F embrace this complexity and use it to characterise various aspects of ways people treat interaction with social robots as performance.

The problem, as we see it, is that C&F's account of depiction is so rich, encompassing so much of human social interaction, that the distinction between actual social agents and depictions of social agents dissolves. As C&F show, there are familiar contexts in which people perform a role, such as hotel receptionist, which also involve derived authority, particular communicative styles and particular costumes and props. These roles are depictions and successful interaction in these cases requires that we recognise and engage with the performance (Eshghi, Howes, & Gregoromichelaki, Reference Eshghi, Howes, Gregoromichelaki, Bernardy, Blanck, Chatzikyriakidis, Lappin and Maskharashvili2022). However, arguably, all human social interaction has these properties (Kempson, Cann, Gregoromichelaki, & Chatzikyriakidis, Reference Kempson, Cann, Gregoromichelaki and Chatzikyriakidis2016). It was Goffman's (Reference Goffman1959) insight that this kind of performative, depictive, dramaturgical description can be applied to any human social interaction.

When the receptionist in C&F's example (target article, sect. 8.1) switches to being someone who grewup in the same region as Clark, this is, in Goffman's terms, a switch from one kind of performed identity to another. It involves, for example, switching to certain kinds of community-specific knowledge, norms, and patterns of language use (see also Clark, Reference Clark, Gumperz and Levinson1996). People have multiple overlapping identities, all involving elements of depiction: different social repertoires, forms of authority, and conventions of interpretation. Moreover, it is unclear why such performances of identity involve depictions rather than indices to contextual features (“contextualisation cues”) that transform the current situation to a new one where the terms of the interaction have changed.

Despite this, we share the intuition that the features of interaction that C&F highlight are important. However, the crucial role that they assign to inference and pretense seems uncharacteristically individualistic, presenting the role of potentially sophisticated robots as passive, and ignoring efforts people make to scaffold the interaction. Our suggestion is that one way to retain a meaningful, explanatory role for depictions is to abandon the assumption of any fundamental discontinuity between authentic and performed social agency and, instead, look at how depiction functions in interaction. Specifically, the way depictions are used as a means of transforming the relation between interlocutors when social performances threaten to break down; they provide a way to account for the gap between a represented social role and the role invoked to explain the performative failure. Returning to C&F's receptionist example, the inability to provide local hotel information leads to the discovery of the receptionist's actual location which prompts the conversation to switch from “customer”-“receptionist” to “people from Rapid City.”

Not all failures emerge at the level of social performance. When we encounter contemporary social robots, there are a variety of ways in which things can go wrong and a variety of stances we can take to explain the failure (cf. Dennett, Reference Dennett1987). We quickly discover the limitations of robot social affordances and this forces us to reason about, for example, who made this thing? (authority); what is it supposed to do? (intention/character); is there hardware failure (base scene)? This applies equally to humans and robots: We sometimes invoke problems with authority (e.g., someone is too junior or too young to answer), intention (e.g. deceit) or hardware problems (someone can't hear, or is too drunk).

There are some empirical advantages to approaching depiction in this way. It restricts the range of possible depictions to things that are actually cited to account for disruptions to interaction rather than the indefinitely many possible forms of social depiction we could imagine. It also provides an index of social competence. The relative frequency with which we invoke interactive depictions or, for example, hardware problems, provides a measure of how sophisticated a social agent is. Embarrassment accompanies the failure of social roles (Goffman, Reference Goffman1967); involving characteristic displays such as blushing, averting eye contact, face touching, and smiling and laughter. Unlike shame, embarrassment also directly implicates other participants in a coordinated understanding of what has failed, how it failed and how to recover from it. Interestingly, robots are not currently designed to systematically recognise or produce signals of embarrassment (Park et al., Reference Park, Healey and Kaniadakis2021).

Our assumption is that what makes an “authentic” social interaction is the ability to detect and recover from failure – something in principle achievable by machines. Machines can participate in interactions where cognitive abilities are distributed across multiple agents and each can compensate for the failures or inadequacy of the other. The centrality of miscommunication (and ability to recover from it) in human–human interaction (Healey, de Ruiter, & Mills, Reference Healey, de Ruiter and Mills2018; Howes & Eshghi, Reference Howes and Eshghi2021) follows from the observation that we never share the same language, skills, or information as anyone we nevertheless successfully interact with (Clark, Reference Clark, Gumperz and Levinson1996). This is obvious in, for example, parent–child or expert/non-expert interactions, but is arguably characteristic of all social exchanges, including interactions with social robots. At present the potential possibilities for divergences may be broader and along different dimensions but this is not, we argue, different in kind.

Financial support

Christine Howes was supported by two grants from the Swedish Research council (VR) 2016-0116 – Incremental Reasoning in Dialogue (IncReD) and 2014-39 for the establishment of the Centre for Linguistic Theory and Studies in Probability (CLASP) at the University of Gothenburg. Purver received financial support from the Slovenian Research Agency via research core funding for the programme Knowledge Technologies (P2-0103) and the project Sovrag (Hate speech in contemporary conceptualizations of nationalism, racism, gender and migration, J5-3102); and the UK EPSRC via the project Sodestream (Streamlining Social Decision Making for Improved Internet Standards, EP/S033564/1).

Competing interest

None.

References

Clark, H. H. (1996). Communities, commonalities, and communication. In Gumperz, J. & Levinson, S. (Eds.), Rethinking linguistic relativity (pp. 324–355). Cambridge University Press.Google Scholar

Dennett, D. C. (1987). The intentional stance. MIT Press.Google Scholar

Eshghi, A., Howes, C., & Gregoromichelaki, E. (2022). Action coordination and learning in dialogue In Bernardy, J.-P., Blanck, R., Chatzikyriakidis, S., Lappin, S. & Maskharashvili, A. (Eds.), Probabilistic approaches to linguistic theory (pp. 357–418). CSLI Publications.Google Scholar

Goffman, E. (1959). The presentation of self in everyday life London. Allen Lane.Google Scholar

Goffman, E. (1967). Interaction ritual: Essays on face-to-face behavior (1st ed.). Doubleday.Google Scholar

Healey, P., de Ruiter, J. P., & Mills, G. J. (2018). Editors introduction: Miscommunication. Topics in Cognitive Science, 10(2), 264–278.CrossRef Google Scholar PubMed

Healey, P. G. T. (2021). Human-like communication. In Muggleton, S. & Chater, N. (Eds.), Human-like machine intelligence (pp. 137–151). Oxford University Press.CrossRef Google Scholar

Howes, C., & Eshghi, A. (2021). Feedback relevance spaces: Interactional constraints on processing contexts in dynamic syntax. Journal of Logic, Language and Information, 30(2), 331–362.CrossRef Google Scholar

Kempson, R., Cann, R., Gregoromichelaki, E., & Chatzikyriakidis, S. (2016). Language as mechanisms for interaction. Theoretical Linguistics, 42(3–4), 203–276.CrossRef Google Scholar

Park, S., Healey, P. G. T., & Kaniadakis, A. (2021). Should Robots Blush? In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 717, pp. 1–14. https://doi.org/10.1145/3411764.344556 CrossRef Google Scholar