Individual and team profiling to support theory of mind in artificial social intelligence

Bendell, Rhyse; Williams, Jessica; Fiore, Stephen M.; Jentsch, Florian

doi:10.1038/s41598-024-63122-8

Download PDF

Article
Open access
Published: 02 June 2024

Individual and team profiling to support theory of mind in artificial social intelligence

Rhyse Bendell^1,4,
Jessica Williams^1,5,
Stephen M. Fiore^2,3 &
…
Florian Jentsch^1,4

Scientific Reports volume 14, Article number: 12635 (2024) Cite this article

5756 Accesses
17 Citations
3 Altmetric
Metrics details

Subjects

Abstract

We describe an approach aimed at helping artificial intelligence develop theory of mind of their human teammates to support team interactions. We show how this can be supported through the provision of quantifiable, machine-readable, a priori information about the human team members to an agent. We first show how our profiling approach can capture individual team member characteristic profiles that can be constructed from sparse data and provided to agents to support the development of artificial theory of mind. We then show how it captures features of team composition that may influence team performance. We document this through an experiment examining factors influencing the performance of ad-hoc teams executing a complex team coordination task when paired with an artificial social intelligence (ASI) teammate. We report the relationship between the individual and team characteristics and measures related to task performance and self-reported perceptions of the ASI. The results show that individual and emergent team profiles were able to characterize features of the team that predicted behavior and explain differences in perceptions of ASI. Further, the features of these profiles may interact differently when teams work with human versus ASI advisors. Most strikingly, our analyses showed that ASI advisors had a strong positive impact on low potential teams such that they improved the performance of those teams across mission outcome measures. We discuss these findings in the context of developing intelligent technologies capable of social cognition and engage in collaborative behaviors that improve team effectiveness.

Humans depart from optimal computational models of interactive decision-making during competition under partial information

Article Open access 07 January 2022

Challenging presumed technological superiority when working with (artificial) colleagues

Article Open access 08 March 2022

Drivers and influence of social conformity on decision making in human-AI teams

Article Open access 13 March 2026

Introduction

The role of human-autonomy teams (HAT;¹) is expanding rapidly in numerous industries, including healthcare, transportation, military operations, space exploration, and manufacturing². Artificially intelligent (AI) autonomous agents excel in many forms of taskwork^3,4 and are advancing to the point that they are nearly able to function as teammates in support of some tasks. But these advances have not been made in what would be considered teamwork. In the organizational sciences, a fundamental distinction is made between teamwork and taskwork. Taskwork is used to describe the form of work needing to be accomplished by a team (e.g., develop a new product; rescue victims from a burning building). Teamwork is used to describe the collaborative processes they need to execute to achieve their objectives (e.g., engage in leadership behaviors, communicate effectively, manage conflict, etc.). Thus, for AI to be capable of teamwork, it needs to understand the attitudes, behaviors, and cognition humans rely on during collaboration^1,5.

Effective social interactions and collaboration in human-agent teaming depends on the development of social intelligence in AI agents whereby agents possess the necessary tools to monitor and engage in the exchange of social information^6,7. This means success relies on the ability of autonomous agents to demonstrate social-cognitive capabilities to collaborate effectively with their human counterparts. More specifically, an ASI agent would have the capability to encode and decode socially communicative information contained within social signals. This requires an agent to perceive social cues (e.g., words spoken, intonation, body language, facial expression, etc.), and accurately interpret meaning from them, then to produce social signals to convey the intended social information^6,8. This is an ongoing, interdisciplinary research challenge cutting across disciplines like computer science, engineering, and psychology requiring advances in computer vision, natural language processing, coupled with theory and methods in the social sciences⁹. Although AI has made substantial progress in more general interactions thanks to advancements in large language models, they remain largely socially ignorant. Until recently, they were unable to learn about a user and adapt their outputs over the course of an interaction. Further, they do not retain these memories or the information they learned in subsequent interaction, even with the same user. Such capabilities are essential for AI to demonstrate the kinds of social-cognitive capabilities foundational to interpersonal interactions.

Part of the difficulty in developing an ASI that is capable of engaging in effective social interactions and collaboration with humans is the need for social intelligence that is comparable to humans. Human-centered AI is increasingly recognized as an important area for improving collaborations between humans and machines¹⁰. Within that space, a crucial component of human-centered AI focuses on the social-cognitive aspects of human intelligence. Through the use of what is typically referred to as Theory of Mind (ToM), a core element of social cognition, humans develop their abilities to make inferences about situations and adapt the way they communicate with consideration to the situational context, such as if they are speaking to a novice or an expert in the topic, or when they’ve determined an individual has false or incomplete beliefs¹¹. ToM is the ability to make these mental state attributions, including inferences and predictions about another, in the course of observing their actions, or during interaction. ToM processes enable us to understand and infer what another is thinking, feeling, and doing, and to interact with others based upon these mental state attributions.

Artificial theory of mind

For artificial, socially intelligent systems to truly be able to effectively engage in social interactions with their human counterparts, they need to be able to infer, interpret, simulate, and predict human mental states, beliefs, and knowledge. To be able to make such mental state attributions, AI must be imbued with an Artificial Theory of Mind (AToM;¹²). By adapting models of human ToM to serve as an analog for an Artificial Theory of Mind (AToM), an agent could utilize these models to determine how to best communicate information in a way that is appropriate considering certain characteristics or features, such as prior experience or existing knowledge, the context of a situation, prior interactions, and skills or capabilities¹², a skill that humans learn through interactions they have over the course of their development. If an agent can either be provided with a priori data (prior to interacting with the human) or learn about a particular human or team through human-agent interactions, an ASI can calibrate their internal models by taking into account existing knowledge of their human counterpart. Further, ASI can more effectively communicate in human-understandable ways, as humans are able to do, by providing clarification and transparency to their decision-making processes and the factors that influence them¹³. Research has shown that increasing transparency in communication and explanations of ASI decision-making fosters trust, finding that the humans understanding of the AI’s behavior and the predictability of the AI’s actions both impact the trust a human places in the AI¹⁴. The capability for an AI to adjust its outputs automatically to reconcile misunderstandings or false beliefs in the human counterpart can be facilitated by AToM processes and can be done in ways analogous to how humans use ToM¹².

Theory of mind in humans is improved through interactions and explanations given by others, which allows humans to develop sensitivity to certain cues in their environment (which may differ across cultures, regions, or background⁶). This sensitivity, based on countless experiences, allows humans to perceive nuanced, situated meanings contained within social interactions, to infer another’s mental state based on observable behavior, and to predict future behavior¹⁵. An agent could be trained through interactions if they are able to maintain a base model that integrates prior experiences for use in present contexts and iterates over these models over time to become more accurate and interact more effectively¹². However, traditional intelligent systems typically are not provided with the information they need to develop meaningful, iteratively updated models of their human counterparts. Further, they do not have the capability to extend those conceptualizations to more complex second-order attributions interrelating teammates’ internal cognitive models, or to models of their teammates’ perceptions of the agent themself as an artificial teammate (e.g., third-order ToM attributions). These capabilities are mostly subconscious in humans, but are especially evident in high-performing teams, where members of the team need to develop internal models of their teammates (e.g., shared mental models or transactive memory systems⁵). In most teams, human members differ in a variety of ways, ranging from the attitudinal (e.g., collective orientation), to the cognitive (e.g., knowledge and expertise), with their internal models containing various aspects of a given teammate such as form of existing knowledge and level of expertise, assigned roles and responsibilities, capabilities, current workload, personality, and more. These internal models form the basis for ToM, and the kinds of mental state attributions that allow the team to coordinate in complex and collaborative activities, where they demonstrate the ability to anticipate other team member’s needs, such as timing information pushes and providing backup behavior¹⁶. It is currently not known if it is feasible, or useful, for agents to acquire this information through multiple interactions in order to learn about the humans with whom they are interacting. Rather, providing an ASI with the relevant information about their human counterparts might help the agent use its AToM to adapt explanations and suggestions to users¹⁷, and to inform the agent’s AToM. The ASI should be provided with socially, and situationally, useful data that enables them to develop the AToM models necessary for collaboration.

Profiling to support artificial theory of mind development

Development of artificial theory of mind will require AI to be provided with not just enough data, but, more importantly, contextually relevant information about their teammates to begin modeling and making attributions about their beliefs and intentions, and predict future actions. We contend that building profiles, an abstract, quantifiable, and machine-readable description of an individual's characteristics, can be used to provide the a priori information an ASI can leverage to be able to inform their internal models of humans. Drawing on concepts and methods from social, cognitive, and organizational sciences, profiles can be developed to help an agent understand human behavior¹⁸. Building profile models is largely specific to the context for the use of the profile, as it would not be feasible to capture everything about an individual, so models must be developed with the intended purpose in mind. Profiling individuals originated with user research in marketing, but the approach is also widely applied in video game-based research^19,20, where profiles of players are able to describe and predict various outcomes and performance measures, including at the team level²¹. In addition to profiling related to task performance, previous work has attempted to understand the features of an individual and a team that are most closely related to their success²². Relevant profile features necessarily shift as a function of a team’s given task and context, but there is evidence that underlying features of individual’s (and by extension the teams that they form) knowledge, skills, and attitudes may play more reliable roles in determining teamwork behaviors and success. This includes features such as personality²³ and social skills²², which are particularly important in team settings²⁴.

The research reported here was conducted to examine factors influencing the successful performance of ad-hoc teams being advised by an agent architecture developed to be socially intelligent. It was part of a large research program, Artificial Social Intelligence for Successful Teams (ASIST), designed as an interdisciplinary initiative bringing together teams of researchers from the computational, social, and organizational sciences to collaborate on the development of AI capable of being a member of a team^25,26. A foundational feature of the program was that the AI not be designed to support what we described as taskwork (e.g., engage in active/mechanical working tasks that contribute to the completion of team objectives), but, rather, target AI capable of teamwork, and supporting team processes. The ASI was to monitor and provide teamwork relevant advisement as humans engaged in the task. The task was a simulated urban search and rescue (USAR) operations, a complex task designed to require coordination between teammates assigned varying roles and provided with varying resources. The primary focus of our work was to study the impact of pairing human teams with one of several artificial, socially intelligent agents and how profiling human teams affected this.

We present findings detailing our novel approach to profiling ad-hoc teams that captures features of team composition that may influence team process, success, and perceptions of artificial socially intelligent advisors. The profiles gauged team members and teams in their potential for teamwork and their potential to successfully execute the USAR task (see²⁷ for a detailed description of the profile components, an overview will follow in the Methods section of this paper). Our primary hypotheses regarding player profiles were that increased individual taskwork potential would be associated with improved performance of the taskwork allocated to a given player’s role and that increased individual teamwork potential would be associated with improved performance of tasks that required communication, coordination, and joint action with team members. Similarly, we anticipated that profiles at the team level, which were the aggregated team level taskwork and teamwork potential, would interact to predict a team’s ability to perform well on taskwork-teamwork measures. The profiles developed in this work were implemented into the cognitive models of the ASI agents (to varying degrees across the agents); however, a specific, detailed description of the agents’ cognitive architectures and how the profiles were implemented across the agents is out of scope for this manuscript. Rather, we discuss the profiles, including potential behavioral markers and characteristics we expected of different profiles, with consideration for the taskwork and teamwork sides of the profiles, provided to the teams of ASI developers, with a focus on their utility as predictors of differences in experimental outcomes and task performance to aid ASI agents in their role as an advisor.

Methods

The experiment reported here was pre-registered on the Open Science Framework²⁸ and the data collected as part of this study and that is used in the analyses presented in this work, has been made publicly available (data available here:²⁹). The experiment manipulated the presence of an ASI teammate serving in an advisory role such that some USAR teams had no advisor, others a human advisor, and the remainder paired with one agent imbued with artificial social intelligences (ASI). The purpose of this at the ASIST programmatic level was to test the effectiveness of six different ASI agents, developed by independent research teams, and as part of a research program testing the effectiveness of social-cognitive architectures developed for teams²⁵. Because our interests were more in understanding how profiles are related to human-agent teaming, for our analyses, we collapsed across the six agents combining them into a group of ASI advised teams. We will discuss agent capabilities and constraints in a later section, but overall, the ASI advisors were designed to closely monitor the actions and communications of the teams with which they operated. Based upon inferences derived from the agents AToM, developed by monitoring their human teammates, the agents provided advice and interventions to improve the team’s process and achieve successful outcomes.

Simulated USAR missions

Teams had to complete two simulated Urban Search and Rescue (USAR) missions executed in a testbed built using the Minecraft game environment (see²⁹ for a full description of the testbed). The testbed was designed to support virtual collaboration, allowing for remote experimentation during the pandemic. As such, teams were not collocated, and experiments were coordinated by remotely connecting participants to the testbed along with virtual meeting software allowing for synchronous communication and video of each team member. Missions were completed in a fixed order with each mission featuring a rescue in a partially collapsed building and included a different perturbation such that, in the first mission, teams experienced new rubble falling into the search area, blocking some areas, and, in the second mission, teams experienced a disruption (or black-out) of a shared map included to support coordination. Each of 113 teams were assigned to one of eight (between-team) conditions, with 15 teams in the No Advisor condition, 14 teams in the Human Advisor condition, and 14 teams in each of the remaining six Artificial Social Intelligence Agents (ASIs) Advisor conditions (i.e., 84 total teams in the ASI advisor condition). Each of the three team members were randomly assigned a role, such that the role assignment order for each team was fixed and distributed in order of connecting to the testbed). The three roles in this study were Medic, Engineer, and Transporter, and each role had unique capabilities, tools, and knowledge related to possible locations of victims. Teammates communicated with each other using virtual meeting voice communications as well as through what we call knowledge externalization tools. These tools were designed to be scaffolding for coordinative communications; that is, testbed tools that afforded externalization of cognition³⁰ via physical markup inside the virtual building and reflected on a mini-map layout of the USAR environment³¹ (e.g., blocks marked with pertinent information and placed in front of a room in the building).

Medic role

One individual on each team was assigned the role of Medic, which involved using a device to triage the building-collapse victims that were distributed throughout the environment to determine what kind of injuries they had. The medic was the only role that could acquire information about whether a victim was type A—abrasions, B—bone damage, or C—critical. After the injuries of a given victim were identified, the medic could then stabilize that victim in preparation for transport. It is relevant to note that victims could be transported before they were stabilized, and that transporting victims was a capacity of all roles on the team; however, the nature of a given victim’s injuries dictated their evacuation point, so the effectiveness with which the Medic acquired and shared this information was foundational to team success as well as efficiency of coordination. Additionally, critical victims required assistance from team members to heal/stabilize, which will be discussed in more detail at the end of this section.

Engineer role

A separate individual on each team was assigned the role of Engineer. Engineers had the slowest base movement speed on the team and were able to break rubble blocks. Clearing rubble was a critical task for mission success because it could open new paths or reveal that a victim was trapped within a pile of rubble. Engineers were also uniquely provided with information about the structural stability of rooms in the USAR environment, and they are shown the locations of ‘threat rooms’ (rooms where rubble was likely to fall and trap teammates) on the shared map. The effectiveness with which they shared this information was important for supporting the risk management process of the team.

Transporter role

The final individual on each of the three-person teams was assigned to the Transporter role which had the fastest base movement speed on the team, and provided participants with the ability to detect at a distance whether there was a victim inside of a room in the environment. The Transporter’s effectiveness at searching the environment for victims and communicating those locations to their team was vital to the team’s ability to coordinate triage, stabilizing, and evacuation work as well as to organize the interdependent team tasks.

As noted above, all of the roles were able to pick up the victims, carry them and set the victims down in the environment, but the victims would only count as “evacuated” for the team if the victim was both successfully stabilize and transported to the correct evacuation zone for that victim type (A, B, or C). Additionally, each participant was provided with the same set of knowledge externalization and communication tools that displayed various symbols and could be placed on the virtual floor (as well as removed) such that teammates could see them in the environment as well as view them on a shared mini-map. The semantic meaning of each marker block was as follows: Victim type A, Victim B, No Victim Here, Critical Victim, Regular Victim, Threat Room, Rubble, and Help Me Here. These allowed teammates to quickly and clearly communicate their task needs and organize interdependent tasking as well as backup behaviors.

Teams were also challenged with a shared, interdependent joint-task that required two teammates to work together to stabilize critical victims. Although the medic role was still required to stabilize a critical victim, an additional teammate was required to be in proximity to the victim in order for that stabilization to be successful. Walking away from the victim or not being close enough would cause the medic’s stabilization action to fail.

Before completing the experimental trials, participants responded to surveys and measures that captured participant individual differences and current dispositions, and after the missions they completed measures regarding their team’s success, perceptions of team process, and ratings of their team’s advisor (in conditions that included an advisor). The surveys relevant to this manuscript are described in more detail below.

Materials and measures

Individual player profiles

Player Profiles are based on a six-component model that is constructed from psychometric, psychographic, and skill elicitation measures that tap individuals’ taskwork-related and teamwork-related potential. Related to what was described earlier, this distinction is based on team theory to differentiate the varied competencies associated with completing a task and those needed to collaborate effectively. The combined model of the player profile includes six components with three tapping taskwork potential and three tapping teamwork potential. This integrated approach attempts to bridge the gap between traditional approaches to understanding/facilitating human behavior and modern methods for implementing artificial agents.

In this study, taskwork potential refers to a set of largely task generic competencies related to performing in a virtual world. They capture one’s ability to navigate and recall pathing, comfort/familiarity with task completion in game-based environments, and task execution in the custom Minecraft testbed; the three components used in constructing the task potential part of the model were intended to capture these facets. Ability to navigate and recall pathing was captured using the Santa Barbara Sense of Direction (SSOD), a validated measure of spatial navigation³² and predictive of ability to successfully learn and navigate both real and virtual environments³³. Comfort and familiarity with task completion in game-based environments was captured using a Video Game Experience Measure (VGE; see Appendix B of the Study 3 Preregistration in:²⁹) which targeted video gaming specific experience and skills related to Minecraft and the USAR gamified task. The third component of the task potential part of the model was more task specific. It was captured through a timed, in-game Competency Test (Comp). Although somewhat task specific, the behaviors were fairly generic in the Minecraft game environment; that is, this was a behavioral measure where each player had to individually complete a task battery requiring that they execute essential game actions necessary to complete the task in the Minecraft testbed (e.g., breaking walls).

The other half of the player profile model, the teamwork potential profile, consists of a set of team generic competencies related to collaboration and interpersonal competencies. This was built to capture an individual’s ability to discern mental states/emotions, group interaction tendencies, and collective engagement and grouping behavior. Ability to discern mental states/emotions was captured using the Reading the Mind in the Eyes Test (RMET). This is a validated measure designed initially to detect subtle deficits in ToM in adults with high-functioning autism, and has also been related to neurotypicals’ ability to make mental state attributions³⁴. Interaction tendencies were captured through the Sociable Dominance scale (SD), a validated measure of sociable dominance in individuals that can predict social interactions; for example, it has been found that individuals high in sociable dominance tend to use reasoning and direct communication strategies with others³⁵. Finally, collective engagement and grouping behaviors were captured using the Psychological Collectivism scale (Collectivism), a validated measure of attitudes individuals have about working in groups, including preferences for being in a group, concern for the group, and whether they tend to comply with group norms and rules³⁶.

Individuals were categorized as high or low in teamwork potential and in taskwork potential based on the combination of their scores on the measures related to that part of the model (see Fig. 1). Specifically, if an individual scored higher on two out of the three measures they would be classified as high, and if they scored low on two out of the three measures they would be classified as low. Possible profile groups included: low taskwork - low teamwork, high taskwork - low teamwork, low taskwork - high teamwork, and high taskwork - high teamwork potential. To provide a concrete example, an individual who scored above the sample median on Video Game Experience measure and above median on the SSOD, would be classified as ‘high’ on task potential. And if that individual scored lower than the median on Sociable Dominance and lower on Reading the Mind in the Eyes, they would be classified as “low” on team potential (see Fig. 2 for examples). Thus, their combined profile would be high taskwork - low teamwork potential.

Team holistic profile formulation

Team profiles were constructed from the combination of individual team member profiles through modal analysis (e.g., the most prevalent taskwork and teamwork potential profiles determined the overall team profile). This allowed us to categorize each entire team as low or high taskwork potential and low or high on teamwork potential in a similar manner to the player classifications. For example, if a team consisted of one player that was categorized as high taskwork but low teamwork, a second that was low taskwork low teamwork, and a third that was classified as high taskwork high teamwork, they would then be collectively categorized as high taskwork (2 of 3 representatives) low teamwork (2 of 3 representatives). Further, to provide more insight into the predictiveness of the profiling technique we additionally provide analyses focused on the far ends of the classification spectrum (e.g., teams that were classified as low taskwork - low teamwork or as high taskwork - high teamwork). These groups were selected for additional analysis because this approach to holistically profiling teams is novel, and we hypothesize that there may be interactions between the teamwork and taskwork profiles that would make interpretation of the low taskwork - high teamwork, high taskwork - low teamwork teams unclear at best.

Artificial social intelligence agents overview

The six ASI agents in this study were individually developed by ASIST program performers/teams, and were instantiated with a performer-defined AToM, implemented as computational models of team attributes, teamwork processes, and their impact on effects in teams (see²⁹ for these descriptions in the study preregistration). The ASI agents acted in an advisory role to support their three human team members with teaming behaviors and engaging in teamwork in the experimental tasks. The agents were required to adhere to certain constraints to maintain the primary goal of supporting teamwork processes rather than taskwork – to this end, the agents were not embodied in the virtual Minecraft-based simulated USAR testbed and were not able, or allowed, to engage in any of the taskwork in this study. The ASI were also constrained with respect to their knowledge of the environment so that it would comparably be realistic in the real world. Specifically, they were not given omniscient knowledge of where every victim was, what the best route to a particular location would be, or where threats (e.g., risk of further building collapse) may exist. However, the ASI were able to observe the actions the human team members took in the virtual environment, see the externalized cognitive artifacts (e.g., marker blocks), and see the field of view for each team member to allow them to perceive what each person has seen or encountered in the environment. This allowed the ASI agents to make inferences about the human team members’ beliefs, intentions, goals, and knowledge based on the observable actions taken in the environment. The different agents used various approaches to this, some utilizing Bayesian approaches to model human perspectives, short-term and long-term planning, aspects of human cognition such as workload or emotion, and track multiple hypotheses³⁷. Other agents use internal models that estimate participant knowledge of relevant known entities spatially through a 2D representation of the experimental environment, using neural network prediction models developed on human-annotated and simulated player data to make inferences and predictions based on accumulating knowledge of, and modeling, each individual team member over time and the team as a whole³⁸. Additionally, the agents were provided access to all surveys taken by participants before completing the experimental task, as well as various analytic components (ACs) that were developed independently by performer teams. To varying degrees, the ACs were developed based upon team theory and designed to augment the ASI architecture. As described, our AC was developed to help the ASI understand player profiles. Other research teams based their ACs on, for example, leadership theory. For example, one AC used pre-experiment surveys and in-game data to identify emergent leadership in human players³⁹. This could then be used by ASI to determine where to direct leader relevant interventions.

The player profiles we described above were implemented in the testbed as an analytic component with the intent to provide machine-readable input about the players, and the team as a whole, through the quantified profile model. The player profile analytic component read the survey and gameplay data used in the model, calculated the profile for each individual and the team overall, and published the player profile model’s output to the message bus used by the agent to receive testbed, survey, and component data. The player profile models were developed during prior program studies (see^25,40) before the inclusion of the ASI agents in the present study. The profiles afforded the ASI the ability to interpret the measures and gameplay data used in the profile construction in the context of the theoretical grounding used to develop the profiles over the course of the ASIST program. The ASI agents integrated the player profile data into their internal models to help inform their predictions, allowing the ASI to consider an individual’s potential capacity for teaming and tasking behaviors into their existing models and calculations. Specification of the technical integration of the player profile models, or any analytic component used in this study, into the various ASI agents AToM and cognitive architectures is beyond the scope of this paper. Interested readers are referred to the publicly available dataset, which also contains all of the code and documentation for the agents used in this study, and for the documentation of the player profile analytic component (see²⁷). Additionally, this study did not manipulate the provision of the player profiles or any other information to the ASIs as a variable. All ASI agents were provided the same access to the testbed, survey, and analytic component output data. Thus, because we were not able to manipulate the provision of profiles within agents and teams, we are unable to comment on the particular impact the profiles had on agent interventions and determinations provided to teams. Rather, our analyses focus on the player profile models predictive power with regards to the experimental task measures and perceptions of the ASI. This allows us to comment on the utility of the a priori information being provided to actual artificial social intelligence agents through the player profile model, and whether they provided the ASI with data that is indicative of overall and specific task performance.

USAR task metrics

Performance on the USAR task was tracked along multiple dimensions. Each of these measures are considered to reflect successful taskwork, successful teamwork, or a combination of taskwork and teamwork.

At the individual level, each role that participants could be assigned was associated with a set of unique or optimal taskwork functions that they could perform as well as a set of teamwork functions in which they could engage to support coordination. Given the nature of the testbed, measures of teamwork were relatively difficult to track because, as mentioned above, human teammates communicated through voice comms, and the natural language associated with those exchanges has not yet been fully processed and analyzed by our team. Accordingly, the measures we employ for this article often are taskwork focused or have taskwork woven into their execution, but several either have teamwork components or are entirely reflective of teamwork actions (see Table 1).

Table 1 Outcome measures related to the execution of taskwork and teamwork elements of teams’ simulated missions.

Full size table

Study sample

This study involved 113 three-person teams completing two 17-minute gamified urban search and rescue missions implemented in Minecraft. All participants engaged in the study remotely from an internet connected computer of their choice and were overseen by experimenters at Arizona State University (see the Study 3 Preregistration in: ²⁹) who carried out the methods in accordance with the approved protocol, and relevant guidelines and regulations. All participants in this study reviewed and completed an informed consent form prior to participation. The study was reviewed and approved by the Arizona State University Institutional Review Board. The data sharing agreement and approval for data analysis by researchers at the University of Central Florida were overseen, reviewed, and approved by the University of Central Florida Institutional Review Board. All local approvals were further submitted to the Army Human Research Protections Office (AHRPO) for supplemental review.

Analysis populations

The analyses reported here relate to multiple different groups of participants as a function of the focus on individual versus team outcomes, the availability of complete data for player profiling, the availability of complete data for team profiling, and the exclusion of groups to support the inspection of outcomes related to ASI advisors specifically. More detailed study information on this study and the data repository can be found at²⁸ and²⁹ respectively. See Table 2 below for demographics descriptives for each subset of the data.

Table 2 Descriptive features of each dataset analyzed in the following results section.

Full size table

Results

We report the results of our analyses in two parts: first, we examined the outcomes related to individual players’ profiles and their respective performance as teammates, and second, we examined the team-level outcomes with particular attention to the impacts of teaming with an autonomous, socially intelligent agent. At the individual player level, we tested sets of hypotheses related to the interaction of the teamwork and taskwork components of our player profiling approach. Those interactions were also tested at the team level, but, in addition, we tested the interaction of teams’ holistic profiles with the type of advisor that assisted them in their missions. It is important to note that we only considered players’ and teams’ performances and outcomes associated with the second mission that they completed (see Methods), and that analyses focused on the artificial socially intelligent agents necessarily incorporated only those teams which interacted with them. Full details of the analyses reported here including descriptive statistics are available in the supplementary information (see file: Supplementary Information.docx, also available at: ²⁵).

Shapiro-Wilke Tests were run for the dependent variables, and though the results were significant for a subset, inspection of the Kurtosis and Skewness ratios indicated that these deviations were small (e.g., no more than a ±4 Skewness or Kurtosis ratio, which is greater in magnitude than the ±2 standard, but is not extreme deviation) and we proceeded with analysis of variance as our sample sizes are generally considered sufficient to ensure that ANOVAs are likely to be robust to those assumption violations. All of the following analyses employ an alpha cutoff of p < 0.05, and interpretations of effect size are based on field standards such that η² < 0.06 is considered a small effect, between 0.06 and 0.14 and is considered a moderate effect, and η² > is considered a large effect⁴¹.

Individual player profiles

Our primary hypotheses regarding the player profiles were that increased individual taskwork potential would be associated with improved performance of the taskwork allocated to a given player’s role and that increased individual teamwork potential would be associated with improved performance of tasks that required communication, coordination, and joint action with team members (see Table 3). Additionally, we hypothesized that players demonstrating increased teamwork potential would report more positive perceptions of their teams and their advisors (see Table 4). Unless otherwise stated, the analyses reported here used α = 0.05 cut-off. To test performance on individual’s taskwork across the individual teamwork and taskwork potential profiles, a two-way ANOVA was conducted with each of the role-respective measures.

Table 3 Individual profiles: teamwork measure analyses. These results demonstrate that the taskwork and teamwork dimensions of the profiles separately predict differences in individual task and team activity performance.

Full size table

Table 4 Perceptions of advisors as a function of player profile dimensions. Note that row 1 reflects all players that received advisement (either from human or ASI advisors), and row 2 and 3 reflect players that worked specifically with an ASI. These findings highlight that the profile dimensions (taskwork and teamwork potential) predict differences in players’ overall team process perceptions, and that the two dimensions interact to predict differences in perceptions of ASI advisors.

Full size table

Individual player profiles and performance

For measures of individual taskwork-teamwork performance (e.g., Medic taskwork-teamwork measure 1, Transporter taskwork-teamwork measure 1) the effect of individual teamwork potential profile was found to be statistically significant, indicating that participants with high teamwork potential profiles performed better on taskwork-teamwork measures as compared to those with low teamwork potential profiles (see Table 3). Additionally, for one of these measures (Medic measure 1, critical victims healed) individual taskwork potential profile was found to be significant with a moderate effect size such that participants with high taskwork potential outperformed those with low taskwork potential. In contrast, taskwork potential profile was not found to be predictive of the taskwork measure for one of the roles (Engineer taskwork measure 1). Last, there was no significant interaction effect between individual taskwork potential profile and individual teamwork potential profile on the individual performance measures which indicate that teamwork potential and taskwork potential may have influenced wholly separate components of teammates’ tasking and coordinative capacity.

Individual player profiles and perceptions

Outcomes of ANOVAs conducted on individual participants’ perceptions of their team processes and, separately, their ratings of their ASI advisors showed an interesting interplay between the teamwork and taskwork profile dimensions. To be clear, the analysis of team processes incorporates all teams that were profiled whereas the reflections on ASI teammates includes only teams that worked with an artificial socially intelligent advisor. For perceptions of team process, results showed that both taskwork and teamwork potential profiles predicted team process ratings; however, there was not an interaction between the two, indicating that individuals who were higher in taskwork potential viewed their team’s processes more positively and those higher in teamwork potential viewed their processes more positively but that the two did not combine to yield substantially higher ratings (or lower ratings in the case of low potentials).

Participants’ ratings of their advisors were not associated with a particular pattern of results overall. No significant effects were found for taskwork potential or teamwork potential on participants’ ratings of whether they viewed their artificial socially intelligent advisor to be dependable. In contrast, a significant effect was found for the interaction of taskwork potential and teamwork potential for ratings of whether participants viewed their ASI advisor as reasonable. Notably, participants with a high teamwork potential profile but low taskwork potential profile rated their advisors most highly, whereas those with a low teamwork potential profile and a low taskwork potential profile rated their advisor least highly (see Fig. 3). This suggests that having low taskwork potential coupled with high teamwork potential altered how one valued the ASI contributions.

Team Profiles

Our prior section examined how profiles, at the individual level, were related to performance. In this section, we describe analyses where those profiles are combined to the team level. That is, here we examine the predictive utility of emergent team profiles generated from modal analysis of the profiles of each of the players on a given team (see Methods). First, we tested the effects and interactions between the teamwork and taskwork components of the team profiles (see Table 5), and second, we examined the effect of team profiles specifically when teams worked with an artificial socially intelligent agent (see Table 6), and then as they manifested with each category of advisor (no advisor, human advisor, and artificial socially intelligent advisors; see Table 7).

Table 5 Analyses at the team level show that the interaction of the taskwork and teamwork dimensions of the profile are predictive of performance differences across teams. Particularly, those teams higher in taskwork and teamwork potential demonstrated improved performance outcomes.

Full size table

Table 6 Team taskwork and teamwork potential: team process and ASI perceptions. Similar to the individual level analysis of ASI perceptions, teams that were collectively lower in taskwork potential but high in teamwork potential demonstrated greater ratings of the ASI advisors.

Full size table

Table 7 Team holistic profiles and advisor types: mission performance measures. These results highlight the critical finding that ASI advisors significantly improved the performance of low taskwork, low teamwork potential teams.

Full size table

Team profiles and performance

Outcomes of a series of ANOVAs testing the effects of teamwork and taskwork potential profiles at the teams level revealed a consistent pattern of results demonstrating that the interaction of teamwork and taskwork profiles was predictive of team performance outcomes. Although the teamwork-taskwork combined measures employed for evaluating teams’ performances (see Table 5) showed no statistically significant main effect of taskwork profile nor of teamwork profile, each of the three revealed a significant and small-to-moderately sized effect for the taskwork teamwork profile interaction. As anticipated, these analyses revealed that teams that were categorized as high in taskwork as well as high in teamwork potential performed best overall (see Fig. 4). As predicted, for most measures, teams that were low in taskwork and teamwork potential performed worst. But, for one measure (mission score %), teams low in taskwork potential but high in teamwork potential performed worst whereas teams high in both performed best.

Team profiles and perceptions

Similar to the pattern of results for team performance, results of ANOVAs conducted to test the effects of team profiles on perceptions of team process and ASI advisors revealed that the interaction of taskwork profile and teamwork profile predicted differences in participants responses. Again here, the analysis of team processes incorporates all teams that were profiled whereas the reflections on ASI teammates includes only teams that worked with an artificial socially intelligent advisor. For these analyses, there were no main effects of either taskwork potential or teamwork potential (see Table 6). Notably, although the interaction effect was significant, the pattern of responses across profiles was different than anticipated. Instead of high taskwork and high teamwork teams responding most positively, the most positive responses were provided by teams that were low in taskwork potential but high in teamwork potential. This pattern is the opposite of what was found in the analyses of team performance in which low taskwork high teamwork teams were found to have generally performed worst.

Interactions between advisor types and teams’ profiles (low-low, high-high profiled teams)

The prior section demonstrated the utility of our profiling technique when it comes to showing associations between team and task potential and various process and performance measures. We turn next to our examination of the relationship between profiles and process and performance depending on the nature of advising provided the teams. Particularly, we analyze how team process and performance was affected by the presence of either a human or machine advisor imbued with artificial social intelligence. To forecast our findings and provide context to the reader, the primary outcome that these analyses highlight is a positive impact of ASI advisors on low potential teams.

Following the approach described in the most recent section, these analyses tested the effects and interactions between teams’ holistic profiles (e.g., the combination of their taskwork potential and teamwork potential classifications) and the type of advisor with which they worked during their missions (see Table 7). Note that for the following analyses only teams at the extreme ends of the profile dimensions were examined, specifically teams that were categorized as High Taskwork, High Teamwork potential and those categorized as Low Taskwork, Low Teamwork potential.

Outcomes from ANOVAs conducted to examine the main effects of team holistic profiles and advisor types on team performance measures revealed a consistent, moderately strong main effect of team holistic profile. Analysis of teamwork measure 1 (risk management failure costs) showed a strong main effect of advisory type such that teams that did not work with an advisor were performed worse at risk management (i.e., poorer at communicating threat presence knowledge). Analyses of taskwork-teamwork measure 1 and teamwork measure 2 (mission score %, knowledge externalization) both showed significant and strong interaction effects between profiles and advisor types. Over all of the mission performance measures, teams that were high in taskwork and teamwork potential performed best (or most appropriately, in the case of teamwork measure 2: knowledge externalization) by a large margin as compared to teams that were low in taskwork and teamwork potential. The effect of holistic profile was generally augmented by the presence of an advisor, and most strongly when teams worked with a human advisor. Importantly, the presence of an ASI advisor helped teams low in team and task potential as much as the human advisor when compared to the no advisor condition. Teams that were high in taskwork and teamwork potential that worked with a human advisor performed best by a moderate degree compared to most other groups, and a small margin compared to teams that were high in taskwork and teamwork that worked with an artificial social intelligence advisor (Figs. 5, 6).

Discussion

This study set out to understand how profiling individuals on teams based on task and team potential, are predictive of team performance and how profiles differently predict team outcomes dependent on advisors who were either human, or agents imbued with artificial social intelligence. The results largely supported our primary hypothesis that individuals and teams could be profiled based on psychometric, psychographic, and brief competency test data and that higher potential on the taskwork and teamwork components of those profiles would be associated with improved performance. Our findings revealed that teamwork profiles were predictive of differences in measures that involved teamwork elements as well as some that were considered to involve primarily taskwork elements. These were also influential with respect to participants’ perceptions of their team and the artificial socially intelligent advisors with whom they worked. Taskwork profiles were also found to be predictive of some differences, though primarily on measures that were considered to be a combination of taskwork-teamwork elements. Both taskwork and teamwork profiles were found to interact to predict differences in several measures of individual and team performance as well as of perceptions of team process and ASI dependability and reasonableness.

Most importantly, our findings demonstrated that it will be critical to assign artificial, socially intelligent advisors to teams that are likely to benefit from their assistance. Inspection of the effect of advisor type in the context of low taskwork, low teamwork versus high taskwork, high teamwork potential teams revealed that only the former benefited from the assistance of ASI teams; however, the performance of the low potential teams was significantly improved by the advice of the ASIs. Relevant to selecting teams that may be appropriate for human-agent teaming, our profile analyses were able to detect differences in team behaviors as a function of their profiles and advisement. Specifically, outcomes of our analysis of the interaction between teams’ holistic profiles and the nature of the advisor with which they worked (e.g., no advisor, a human advisor, an ASI advisor) revealed that those holistic profiles were significantly and moderately-to-strongly predictive of differences in team performance. Additionally, profiles interacted with their advisor type such that profiles manifested differently when teams worked with different advisors. Particularly, teams tended to perform best when they had high potential profiles and worked with a human advisor, and worst when they had low potential profiles and did not have an advisor. Teams that had high potential profiles and worked with an ASI advisor performed near, but not quite at the level of high potential teams that worked with a human advisor. Most relevant to consideration of how human-centered artificial intelligence can improve applications, we showed how ASI improved performance for the low task – low team potential profiles. This suggests that our profiling approach can be a useful component in the development of intelligent systems because they focus on human capabilities and potential; that is, our profiles can provide artificial intelligence with information enabling a more accurate theory of mind of their human teammates.

Finally, an important contribution of this work is that we combined traditional attitudinal and perception measures found in the Human-AI literature and human-computer interaction literature with objective measures of process and performance. Further, unlike the majority of current studies on AI that use canned/scripted responses or “Wizard of Oz” setups where humans play the role of AI, our participants interacted with actual artificial intelligence. Thus, the profiling approach we developed and tested can be adapted across task domains to further research on how AI imbued with social intelligence affects both perception and process. There are some known limitations associated with this work primarily stemming from the experimental control and clarity of measures associated with a testbed that can support remote interactions between three remotely located human team members as well as an artificial socially intelligent agent. First, the USAR task and testbed represent a novel tasking environment developed by a team of SMEs to examine these and similar research questions, but they are relatively untested and not deeply explored. Ideal team behaviors in this task are unknown, as are optimal individual behaviors, and the full range of factors that may influence behaviors. Due to practical limitations in data collection, the sample size for human teams and no-advisor teams is also relatively small (only 14 each), and not all of the individuals on these teams provided complete data for profiling. Lastly, there were multiple ASIs that were paired with teams in this experiment, and each behaved somewhat differently with respect to their development of artificial theory of mind and the leveraging of that building block to assist teams. The sample size of complete data for each individual ASI is also small. As a result, some analyses that we would be interested in conducting are not feasible due to the relatively random outcomes regarding the profiles of individuals and teams that worked with each ASI.

In sum, this research examined an approach for helping artificial socially intelligent agents understand their human team members and begin to develop artificial theory of mind. We consider these to be fundamental building blocks required for the development of AI that can operate as effective teammates by attending to the social as well as the functional aspects of working on a hybrid team. Our player profiles are grounded in social science theory, and the findings presented here provide evidence that they offer meaningful insight into behavioral differences between participants. We also showed that the player profiles were useful for understanding the differences between participants’ perceptions of their team process as well as of the human and AI-enabled advisors from whom they received advice while completing missions. Further, we demonstrated that the individual player profiles can be used to construct team profiles, which are, themselves, predictive of team success with respect to overall performance as well as more nuanced dimensions such as coordination, error rate, and risk management success. Perhaps most importantly, we found that team profiles interacted with the type of advisor working with teams such that there were differences in performance, knowledge externalization behaviors, and perceptions of team process depending on whether teams worked with no advisor, a human advisor, or an artificial socially intelligent advisor. Critically, we found that ASI advisors were able to elevate performance of teams low in taskwork and teamwork potential such that they performed as well as if they had a human advisor. This suggests useful applications for both the profile approach and AI implementation in that artificial social intelligence may be best applied in situations for teams low in team and task potential. It may be worth noting that ASI advisors were always rated relatively more negatively than human advisors (note limitations), though not low relative to the provided scales. This opens up an important area of research in that there may be many reasons for this including the interaction modalities employed by advisors, the content and style of the advice provided by advisors, and because some teams performed worse when working with ASI advisors. Currently, it is unclear whether that outcome is causally linked to the ASIs, is an artifact of the interaction modality, the sample, or may be associated with differences in the missions performed by teams working with no advisor, the human advisor, or one of the ASI advisors.

Data availability

All data used in the present manuscript may be found at https://doi.org/10.48349/ASU/QDQ4MH²⁹. The data files employed for the analyses reported here as well as information regarding the preregistered hypotheses tested in this manuscript can be found at osf.io/t26kd ²⁵.

References

McNeese, N. J., Demir, M., Cooke, N. J. & Myers, C. Teaming with a synthetic teammate: Insights into human-autonomy teaming. Hum. Factors 60(2), 262–273 (2018).
Article PubMed Google Scholar
Misuraca, G., van Noordt, C., & Boukli, A. The use of AI in public services: Results from a preliminary mapping across the EU. In Proceedings of the 13th International Conference on Theory and Practice of Electronic Governance, Athens, 90–99 (2020). https://doi.org/10.1145/3428502.3428513
Musick, G., O’Neill, T. A., Schelble, B. G., McNeese, N. J. & Henke, J. B. What happens when humans believe their teammate is an AI? An investigation into humans teaming with autonomy. Comput. Hum. Behav. 122, 106852 (2021).
Article Google Scholar
Phillips, E., Ososky, S., Grove, J. & Jentsch, F. From tools to teammates: Toward the development of appropriate mental models for intelligent robots. Proc. Hum. Factors Ergonom. Soc. Annu. Meet. 55(1), 1491–1495. https://doi.org/10.1177/1071181311551310 (2011).
Article Google Scholar
Cuevas, H. M., Fiore, S. M., Caldwell, B. S. & StRAtER, L. Augmenting team cognition in human-automation teams performing in complex operational environments. Aviat. Space Environ. Med. 78(5), B63–B70 (2007).
PubMed Google Scholar
Fiore, S. M. et al. Toward understanding social cues and signals in human-robot interaction: Effects of robot gaze and proxemic behavior. Front. Psychol. 4, 859. https://doi.org/10.3389/fpsyg.2013.00859 (2013).
Article PubMed PubMed Central Google Scholar
Wiltshire T. J., Lobato E. J., Velez J., Jentsch F., & Fiore S. M. An interdisciplinary taxonomy of social cues and signals in the service of engineering robotic social intelligence. In Unmanned Systems Technology XVI (International Society for Optics and Photonics, 2014). https://doi.org/10.1117/12.2049933
Best, A., Kapalo, K. A., Warta, S. F., & Fiore, S. M. Clustering social cues to determine social signals: Developing learning algorithms using the "n-most likely states" approach. In Unmanned Systems Technology XVIII (Vol. 9837, 187–201). SPIE (2016).
Joo H., Simon T., Cikara M., & Sheikh Y. Towards social artificial intelligence: Nonverbal social signal prediction in a triadic interaction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 10873–10883 (2019).
Ozmen Garibay, O. et al. Six human-centered artificial intelligence grand challenges. Int. J. Hum. Comput. Interact. 39(3), 391–437 (2023).
Article Google Scholar
Oguntola, I., Hughes, D., & Sycara, K. Deep interpretable models of theory of mind. In 2021 30th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) (Vancouver: IEEE), 657–664 (2021). https://doi.org/10.1109/RO-MAN50785.2021.9515505
Williams, J., Fiore, S. M. & Jentsch, F. Supporting artificial social intelligence with theory of mind. Front. Artif. Intell. 5, 763 (2022).
Article Google Scholar
Chatzimparmpas, A. et al. The state of the art in enhancing trust in machine learning models with the use of visualizations. Comput. Graph. Forum 39(3), 713–756 (2020).
Article Google Scholar
Akula, A. R., Liu, C., Saba-Sadiya, S., Lu, H., Todorovic, S., Chai, J. Y., & Zhu, S. C. X-tom: Explaining with theory-of-mind for gaining justified human trust (2019). arXiv preprint arXiv:1909.06907.
Vinanzi, S., Patacchiola, M., Chella, A. & Cangelosi, A. Would a robot trust you? Developmental robotics model of trust and theory of mind. Philos. Trans. R. Soc. B 374, 20180032. https://doi.org/10.1098/rstb.2018.0032 (2019).
Article Google Scholar
Mathieu, J. E., Luciano, M. M., D’Innocenzo, L., Klock, E. A. & LePine, J. A. The development and construct validity of a team processes survey measure. Organ. Res. Methods 23(3), 399–431 (2020).
Article Google Scholar
Lyons J. B. Being transparent about transparency: A model for human-robot interaction. In 2013 AAAI Spring Symposium Series, Stanford, CA (2013).
Bendell, R., Williams, J., Fiore, S. M., & Jentsch, F. Supporting social interactions in human-AI teams: Profiling human teammates from sparse data. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting (Vol. 65, No. 1, 665–669). (SAGE Publications, 2021).
Bakkes, S. C., Spronck, P. H. & van Lankveld, G. Player behavioural modelling for video games. Entertain. Comput. 3(3), 71–79 (2012).
Article Google Scholar
Jiang, J., Maldeniya, D., Lerman, K. & Ferrara, E. The wide, the deep, and the maverick: Types of players in team-based online games. Proc. ACM Hum. Comput. Interact. 5(CSCW1), 1–26 (2021).
Google Scholar
Nascimento Junior, F. F. D., Melo, A. S. D. C., Da Costa, I. B., & Marinho, L. B. Profiling successful team behaviors in league of legends. In Proceedings of the 23rd Brazilian Symposium on Multimedia and the Web, 261–268 (2017).
Weidmann, B. & Deming, D. J. Team players: How social skills improve team performance. Econometrica 89(6), 2637–2657 (2021).
Article Google Scholar
Tang, K. H. D. Personality traits, teamwork competencies and academic performance among first-year engineering students. High. Educ. Skills Work Based Learn. 11(2), 367–385 (2021).
Article Google Scholar
Morgeson, F. P., Reider, M. H. & Campion, M. A. Selecting individuals in team settings: The importance of social skills, personality characteristics, and teamwork knowledge. Pers. Psychol. 58(3), 583–611 (2005).
Article Google Scholar
DARPA. Artificial Social Intelligence for Successful Teams (ASIST). (2019). Retrieved from https://www.darpa.mil/program/artificial-social-intelligence-for-successful-teams
Artificial Social Intelligence for Successful Teams. Artificial Social Intelligence for Successful Teams (ASIST) (2023). https://artificialsocialintelligence.org/
Bendell, R., Williams, J., Fiore, S. M., & Jentsch, F. University of Central Florida: ASIST Study 3 Findings (2023). Retrieved from osf.io/t26kd
Huang, L., Freeman, J., Cooke, N., Colonna-Romano, J., Wood, M. D., Buchanan, V., & Caufman, S. J. Exercises for Artificial Social Intelligence in Minecraft Search and Rescue for Teams (2022). https://doi.org/10.17605/OSF.IO/JWYVF.
Huang, L. et al. Artificial social intelligence for successful teams (ASIST) Study 3 (ASU library research data repository; V4) [data set, study procedure and materials]. ASU Libr. Res. Data Repos. https://doi.org/10.48349/ASU/QDQ4MH (2022).
Article Google Scholar
Fiore, S. M. & Wiltshire, T. J. Technology as teammate: Examining the role of external cognition in support of team cognitive processes. Front. Psychol. 7, 1531 (2016).
Article PubMed PubMed Central Google Scholar
Corral, C. C., Tatapudi, K. S., Buchanan, V., Huang, L., & Cooke, N. J. Building a synthetic task environment to support artificial social intelligence research. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting (Vol. 65, No. 1, 660–664) (SAGE Publications, 2021).
Hegarty, M., Richardson, A. E., Montello, D. R., Lovelace, K. & Subbiah, I. Development of a self-report measure of environmental spatial ability. Intelligence 30(5), 425–447 (2002).
Article Google Scholar
Carbonell-Carrera, C., Gunalp, P., Saorin, J. L. & Hess-Medler, S. Think spatially with game engine. ISPRS Int. J. Geo-Inf. 9(03), 159 (2020).
Article Google Scholar
Baron-Cohen, S., Jolliffe, T., Mortimore, C. & Robertson, M. Another advanced test of theory of mind: Evidence from very high functioning adults with autism or Asperger syndrome. J. Child Psychol. Psychiatry 38(7), 813–822 (1997).
Article CAS PubMed Google Scholar
Kalma, A. P., Visser, L. & Peeters, A. Sociable and aggressive dominance: Personality differences in leadership style?. Leadersh. Q. 4(1), 45–64 (1993).
Article Google Scholar
Jackson, C. L., Colquitt, J. A., Wesson, M. J. & Zapata-Phelan, C. P. Psychological collectivism: A measurement validation and linkage to group member performance. J. Appl. Psychol. 91(4), 884 (2006).
Article PubMed Google Scholar
Robertson, P., Cerys, D., Shrobe, H., & Katz, B. DOLL/MIT Study 2 Capability Preregistration (2021). https://doi.org/10.17605/OSF.IO/E8329
Sycara, K., Lewis, M., & Hughes, D. CMU-RI Study 3 Preregistration (2022). https://doi.org/10.17605/OSF.IO/YJ52E
Davoodi, T., Diego-Rosell, P., Maese, E., & Debusk-Lane, L. Gallup Study 3 Results (2022). https://doi.org/10.17605/OSF.IO/9ZE2M
Williams, J., Bendell, R., & Fiore, S. M. UCF TA2 - ASIST Study 2 Results Registration (2021). https://doi.org/10.17605/OSF.IO/K49H3
Lakens, D. Calculating and reporting effect sizes to facilitate cumulative science: A practical primer for t-tests and ANOVAs. Front. Psychol. 4, 863. https://doi.org/10.3389/fpsyg.2013.00863 (2013).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This material is based upon work supported by the Defense Advanced Research Projects Agency (DARPA) under Contract No. W911NF-20-1-0008. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of DARPA or the authors’ affiliated University.

Author information

Authors and Affiliations

Team Performance Laboratory, Institute for Simulation and Training, University of Central Florida, Orlando, FL, 32816, USA
Rhyse Bendell, Jessica Williams & Florian Jentsch
Cognitive Sciences Laboratory, Institute for Simulation and Training, University of Central Florida, Orlando, FL, 32816, USA
Stephen M. Fiore
Department of Philosophy, University of Central Florida, Orlando, FL, 32816, USA
Stephen M. Fiore
Department of Psychology, University of Central Florida, Orlando, FL, 32816, USA
Rhyse Bendell & Florian Jentsch
School of Modeling, Simulation, and Training, University of Central Florida, Orlando, FL, 32816, USA
Jessica Williams

Authors

Rhyse Bendell
View author publications
Search author on:PubMed Google Scholar
Jessica Williams
View author publications
Search author on:PubMed Google Scholar
Stephen M. Fiore
View author publications
Search author on:PubMed Google Scholar
Florian Jentsch
View author publications
Search author on:PubMed Google Scholar

Contributions

R.B., J.W., and S.F. conceived of the models tested in this manuscript. RB handled the preparation of data and analysis. R.B. and S.F. participated in the interpretation of data. R.B. and J.W. prepared this manuscript. S.F. and F.J. supervised and advised the writing of this manuscript. All authors provided critical feedback and helped shape this research. All authors contributed to the article and approved the submitted version.

Corresponding author

Correspondence to Rhyse Bendell.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1. (download CSV )

Supplementary Information 2. (download CSV )

Supplementary Information 3. (download CSV )

Supplementary Information 4. (download CSV )

Supplementary Information 5. (download CSV )

Supplementary Information 6. (download CSV )

Supplementary Information 7. (download CSV )

Supplementary Information 8. (download CSV )

Supplementary Information 9. (download CSV )

Supplementary Information 10. (download CSV )

Supplementary Information 11. (download CSV )

Supplementary Information 12. (download CSV )

Supplementary Information 13. (download CSV )

Supplementary Information 14. (download CSV )

Supplementary Information 15. (download CSV )

Supplementary Information 16. (download CSV )

Supplementary Information 17. (download DOCX )

Supplementary Information 18. (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bendell, R., Williams, J., Fiore, S.M. et al. Individual and team profiling to support theory of mind in artificial social intelligence. Sci Rep 14, 12635 (2024). https://doi.org/10.1038/s41598-024-63122-8

Download citation

Received: 26 July 2023
Accepted: 23 May 2024
Published: 02 June 2024
Version of record: 02 June 2024
DOI: https://doi.org/10.1038/s41598-024-63122-8

This article is cited by

Digital humanistic program to manage premature frailty in young breast cancer survivors with gender perspective
- Yun Hu
- Joshua Wiley
- Eun-Ok Im
npj Digital Medicine (2025)

Subjects

Abstract

Similar content being viewed by others

Introduction

Artificial theory of mind

Profiling to support artificial theory of mind development

Methods

Simulated USAR missions

Medic role

Engineer role

Transporter role

Materials and measures

Individual player profiles

Team holistic profile formulation

Artificial social intelligence agents overview

USAR task metrics

Study sample

Analysis populations

Results

Individual player profiles

Individual player profiles and performance

Individual player profiles and perceptions

Team Profiles

Team profiles and performance

Team profiles and perceptions

Interactions between advisor types and teams’ profiles (low-low, high-high profiled teams)

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links