Multiscale synchronisation dynamics reveals the impact of an improvisatory approach to performance on music experience

Nozawa, Takayuki; Sas, Madalina I.; Dolan, David; Rajpal, Hardik; Rosas, Fernando E.; Timmermann, Christopher; Mediano, Pedro A. M.; Honda, Keigo; Amano, Shunnichi; Miyake, Yoshihiro; Jensen, Henrik J.

doi:10.1038/s41598-025-90271-1

Download PDF

Article
Open access
Published: 24 March 2025

Multiscale synchronisation dynamics reveals the impact of an improvisatory approach to performance on music experience

Takayuki Nozawa^1,2,
Madalina I. Sas³,
David Dolan⁴,
Hardik Rajpal³,
Fernando E. Rosas^3,5,
Christopher Timmermann³,
Pedro A. M. Mediano³,
Keigo Honda¹,
Shunnichi Amano¹,
Yoshihiro Miyake¹ &
…
Henrik J. Jensen^1,3

Scientific Reports volume 15, Article number: 10097 (2025) Cite this article

3600 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Experiences of collective creative activities play an essential role in human societies, yet these experiences are particularly hard to capture, making their scientific study challenging. In a classical music concert-experiment performed by a string quartet, we contrast a Let-go performance mode, characterised by a more creative and improvisatory approach that encourages risk-taking and spontaneous expression, with a more Strict mode which requires adhering closely to the score, common in many Western classical music performance environments. We investigate the experience of audience members by analysing their subjective reports and movement patterns. Our results show that during performances in Let-go mode, movement synchronization was reduced between performers and audience members in shorter timescales, while the synchronization and its temporal variability were enhanced in longer timescales. Furthermore, these differences in the synchronization dynamics are predictive of changes in the audience’s perception of music. These results provide a first step towards the quantification of some of the fundamental aspects of collective music experiences. Specifically, the reported findings demonstrate the relevance of the often-neglected multiscale coordination between audiences and performers, and explain how this rich tapestry of physical behaviour is connected with the quality of the collective music experience.

Synchrony in the periphery: inter-subject correlation of physiological responses during live music concerts

Article Open access 17 November 2021

Coordination dynamics of multi-agent interaction in a musical ensemble

Article Open access 10 January 2022

Audience synchronies in live concerts illustrate the embodiment of music experience

Article Open access 05 October 2023

Introduction

Collective activities involving shared experiences are ubiquitous in human culture. They are believed to play crucial roles for strengthening social bonds, sense of group belonging, and social cohesion^1,2. Empirical investigations have extensively explored the impact of interpersonal synchronization of physical activity on these social dynamics. Such synchronization is strongly associated with collective subjective experiences^3,4,5. These include feelings of unity and perceived social bonding⁶, and the enhanced group experience in shared social and ritual celebrations^7,8. Moreover, synchronization has been linked to positive objective outcomes such as occurrence of face-to-face communication and emergence of functional roles in various types of verbal and non-verbal interaction^9,10.

Among collective activities, music making and listening occupies an important place in all known human societies^11,12, and often reveals, even within unique, cultural-specific approaches, universal elements of expressing and perceiving emotional cues¹³. As a joint activity, ensemble music making requires high levels of empathy^13,14,15, coordination, and synchrony¹⁶, which support the emergence of leadership¹⁷, improvisation^18,19, and group states of flow^20,21, and moreover, is known to engage audiences in a participatory, reciprocal relationship with the performers^22,23.

Within musical praxis, musical improvisation is a very complex, creative and—when performed in an ensemble—highly social process, which requires years of training²⁴. Improvisation has widespread appeal, as evidenced by its varied expressions across different cultures and musical genres^25,26.

Yet, from the early 20th century until recently, the mainstream of Western classical music performance largely adhered to notation-based performance. Performers aimed to follow the score strictly and accurately, striving for the best and most expressive performance while avoiding spontaneous, improvisatory elements²⁷.

In the last three decades, the concept of re-integrating a more improvisational approach to music performance is regaining attention in Western classical music research, teaching, and to some extent, practice. This can be seen, for example, in the creation, back in 2008, of the METRIC network: a project focusing on re-integrating improvisation in European conservatoires²⁸. This project included 18 actively involved European conservatoires that are internationally renowned. Overall, compared to the last quarter of the previous century, there has been a vast increase in the volume of research focusing on the phenomenology, teaching and performance-related parameters of classical improvisation in Western art music. Importantly, several studies place the omission of improvisation from classical music performance and training under question, as this practice has been shown to enhance the musical experience of both performers and audiences^29,30. However, this remains more of an exception rather than the norm in Western classical concert culture at large, unlike in several non-European music cultures such as Arabic and Indian art music.

We refer to the two performance modes described above as Strict and Let-go, respectively³⁰. Namely, the Strict mode aims to follow the written score closely, avoiding any gesture not directly indicated by the written text. In contrast, Let-go mode actively incorporates a creative, ‘beyond the score’ approach³¹ to interpretation with real-time improvisatory concept that involves risk-taking and spontaneous expression^29,30. Strict and Let-go performance modes are the two ends of a continuum, in which the Let-go end represents an improvisational state of mind. This continuum enables us to assess different degrees of strictness vs. improvisatory approach. Daniel Leech-Wilkinson’s reference to the “norms” (and the policing applied to ensure their maintenance)³² is similar to what we refer to as Strict, while the other end, Let-go or improvisational state of mind, belongs to what Wilkinson refers to as “... and how to escape them”³². For young performers, participating in international music competitions involves adhering to these norms, in addition to technical mastery. While some manage to keep a personal voice and narrative, many choose to avoid any risk-taking, aim for precision in executing the written score, and keep spontaneity to a minimum. Associating the strict mode of playing with international competitions came up during conversations with the musicians who participated in the concert experiment.

It is important to clarify what we mean by Let-go, and its place in the spectrum of improvisatory performance. Within the continuum of composition-improvisation used in many studies of improvisation²⁵, the focus is very often on the dimension of “what”—the notes being played, or the musical contents of the piece being played. In our study, we broaden the focus to explore the level of “how” of performance, which encompasses a wide spectrum of performance dynamics. This includes the interaction between performers and the audience, as well as the performers’ real-time decisions and adaptations. In particular, Let-go mode involves a unique state of flow²⁰ the performers find themselves in, and their phenomenology in the moment of performance. Contrary to the Strict mode in relation to performance, the “improvisational state of mind” exists within a flow-state characterised by risk-taking, more spontaneity and a more creative approach. The extra risk-taking and application of creativity does not necessarily have to result in changing any notes; it can be manifested in far-reaching changes—compared to the norms—of performance-related parameters: timing (tempi, rubati), dynamics (changes of loudness and how predictable or not these changes are), and timbre³⁰.

In the context of Western classical music, coordination of physical movements between performers has been investigated^33,34, yet previous studies have rarely studied whether and how physiological or movement synchrony occurs between music performers and their audiences, or the synchrony among listeners themselves. Recently, it has been shown that audiences synchronise on physiological signals such as heart and respiration rate in many situations of music-listening^{35,36,37,38,39}. But regarding their physical motion, audiences to classical music are still mostly assumed to be passive and static, in contrast to audiences in other musical genres^40,41.

In this paper, we challenge this assumption and explore the effect of adopting an improvisational approach to performance on the collective motion of seated audience members attending a classical chamber music concert. For this purpose, we developed a concert-experiment where two classical repertoire pieces were played twice, each in the Strict and Let-go performance modes, in order to compare and contrast between the two modes.

Albeit many psychological studies of music and improvisation tend to focus on short segments of a few measures being performed by many musicians^42,43, the current experimental design allows us to address the improvisational character of performance and study its effects on the musical experience more comprehensively in an ecological, naturalistic manner.

During the experiment, we measured the spontaneous movement of the audience. Although maybe subtle, we expect their physical activity to be linked to their experience of the music, as well as to the movements of performers. Thus, we hypothesise the degree of movement synchrony in the audience reflect the way the audience perceive the different performance modes. Specifically, we hypothesise:

1.
Let-go would be perceived by the audience as more innovative and improvisatory than Strict performances. This is in line with previous work^29,30, and here we aim to confirm the results with a larger group;
2.
Let-go would induce higher movement synchrony (between performers and audience and within the audience) than Strict performances. Furthermore, Let-go would also induce larger temporal variability in the degree of movement synchrony, due to the additional moments in which the audience face unpredictability or “surprise” in the music as it unfolds;
3.
the effects on the audience’ perception and movement synchrony would be correlated. Specifically, higher movement synchrony and its temporal variability would be positively associated with the degree to which the audience perceives the music to be innovative and stimulating. We posit that such performances, by virtue of their novelty and engagement, capture the audience’s attention and foster a deeper connection with the performers and, indirectly, among the audience members themselves.

In addition to these main hypotheses, we also explore the role of psychological absorption in the audience’s perception of the music. Since psychological absorption is associated with the enjoyment of music⁴⁴, we aim to test whether individual differences in the absorption trait may have any interactions with the effect of performance modes. Furthermore, we question whether visual perception of the performance is essential for the audience to appreciate Let-go performance mode. To test the possibility, we have blindfolded a subgroup of audience members, and investigate whether they respond differently to the two performance modes.

Results

A classical concert-experiment showcasing a string quartet performance was arranged. During the event, the quartet presented each of the two repertoire pieces: Mozart’s string quartet KV. 421 no. 15 (exposition of the first movement) and Haydn’s Op. 76 no. 1 (third movement). Each piece was played twice: once in Strict mode and once in Let-go mode. For Strict mode, we instructed the quartet to aim for the level required in international competitions and perform to the best of their ability. In contrast, the performers were asked to adopt an improvisational state of mind when playing in Let-go mode³⁰. Comparing Let-go with Strict performances of the same repertoire piece by the same performers allows us to experimentally manipulate collective musical experiences while controlling other factors such as piece characteristics and individual differences. The order of Strict and Let-go modes were exchanged between the pieces. The audience was not informed about the nature or the order of the two modes. The concert was attended by 42 audience members. Unlike other classical music studies, the audience consisted of younger listeners (most under 30) and a female majority. For more demographics data, see supplementary material Sec. A.1 . Questionnaire responses and movement data were collected in order to investigate how the two performance modes affect the audience’s experience. Details of the experimental design can be found in “Methods”.

Performance ratings

As a first step in our analysis, to test our hypothesis 1, we investigated the subjective experience of audience members as reflected by questionnaire responses given after each pair of performances. Questionnaire scores were analysed via multilevel models that included experimental variables as fixed effects and participant IDs as random effects (see “Statistical tests”).

Results in Fig. 1a reveal that the audience was receptive to the performance mode, rating the Let-go performances to be significantly more Improvisatory (standardized coefficient $\beta =0.41$, $t_{40}=3.78$, $p<0.001$), Innovative ($\beta =0.31$, $t_{40}=2.69$, $p=0.011$), Risk-taking ($\beta =0.44$, $t_{40}=3.75$, $p<0.001$), and Emotionally Engaging ($\beta =0.27$, $t_{40}=2.44$, $p=0.017$) than the Strict. This is in accordance with our hypothesis 1 and previous work³⁰. In contrast, no significant differences were observed regarding how Musically Convincing both renditions were.

To quantify to what extent these ratings reflect either a unified factor or different aspects of the audience’s experience, we performed a principal component analysis (PCA) to evaluate how much variance in questionnaire scores can be explained as being part of a single factor. Results show that the first principal component (PC1)—mainly consisting of the Improvisatory, Innovative, Risk-taking, and Emotionally Engaging items—accounts for 43.9% of the variance (see Fig. 1b, c). Furthermore, the value of PC1 is significantly higher for the Let-go than the Strict mode ($\beta =0.39$, $t_{40}=3.12$, $p=0.003$; Fig. 1a right), supporting the conclusion that it captures a principal axis that differentiates between performance modes. See the supplementary Table S2 for the full statistical information.

To evaluate the potential effect of visual cues on the difference of experience between Strict and Let-go, 13 audience members were blindfolded. Incorporating the blindfolding factor in our multilevel models did not show a significant main effect of sight nor significant interactions with performance mode. A significant 3-way interaction was observed for the PC1, Improvisatory, and Risk-taking ratings. See the supplementary Table S2 for the full statistical information, and Fig. S2 for the visualization.

As an additional control, we investigated if the difference in the ratings between performance modes could be related to individual differences in psychological absorption in the audience members, as this trait has previously been linked to higher engagement with music⁴⁵. Results show that absorption has a positive effect on the audience ratings in general, but not on differentiating the mode of performance, and no interaction with the mode of performance. We observe a significant effect on the audience’s Innovative ($\beta =0.25$, $t_{39}=2.30$, $p = 0.027)$ and Emotionally Engaging ($\beta =0.30$, $t_{39}=3.14$, $p = 0.003$) ratings, suggesting that higher absorption is indeed associated with a more positive emotional experience regardless of the mode of performance. It is insightful to observe the Musically Convincing rating and absorption are not significantly related ($\beta =-0.01$, $t_{39}=-0.07,$ $p=0.947$), and also that higher absorption subjects are likely to find the piece more Familiar ($\beta =0.27$, $t_{39}=2.38$, $p=0.022$). (See section B.2 in the Supplementary material for detailed results.)

Movement synchrony

The second step in our analysis is to investigate the movement patterns of audience members, in particular the synchrony among listeners and with the performers and how they are affected by the performance modes (hypothesis 2). For this purpose, we carried out quantitative analyses using accelerometer data collected from the audience and performers.

Wavelet transform coherence analysis

We start with the degree of synchronisation across the entire spectrum of physical movement, considering the synchrony of movements between performers and audience (P–A sync) and also between audience members (A–A sync) over a wide range of timescales (Fourier periods). For this, we employ the wavelet transform coherence (WTC) on the time-frequency domain⁴⁶, which has been widely used to evaluate interpersonal movement synchrony in various types of interactions, including in a musical context^{18,47,48,49,50,51}.

When analysing synchrony at different timescales among audience members and between performers and audience, our results show that in both cases the audience exhibits higher synchrony in the Strict mode only at shorter timescales, while during the Let-go performances higher synchrony is seen at longer timescales (see Fig. 2).

Short timescales correspond to rhythmic elements of the piece as well as physiological signals such as breathing, and henceforth the synchrony that dominates in Strict can be referred to as ‘beat-sync’. In contrast, the longer timescales, that dominate in Let-go, correspond to longer musical gestures related to higher-level semantics and musical expression⁵², which we therefore describe as ‘music-sync’.

In addition to the average P–A and A–A sync, we studied temporal variability of the P–A and A–A sync at each timescale. Results show that the audience exhibits significantly more variability of synchronisation at longer timescales during the Let-go performances (see Fig. 3). We refer to the temporal variability of the synchrony in these longer timescales as ‘music-sync variability’. No significant differences were observed at shorter timescales.

Additional analyses showed no effects of blindfolding on the different types of synchrony and no significant interaction between visibility and performance mode. (See Section C.4 in supplementary material for details).

Breathing-related pattern analysis

One of the primary drivers of the beat-sync, especially at the period range of 3-5 s, is assumed to be the audiences’ respiration⁵³. In order to deepen our understanding of the nature of this breathing-related component, we conducted further analysis on breathing-related patterns extracted from the accelerometer signals of each audience member, using alternative measures to complement the WTC analysis above.

We first investigated the diversity of breathing-related patterns exhibited by each individual by calculating their entropy rate (ER), which is a well-established information-theoretic metric of pattern diversity⁵⁴. Results reveal an increase in ER of breathing-related patterns during the Let-go performance ($\beta =0.72$, $t_{40}=5.89$, $p {<} 0.001$, see Fig. 4), suggesting increased variability, which has been related to increased arousal with positive valence⁵⁵.

By studying the level of synchrony between the breathing-related patterns of pairs of audience members via the phase locking value (PLV)⁵⁶, we observe a significantly higher degree of synchrony in Strict than Let-go ($\beta =-0.51$, $t_{40}=-4.98$, $p {<} 0.001$). We also investigated the presence of higher-order synchronisation, namely at the level of triplets instead of pairs, among the audience members, but did not find significant differences (see Section C3 in the supplementary material for these results). The larger pairwise synchrony observed in Strict, which is contrary to our hypothesis 2, can be interpreted as arising from the more regular rhythms that characterise this performance mode and can be linked to the higher synchrony in beat-sync and the higher regularity of tempo in the Strict mode.

Correlation between ratings and synchrony

As a final step in our investigation, we studied whether the differences in synchrony found in the previous section were predictive of the subjective experience as reported in the questionnaire ratings. For this, we built multilevel models using the various questionnaire items as dependent variables, and mean synchrony (either beat- or music-sync) or music-sync variability as independent variables, while accounting for subject ID using a random intercept (see “Methods” for details).

Results show that higher music-sync variability is most significantly associated with higher subjective scores (see Table 1), in particular the PC1 and items of the audience ratings that constitute perceived innovativeness of the performance. In contrast, increases in the average beat-sync were negatively associated with PC1 and the Improvisatory and Risk-taking ratings. This indicates that higher synchrony in this timescale was linked to lower audience perception of the performance as Improvisatory. The significant association between beat-sync and subjective scores were observed only in P-A sync and not in A-A sync. Changes in average music-sync between audience members or musicians were not associated with differences in the subjective scores.

Table 1 Association between audience ratings and mean movement synchrony in shorter and longer timescales, and the temporal variability of synchrony in the longer timescale. Values in the “r” columns represent correlation coefficients, which indicate the strength and direction of the associations. Values in the “$t_{125}$” columns represent t test statistics with 125 degrees of freedom, which are obtained by the multilevel models. Significant associations with $p<0.05$ are marked by bold face.

Full size table

Overall, this suggests that having more dynamic synchrony at the scale of the musical gestures related to higher-level semantics and musical expression is associated with the distinctive experience provided by the Let-go performance, while having higher synchrony with musicians in the beat-sync scale is more characteristic of the less improvisatory experience.

It is worth noting that when focusing solely on Haydn’s composition (which musicians regarded as more successfully differentiated between Let-go and Strict modes), more statistically significant associations between mean beat-sync, mean music-sync, and music-sync variability with the ratings are observed, as shown in Table S4 in the supplementary material.

Music performance analysis

Assuming that music serves as the primary driver of the collective behaviour observed in the audience, our findings on movement synchrony and the audience’s experience can be better contextualised by examining specific attributes of the musical performance. For this purpose, we conducted exploratory analyses based on the performers’ subjective reports, listening tests and quantitative performance analysis of the recordings in Sonic Visualiser⁵⁷. See Methods for more details.

A brief study of the piece’s dynamics, by using the loudness curve reveals both Let-go performances had a larger dynamic range and showed higher pattern complexity, which is in line with a higher level of unexpectedness in terms of performed expressive gestures and phrasing, and also in accordance to subjective reports by the musicians. Moreover, we observe that the results related to pattern complexity of the loudness curves are consistent with mode of performance, with higher values in the Let-go performance mode.

Spectral entropy, on the other hand, which relates to the complexity of timbre, seems more consistent with the subjective reports of the musicians themselves, who maintained the modes of performance were not as clearly differentiated when performing Mozart’s quartet. (See Table 2 as well as Limitations below).

Table 2 Quantitative comparison between the modes of performance using tools from music performance analysis. Results in bold show consistent differences between the two modes in both pieces. LUFS loudness units relative to full scale.

Full size table

Finally, we studied the tempi of the performances using graphs of tempo curves generated in Sonic Visualiser. For each performance, the musicians have established a beat of rhythmic reference based on the tempo, time signature and mode of performance (see Methods).

When analysing tempi curves of the peformance of Mozart’s quartet, we related to a crochet (1/4) as a beat of rhythmic reference. Measurements revealed that the average duration of a crochet in the Strict mode is 0.74s, and 0.744s in the Let-go mode. This makes the mean duration of a half-bar approximately 1.5 seconds, and the mean duration of a bar approximately 3 seconds. When analysing Haydn’s minuet performance’s tempi curve, we measure the tempo in relation to one whole bar of 3/4 as beat of rhythmic reference. This movement comprises three sections, minuet-trio-minuet, each with slightly different tempo, so we computed the average bar duration for each section. The average durations of each bar of three crochets is 0.6, 0.885, 0.551 seconds in Strict, and 0.631, 0.958, 0.585 seconds in Let-go, which all match to timescales included in beat-sync.

At times, the musicians created one uninterrupted gesture made of two or four bars in the case of Haydn’s minuet and a half-bar, one bar or sometimes two bars in the case of Mozart’s quartet. The duration of such musical gestures would range between 1 and 4 seconds. Therefore, in both the Haydn and Mozart performances, the mean duration of beats of rhythmic reference and musical gestures comprising of 2 or 4 such beats correspond to timescales within beat-sync. Moreover, timescales around 0.8, 1.5, and 3 seconds show significant peaks in synchrony and this tempo analysis suggests they may be related to the beat of rhythmic reference. These beat-sync peaks are higher during the Strict performances, where the musicians follow a more even and rigid rhythmic pulse.

On average, tempi were slightly slower in Let-go performance modes, and more importantly, we found tempo variability is consistently higher in Let-go than in Strict performance mode (See Table 2). The differences in synchrony scales are also supported by the difference in tempo variability in Strict as compared to Let-go. This result agrees with the musicians’ observation that the Strict performance mode is characterised by more rigidly even short-term beats and higher synchrony at a single, short-term beat level. It is also related to the fact that the audience may find these more rigidly even beats more predictable, likely causing the higher movement synchrony in short beat-sync timescales during Strict performances.

To explore the longer, music-sync timescales emergent in the movement synchrony, it is important to distinguish between the short-term metronomic beat—such as quavers or crochets, characterised by evenness between notes—and a broader structural pulse associated with longer, more variable durations. The latter is a deeper structural pulse which often governs longer musical phrases and facilitates more significant expressive potential. During the Let-go performance mode, in which an improvisational state of mind is applied, the length of expressive gestures marked by the deeper structural pulse can change according to the performer’s expressive intentions. Therefore, gestures and phrases may have longer durations. (We would like to note that musicians in general, not only conductors, often refer to the specific beat they focus on while performing as a conducting beat. This is often the beat of rhythmic reference explained above. However, in the Let-go performance mode, the deeper structural pulse sometimes functions as the conducting beat.) For example, in both performances of Mozart’s quartet, a 4-bar phrase would last on average 10-12 seconds, which is within the music-sync timescale for P-A sync. This 4-bar phrase is written in a classical-style, symmetric periodic phrase structure, which acts as a “call and response” and would be governed by the deeper pulse of half bar or whole bar rather than the metronomic crochet or quaver pulse. Longer durations in music-sync can be associated with even longer phrasing structures.

While listening to the performance recordings, the performers found that arriving together at the phrases’ goal points was more convincing in the Let-go performance mode, despite spontaneous deviations from some of the score’s instructions in terms of timing, dynamics, and at times actual notes (improvised elaborated repeats or cadential moments for example). Those changes represent an element of unknown and a higher level of risk-taking, which makes the stronger cohesion in phrasing far from obvious, and supports the higher music-sync and music-sync variability during Let-go.

Discussion

This study investigated how the improvisational approach to performance, termed the Let-go mode, impacts the audience’s collective experience in comparison to the Strict performance mode, which represents the mainstream approach in Western classical music performance. The analysis focused on three aspects: (1) the audience’s ratings for the experiential part; (2) movement synchrony between the performers and audience, and among audience members, for the movement part; and (3) the relationship between the ratings and synchrony to integrate the experiential and movement aspects. We also conducted additional follow-up analyses to deepen our understanding of the mechanisms and factors influencing the impact of the improvisatory approach.

Consistently with previous studies³⁰, audience rated Let-go performance higher than Strict counterparts in multiple experiential dimensions, suggesting that the experiment was successful in inducing differentiated musical experiences on the audience. The result supports our hypothesis 1. In particular, the audience perceived the higher Improvisatory, Innovative and Risk-Taking character of Let-go performances, while considering both performances as Musically Convincing. Additional analyses show no effects of blindfolding on ratings, suggesting that the music itself—rather than visual cues—acted as a driver for the collective subjective experience. Moreover, results show that performance ratings are also related to the psychological trait of absorption, but this does not explain away the effect of the performance mode. Absorption has been previously linked to the enjoyment of music⁵⁸, yet it does not seem to affect collective engagement in the Let-go performance.

To explore the relationship between qualitative aspects of musical performance and audiences’ experience, we also gathered subjective accounts of the performance from the musicians. All members of the quartet recognised that during the first pair of performances (of Mozart’s piece), the discomfort they felt initially due to the experimental equipment affected their state of flow, causing their first performance to be more rigid than intended for a Let-go performance. In the second pair of performances (of Haydn’s piece), they felt more together and more emotionally connected during Let-go, and also recognised many more moments of significant emotional expression in the Let-go mode.

Interestingly, looking into the effect of compositions on the audience’s ratings, we find that the differences in PC1 and the Improvisatory, Innovative and Risk-taking ratings between the two performances of the piece by Mozart are weaker than those for the piece by Haydn, as revealed by significant interactions between the performance mode and composition factors by the multilevel models (Improvisatory: $\beta =-0.67$, $t_{40}=-3.67$, $p<0.001$; Innovative: $\beta =-0.59$, $t_{40}=-3.60$, $p<0.001$; Risk-taking: $\beta =-0.62$, $t_{40}=-2.97$, $p=0.005$; see supplementary Section B.1 for details). These results indicate that the audience perceived the differences between the Let-go and Strict performances of Haydn’s composition, but they were not as sensitive to the difference between Let-go and Strict performances of Mozart’s composition. Importantly, this is in accordance with the musicians’ subjective reports on their own performance.

Regarding our hypothesis 2, the results reveal that an improvisatory approach to performance affects movement synchrony in the audience in opposite directions, depending on the timescale. In effect, Let-go performances reduce synchrony comparing with Strict in shorter timescales, while they enhance synchrony on longer timescales. Short timescales can be associated with the rhythmic pulse and physiological responses to it, which are clearer in the Strict rendition of the music, and longer timescales with longer musical macrostructures and musical phrases⁵².

Our findings, therefore, suggest that collective music experience is embodied in a multiscale adaptive interaction between the performers and audiences, with these spanning a longer temporal horizon in Let-go renditions than in Strict ones. Similar time-scale dependency of the movement synchrony has also been observed in different forms of social interactions, including collaborative team problem solving⁵⁰ and joke telling⁴⁷.

It is worth noticing that the fact that synchrony was observed both for blindfolded and sighted audience members suggest that, in terms of mechanisms, audience modulated their movement synchrony with the performers mainly via auditory rather than visual information, which is in line with previous results³⁰. This suggests, in turn, that performance-to-audience synchronisation was primary, and that audience-to-audience synchronisation emerged mainly indirectly, mediated by the former interactions—rather than by the direct interaction between audience members.

Our analysis of synchrony in movement patterns was not restricted to the average degree of synchrony, but also considered the variance of synchrony during the performance. Results show that Let-go performances increase the temporal variability of synchronisation on longer timescales. Combined with the results of the average sync, this means that the Let-go performances increased longer-timescale synchrony at specific timings rather than evenly throughout the performances. In other words, it enhanced the dynamical transition between a convergent (in-sync) phase, where the movements of audience members and performers are well-coordinated, and a divergent (out-of-sync) phase, where their movements are less coordinated. From a dynamical systems perspective, this temporal variability in synchrony could be interpreted in terms of meta-stability, which captures the temporal dynamics between high-synchrony phase where the involving elements are coordinated and integrated into quasi-stable low-dimensional states, and low-synchrony phase where the elements are segregated leading to transient uncoordinated states⁵⁹. In our musical context, Let-go performance would make the music less predictable for the audience at certain moments, which could lead to the higher degree of temporal variability in the degree of synchrony. Interestingly, in the context of dyadic interpersonal coordination, it has been suggested that meta-stability, moving in and out of synchrony, is a characteristic of well-functioning interactions⁶⁰. Along this thought, we speculate that the temporal variability of synchrony could signify how much the musical interaction between performers and audience are functioning in an adaptive and flexible manner.

In contrast, decreases of synchrony in the shorter-timescale and increase of diversity in breathing-related pattern by the Let-go performance took place more evenly over the whole performances, as shown by the less significant changes in temporal variability. This confirms the idea that the origins of the shorter- and longer-scale sync are different, and further suggests that the shorter-scale sync corresponds to the low-level musical components (shorter beats and meters) and autonomic responses to them, which exist throughout the performances, while the longer-scale sync corresponds to the temporally organized higher-level hierarchical musical structures that are fundamental for the narrative generated by the music performance.

To test our hypothesis 3, we investigated relationship between the audience’s subjective ratings on the performances and the collective movement synchronization. The statistical associations found between changes in psychological ratings and patterns of collective movement suggest that these may be reflecting different manifestations of the same underlying process. Interestingly, results show that higher synchrony in the shorter timescale was negatively associated with the audience’s perception of the innovativeness of performances, which further supports the idea that the shorter-scale synchrony may reflect rather automatic and unconscious alignment to low-level structural/syntactic aspects of the music. That is, the more standard and predictable a performance was (especially in the Strict mode), the easier it may have been for the audience to physically and automatically get entrained into it. At the same time, the high predictability may have led to below the optimal zone of uncertainty for music pleasure^61,62,63, giving the audience the impression the performance was less innovative. On the contrary, higher synchrony and its temporal variability in the longer timescale was positively associated with the audience’s innovative experience. Thus, the longer-scale synchrony may reflect the audience’s absorption to the dynamics of higher-level musical expression or semantics, which is enriched by the Let-go performance mode.

The synchrony scales emerging in the audience can also be interpreted through a musicological and performance analysis perspective. Short timescales are associated with the average duration of beats of reference that create the pulsation of rhythm. The rhythm is more pronounced and metronomic, with more even and rigid beats and less tempo variability in the Strict musical performances, enhancing beat-sync. The rhythm of the music has also been previously shown to act as a driver of physiological rhythms such as breathing^37,41. In contrast, the longer timescales are associated with freer musical gestures, based on deeper, structural pulses in the music, allowing more possibilities in terms of phrasing, articulating and ability to deviate from expectations in Let-go^52,64. The higher music-sync in Let-go, as well as the higher temporal music-sync variability, may be caused by the audience’s synchronised response to the spontaneous and unplanned arrival of the ensemble at the same point in the music, crafting moments of peak emotional expression. Previous work has also shown that the audience shows higher physiological synchrony during important structural moments in the music³⁷. In future work, we hope to relate both structurally relevant and subjectively intense expressive moments in the music with an analysis of movement synchrony focusing on the time domain.

Here, in terms of musicology, these findings may encourage us to revisit the distinction between the structural design of a composition, and the micro- and macrostructural patterns generated by performers^31,65. We argue that performers who apply an improvisational state of mind³⁰ use a similar kind of generative processes inherent in composition⁶⁶ during the spontaneous creative processes of performance, whether they are performing a repertoire work or freely improvising⁶⁴. Further performance study-based explorations of music performance parameters such as tempo and dynamics in important structural moments in both score and performance, and how they are linked to the subjective experience of musicians and audience, are an important avenue for future work.

The findings in this study have not only scientific significance in and of themselves but also potential applications. Since verbalising and sharing collective creative experiences—exemplified here by a collective music experience—is very difficult, designing and evaluating such experiences is a particularly challenging task. Usually, this endeavor heavily relies on intuitive judgement from domain experts in the target field, such as music directors or concert organisers. The current results involving the objective movement synchrony provide a first step towards the quantification of some aspects of these ephemeral experiences, opening the possibility for sensing technologies to evaluate their elusive yet important aspects. This evaluative information could help the design process by accelerating the try-and-evaluate loop and facilitating knowledge sharing. Furthermore, real-time utilisation of the synchrony information could enhance the experience of all participants immediately. It may be also possible, in some specific artistic contexts, to enrich the audience’s musical experience by converting the synchronisation state into dynamic visual effects or sensations for other modalities and presenting them along with the music.

Limitations

A major limitation of the current study is the sample size. Although the number of audience members participated in the study is almost double the number of our previous study³⁰, the concert was held only once. It will be important to check whether the findings are reproduced in a multiple occasions of concerts with different samples of audience.

The small sample size could have been especially problematic in exploring the effect of audience’s vision, which was investigated through the manipulation of blindfolding on a subset of audience members. In comparison to the within-subject factors such as the performance modes and composition, effects of between-group factors are statistically harder to detect with small samples. In addition, we assigned the blindfolding manipulation based on the seating of the audience members (see Fig. S1). This design choice led us to unbalanced sample size (13 audience members for blindfolded and 29 for non-blindfolded. These problems may have hindered us from detecting potentially significant effect of audience’s vision. Previous studies have shown the role of vision in music appreciation. For example, people depends heavily on visual information when making judgments about music performance⁶⁷. Thus, further experiments with larger sample size would be needed to draw certain conclusion about the relative impact of vision versus audition for the appreciation of the improvisatory performances. In the same way, it is possible that a larger sample size may also reveal interactions between the absorption trait, which is a between-subject variable, and the perception of music with improvised elements.

Other aspects yet to be investigated in the future include the effects of audience members’ personality traits and social relationships between them. For example, a recent study has shown that people’s neural responses to viewing naturalistic stimuli are generally more synchronised with peers who have higher personality similarity⁶⁸. Another study indicated that pairs of strangers, friends, and lovers viewing a series of video clips show different physiological synchrony patterns depending on the emotion elicited by the videos, with a general tendency for higher synchrony between stranger pairs⁶⁹. While the current study did not probe into these aspects, it would be interesting to explore how the composition of audience members might enhance objective synchronisation and subjective musical experience in response to different types of performance approaches.

A technical limitation comes from the way we extracted and measured breathing signals. In physiological synchrony research, special sensors are often used to study breathing rate. In our study, we instead used discrete wavelet transforms and wavelet reconstruction to extract the signals with frequencies associated with breathing rate⁷⁰ from the accelerometer signals. This workflow has been used in previous research, but does indeed require an extra preprocessing step and the signals are not measured directly. Therefore, the breathing results will need to be validated in future studies using a more direct measurement, such as respiration belt, in order to avoid contamination by body movement.

A final limitation comes from the difficulties of organising naturalistic concert-experiments and balancing between a realistic improvisational state of mind experience within the performers, and robust collection of experimental data. The experimental equipment has had effects on the state of flow of the musicians, causing the first performance of Mozart to be more constrained and less innovative, even though the performers were aiming to play in a Let-go, improvisational state of mind. This has resulted in a diminshed effect of the mode of performance in the first pair of pieces. Allowing the musicians to get used to the experimental equipment and context will help improve robustness of future experiments.

Conclusions

In conclusion, this research uncovers the relevance of the often-neglected multiscale coordination between audience and performers, and reveals its deep connections with the quality of the collective subjective experience. Our results provide quantitative evidence that illuminates how a collective music experience is embodied in a multiscale dynamical interaction, which expands the group flow aspects of the relationships between the improvising musicians^19,20 to a complex dialogue with audiences that is enhanced by the innovative, risk-taking and unexpected qualities of improvisatory performance. Last but not least, the reported results highlight the importance of regarding collective creative activities as physically embodied experiences, suggesting that a rich tapestry of physical behaviour is underlying the shared experience even in audiences that are often regarded as passive.

Methods

Experimental procedure

The concert/experiment involved the Portorius String Quartet, who performed movements from Mozart (String Quartet No. 15 in D Minor K. 421 - first movement: Allegro moderato) and Haydn (String Quartet in G Major, Hob.III:75, Op. 76, No. 1—third movement: Menuetto: Presto) as well as improvised pieces in different performance modes (Table S1). Specifically, for the repertoire works, the same piece was performed twice, in each of the two modes, Strict and Let-go, varying the order, allowing us to better isolate the effect of performance mode on the audience.

The two repertoire pieces were chosen as they are both from the classical period and their phrase structure lends itself to more straightforward creative work when performed in Let-go, but they contrast each other in mood and musical energy. Mozart’s piece is more introverted and complex from a contrapuntal point of view. Haydn’s piece is more extroverted and varied from a rhythmic point of view. The four improvisation pieces (pieces 3–6 in Table S1 were fully improvised and very different between each other, making it difficult to compare in terms of performance modes. Therefore, they were excluded and only the four repertoire pieces were analysed in this study. Audio-video recordings of the four repertoire pieces are shared on the Open Science Framework server (https://osf.io/ar64j/).

Prior to the concert, all members of the quartet have taken part in Professor David Dolan’s course Interpretation through Improvisation at the Guildhall School of Music and Drama in London⁷¹. The method taught during the course involves a creative approach to studying and performing repertoire works, engaging with structural, harmonic, rhythmic and motivic reductions while maintaining an improvisational state of mind, encouraging risk-taking and allowing spontaneous deviations from the score in coordination with ensemble partners. As such, the string quartet was able to adopt the different states of mind and approaches required to perform under the two different modes.

The concert experiment was conducted in a recital room in the Guildhall School of Music and Drama (see Fig. S1).

Participants

Audience members were recruited via posters on bulletin boards and online call for participation. Fifty adult volunteers attended the concert experiment as audience. They were mainly graduate students and staff of the Imperial College London or their families and friends, with a wide range of experience with classical music. Out of them, 8 subjects encountered issues with the physical motion recording or failed in giving the subjective ratings on the performances. Therefore, the data from the remaining 42 subjects were subjected to the analyses. Here we must note a diversity of nationalities, in particular 12 different nationalities from Europe, 4 different nationalities from Asia, plus 2 Australians and 1 Mexican. Moreover, unlike the usual distribution of classical music attendees, most of our participants were under 30, and also female. A large proportion were familiar with playing music, but only a subset were formally trained or actively practising. Further details on the characteristics of the audience members are provided in the supplementary materials (Section A.1). In order to investigate the role of audience’s vision, 13 out of the 42 audience members listened to the performances wearing blindfolds.

Measurements

Body motion acceleration

The performers’ head motions were measured with inertial measurement units (IMUs; TSND151; ATR-Promotions, Japan) placed on the middle of their forehead, attached to the fNIRS brain activity measurement device (HOT-1000; NeU, Japan). The audience members’ body motion fluctuations were measured with IMUs contained in the smartphones (Zenfone 3 Laser; ASUSTek, Taiwan) that they wore around their necks⁷². The sampling frequency was 100Hz for both sensors, and then downsampled to 50 Hz.

Questionnaires

Before the study, audience members filled a psychometric questionnaire to assess their psychological trait of absorption⁷³ as this has been previously related to the enjoyment of music⁴⁴, as well as susceptibility to altered states of mind and even psychedelic experiences⁴⁵.

After each pair of successive performances, the audience members rated their subjective evaluation of each performance on five items: how they felt each performance to be (1) Improvisatory, (2) Innovative, (3) Emotionally Engaging, (4) Musically Convincing, and (5) Risk-taking. These items were identical to the ones used in the previous studies^29,30. Two additional items were added, where the audience members were asked to rate their degree of (6) familiarity with the piece and (7) sleepiness. The rating for each of these seven items was given on a six-level Likert scale, ranging from 0: “not at all/none” to 5: “totally/completely”. In the questionnaire, the pair of performances were labeled as “performance 1” for the earlier one and “performance 2” for the latter one, and the items are presented for each of them.

The collected rating data contained small amount of missing values; in the total number of 1176 values consisting of 168 observations (42 members $\times$ 2 pieces $\times$ 2 repetitions with different modes) and the seven questionnaire items, “Improvisatory”, “Convincing”, “Familiar”, and “Sleepy” items had one missing value each, “Risk-taking” item had two missing values. For each subject and each performance there was no more than one missing value. These missing values were imputed using the missForest algorithm, a random forest-based multiple imputation scheme⁷⁴, as the PCA on the questionnaire scores requires complete dataset.

Analysis

Wavelet synchrony analysis

To evaluate synchrony between physical activity, triaxial head acceleration data of the musicians (from IMU sensors) and body acceleration data of the audience (from smartphones) was converted to a one-dimensional time series of Euclidean norm of acceleration.

$$\begin{aligned} a(t) = \sqrt{a^2_x(t)+a^2_y(t)+a^2_z(t)} \end{aligned}$$

Then, we evaluated movement synchrony of each pair of signals by using the WTC⁴⁶ of their acceleration norm time series. WTC finds regions in time-frequency space where two time series covary, but do not necessarily have high power. WTC has been used to evaluate interpersonal movement synchrony in various types of interactions^18,48,50 and is defined as⁷⁵:

$$\begin{aligned} R^2(t, s) = \frac{\vert S (s^{-1}W^X(t, s) W^Y(t, s)) \vert ^2}{S ( s^{-1} \vert W^X(t, s)\vert ^2) S (s^{-1} \vert W^Y(t, s)\vert ^2)} \end{aligned}$$

where $W^X$ and $W^Y$ refer to the wavelet transforms of the two signals and t and s refer to time sample and wavelet scale. Wavelet scale s is directly associated with a Fourier period⁷⁵, which is used to discuss scales of synchrony. Results were computed using the open-source wavelet-coherence Matlab package⁷⁶, with it’s default setting of the Morlet wavelet $\psi _0(\eta ) = \pi ^{-\frac{1}{4}}e^{i\omega _0 \eta }e^{-\frac{1}{2}\eta ^2}$ with the parameter $\omega _0 = 6$ as the wavelet function, and the scale resolution of 12 scales per octave.

To calculate the synchrony between performers and audience members (P-A sync) for each performance, first, for each performer X and audience member Y, the WTC coefficient $R^2(t, s)$ was time-averaged over the duration of the performance ($\langle R^2(s) \rangle$). Then, for each audience member Y, the coefficients were averaged over all the four performers (Xs), resulting in a measure of how much each listener was in sync with the performers on average, at each timescale s, for each performance.

Similarly, to quantify the synchrony between audience members (A-A sync), firstly we temporally averaged the WTC coefficient $R^2(t, s)$ over the duration of the performance ($\langle R^2(s) \rangle$) for all the audience pairs (X, Y). Then, for each audience member X, the coefficients with all other audience members (Ys) were averaged. This provides a measure of how much each audience member was in sync with other audience on average, at each timescale, for each performance.

Due to the similar duration of the repertoire pieces (between 120 and 140 seconds), the same wavelet scales (or Fourier period) can be used to discuss all pieces. We choose a range of relevant periods to be ${<}$ 0.5 s, as audience’s physical activities in the timescales below it would have no musically meaningful counterpart.

The synchrony analysis is conducted in order to identify ranges of frequencies (or bands) where there are significant differences in the audience’s degree of synchrony between the performance modes. Averaging the per-subject wavelet coefficients in these bands provides a measure of synchrony in that band, which can be used further to test how the synchrony in these different bands are correlated with the audience’s subjective perception (hypothesis 3), as well as how the effect of the performance modes on the synchrony is affected by other factors such as audience’s sight (blindfolding) and compositions of the pieces (see the explanation on the multilevel model below).

Breathing rate analysis

To further investigate A–A sync, the breathing rate of participants was extracted from the front (z-axis) of the triaxial acceleration data by using a continuous wavelet transform⁷⁰. The wavelet coefficients in the relevant scales for breathing (3–5 s) were then used to reconstruct the respiration signals⁵³, producing a time series that can be analysed with stationary methods, due to the oscillatory nature of breathing. Due to the nature of this extraction procedure, we note that there is a possibility that a part of the breathing rate signals might have resulted from body-swaying to the music.

To investigate synchrony of breathing, average pairwise PLV⁵⁶ was computed and averaged for each subject. PLV is a measure of phase synchrony between a pair oscillatory signals calculated using their average phase difference. To obtain mean synchrony for a subject, their mean PLV with all other audience members is computed.

To investigate variability in breathing, ER was computed on each listener’s reconstructed breathing signals using the state space estimator⁵⁴. This measure uses vector auto-regressive model to estimate ER of continuous signals and is shown to be data-efficient and calibrated against other measures like Lempel-Ziv complexity (LZc).

Music performance analysis

To quantitatively study the music, a number of techniques for the analysis of performance⁷⁷ are used on the audio data, focusing, in particular, on tempo and dynamics.

To understand the timescales of movement synchrony emerging between the performers and the audience, it is important to establish a performance-related beat of reference, which is the conducting beat to which the performers relate. This beat of musical or expressive reference is related to the tempo of the performance as well as to the time signature, and functions as the smallest unit of musical meaning for the given performance. Note that Haydn’s 3^rd movement from Op. 76 no. 1 performed in this concert (marked presto and written in 3/4) is much faster than Mozart’s 1^st movement from K. 421 (marked Allegro moderato and written in 4/4). In Haydn’s piece, the performers related to one whole bar as a basic beat of musical reference, whereas in Mozart’s piece, half a bar was the essential beat of musical reference. At times, the musicians created one uninterrupted gesture made of two or four bars in the case of Haydn’s movement and one or two bars in the case of Mozart. This has been confirmed through critical listening sessions conducted independently by each quartet member, and David Dolan, a co-author of this article. This is why, when comparing mean and variability between modes of performance, we relate to beats of musical reference and gestures rather than metronomic pulses of crochets or quavers.

Analysis of the tempo requires quantitatively measuring the duration of each bar. This was done by making use of open-source software Sonic Visualiser⁵⁷. From this data, the mean and standard deviation of each bar’s duration were computed. These durations were interpreted with respect to the time signature of the piece.

To analyse the dynamics of the performance, first the dual-channel audio was converted to mono, by averaging the two channels, then the loudness curve was computed in loudness units (LUFS)⁷⁸ using the mzPowerCurve plugin in Sonic Visualiser⁵⁷ at a sample rate of 200, with a moving window of 480 samples.

As this study makes use of complex, often 2 minutes long pieces of music, some techniques for analysis of timbre, such as pitch, are difficult to interpret here. On the other hand, we made use of entropy as a measure of music complexity⁷⁹. Entropy of the loudness curve, and spectral entropy, respectively, were computed using lziv_complexity and spectral_entropy in package antropy⁸⁰ the former on the loudness curve, the latter both on the loudness curve and directly on the mono WAV audio data.

Statistical tests

To study the differences in ratings, as well as in the average synchrony and temporal variability of synchrony at each period (timescale) between performances, we estimated a three-way mixed-effect multilevel model that includes the performance mode and composition as within-subject factors and blindfolding as a between-group factor with fixed effects, and each subject as random effects. We primarily focused on the main effect of the performance mode, as it directly addresses our hypotheses 1 and 2. The main effect of blindfolding and its interaction with performance mode address the impact of visual information available to audience. The composition was included as a controlling variable, so its effects were of little interest at the stage of experimental design. Using the lme4 package⁸¹ in R statistical software, the multilevel model is expressed as

$$\texttt{DV}\, {\sim}\,\texttt{Blindfold\,*\,Composition \,*\, Mode + (1|Subject)} \\ \ + \texttt{(1|Composition:Subject)}+\texttt{(1|Mode:Subject)}$$

where DV represents the dependent (target) variable.

When DV is a subjective rating for each performance from audience members, the main effect of Mode in the model addresses hypothesis 1. When DV is synchrony, its temporal variability, or a measure derived from the breathing-related patterns, the main effect of Mode addresses hypothesis 2. All the fixed-effect independent variables are zero-centered before estimation⁸². Statistical significance of the variables were tested using the lmerTest package⁸³. To quantify the fitting of the entire model, marginal and conditional $R^2$ values are calculated using MuMIn package⁸⁴. Diagnostic analysis of the multilevel models was conducted using DHARMa package⁸⁵. By checking the distribution of residuals, its variance over the predictor variables or fitted values, we have confirmed the normality and homoscedasticity, with no conspicuous deviations or outliers.

In the analyses of movement synchrony, to correct for multiple testing over many timescales, false discovery rate (FDR) control via the Benjamini-Hochberg procedure⁸⁶ was applied to the p-values.

For each subject and performance, the mean movement synchrony values in the timescales with significant Let-go < Strict difference were averaged into the beat-sync measure, those with significant Let-go > Strict difference were averaged into the music-sync measure. Similarly, synchrony variability values in the timescales with significant Let-go > Strict were averaged into the music-sync variability measure.

To investigate how these measures were predictive of the subjective evaluations by the audience (hypothesis 3), multilevel models of the form rating $\sim$ sync + (1|Subject) were tested. Here, sync represents either the beat-sync, music-sync, or music-sync variability after centering-within-cluster. This analysis is equivalent to the within-subject repeated measures correlations⁸⁷, and evaluates how the within-subject variances in the sync and rating are consistently correlated over the four performances (Let-go and Strict performance mode for the both pieces) or over the two performances with the different modes for each piece, separately.

Multilevel models of the form rating $\sim$ Absorption * Mode + (1|subject) are used to quantify the effect of absorption on performance ratings and its interaction with the performance mode.

Data availability

Data generated as part of this study are available to other investigators exclusively for the purposes of basic research exploring the relationship between ensemble music experience and physical as well as brain activities. Data is available from the corresponding author on reasonable request.

References

Castano, E., Yzerbyt, V. & Bourguignon, D. We are one and i like it: The impact of ingroup entitativity on ingroup identification. Eur. J. Soc. Psychol. 33, 735–754. https://doi.org/10.1002/ejsp.175 (2003).
Article Google Scholar
Bak-Coleman, J. B. et al. Stewardship of global collective behavior. Proc. Natl. Acad. Sci. 118. https://doi.org/10.1073/pnas.2025764118 (2021).
Rennung, M. & Göritz, A. S. Prosocial consequences of interpersonal synchrony: A meta-analysis. Z. Psychol. 224, 168–189. https://doi.org/10.1027/2151-2604/a000252 (2016).
Article PubMed PubMed Central Google Scholar
Vicaria, I. M. & Dickens, L. Meta-analyses of the intra-and interpersonal outcomes of interpersonal coordination. J. Nonverb. Behav. 40, 335–361. https://doi.org/10.1007/s10919-016-0238-8 (2016).
Article MATH Google Scholar
Mogan, R., Fischer, R. & Bulbulia, J. A. To be in synchrony or not? A meta-analysis of synchrony’s effects on behavior, perception, cognition and affect. J. Exp. Soc. Psychol. 72, 13–20. https://doi.org/10.1016/j.jesp.2017.03.009 (2017).
Article Google Scholar
Nozawa, T. et al. Prior physical synchrony enhances rapport and inter-brain synchronization during subsequent educational communication. Sci. Rep. 9, 12747. https://doi.org/10.1038/s41598-019-49257-z (2019).
Article ADS PubMed PubMed Central MATH CAS Google Scholar
Konvalinka, I. et al. Synchronized arousal between performers and related spectators in a fire-walking ritual. Proc. Natl. Acad. Sci. 108, 8514–8519. https://doi.org/10.1073/pnas.1016955108 (2011).
Article ADS PubMed PubMed Central MATH Google Scholar
Kettner, H. et al. Psychedelic communitas: Intersubjective experience during psychedelic group sessions predicts enduring changes in psychological wellbeing and social connectedness. Front. Pharmacol. 234. https://doi.org/10.3389/fphar.2021.623985 (2021).
Higo, N. et al. Interpersonal similarity between body movements in face-to-face communication in daily life. PLoS One 9, e102019. https://doi.org/10.1371/journal.pone.0102019 (2014).
Article ADS PubMed PubMed Central MATH Google Scholar
Varni, G., Volpe, G. & Camurri, A. A system for real-time multimodal analysis of nonverbal affective social interaction in user-centric media. IEEE Trans. Multimed. 12, 576–590. https://doi.org/10.1109/tmm.2010.2052592 (2010).
Article MATH Google Scholar
Trehub, S. E., Becker, J. & Morley, I. Cross-cultural perspectives on music and musicality. Philos. Trans. R. Soc. B Biol. Sci. 370, 20140096–20140096. https://doi.org/10.1098/rstb.2014.0096 (2015).
Article MATH Google Scholar
Mehr, S. A., Singh, M., York, H., Glowacki, L. & Krasnow, M. M. Form and function in human song. Curr. Biol. 28, 356-368.e5. https://doi.org/10.1016/j.cub.2017.12.042 (2018).
Article PubMed PubMed Central CAS Google Scholar
Clarke, E., DeNora, T. & Vuoskoski, J. Music, empathy and cultural understanding. Phys. Life Rev. 15, 61–88. https://doi.org/10.1016/j.plrev.2015.09.001 (2015).
Article ADS PubMed MATH Google Scholar
Laird, L. Empathy in the classroom. Music Educ. J. 101, 56–61. https://doi.org/10.1177/0027432115572230 (2015).
Article MATH Google Scholar
Cho, E. The relationship between small music ensemble experience and empathy skill: A survey study. Psychol. Music. 030573561988722. https://doi.org/10.1177/0305735619887226 (2019).
Keller, P. E. Joint action in music performance. In Enacting Intersubjectivity: A Cognitive and Social Perspective on the Study of Interactions (Morganti, F., Carassa, A. & Riva, G. eds.) . Vol. 17 (IOS Press, 2008).
Chang, A., Livingstone, S. R., Bosnyak, D. J. & Trainor, L. J. Body sway reflects leadership in joint music performance. Proc. Natl. Acad. Sci. U S A 114, E4134–E4141. https://doi.org/10.1073/pnas.1617657114 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Walton, A. E., Richardson, M. J., Langland-Hassan, P. & Chemero, A. Improvisation and the self-organization of multiple musical bodies. Front. Psychol. 06. https://doi.org/10.3389/fpsyg.2015.00313 (2015).
Noy, L., Levit-Binun, N. & Golland, Y. Being in the zone: physiological markers of togetherness in joint improvisation. Front. Hum. Neurosci. 9. https://doi.org/10.3389/fnhum.2015.00187 (2015).
Csikszentmihalyi, M. Flow: The Psychology of Optimal Experience (Harper and Row, 1990).
Google Scholar
Shehata, M. et al. Team flow is a unique brain state associated with enhanced information integration and neural synchrony. bioRxiv (Cold Spring Harbor Laboratory). https://doi.org/10.1101/2020.06.17.157990 (2020).
Brand, G., Sloboda, J., Saul, B. & Hathaway, M. The reciprocal relationship between jazz musicians and audiences in live performances: A pilot qualitative study. Psychol. Music 40, 634–651. https://doi.org/10.1177/0305735612448509 (2012).
Article Google Scholar
Toelle, J. & Sloboda, J. A. The audience as artist? The audience’s experience of participatory music. Musicae Sci. 25, 102986491984480. https://doi.org/10.1177/1029864919844804 (2019).
Article MATH Google Scholar
Jansen, E. Complexity and musical improvisation. Music Sci. 1, 205920431877980. https://doi.org/10.1177/2059204318779807 (2018).
Article MATH Google Scholar
Nettl, B. Thoughts on improvisation: A comparative approach. Music. Q. LX, 1-19. https://doi.org/10.1093/mq/lx.1.1 (1974).
Matare, J. Creativity or musical intelligence? A comparative study of improvisation performance by European and African musicians. Think. Skills Creativ. 4, 194–203. https://doi.org/10.1016/j.tsc.2009.09.005 (2009).
Article Google Scholar
Creech, A. et al. Investigating musical performance: Commonality and diversity among classical and non-classical musicians. Music Educ. Res. 10, 215–234. https://doi.org/10.1080/14613800802079080 (2008).
Article MATH Google Scholar
Metricimpro.eu. Metric—Modernizing European Higher Music Education Through Improvisation.
Dolan, D., Sloboda, J., Jensen, H. J., Crüts, B. & Feygelson, E. The improvisatory approach to classical music performance: An empirical investigation into its characteristics and impact. Music Perform. Res. 6 (2013).
Dolan, D. et al. The improvisational state of mind: A multidisciplinary study of an improvisatory approach to classical music repertoire performance. Front. Psychol. 9, 21. https://doi.org/10.3389/fpsyg.2018.01341 (2018).
Article MATH Google Scholar
Cook, N. Beyond the Score: Music as Performance (Oxford University Press, 2013).
MATH Google Scholar
Leech-Wilkinson, D. Challenging performance: Classical music performance norms and how to escape them. Version 2.2.1 (6.ix.24). https://challengingperformance.com/the-book/ (2020).
Volpe, G., D’Ausilio, A., Badino, L., Camurri, A. & Fadiga, L. Measuring social interaction in music ensembles. Philos. Trans. R. Soc. B Biol. Sci. 371, 20150377. https://doi.org/10.1098/rstb.2015.0377 (2016).
Article Google Scholar
Chang, A., Kragness, H. E., Livingstone, S. R., Bosnyak, D. J. & Trainor, L. J. Body sway reflects joint emotional expression in music ensemble performance. Sci. Rep. 9. https://doi.org/10.1038/s41598-018-36358-4 (2019).
Bernardi, N. F. et al. Increase in synchronization of autonomic rhythms between individuals when listening to music. Front. Physiol. 8, 785. https://doi.org/10.3389/fphys.2017.00785 (2017).
Article PubMed PubMed Central MATH Google Scholar
Ardizzi, M., Calbi, M., Tavaglione, S., Umiltà, M. A. & Gallese, V. Audience spontaneous entrainment during the collective enjoyment of live performances: Physiological and behavioral measurements. Sci. Rep. 10, 3813. https://doi.org/10.1038/s41598-020-60832-7 (2020).
Article ADS PubMed PubMed Central CAS Google Scholar
Czepiel, A. et al. Synchrony in the periphery: inter-subject correlation of physiological responses during live music concerts. Sci. Rep. 11. https://doi.org/10.1038/s41598-021-00492-3 (2021).
Tschacher, W. et al. Physiological synchrony in audiences of live concerts. Psychol. Aesth. Creativ. Arts. https://doi.org/10.1037/aca0000431 (2021).
1. Madsen, J. & Parra, L. C. Cognitive processing of a common stimulus synchronizes brains, hearts, and eyes. PNAS Nexus 1, pgac020. https://doi.org/10.1093/pnasnexus/pgac020 (2022).
Burger, B., Thompson, M. R., Luck, G., Saarikallio, S. & Toiviainen, P. Influences of rhythm- and timbre-related musical features on characteristics of music-induced movement. Front. Psychol. 4, 183. https://doi.org/10.3389/fpsyg.2013.00183 (2013).
Article PubMed PubMed Central Google Scholar
Ellamil, M., Berson, J., Wong, J., Buckley, L. & Margulies, D. S. One in the dance: Musical correlates of group synchrony in a real-world club environment. PLoS One 11, e0164783. https://doi.org/10.1371/journal.pone.0164783 (2016).
Article PubMed PubMed Central CAS Google Scholar
Vieillard, S. et al. Happy, sad, scary and peaceful musical excerpts for research on emotions. Cognit. Emot. 22, 720–752. https://doi.org/10.1080/02699930701503567 (2008).
Article MATH Google Scholar
Donnay, G. F., Rankin, S. K., Lopez-Gonzalez, M., Jiradejvong, P. & Limb, C. J. Neural substrates of interactive musical improvisation: An fmri study of “trading fours” in jazz. PLoS ONE 9, e88665. https://doi.org/10.1371/journal.pone.0088665 (2014).
Rhodes, L. A., David, D. C. & Combs, A. L. Absorption and enjoyment of music. Percept. Motor Skills 66, 737–738. https://doi.org/10.2466/pms.1988.66.3.737 (1988).
Article MATH Google Scholar
Haijen, E. C. H. M. et al. Predicting responses to psychedelics: A prospective study. Front. Pharmacol. 9. https://doi.org/10.3389/fphar.2018.00897 (2018).
Grinsted, A., Moore, J. & Jevrejeva, S. Application of the cross wavelet transform and wavelet coherence to geophysical time series. Nonlinear Process. Geophys. 5(6), 561–566. https://doi.org/10.5194/npg-11-561-2004 (2004).
Article ADS MATH Google Scholar
Schmidt, R., Nie, L., Franco, A. & Richardson, M. J. Bodily synchronization underlying joke telling. Front. Hum. Neurosci. 8, 633. https://doi.org/10.3389/fnhum.2014.00633 (2014).
Article PubMed PubMed Central CAS Google Scholar
Issartel, J., Bardainne, T., Gaillot, P. & Marin, L. The relevance of the cross-wavelet transform in the analysis of human interaction-A tutorial. Front. Psychol. 5, 1566. https://doi.org/10.3389/fpsyg.2014.01566 (2015).
Article PubMed PubMed Central MATH Google Scholar
Fujiwara, K. & Daibo, I. Evaluating interpersonal synchrony: Wavelet transform toward an unstructured conversation. Front. Psychol. 7, 516. https://doi.org/10.3389/fpsyg.2016.00516 (2016).
Article PubMed PubMed Central Google Scholar
Wiltshire, T. J., Steffensen, S. V. & Fiore, S. M. Multiscale movement coordination dynamics in collaborative team problem solving. Appl. Ergon. 79, 143–151. https://doi.org/10.1016/j.apergo.2018.07.007 (2019).
Article PubMed MATH Google Scholar
Schirmer, A., Lo, C. & Wijaya, M. When the music’s no good: Rhythms prompt interactional synchrony but impair affective communication outcomes. Commun. Res. 50, 30–52. https://doi.org/10.1177/00936502211015900 (2023).
Article Google Scholar
Godøy, R. I. & Leman, M. Musical Gestures (Routledge, 2010).
Bernardi, L., Porta, C. & Sleight, P. Cardiovascular, cerebrovascular, and respiratory changes induced by different types of music in musicians and non-musicians: the importance of silence. Heart 92, 445–452. https://doi.org/10.1136/hrt.2005.064600 (2005).
Article PubMed PubMed Central Google Scholar
Mediano, P. A. et al. Spectrally and temporally resolved estimation of neural signal diversity. bioRxiv 2023-03. https://doi.org/10.1101/2023.03.30.534922 (2023).
Krumhansl, C. L. An exploratory study of musical emotions and psychophysiology. Can. J. Exp. Psychol./Rev. Can. Psychol. Exp. 51, 336. https://doi.org/10.1037/1196-1961.51.4.336 (1997).
Article MATH CAS Google Scholar
Aydore, S., Pantazis, D. & Leahy, R. M. A note on the phase locking value and its properties. Neuroimage 74, 231–244. https://doi.org/10.1016/j.neuroimage.2013.02.008 (2013).
Article PubMed MATH Google Scholar
Cannam, C., Landone, C. & Sandler, M. Sonic visualiser. In Proceedings of the 18th ACM international conference on Multimedia[SPACE]https://doi.org/10.1145/1873951.1874248 (2010).
Høffding, S. A Phenomenology of Musical Absorption (Cham Palgrave Macmillan, 2018).
Hancock, F. et al. Metastability demystified—The foundational past, the pragmatic present, and the potential future. Preprints[SPACE]https://doi.org/10.20944/preprints202307.1445.v1 (2023).
Mayo, O. & Gordon, I. In and out of synchrony-behavioral and physiological dynamics of dyadic interpersonal coordination. Psychophysiology 57, e13574. https://doi.org/10.1111/psyp.13574 (2020).
Article PubMed MATH Google Scholar
Gold, B. P., Pearce, M. T., Mas-Herrero, E., Dagher, A. & Zatorre, R. J. Predictability and uncertainty in the pleasure of music: A reward for learning?. J. Neurosci. 39, 9397–9409. https://doi.org/10.1523/JNEUROSCI.0428-19.2019 (2019).
Article PubMed PubMed Central CAS Google Scholar
Stupacher, J., Matthews, T. E., Pando-Naude, V., Foster Vander Elst, O. & Vuust, P. The sweet spot between predictability and surprise: Musical groove in brain, body, and social interactions. Front. Psychol. 13, 906190. https://doi.org/10.3389/fpsyg.2022.906190 (2022).
Vuust, P., Heggli, O. A., Friston, K. J. & Kringelbach, M. L. Music in the brain. Nat. Rev. Neurosci. 23, 287–305. https://doi.org/10.1038/s41583-022-00578-5 (2022).
Article PubMed CAS Google Scholar
Pressing, J. The micro- and macrostructural design of improvised music. Music Percept. 5, 133–172. https://doi.org/10.2307/40285390 (1987).
Article MATH Google Scholar
Levin, R. Improvizing Mozart. Bull. Am. Acad. Arts Sci. 55, 87–90 (2002).
Google Scholar
Lerdahl, F. & Jackendoff, R. S. A Generative Theory of Tonal Music (Reissue, with a New Preface) (MIT Press, 1996).
Tsay, C.-J. Sight over sound in the judgment of music performance. Proc. Natl. Acad. Sci. 110, 14580–14585. https://doi.org/10.1073/pnas.1221454110 (2013).
Article ADS PubMed PubMed Central MATH CAS Google Scholar
Matz, S. C., Hyon, R., Baek, E. C., Parkinson, C. & Cerf, M. Personality similarity predicts synchronous neural responses in fMRI and EEG data. Sci. Rep. 12, 14325. https://doi.org/10.1038/s41598-022-18237-1 (2022).
Article ADS PubMed PubMed Central CAS Google Scholar
Bizzego, A. et al. Strangers, friends, and lovers show different physiological synchrony in different emotional states. Behav. Sci. 10, 11. https://doi.org/10.3390/bs10010011 (2020).
Article Google Scholar
Phan, D., Bonnet, S., Guillemaud, R., Castelli, E. & Thi, N. P. Estimation of respiratory waveform and heart rate using an accelerometer. In 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 4916–4919. https://doi.org/10.1109/IEMBS.2008.4650316 (IEEE, 2008).
Dolan, D. Interpretation Through Improvisation” Course at the Guildhall School of Music and Drama. https://www.gsmd.ac.uk/study-with-guildhall/music/performance-and-collaboration/centre-for-creative-performance-classical. Accessed 01 June 2024.
Nozawa, T., Uchiyama, M., Honda, K., Nakano, T. & Miyake, Y. Speech discrimination in real-world group communication using audio-motion multimodal sensing. Sensors 20, 2948. https://doi.org/10.3390/s20102948 (2020).
Article ADS PubMed PubMed Central Google Scholar
Tellegen, G., Auke; Atkinson. Openness to absorbing and self-altering experiences (“absorption”), a trait related to hypnotic susceptibility. J. Abnorm. Psychol. 83, 268–277. https://doi.org/10.1037/h0036681 (1974).
Stekhoven, D. J. & Bühlmann, P. MissForest—Non-parametric missing value imputation for mixed-type data. Bioinformatics 28, 112–118. https://doi.org/10.1093/bioinformatics/btr597 (2012).
Article PubMed MATH CAS Google Scholar
Torrence, C. & Compo, G. P. A practical guide to wavelet analysis. Bull. Am. Meteorol. Soc. 79, 61–78. https://doi.org/10.1175/1520-0477(1998)079%3C0061:APGTWA%3E2.0.CO;2 (1998).
Grinsted, A. Cross wavelet and wavelet coherence toolbox. http://grinsted.github.io/wavelet-coherence/. Accessed 01 June 2024.
Bowen, J. A. Tempo, duration, and flexibility: Techniques in the analysis of performance. J. Musicol. Res. 16, 111–156. https://doi.org/10.1080/01411899608574728 (1996).
Article ADS MATH Google Scholar
International Telecommunication Union (ITU). Recommendation ITU-R BS.1770-5: Algorithms to measure audio programme loudness and true-peak audio level. https://www.itu.int/rec/R-REC-BS.1770/en (2023). Accessed 01 June 2024.
Margulis, E. H. & Beatty, A. P. Musical style, psychoaesthetics, and prospects for entropy as an analytic tool. Comput. Music J. 32, 64–78. https://doi.org/10.1162/comj.2008.32.4.64 (2008).
Article MATH Google Scholar
Vallat, R. Antropy: Entropy and complexity of (EEG) time-series in Python. https://github.com/raphaelvallat/antropy. Accessed 01 June 2024.
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48. https://doi.org/10.18637/jss.v067.i01 (2015).
Yaremych, H. E., Preacher, K. J. & Hedeker, D. Centering categorical predictors in multilevel models: Best practices and interpretation. Psychol. Methods 28, 613–630. https://doi.org/10.1037/met0000434 (2023).
Article PubMed MATH Google Scholar
Kuznetsova, A., Brockhoff, P. B. & Christensen, R. H. B. lmertest package: Tests in linear mixed effects models. J Stat Softw 82. https://doi.org/10.18637/jss.v082.i13 (2017).
Bartoń, K. MuMIn: Multi-Model Inference (R Package version 1.47.5). https://CRAN.R-project.org/package=MuMIn. Accessed 01 June 2024.
Hartig, F. DHARMa: Residual Diagnostics for Hierarchical (Multi-Level / Mixed) Regression Models (R package version 0.4.6). https://CRAN.R-project.org/package=DHARMa (2022). Accessed 01 June 2024.
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B (Methodological) 57, 289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x (1995).
Bakdash, J. Z. & Marusich, L. R. Repeated measures correlation. Front Psychol 8, 456. https://doi.org/10.3389/fpsyg.2017.00456 (2017).
Article PubMed PubMed Central MATH Google Scholar

Download references

Acknowledgements

We thank the Guildhall School of Music and Drama for allowing us to use their facilities and for the help of their IT team support. This work was partially supported by the Center of Innovation Program (Grant Number JPMJCE1309) from JST, Japan, and also by KAKENHI (Grant Numbers JP17H01753, JP20H03553, and JP21K19787) from JSPS/MEXT, Japan. M.S. acknowledges a scholarship by Splunk.

Author information

Authors and Affiliations

Tokyo Institute of Technology, Tokyo, Japan
Takayuki Nozawa, Keigo Honda, Shunnichi Amano, Yoshihiro Miyake & Henrik J. Jensen
University of Toyama, Toyama, Japan
Takayuki Nozawa
Imperial College London, London, UK
Madalina I. Sas, Hardik Rajpal, Fernando E. Rosas, Christopher Timmermann, Pedro A. M. Mediano & Henrik J. Jensen
Guildhall School of Music and Drama, London, UK
David Dolan
University of Sussex, Brighton, UK
Fernando E. Rosas

Authors

Takayuki Nozawa
View author publications
Search author on:PubMed Google Scholar
Madalina I. Sas
View author publications
Search author on:PubMed Google Scholar
David Dolan
View author publications
Search author on:PubMed Google Scholar
Hardik Rajpal
View author publications
Search author on:PubMed Google Scholar
Fernando E. Rosas
View author publications
Search author on:PubMed Google Scholar
Christopher Timmermann
View author publications
Search author on:PubMed Google Scholar
Pedro A. M. Mediano
View author publications
Search author on:PubMed Google Scholar
Keigo Honda
View author publications
Search author on:PubMed Google Scholar
Shunnichi Amano
View author publications
Search author on:PubMed Google Scholar
Yoshihiro Miyake
View author publications
Search author on:PubMed Google Scholar
Henrik J. Jensen
View author publications
Search author on:PubMed Google Scholar

Corresponding authors

Correspondence to Takayuki Nozawa, Madalina I. Sas or Pedro A. M. Mediano.

Ethics declarations

Ethical approval

This study was approved by the Human Subjects Research Ethics Review Committee, Tokyo Institute of Technology (Approval No. 2019101) and ethical approval was also confirmed by the Imperial College Research Ethics Committee. The study was conducted according to the Declaration of Helsinki. Written informed consent was obtained from all participants.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information. (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nozawa, T., Sas, M.I., Dolan, D. et al. Multiscale synchronisation dynamics reveals the impact of an improvisatory approach to performance on music experience. Sci Rep 15, 10097 (2025). https://doi.org/10.1038/s41598-025-90271-1

Download citation

Received: 14 November 2023
Accepted: 11 February 2025
Published: 24 March 2025
Version of record: 24 March 2025
DOI: https://doi.org/10.1038/s41598-025-90271-1

Subjects

Abstract

Similar content being viewed by others

Synchrony in the periphery: inter-subject correlation of physiological responses during live music concerts

Coordination dynamics of multi-agent interaction in a musical ensemble

Audience synchronies in live concerts illustrate the embodiment of music experience

Introduction

Results

Performance ratings

Movement synchrony

Wavelet transform coherence analysis

Breathing-related pattern analysis

Correlation between ratings and synchrony

Music performance analysis

Discussion

Limitations

Conclusions

Methods

Experimental procedure

Participants

Measurements

Body motion acceleration

Questionnaires

Analysis

Wavelet synchrony analysis

Breathing rate analysis

Music performance analysis

Statistical tests

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Ethical approval

Additional information

Publisher’s note

Supplementary Information

Supplementary Information. (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links