Introduction

The integration of positive psychology (PP) into second language (L2) education has prompted a significant shift in research focus (Derakhshan, 2022). Previously, the majority of scholarly attention was on cognitive aspects, such as language learning strategies (Pawlak and Kiermasz, 2018). However, the emphasis has broadened to include the three pillars of PP: positive institutions, positive characteristics, and positive emotions (Seligman, 2011). These pillars are interconnected: positive institutions (e.g., work environment) provide the structural foundation that nurtures positive characteristics (e.g., resilience, self-efficacy) in individuals, which in turn foster positive emotions (e.g., enjoyment, pride) (Seligman, 2011). Conversely, inadequate institutional support may hinder the development of positive characteristics and trigger negative emotions (e.g., anxiety, anger), highlighting the need to systematically measure institutional support to understand its cascading effects on emotions. Initially, the focus was on the positive characteristics and emotional experience of both L2 teachers and L2 learners (Derakhshan et al., 2023; Wang et al., 2022; Zhao et al., 2024; Zhao and Wang, 2024). As PP continues to be incorporated into L2 education, research has expanded to explore positive institutions in recent years (Greenier et al., 2023; Wu and Zeng, 2024; Zeng et al., 2024), which, in the context of L2 teaching, relates to the idea of supportive work environment (SWE).

The concept of SWE initially emerged in organizational psychology (Bosma et al., 2020) and refers to an environment where employees feel valued, respected, and supported in their roles (Ángeles López-Cabarcos et al., 2022). In educational settings, SWE specifically encompasses structural and interpersonal resources that enable teachers to perform their roles effectively (Wu and Zeng, 2024). Subsequently, similar concepts such as school climate (Oder and Eisenschmidt, 2018) and school working conditions (Toropova et al., 2021) were introduced into the field of teacher education. Consequently, researchers in the field of L2 education typically use two methods to measure SWE. The first method involves using scales that assess employees’ perceived SWE (e.g., Wu and Zeng, 2024). The second method employs scales designed specifically for teachers in a general sense (Greenier et al. 2023). Despite the use of various scales, it is certain that SWE is a multidimensional concept as noted by Wang and Degol (2016).

L2 teachers face unique workplace challenges that set their SWE needs apart from those in other fields. The inherent complexity of L2 teaching makes them more susceptible to job-related stressors. For non-native teachers, these challenges are amplified by L2 anxiety and low self-confidence. Ershadi et al. (2024) found that non-native teachers often feel marginalized, view policymakers as sources of inequality, and struggle with pedagogical inefficacy. The nature of L2 teaching, along with the delayed outcomes of L2 learning (Sudina and Plonsky, 2021), increases teachers’ risk of frustration, burnout, and turnover (King et al., 2024). Occupational stress is a well-documented threat to L2 teachers’ psychological well-being and professional growth (Mercer, 2023). MacIntyre et al. (2019) highlight the vulnerability of L2 teachers to stress, particularly from unstable work conditions, job insecurity, and identity crises. Hence, Hiver and Dörnyei (2017) describe L2 teaching as a profession in crisis, overwhelmed by various pressures. Despite the uniqueness, existing scales, which are validated for teachers in a general context, may therefore inadequately capture these nuances, underscoring the need for a tailored measurement tool.

Besides, through a detailed examination of scales used in general education (e.g., Burden and Fraser, 1994; Dang et al., 2024; Johnson et al., 2007) and those applied in L2 research (e.g., Greenier et al., 2023; Wu and Zeng, 2024; Zeng et al., 2024), we identified several limitations. First, the psychometric properties of these scales, such as convergent and divergent validity, have not been thoroughly evaluated. Second, many of these scales have an excessive number of items or factors, resulting in lengthy questionnaires that could negatively impact completion rates (Schoeni et al., 2013). To address these issues and meet the growing demands of L2 research, this study first aims to develop a streamlined SWE scale that maintains robust psychometric properties, offering a more effective tool for future L2 research. Furthermore, given the significance of SWE in influencing teachers’ mental health, we also examined the links between specific SWE dimensions and L2 teachers’ emotional experiences—defined as teachers’ subjective affective states during work, including positive emotions (e.g., enjoyment, pride) and negative emotions (e.g., anxiety, anger) that arise from job-related interactions and tasks (Frenzel, 2014).

Literature review

Supportive work environment in education: concept and dimensions

The construct of SWE emerged from organizational psychology in the early 20th century, with its significance gaining recognition as industrialization advanced. During this period, scholars increasingly noted that the work setting exerts a profound influence on employees’ physical and mental well-being, as well as their work efficiency (Zhong and House, 2012). This understanding was further solidified by the Hawthorne studies (Hassard, 2012), which highlighted the pivotal role of the work environment in shaping employees’ behaviors and attitudes. Conceptually, SWE refers to a collection of organizational cultures and management practices designed to foster a positive, healthy, and safe working environment, ultimately promoting employees’ well-being and productivity (Ángeles López-Cabarcos et al., 2022). Among early conceptualizations, Broad and Newstrom (1992) identified four core components of SWE: perceived organizational support, perceived climate, supervisory relationship, and peer group interaction, each capturing distinct aspects of how support operates within organizational structures.

The introduction of SWE into educational contexts holds significant importance, as it directly enriches teachers’ professional experiences, fosters their growth, and elevates overall educational quality (Wu and Zeng, 2024). However, the academic community remains divided regarding both the terminology used to describe this construct and its constituent components. In terms of terminology, some scholars adopt labels rooted in organizational psychology, such as “SWE” (Zeng et al., 2024), while others use terms like “school climate” (Ryberg et al., 2020) or “working conditions” (Toropova et al., 2021). We opted for “SWE” because “school climate” typically encompasses perceptions from both teachers and students, diluting focus on the workplace experience specific to educators, and “working conditions” fails to emphasize the positive, supportive attributes central to our inquiry.

Regarding the components of SWE, scholarly perspectives vary widely. Zeng et al. (2024) strictly adhere to the four-dimensional framework proposed by Broad and Newstrom (1992), while Masoom (2021) categorizes it to three components: organizational encouragement, supervisory encouragement, and workgroup support. Toropova et al. (2021) expand the construct further, identifying five distinct aspects: student discipline, leadership support, school resources, teacher cooperation, and teacher workload. From another angle, Yada and Savolainen (2023) measure SWE across five dimensions: affiliation, innovation, participatory decision-making, resource adequacy, and student support. Given our goal of developing a streamlined SWE scale with fewer factors and items, Masoom’s (2021) three-dimensional model was selected for its balance of comprehensiveness and conciseness, making it the most suitable foundation for our study.

To make this framework more applicable to school contexts—where organizational psychology terms, often derived from corporate settings, may not fully capture the unique dynamics of educational environments—we adjusted the dimension labels while maintaining the essence of Masoom’s (2021) model (see Fig. 1). Specifically, “organizational encouragement” was redefined as “school support” to better reflect the educational setting, aligning with terms like “school resources” (Toropova et al., 2021) and “resource adequacy” (Yada and Savolainen, 2023), which emphasize institutional support in schools. “Supervisory encouragement” was renamed “supervisor support” to broaden its scope, acknowledging that school supervisors (e.g., principals, department heads) play roles beyond mere “encouragement”, including instructional leadership, administrative oversight, and fostering professional growth (Harris and Jones, 2023). Lastly, “workgroup support” was changed to “colleague support” to clarify that it encompasses both teaching and non-teaching staff (e.g., administrative personnel) who collaborate with teachers in their daily activities, reflecting the inclusive nature of the “workgroup” concept in Masoom’s (2021) model and echoing Toropova et al.’s (2021) emphasis on “teacher cooperation” as a key aspect of SWE in schools.

Fig. 1
figure 1

Three core components of SWE.

Benefits of supportive work environment to L2 teachers

Previous research in general education has extensively documented the benefits of SWE for teachers, including enhanced job satisfaction (Toropova et al., 2021), increased self-efficacy (Jang et al., 2023), improved well-being (Dreer, 2024), higher retention rates (Li and Yao, 2022), and ultimately, better academic outcomes for students (Wartenberg et al., 2023). Studies specifically focusing on L2 teachers have emerged more recently (Zeng et al., 2024), driven by the growing influence of PP in L2 research (Wang et al., 2021). Traditionally, L2 research emphasized the inhibitory effects of negative factors (e.g., negative emotions) on students’ language learning (Zhao and Danping, 2024), but MacIntyre and Mercer (2014) highlighted that L2 learning requires sustained effort, motivation, resilience, and external support—concepts aligned with PP’s focus on positive emotions, characteristics, and institutions. As PP further integrates into L2 research, attention to L2 teachers has grown, with researchers emphasizing that SWE directly impacts their capacity for continuous learning and adaptation to evolving L2 teaching methodologies (Greenier et al., 2023; Ma and Wang, 2024; Wu and Zeng, 2024). Specifically, SWE provides necessary resources for professional development and facilitates cultural exchange through shared diverse perspectives, thereby strengthening teachers’ instructional readiness.

This instructional readiness is particularly vital for cultivating creativity, as L2 classrooms uniquely foster student creativity by exposing learners to diverse cultural perspectives, sparking curiosity, imagination, and openness—key components of creativity (Wu and Zeng, 2025). Given teachers’ centrality in this process, their mental health, especially emotional experiences, critically affects their creativity cultivation capacity (Su et al., 2024). Research confirms SWE significantly shapes these emotions: Adequate SWE fosters positive emotions like enthusiasm (Greenier et al., 2023), hope, and pride (Wu and Zeng, 2024), enhancing L2 teachers’ ability to cultivate students’ creativity. Conversely, insufficient SWE triggers negative emotions such as anxiety and anger, hindering these outcomes. Wu and Zeng (2024) explain this SWE-emotion link using Pekrun and Perry’s (2014) Control-Value Theory (CVT), which posits emotions stem from appraisals of control over and the subjective value of meaningful tasks. While control-value appraisals are central, CVT recognizes distal antecedents like environmental antecedents like SWE, which influence emotions through these appraisals. However, treating SWE as a single construct by Wu and Zeng (2024) leaves unexplored the specific links between different SWE dimensions and various teacher emotions, limiting targeted interventions from an SWE perspective.

Limitations in measuring supportive work environment in education

Despite the vital role of the SWE in promoting positive teacher outcomes, assessing this construct remains challenging. The existing literature identifies two primary methods for evaluating SWE. The first involves aggregating items from various studies to define its components. For instance, Zeng et al. (2024) drew from Tripathi and Kalia (2022) to cover areas like perceived organizational support, perceived climate, etc. Similarly, Wu and Zeng (2024) integrated these elements to assess SWE. However, this approach raises psychometric concerns: Zeng et al. (2024) did not provide model fit statistics for their aggregated scale, while Wu and Zeng (2024) presented only overall model fit indices, lacking essential details such as subscale reliability and evidence of convergent and discriminant validity. The absence of these validation steps reduces the scale’s effectiveness in capturing the specific nuances of SWE, potentially distorting its relationship with teacher outcomes.

The second method relies on the School-Level Environment Questionnaire (SLEQ) (Burden and Fraser, 1994) and its revised versions (Dang et al., 2024; Johnson et al., 2007). However, the psychometric suitability of both the original and modified versions of SLEQ has been questioned. While the original SLEQ, consisting of 56 items across eight factors, has been widely used, its developers did not conduct psychometric testing such as exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) (Burden and Fraser, 1994). Johnson and Stevens (2001), after analyzing a sample of U.S. elementary school teachers, removed three factors due to low item loadings. However, the psychometric properties of their revised 35-item, five-factor model remain uncertain, as the CFA model only met fit criteria after around 30 times for residual adjustments, indicating possible overfitting.

In a subsequent revision, Johnson et al. (2007) removed 14 items, resulting in a five-factor model that showed good internal consistency among U.S. elementary and secondary school teachers. However, questions remain regarding the model fit, as important fit indices, such as chi-square to degrees of freedom ratio, Tucker-Lewis Index (TLI), and Standardized Root Mean Square Residual (SRMR), were not reported; only Comparative Fit Index (CFI) and Root Mean Square Error of Approximation (RMSEA) were provided. More recently, Dang et al. (2024) revalidated the original SLEQ with high school teachers in Vietnam, adding a “principal leadership” factor (similar to “supervisor support”), suggesting that the internal structure of SLEQ may differ across cultural contexts. Their study confirmed that the revised nine-factor model demonstrated strong reliability and validity. However, this model includes 38 items, which may be too lengthy for studies focusing on multiple constructs. Overall, the evolution of SLEQ demonstrates its structural instability and cultural sensitivity, limiting its generalizability across different educational and national contexts. Moreover, none of these versions have been validated in China or among university teachers, raising doubts about the relevance of existing SLEQ versions for L2 teaching in China.

These limitations—particularly the contextual reliance of SLEQ and the lack of thorough validation for ad-hoc scales—diminish the rigor and reliability of SWE research in L2 education. Consequently, this study aims to create the Supportive Work Environment Scale (SWES) assess its psychometric properties specifically among L2 teachers (see Fig. 2). Additionally, to inform emotion-focused interventions for improving L2 teachers’ mental health, this research also explores the relationship between SWE dimensions and their emotional experiences (i.e., enjoyment, hope, anxiety, and anger). The following research questions guide this study:

Fig. 2
figure 2

Steps for the development and validation of the SWES.

RQ1: What are the psychometric properties of the SWES?

RQ2: How do specific SWE factors relate to L2 teachers’ emotional experiences?

Methodology

Participants

The study recruited 753 L2 teachers from educational institutions across 11 provinces in China (Zhejiang, Hunan, Guangdong, Jiangxi, Jilin, Liaoning, Shandong, Sichuan, Guizhou, Henan, and Anhui), covering primary to tertiary education. To avoid overfitting and ensure the generalizability of the model (Thompson, 2004), participants were divided into two independent samples collected at different stages.

Sample 1, consisting of 351 participants, was initially recruited, and a total of 378 responses were collected. However, 27 responses were excluded due to either uniform selection of the same option (e.g., all choosing “Neutral”) or incomplete submissions, resulting in a response rate of 92.9% (351/378). This sample completed the original 26-item SWES for EFA. Following the EFA, the scale was refined to 20 items.

Sample 2, consisting of 402 participants, was recruited afterward, with 418 responses collected initially. Out of these, 16 responses were excluded based on the same criteria (uniform responses or incompletion), yielding a response rate of 96.2% (402/418). This group completed both the refined 20-item SWES and measures of L2 teachers’ emotional experiences, which were used for CFA and other psychometric evaluations.

The two-phase data collection approach was implemented to minimize participant burden, as collecting all data in one batch would require completing a large number of items (including both the original SWES and emotional experience items). Splitting the collection into two stages helped reduce the response burden.

Adequate sample size is essential for the validity of factor analysis, and de Vet et al. (2011) recommend a participant-to-item ratio of at least 10:1. Given that the initial SWES comprised 26 items, Sample 1 with 351 participants met the required ratio. For the refined SWES with 20 items, Sample 2 consisting of 402 participants also satisfied the recommended ratio. Data were collected via the Wenjuanxing online platform, with all questionnaires administered in Chinese. The detailed demographic characteristics are presented in Table 1.

Table 1 Demographic characteristics of two samples.

Ethical guidelines were strictly followed: participants were informed of the research purpose, assured of data confidentiality, and granted the right to withdraw at any time. The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of Hunan Normal University.

Instruments

The supportive work environment scale

In developing the SWES, strict adherence to the scale development principles outlined by DeVellis (2016) was maintained. The initial phase involved an extensive review of existing literature focusing on the three core components of SWE: school support, supervisor support, and colleague support. This review specifically targeted scales measuring related constructs from key sources:

  1. 1.

    Wu and Zeng (2024): Items related to our dimensions were identified within their scales for perceived organizational support (8 items), supervisory relationship (7 items), and peer group interaction (8 items).

  2. 2.

    Johnson et al. (2007): Items were primarily drawn from their dimensions of “collaboration” (6 items) and “school resources” (4 items).

  3. 3.

    Dang et al. (2024): As the full item set was unavailable, we referenced the single example items provided in their paper for each of the dimensions: affiliation, resource adequacy, and principal leadership (1 item per dimension).

  4. 4.

    Masoom (2021): Items were selected from the dimensions of organizational encouragement (5 items), supervisory encouragement (4 items), and workgroup support (6 items).

Given the conceptual overlap in the dimensions across sources, items were synthesized and modified. Specifically, the subject of items was consistently adapted to refer to “the school”, “the supervisor”, or “the colleague” to align with our target dimensions. This process generated an initial item pool of 31 items.

Following the foundational methodological guidance of Lawshe (1975) and McKenzie et al. (1999), which emphasizes the importance of involving at least five experts to ensure content validity, a rigorous evaluation of the scale’s clarity was conducted. This assessment was carried out by a distinguished panel of six experts from diverse backgrounds, including a professor specializing in psycholinguistics, another in teacher education, a psychometrics specialist, and three experienced L2 teachers from primary, middle, and high schools.

The expert feedback was systematically collected and analyzed, resulting in a comprehensive review of the scale items. Based on the consensus of the expert panel, five items were deemed unsuitable for inclusion and were subsequently excluded, while other items were refined according to the experts’ recommendations. Consequently, a revised draft of the scale was developed, comprising 26 items. Participants were asked to respond in Chinese. These items are presented on a 5-point Likert scale, ranging from 1 (strongly disagree) to 5 (strongly agree). The Chinese version can be seen in our uploaded supplementary material.

The achievement emotions questionnaire–teachers

The Achievement Emotions Questionnaire–Teachers (AEQ-T) by Hong et al. (2016) was used to assess participants’ emotional experiences. It includes four subscales: enjoyment (4 items), pride (4 items), anxiety (4 items), and anger (3 items), totaling 15 items. Each subscale operates independently and uses a 4-point Likert scale (1 = “strongly disagree”, 4 = “strongly agree”). Since only the English version was available, a translation and back-translation procedure was employed. The AEQ-T was translated from English to Chinese and then back to English by three bilingual researchers and two experts. Participants responded in Chinese. The Chinese AEQ-T in this study demonstrated strong reliability and validity, with Cronbach’s α of 0.747 for enjoyment, 0.852 for pride, 0.851 for anxiety, and 0.724 for anger (Viladrich et al., 2017). The construct validity indices were: Chi-square per degree of freedom (χ2/df) = 2.578, CFI = 0.983, TLI = 0.970, RMSEA = 0.063, and SRMR = 0.035 (Hu and Bentler, 1999). The detailed item content in both English and Chinese is provided in the supplementary material we uploaded.

Data analysis

Data from Sample 1 underwent analysis using SPSS 26.0. Initial Pearson correlation analysis was conducted to determine correlations between each item and the total score of the SWES. Items with adjusted correlation coefficients below 0.3 were excluded from further analysis. The reliability was assessed through Cronbach’s α, with items removed if their exclusion notably increased α. EFA prerequisites included a KMO measure exceeding 0.60 and Bartlett’s test of sphericity significant at p < 0.05 (Carpenter, 2018). Promax oblique rotation was employed for factor extraction, retaining components with eigenvalues greater than 1 (Worthington and Whittaker, 2006), and varimax rotation facilitated factor interpretation (Morrison, 2009). Criteria for item deletion during EFA involved factor loadings below 0.50, presence of cross-loadings, communalities below 0.20, and factors comprising fewer than three items (Carpenter, 2018).

Data from Sample 2 were analyzed using AMOS 24.0. CFA assessed construct validity based on criteria including χ2/df, CFI and TLI, RMSEA, and SRMR (Hu and Bentler, 1999). Internal consistency of the SWES was assessed using Cronbach’s α, with above 0.70 indicating high reliability (Viladrich et al., 2017). Convergent and discriminant validity were assessed using the guidelines of Fornell and Larcker (1981). An AVE score of ≥0.5 was considered indicative of satisfactory convergence, while sufficient discriminant validity was confirmed when the square root of the AVE for each subscale exceeded its intercorrelations. Besides, Pearson correlation analysis using SPSS 26.0 was conducted to examine the associations between SWE and emotional experiences.

Results

Item analysis

Item analysis conducted on the data from Sample 1 indicated that item 1, along with all items proposed for Professional Development (28, 29, 30, and 31), had adjusted correlation coefficients with the total score below 0.3, resulting in their exclusion. The remaining items demonstrated stronger associations with the total score, with correlation coefficients varying between 0.453 and 0.762. Moreover, the initial SWES exhibited a high Cronbach’s α of 0.942, and the removal of any item would result in a decrease in this internal consistency. Consequently, no item was omitted. As a result of the item analysis process, a total of 26 items were retained for further evaluation.

Exploratory factor analysis

EFA was performed on data from Sample 1. The suitability of the data for EFA was confirmed by a high KMO measure of sampling adequacy (0.957) and a significant Bartlett’s Test of Sphericity (χ² = 7054.897, df = 325, p < 0.001). Based on the EFA results, six items were excluded due to problematic cross-loadings, categorized as follows:

Items cross-loading on two pre-specified factors: Item 8 (“The school supervisors do not exploit me”), Item 10 (“The school supervisors make me feel like an important member of the team”), Item 11 (“The school supervisors respect L2 teachers”), and Item 22 (“L2 teachers are allowed to impact several key school issues”)—all loading on both “school support” and “supervisor support”;

Item 23 (“Colleagues collaborate closely and complete tasks on time”), which cross-loaded on two other pre-specified factors: “supervisor support” and “colleague support”;

Item 25 (“Colleagues often engage in conversations unrelated to work”), which cross-loaded on “colleague support” and an unspecified (non-predefined) factor.

After removing these items, the remaining items formed three distinct factors, collectively accounting for 69.766% of the total variance. The scree plot supporting this factor solution is presented in Fig. 3.

Fig. 3
figure 3

The scree plot from EFA.

Factor 1 (school support) comprised 8 items, factor 2 (supervisor support) included 7 items, and factor 3 (colleague support) retained 5 items. Besides, the communality of all remaining items exceeded 0.2. Detailed results are presented in Table 2.

Table 2 The items, factor loadings, and factors of the SWES (n = 351).

Confirmatory factor analysis

Using the pattern obtained from EFA, a 20-item scale was administered to the data from Sample 2, and CFA was conducted. First, we examined the item loadings in the unstandardized estimation, with items exhibiting non-significant loadings excluded (Kline, 2016). None of the 20 items showed non-significant unstandardized estimates, and all standardized estimates exceeded 0.45, so all items were retained in the model.

Next, we evaluated the model fit of the initial CFA model, which yielded the following results: χ²/df = 4.515, CFI = 0.906, TLI = 0.893, RMSEA = 0.094, and SRMR = 0.052. These indices indicated that the initial model did not meet the established fit criteria (Hu and Bentler, 1999), which specify that a well-fitting model should have χ²/df < 3, CFI > 0.90, TLI > 0.90, RMSEA < 0.08, and SRMR < 0.08.

To improve model fit, we then examined modification indices with a threshold of 10, focusing on those consistent with existing literature—specifically, correlations between error terms of items from the same component, justified by content overlap (Kline, 2016). Following Harrington (2008), we adjusted only the highest modification index at a time, repeating this process until the model fit improved adequately.

The final modified CFA model (as depicted in Fig. 4) showed acceptable fit to the data, with fit indices meeting the criteria: χ²/df = 2.947, CFI = 0.949, TLI = 0.941, RMSEA = 0.070, and SRMR = 0.042. These values indicate that the modified model fits the data well, aligning with the guidelines proposed by Hu and Bentler (1999).

Fig. 4: The final modified CFA model with standardized estimates (n = 402).
figure 4

Note. School school support, Supervisor supervisor support, Colleague colleague support.

Convergent and discriminant validity

We evaluated the convergent and discriminant validity of the SWES using data from Sample 2. As presented in Table 3, all AVE scores were above 0.5, demonstrating strong convergence within the overall scale. Additionally, the square root of each subscale’s AVE (bolded in Table 3) surpassed its correlations with other subscales, confirming the discriminant validity of the SWES. These findings collectively validate the structural integrity of the SWES, supporting both convergent and discriminant validity.

Table 3 Convergent and discriminant validity of the SWES (n = 402).

Reliability analysis

An internal consistency analysis was performed on the data from Sample 2 to evaluate the reliability of the SWES. The results demonstrated strong reliability, with an overall Cronbach’s α of 0.953 for the scale. Specifically, Cronbach’s α for individual factors were also high: 0.923 for school support, 0.931 for supervisor support, and 0.852 for colleague support.

Correlations with emotional experiences

Table 4 presents the results of Pearson correlation analysis using data from Sample 2. It is important to clarify that these correlations were based on basic sum-score correlations, not latent correlations. The analysis revealed significant relationships between SWE and emotional experiences, with positive associations to enjoyment and pride, and negative associations to anxiety and anger. Specifically, enjoyment was most strongly correlated with school support (r = 0.449, p < 0.05), pride with supervisor support (r = 0.384, p < 0.05), anxiety with colleague support (r = −0.357, p < 0.05), and anger with school support (r = −0.277, p < 0.05).

Table 4 Correlations between SWES and emotional experiences (n = 402).

Discussion

In modern education, the significance of teachers’ work environments has drawn considerable global attention, generating substantial academic interest (Dang et al., 2024; Masoom, 2021; Toropova et al., 2021). Hence, it is crucial to create a scientifically grounded, reliable tool to evaluate how teachers perceive the supportiveness of their work environment. This could aid in designing effective educational policies. Additionally, as PP becomes increasingly incorporated into L2 education, many researchers have identified the work environments of L2 teachers as a central element of this framework (Wu and Zeng, 2024; Zeng et al., 2024; Zhang et al., 2023; Zhi and Derakhshan, 2024). However, despite the growing demand for such instruments, there remains an absence of rigorously validated scales in current L2 research. To fill this gap, this study introduces the SWES. Given that the SWES was developed and validated among L2 teachers, further cross-disciplinary studies are needed to explore its broader applicability to other educational settings.

EFA was conducted on the data from Sample 1, revealing three factors that account for a total of 20 items. These factors correspond to the concepts of school support, supervisor support, and colleague support. Together, these three factors explain 66.75% of the total variance, substantiating our decision to use a three-factor model to represent the perceived SWE among L2 teachers. This variance explained exceeds rates commonly reported in social science studies (nearly 55%) (Peterson, 2000). While this supports the model’s utility, we emphasize that measurement quality requires holistic evaluation beyond variance explained, considering model complexity and practical utility. Critically, the removal of cross-loading items (Items 8, 10, 11, 22, 23, 25) was not solely statistically driven but also theoretically grounded (Boateng et al., 2018). These items exhibited ambiguous conceptual alignment with our tripartite framework distinguishing support sources (school, supervisor, colleague). For instance, Items 8, 10, 11, and 22 referenced “school supervisors” while loading on both school support (an institutional-level construct) and supervisor support (an interpersonal-level construct), thereby blurring the theoretical boundaries between these dimensions. Similarly, Item 23 (colleague collaboration) cross-loaded on supervisor support, conflating distinct support agents. Item 25 (non-work conversations) loaded on an unspecified factor unrelated to our framework. Retaining these items would have compromised the theoretical integrity of the distinct support sources central to our model.

Breaking it down, Factor 1, “school support”, includes 8 items that assess L2 teachers’ perceptions of the support they receive from their institution, covering aspects like personal welfare, respect, assistance, and tolerance for mistakes. The factor scores reflect how supported teachers feel by the school. Items related to assistance, with high factor loadings, suggest that teachers highly value the school’s role in providing solutions to challenges (Bogler and Berkovich, 2022). Factor 2, “supervisor support”, consists of 7 items evaluating feedback, communication, collaboration, recognition, respect, and fairness from leadership. The scores reveal how teachers perceive support from their supervisors. High factor loadings for items related to respect and fairness reflect the multifaceted nature of teachers’ roles, which include teaching, research, and communication with families, thereby increasing their expectations for fair treatment. This is consistent with findings by Zhang et al. (2023) and Taamneh et al. (2024), which stress that responsible leadership, work recognition, and justice are crucial for teachers’ emotional well-being. Finally, Factor 3, “colleague support”, consists of 5 items focused on communication, collaboration, and relationships with colleagues. The scores show the extent to which teachers feel supported by their peers. Items concerning communication and friendliness, with relatively high factor loadings, underscore the importance of these aspects for effective collaboration and knowledge sharing, as supported by Akinyemi et al. (2020) and Kolleck et al. (2021).

The CFA outcomes subsequently validated the structural model of the SWES, thereby substantiating its theoretical underpinnings anchored in a tripartite factor configuration (Hu and Bentler, 1999). Furthermore, analyses of convergent and discriminant validity provided strong evidence for the psychometric robustness of the SWES. The satisfactory convergent validity indicates that the items within each factor (school support, supervisor support, colleague support) consistently measure their intended underlying construct. Crucially, the established discriminant validity demonstrates that these three factors, while interrelated components of the broader SWE construct, are statistically distinct and capture unique aspects of teachers’ perceptions of support from different sources within their work environment. This distinction is theoretically meaningful, as it confirms that support from the institution, supervisors, and colleagues represents empirically separable facets of the work environment experienced by L2 teachers. To evaluate the reliability of the SWES and its constituent factors, we employed Cronbach’s α. The results revealed that both the overall scale and its three subscales exhibited Cronbach’s α exceeding 0.7, indicating acceptable reliability (Viladrich et al., 2017).

We also analyzed the correlation between SWE and the emotional experiences of L2 teachers. The results revealed significant positive correlations with emotions such as enjoyment and pride, while negative correlations with anxiety and anger. To explain these patterns, we apply Pekrun and Perry’s (2014) CVT, which attributes achievement emotions to two core appraisal processes: individuals’ evaluations of their control over teaching activities and outcomes, and the subjective value assigned to these activities—categorized as intrinsic (inherent meaning) or extrinsic (instrumental consequences). CVT suggests that these appraisals are influenced by distal factors, such as the environment. SWE, as conceptualized here, likely enhances teachers’ sense of control over professional tasks while reinforcing intrinsic value (e.g., personal fulfillment from teaching) or extrinsic value (e.g., career advancement), fostering positive emotions and reducing negative ones. For the focal emotions, CVT specifies distinct antecedent patterns: Enjoyment arises from high perceived control and high intrinsic value (positive engagement in the activity itself); Pride comes when success is attributed to one’s capabilities and effort (control appraisal) and holds significant achievement value (intrinsic self-worth or extrinsic recognition); Anxiety results from low perceived control over high-value outcomes or failure scenarios with high costs; Anger occurs when high-value goals are obstructed by external, unreasonable barriers despite high perceived capability. This framework helps clarify our findings.

Specifically, enjoyment was most strongly linked to school support, suggesting that institutional resources and care can enhance teachers’ sense of control over their work environment while reinforcing the intrinsic value of their instructional role, which contributes to greater happiness and satisfaction (Ashley et al., 2011). In contrast, pride was most closely associated with supervisor support, indicating that recognition and constructive feedback play a key role in boosting teachers’ sense of accomplishment (extrinsic value of outcomes) and competence (control), which fosters pride (Yang et al., 2022). Anxiety, on the other hand, was most strongly correlated with colleague support, revealing that lack of peer collaboration can reduce teachers’ control over classroom challenges and amplify the external pressures to avoid failure, contributing to feelings of anxiety. Lastly, anger was most associated with school support, suggesting that unsupportive environments are perceived as obstacles to goal achievement (lowering control) and reducing the extrinsic value of efforts, which leads to anger (Miron-Spektor and Rafaeli, 2009). Overall, these findings imply that environmental factors (SWE) serve as indirect influences on L2 teachers’ emotions, shaping them through their impact on context-specific control-value appraisals.

It is noteworthy that while the SWES’s dimensions (i.e., school, supervisor, and colleague support) and specific items (e.g., “The school places significant importance on my well-being”) may hold relevance beyond L2 education, its unique value lies in its validation among L2 teachers and its integration with PP frameworks in L2 research to interpret relationships with emotional experiences—contexts that underscore its targeted relevance for L2 education research and practice. In other words, while the SWES has wider potential, the conceptual and functional dynamics of these support dimensions (such as their connection with emotions) are deeply rooted in the unique context of L2 education. Therefore, applying it across disciplines requires empirical validation to ensure its relevance. Ultimately, the SWES is a specialized tool for exploring core support structures and their outcomes in the distinct professional landscape of L2 teaching.

Implications and limitations

This study makes several contributions to the field. First, the SWES provides a concise tool to assess L2 teachers’ perceived supportiveness of their work environment, with established psychometric properties. However, since the SWES was designed for L2 teachers, its potential application in other fields would require empirical verification of its psychometric properties in those specific contexts. Besides, the correlations observed between SWE dimensions and emotional experiences also suggest preliminary practical implications. Schools may consider implementing institutional support mechanisms to enhance teachers’ sense of control and value, which could contribute to enhanced enjoyment. Supervisory practices focused on recognition and constructive feedback may support teachers’ sense of accomplishment. Fostering collaborative relationships among colleagues could help mitigate anxiety triggers, while addressing systemic barriers might reduce anger. If implemented, these approaches could enhance teacher well-being in conjunction with other institutional interventions. However, these implications should be interpreted with caution due to the study’s methodological constraints.

Specially, the limitations of this study should be considered as follows. First, the exclusive reliance on self-reported data introduces the risk of common method bias and social desirability effects, which could inflate the observed correlations (Podsakoff et al., 2003). Future research could incorporate objective institutional records, such as official documents outlining school support policies and log data on resource allocation. By triangulating these sources with self-reported data, the robustness of the findings would be enhanced. Second, the cross-sectional design prevents drawing causal inferences about the relationships between SWE and emotional experiences. Longitudinal studies could help track changes in SWE and emotional experiences over time, offering more clarity on the directional relationships between these variables. Additionally, we did not conduct measurement invariance tests across subgroups (e.g., gender, educational levels) due to insufficient sample sizes. Consequently, conclusions regarding the scale’s equivalence across groups are limited. Future research with larger and more balanced samples is necessary to confirm the measurement invariance of the SWES. Finally, the omission of student support—an important SWE dimension (Toropova et al., 2021; Yada and Savolainen, 2023)—represents a significant conceptual gap that needs to be addressed in future studies.

Conclusion

This study developed and validated the SWES among teachers, which includes three dimensions: school support, supervisor support, and colleague support. The results indicate that the SWES demonstrates acceptable reliability and validity within the L2 teacher population, providing an effective tool for assessing L2 teachers’ perceptions of their work environment. However, the SWES may have overlooked the potentially key dimension of “student support,” which should be included and tested in future research.

The study also provides preliminary evidence suggesting that SWE may help improve L2 teachers’ positive emotions (i.e., enjoyment and pride) and alleviate negative emotions (i.e., anxiety and anger). However, due to the use of self-reported data and a cross-sectional design, the interpretation of these findings should be cautious. Future research, if it addresses these limitations, could provide stronger evidence to inform policies and practices aimed at improving teacher well-being and potentially optimizing teaching effectiveness.

Moreover, it should be clarified that while the wording of the SWES items is neutral and could potentially be applied to other teacher populations after further validation, it is crucial to emphasize that the development and validation of the scale were specifically focused on L2 teachers. As such, the main contribution of this study lies in providing the L2 research community with a validated tool to assess SWE as a critical factor in the L2 teaching context and its connection to emotional experiences, a key aspect of L2 teacher well-being.