Introduction

Football is among the most widely practiced sports globally and occupies a central position in youth development systems across cultures. In recent decades, research has increasingly examined the environments and experiences that shape athletes’ developmental trajectories. Within this broader landscape, women’s football has expanded rapidly in participation and visibility, accompanied by growing scientific attention1. Understanding player development requires examining the pathways and experiences accumulated during childhood and adolescence2,3, particularly how multiple contexts interact to shape both athletic and personal growth. Contemporary models of youth sport development emphasize diversified and interconnected experiences in the early years, integrating developmental phases rather than prioritizing early specialization4,5.

The Personal Assets Framework (PAF)4 conceptualizes long-term development through three interrelated components: appropriate settings, referring to the physical and competitive environment; personal engagement in activities, encompassing diversified sport participation; and quality social dynamics, reflecting relationships with coaches, peers, parents, and other stakeholders. The interaction of these components is proposed to foster adaptive developmental outcomes6. During childhood and adolescence, deliberate play (informal, intrinsically motivated activity) and deliberate practice (structured training) jointly shape skill acquisition and psychosocial development. The model posits that deliberate play should predominate during the sport experimentation phase, facilitating autonomy, enjoyment, and broad motor and cognitive development6. Early diversified engagement has also been linked to enhanced decision-making, motor coordination, and tactical learning7.

However, empirical findings remain mixed. While some evidence suggests greater diversification in girls’ early sport participation3,8, studies of elite performers indicate higher volumes of early sport-specific practice among those reaching professional levels9. These tensions underscore the need for context-sensitive investigation of developmental pathways.

In Brazil, youth development in women’s football faces structural constraints, including limited infrastructure, inconsistent youth programming, and persistent social barriers to female participation10. Evidence from Brazilian handball, analyzed through the PAF lens, suggests that strong social dynamics may partially compensate for precarious environmental conditions during early development (Lima et al., 2022)24, highlighting the interdependence between relational and contextual factors in sustaining engagement.

Despite the growth of women’s football, systematic analyses of the developmental environments of Brazilian female players remain limited. Existing assessment tools have largely been developed in male populations or within sociocultural contexts that differ substantially from Brazilian women’s football. Consequently, gender-specific and contextual dimensions of development may be insufficiently captured. Similar limitations have been reported internationally; even in England, female talent development environments remain comparatively underexamined, and stakeholder perspectives are often incompletely integrated2.

The Participation History Questionnaire (PHQ)11 illustrates this limitation. Developed and validated in male, English-speaking samples, it reflects assumptions that may not align with the structural and cultural realities of Brazilian women’s football. Such instruments risk overlooking the specific constraints, opportunities, and social configurations shaping female developmental pathways. There remains a need for culturally grounded, gender-sensitive measures capable of capturing early sport experiences within women’s football.

Accordingly, this study aimed to develop and gather validity evidence for the Questionnaire on Sports Experiences in Women’s Football (QEEF-Fem), grounded in the PAF. The instrument was designed to assess three central dimensions: personal engagement, quality social dynamics, and appropriate settings, among Brazilian female players aged 12–17 years. We hypothesized that the QEEF-Fem would demonstrate a theoretically coherent factorial structure consistent with the PAF and adequate internal consistency for exploratory research. By providing a context-sensitive psychometric tool, this study seeks to advance the empirical assessment of the sport experimentation phase and inform evidence-based practice in women’s football.

Methods

Study design

This study employed a psychometric design to develop and gather validity evidence for a questionnaire grounded in the Personal Assets Framework (PAF)4. Although participants were aged 12–17 years at data collection, the instrument was designed as a retrospective measure to capture perceptions of experiences during the sport experimentation phase. The questionnaire uses Likert-type response scales assessing agreement, satisfaction, or frequency across three PAF dimensions: Personal Engagement (diversified sport participation), Quality Social Dynamics (relationships with coaches, peers, parents, and administrators), and Appropriate Settings (physical and competitive characteristics of the practice environment).

Validation procedures followed the Standards for Educational and Psychological Testing12. Consistent with contemporary recommendations for construct validation13, the process integrated theoretical (content validity), empirical (face validity), and statistical (construct validity and internal consistency) evidence.

Item development was guided by an analytical matrix derived from the PAF. An initial pool of 45 items was generated (Personal Engagement: 11; Quality Social Dynamics: 15; Appropriate Settings: 19). Higher scores indicate more positive developmental experiences, and no items required reverse coding. Validation proceeded in four stages: I) face validity to assess clarity and comprehensibility; II) content validity through expert evaluation of linguistic and theoretical adequacy; III) construct validity via exploratory factor analysis (EFA); and IV) internal consistency assessed using Cronbach’s alpha, composite reliability (CR), and McDonald’s omega.

Face validity

Face validity assessed whether the appearance of the questionnaire is understandable and appropriate for its users. However, the literature recommends that this step be accompanied by content validity14. For this validation stage, the questionnaire was administered to a pilot group of 45 young female players from a football club in the state of Paraná who are part of the youth categories of a professional club, aged between 12 and 17 years. Higher scores are considered positive.

Content validity

Content validity is understood as fundamental for assessing the degree of representativeness of the instrument in relation to the theoretical constructs it proposes to measure15. This stage was conducted with the participation of experts selected for convenience, similar to that carried out in the study by Tozetto et al.16, based on the following criteria: a) Hold a PhD in Physical Education,b) Work as a university professor; c) Have publications on youth training in sports; d) Have publications focused on youth training and development in football; and e) Have previous experience with other validation processes.

The experts were contacted by email, accompanied by an invitation and a brief description of the research. Thus, ten experts (n = 10) participated in this stage. Responses to the items were obtained on a five-point Likert scale, ranging from 1 to 5 (1 = not relevant; 5 = relevant). Based on the 45 questions initially developed, the Content Validity Index (CVI) was used, considering questions above 0.80 to be acceptable17.

Construct validity

Construct validity seeks to establish a correspondence between the proposed theoretical structure and the empirical hypothesis investigated18. Thus, in the present study, construct validity was examined using data from 391 female athletes, allowing the evaluation of the relationship between the theoretical constructs of the Personal Assets Framework and the proposed measurement model. To this end, as it is a new instrument adapted to a specific culture, EFA was employed. Thus, we sought to verify the consistency between the theoretical basis and the empirical evidence obtained, consolidating the evidence of construct validity13.

Internal consistency of items

Internal consistency was assessed based on the stability of responses, following recommendations (Hayes and Coutts, 2020)26. The objective was to verify whether the set of responses obtained reflects the same underlying theoretical constructs (Hayes and Coutts, 2020). Thus, Cronbach’s alpha was used to measure item homogeneity, CR to assess latent construct consistency, and McDonald’s omega to provide a more accurate reliability estimate based on factor loadings. The results range from 0 to 1, with values closer to 1 indicating greater internal consistency among the items18.

Data collection procedures

The athletes answered the questionnaire in person and collectively during visits to the clubs. The following inclusion criteria were adopted: I) being between 12 and 17 years old (considering the sports specialization phase); II) athletes linked to football teams in the states of Paraná, Santa Catarina, and Rio Grande do Sul that competed in competitions organized by the respective state federations. Thus, the data collection period took place between June and September 2025, with the aim of applying the questionnaire and understanding the contexts of women’s football practice. The sample included 391 athletes representing 15 professional football clubs. The clubs were selected by convenience sampling, based on accessibility and institutional availability to participate. This depended on formal authorization and logistical feasibility, resulting in the inclusion of clubs that were accessible at the time of data collection.

The sample size was determined based on widely adopted psychometric guidelines for EFA, which recommend a minimum ratio of 10 participants per item to ensure stable factor solutions19. Considering that the initial version of the instrument comprised 38 items, the final sample of 391 athletes exceeded this recommendation. This sample size was therefore considered adequate for exploratory purposes and allowed for potential data loss due to incomplete responses or exclusion criteria.

Ethical aspects of research

This project was submitted to the Human Research Ethics Committee (CEPSH) of the Federal University of Santa Catarina and received a favorable opinion (Process No. 7,517,417). All ethical procedures were followed, with the signing of the Free and Informed Consent Form (FICF) by those responsible and the Free and Informed Assent Form (FIAF) by the participants. Since the participants were minors, written informed consent was obtained from their parents or legal guardians, and assent was obtained from all participants before data collection. Therefore, all procedures performed in this study were conducted in accordance with the relevant CEPSH guidelines and regulations.

Data analysis

The data were organized in a Microsoft Excel spreadsheet and analyzed using JASP 0.95.2.0 software. Face validity was assessed using Aiken’s V coefficient20. Content validity was assessed using the content validity index (CVI), considering items with a score of 4 or 5 by at least 80% of the experts15. The construct validity stage was examined using EFA, considering adjustment estimators with three factors, using the Weighted Least Squares (WLS) factor method with oblique rotation and a polychoric correlation matrix, in order to evaluate the factor structure of the instrument21. Items were excluded based on the following criteria: (1 factor loadings < 0.30, (2 cross loadings between factors; and (3 the balance between statistical stability and theoretical relevance. The model was verified using Kaiser–Meyer–Olkin (KMO) ≥ 0.600 and Bartlett’s test (p < 0.001), indicating sampling adequacy and matrix factorability. Additionally, all items presented Skewness and Kurtosis values within the established limits (Skewness between − 3 and + 3 and Kurtosis between − 10 and + 10), supporting the adequacy of the data distribution for factor analysis18. Thus, Cronbach’s alpha (α), McDonald’s omega (ω), and CR were used to assess data reliability, adopting values ≥ 0.600 as acceptable for exploratory studies22 (Fig. 1).

Fig. 1
Fig. 1
Full size image

Process of constructing and gathering validity evidence.

Results

Face validity

The process of obtaining evidence of the instrument’s validity began with the face validation stage, based on the application of the questionnaire to 45 football players aged between 12 and 17 years, members of the youth categories of a professional women’s football team. Table 1 describes the results obtained in the face validity stage, considering the responses on a Likert scale from 1 to 5, and presents the mean values, standard deviation, and Aiken’s V coefficient. Considering the responses obtained in the seven items evaluated, a positive assessment of the participating athletes was observed.

Table 1 Face Validity of the questionnaire, considering a scale from 1 to 5.

Content validity

Content validity was assessed by 10 PhD in Physical Education. The CVI was used, considering questions above 0.80 to be acceptable. Thus, of the 45 questions previously prepared, 3 questions had an average CVI below 0.80 and were removed (average CVI = 0.73) in the first round of assessment. Another 15 questions underwent a new round of validation after textual adjustments suggested by the experts (average CVI = 0.86). In the second round of evaluation, the experts suggested the exclusion of 4 questions (average CVI = 0.73), leaving the questionnaire with 38 questions (Table 2).

Table 2 CVI Assigned by experts during the content validity stage.

Construct validity

The Shapiro–Wilk test was applied to the 38 items included in the EFA, confirming the asymmetry between the data. All items presented Skewness and Kurtosis within the established limits. The coefficients of variation (CV) indicated a predominance of low values ​​among the items, revealing greater homogeneity in the responses. Some items presented a higher CV, evidencing greater heterogeneity. The data indicated are represented in Table 3.

Table 3 Description of values by mean, standard deviation, coefficient of variation, variation, and Shapiro–Wilk test (p < 0.05).

The Bartlett Sphericity test (X2 = 2480.715; df = 153, p < 0.001) and the Kaiser–Meyer–Olkin (KMO) = 0.702 indicated adequacy and interpretability of the correlation matrix of the items. The correlation between the factors in the final model was: factor 1 and factor 2: 0.128; factor 1 and factor 3: 0.384; factor 2 and factor 3: 0.221. Thus, the AFE was constructed based on the three scales recommended by the theoretical structure of the model (Personal Engagement; Quality social dynamics; Context of Practice). Thus, 17 questions had factor loadings above 0.300 are presented at Table 4.

Table 4 Factor structure of the questionnaire.

Internal consistency

The internal consistency of the unifactorial model was assessed using Cronbach’s alpha (α = 0.745) and McDonald’s omega (ω = 0.705), indicating adequate to good reliability within an exploratory context. For the trifactorial model, Cronbach’s alpha values ranged from 0.561 to 0.669, while McDonald’s omega ranged from 0.554 to 0.618, reflecting borderline to acceptable internal consistency. CR values ranged from 0.619 to 0.714, which are considered adequate for exploratory studies. Overall, these results suggest that the instrument demonstrates reliability levels appropriate for exploratory research, while highlighting the need for further confirmatory analyses in future studies.

Discussion

This study developed and provided initial validity evidence for a psychometric instrument designed to assess the sport experimentation phase of young female football players within the framework of the PAF. Across sequential validation procedures: face, content, construct validity, and internal consistency the resulting 17-item instrument demonstrated acceptable psychometric properties for exploratory use.

Face validity analyses indicated strong agreement regarding clarity and relevance, supporting the instrument’s comprehensibility among athletes aged 12–17 years. High Aiken’s V coefficients suggest that the items were interpreted consistently, reinforcing their practical adequacy. These findings align with previous validation research indicating that perceived clarity enhances respondent engagement and data quality14.

Exploratory factor analysis supported a three-factor structure consistent with the theoretical dimensions of the PAF. Sphericity and sampling adequacy indices confirmed the suitability of the data for factor analysis. Although item Q16 presented cross-loadings, its retention was theoretically justified, and its removal did not substantially improve model fit or reliability estimates. Nonetheless, this item warrants re-examination in future confirmatory analyses. Overall, the factorial solution parallels structures reported in related validation studies (e.g.,23), lending further support to the coherence of the proposed model.

Internal consistency indices were within acceptable thresholds for exploratory research. While some coefficients were modest (particularly in factors with fewer items) such values are consistent with early-stage scale development and with the influence of item quantity on reliability estimates22. Importantly, reliability estimates did not contradict the theoretical structure, suggesting that the dimensions capture related but distinguishable aspects of developmental experience.

Conceptually, the instrument operationalizes the core mechanisms proposed by the PAF (i.e. personal engagement, quality social dynamics, and appropriate settings) within the specific context of Brazilian women’s football. Prior qualitative work has supported the relevance of the PAF in Brazilian athlete development24, the present findings extend this evidence by providing a structured quantitative measure aligned with the model.

These results also contribute to broader discussions regarding gender-sensitive and culturally grounded assessment in sport. International research has highlighted the relative underrepresentation of female talent development environments2 and the central role of communication and social support in shaping female athletes’ trajectories (Gledhill and Harwood, 2019,25 Hendry et al., 2019)27. Within Brazil, structural constraints and sociocultural barriers further underscore the need for contextually responsive instruments10. The QEEF-Fem addresses this gap by integrating international theoretical frameworks with local developmental realities.

Practically, the instrument may assist practitioners in monitoring perceived developmental experiences, including the balance between structured practice and diversified engagement, as well as relational and environmental conditions within youth programs. However, interpretations should remain cautious given the exploratory design and region-specific sample.

Several limitations warrant consideration. The absence of temporal stability assessment (test–retest) precludes conclusions regarding response consistency over time. Additionally, the sample was restricted to clubs from southern Brazil, limiting generalizability. The relatively small number of items per factor may also constrain content breadth and partially explain moderate reliability estimates. Future research should expand the item pool, conduct confirmatory factor analyses in independent samples, test measurement invariance, assess temporal reliability, and explore item response theory approaches.

Within these boundaries, the QEEF-Fem represents an initial step toward a context-sensitive assessment framework for women’s football. By providing structured evidence aligned with developmental theory, it contributes to strengthening the empirical basis of athlete development research in Brazil.

Conclusion

This study developed and compiled different validity evidence for the Questionnaire on Sports Experiences in Women’s Football (QEEF-Fem), providing robust evidence of its reliability and validity. The results confirm that the QEEF-Fem is a culturally adapted and scientifically rigorous tool, capable of generating accurate information about the developmental trajectories of female athletes. In addition to its methodological contribution, the questionnaire offers practical applications for coaches, pedagogical coordinators, sports clubs, and public policy makers, supporting the planning of training programs and development strategies sensitive to the realities of women’s football in Brazil. The QEEF-Fem represents a groundbreaking advancement in women’s soccer research and practice, strengthening the scientific basis for athlete development and contributing to more inclusive and evidence-based sports policies.