Introduction

Numeracy is a developmental process beginning in infancy, fueled by genetic and environmental factors, which involves practical concepts such as number sense, quantity, and basic operations (e.g., addition).1 These form a foundation for math, which comprises an array of more formal, abstract subjects (e.g., algebra, geometry).1 Numeracy and math are both vital in the technology-based economy, yet difficulties are prevalent and increasing.2 Many children arrive at kindergarten (typically age 5-6 in the US) with low numeracy skills and consequent math readiness, particularly those from socioeconomically marginalized backgrounds, which has worsened since the COVID-19 pandemic.2

The American Academy of Pediatrics (AAP) recommends school readiness promotion and developmental screening during well-visits.3,4 However, general developmental screening (e.g., Survey of Well-Being of Young Children, SWYC) does not assess numeracy5 and while several preschool numeracy screeners have been validated in early childhood education settings (e.g., K-SEALS, PENS), none have been studied or utilized in pediatric practice.6,7 Reasons include time constraints, cost, need for specialized kits, and non-synchrony with typical workflows. The Number Farm (TNF) is a children’s book-based measure that screens numeracy skills that are most often learned in the preschool period. TNF was inspired by a book-based approach to emergent literacy screening validated during pediatric primary care.8,9 The objective of this study was to pilot TNF during well-child visits for feasibility, family acceptance, and psychometric validity, particularly with respect to home math and reading environments.

Methods

Numeracy measures

The Number Farm (TNF) is a 14-page illustrated, rhyming children’s book (Fig. 1b). A Power Point-rendered version was pilot tested for clarity with families and colleagues prior to the book’s publication (Fig. 1a). TNF screener includes 11 scripted items that refer to the book’s content (e.g., covers, scenes) yet are distinct from its text (0–28 points). Numeracy domains and weightings were selected to mirror those in the validated Preschool Early Numeracy Screener (PENS) developed by Purpura et al., cited as vital for school readiness and success in math.6 TNF items were designed in consultation with numeracy experts and a review of relevant literature. Item distribution reflective of component skills and corresponding sections of the book is summarized in Table 1.

Fig. 1: Sample content from The Number Farm.
Fig. 1: Sample content from The Number Farm.The alternative text for this image may have been generated using AI.
Full size image

Top (a) is from the mockup version and bottom (b) is from the published book, for Item 5 (1:1 counting, sets). The 11-item scripted assessment is based on the book’s covers and interior illustrations but is distinct from its text.

Table 1 The number farm skill and scoring distribution

The PENS was used as the “gold standard” comparison measure, involves 25-items with an estimated administration time of 5 minutes, and has been validated for preschool age.6,10 The PENS assesses the same numeracy domains as TNF.

Participants/Setting

This study was based at an academic pediatric primary care clinic serving largely Medicaid-eligible families in Cincinnati, OH. Recruitment was by convenience sampling of families presenting for well-visits meeting these criteria: child age 36 to 59 months, no developmental disorder likely to confer cognitive delay (e.g., Trisomy 21, autism), parent/child fluent in English, no acute infectious illness. The study was approved by the sponsoring institution’s Institutional Review Board. Prior to screening, written informed consent was obtained, and families were provided with a gift card after data collection.

Screening and assessments

Pediatric residents collected all data in the exam room before or after the physician conducted the visit, mindful of clinic flow. Following informed consent, TNF and PENS were administered to the child in predetermined, random order, noting screening time and barriers (e.g., interruption by staff). Three-item, Likert-based TNF impression surveys developed by the study team based on prior work were then administered to the child and parent.8,9 Next, surveys of home math activities and attitudes (Likert-based, developed by the study team), home reading environment (StimQ READ11), screen time (ScreenQ12) and demographics (e.g., sex, preschool enrollment) were administered to the parent.

Statistical analyses

Descriptive statistics were computed for all variables. TNF items were analyzed using modern theory Rasch analyses,13 providing estimates of smoothness, modality, difficulty, polarity, response sufficiency, and item-fit. A preliminary estimate of TNF reliability was calculated using Cronbach’s alpha (αCr), and concurrent validity was estimated using Pearson correlation coefficients between TNF and PENS scores. Pearson coefficients were also computed between TNF total score and demographic, home reading and math environment, and digital media use measures. Analyses were conducted using SAS v9.4 and Winsteps v5.6.4 software.

Results

Sample characteristics

Thirty-one children participated, mean age = 50 months (±4; 36–71), 17 boys and 14 girls. Barriers to screening were rare (under 10%) and most often included interruption by clinic staff to initiate the visit. In each of these instances, screening was successfully completed after the physician had finished. TNF administration time was under 10 minutes.

Screener total scores and TNF item-level analysis

Mean TNF score was 11 (±8; 0-26) and PENS score was 10 (±8; 0-25). TNF and PENS scores were not normally distributed (Shapiro-Wilk p = 0.03 and 0.02, respectively), just below the p = 0.05 threshold.

TNF item-level statistics are provided in Table 2. Rasch estimates of item difficulty ranged from 2.72 (less difficult; item 5, numeral names) to 1.26 (more difficult; item 10, addition) and were evenly distributed between harder and easier items, with a slight concentration of moderately difficult items (1, 9, 12). Item fit statistics were within generally accepted ± 2 standard deviations, suggesting no outliers likely to influence the distribution of scores. Point-measure correlations ranged from 0.35 (item 1, concept of number) to 0.88 (item 5, 1-1 counting of sets) and were generally between 0.52 and 0.70 across all items, suggesting moderate relationships between each item and the entire TNF scale. Items 4 (set comparison) and 5 (ordinality) represent exceptions, with relatively high point-measure correlations of r = 0.82 and r = 0.88, respectively.

Table 2 Item analysis and summary statistics for the number farm (Rasch Analysis)

Reliability and validity

For reliability, internal consistency was rCo-α = 0.94, considered “excellent.”14 For concurrent validity, the Pearson correlation between TNF and PENS total scores was 0.94 (p < 0.001). Figure 2 is a scatter plot of TNF and PENS scores.

Fig. 2
Fig. 2The alternative text for this image may have been generated using AI.
Full size image

Scatter plot of The Number Farm (TNF) vs. PENS total scores (n = 31).

TNF scores were strongly correlated with child age (rρ = 0.72, p < 0.001) and preschool enrollment (rρ = 0.38, p < 0.05) but not with child sex. TNF scores were positively correlated with reported number of children’s books at home (rρ = 0.48), days/week of shared reading (rρ = 0.64) and frequency of playing math-related games (rρ = 0.39; all p’s < 0.05). Reported parental enjoyment of math was negatively correlated with parental difficulties in math (rρ =−0.41, p < 0.05). There were trends towards negative correlation between TNF scores and reported parental difficulties in math (rρ = −0.28, p = 0.12) and positive correlation with reported parental enjoyment of math (rρ = 0.24, p = 0.20). TNF scores were not significantly correlated with digital media use (ScreenQ score).

Survey responses/satisfaction

Ninety-seven percent of children reported their experience with TNF as “Fun.” Parent impression of TNF was also favorable, with 100% rating screening as “helpful,” 75% rating time required as “just right” (10% too much), 94% stating that they learned new information about the child’s skills, and 97% that they were inspired to work more with their child in math at home. Eighteen parents (58%) were at least somewhat surprised by their child’s performance. Of these, 15 (83%) were surprised at how well their child performed, and 3 (17%) were surprised at how poorly (17%).

A majority of parents reported that they were somewhat familiar with the math skills needed for kindergarten (55%, 10% Not at All), and 52% were somewhat worried about their child’s math readiness (35% Not at All).

Discussion

With trusted, frequent access to families during early childhood, pediatricians are poised to conduct developmental screening and provide guidance promoting school readiness, as recommended by the AAP.4 However, while early literacy screening has shown promise,8 there is currently no established approach to numeracy screening in pediatric practice. Further, commonly used screening measures (e.g., Ages & Stages Questionnaire, SWYC) do not specifically assess numeracy skills. This reflects an important gap and opportunity, as a large and increasing number of children arrive at kindergarten (typically age 5-6 in the US) with low math readiness, particularly those from socioeconomically disadvantaged households.4,15,16 The goal of this study was to test a children’s book-based approach to numeracy screening during pediatric primary care.

Psychometric properties of TNF were strong, particularly for a new measure capturing a complex cognitive domain. We attribute this to an evidence-based conceptual model involving well-defined component numeracy skills,17 and a book-based approach that has been successful in screening emergent literacy skills at this age.8 TNF and PENS total scores were highly correlated (rρ = 0.94), suggesting strong concurrent validity, and internal consistency was excellent (rCo-α = 0.94), suggesting high coherence among screener items.

In terms of internal psychometrics, Rasch analyses of TNF items were also promising. TNF item and total scores were all positively correlated, with all response options selected by no less than 6% of the sample (an accepted standard), and monotonic ordering for each response option (i.e., full credit, partial credit, no credit, in a uniform direction). Item difficulty was balanced between easier and harder items, with addition (item 10) and numeral comparison (item 9) scoring most difficult, and 1-1 counting (item 5) and verbal counting (item 4) scoring easiest (Table 2). This is consistent with expected skill trajectories, where more abstract concepts and mental manipulation thereof (i.e., concept of number size, addition) were harder, while rote concepts (e.g., counting aloud, counting small groups) were easier.1,17 Item 5 (1-1 counting of sets) was by far the easiest item, potentially attributable to the small number of animals in each set tested (1 to 4 per set). Item revision to optimize these metrics is planned for a subsequent version of TNF.

Item-measure correlations were largely moderately positive, below 0.80, suggesting measure coherence and that each item contributed uniquely to the TNF total score. Exceptions were items 4 (verbal counting) and 5 (counting sets), which were also the easiest. This finding may reflect the influence of overall cognitive development on total score, rather than mastery of specific skills, evidenced by the strong correlation between TNF score and child age. Revision of these items, as mentioned above, may help minimize this effect.

TNF scores were strongly correlated with child age and preschool enrollment but not significantly with sex, consistent with developmental maturation, established benefits of preschool,4,18 and lagging sex disparities in math abilities.19 They were marginally positively correlated with reported parental enjoyment of math and marginally negatively correlated with parental reported difficulties in math, likely reflecting varying parental abilities and/or educational experiences. They were significantly, positively correlated with the reported number of children’s books at home, days/week of shared reading and playing math-related games, each reflecting empowering ways for parents to help the child at home.18 They were not correlated with digital media use, surprising considering the potential displacement of parent-child interaction and recent evidence of negative effects.20 However, such evidence is scant in terms of risks or benefits of digital media, warranting further study.

Responses to parent and child surveys were strongly favorable, suggesting that TNF screening may be feasible, useful, and enjoyable during well-visits. Administration time was under 10 minutes, which is excessive for most clinics, yet attributable to the research stage of this work. Published work involving literacy screening has found that screening time improves substantially with practice, which seems a reasonable investment, especially if reimbursable as developmental surveillance,3 when utilized by embedded school readiness staff,5,9 in specialty clinics where visits are often longer, and/or as a pediatric resident teaching tool.21 Further refinement of TNF to improve efficiency is planned.

This study has limitations. The sample size was small and collected at an academic primary care clinic serving largely socioeconomically disadvantaged, primarily Black families, limiting generalizability, though this is a vital demographic in terms of school readiness.4 TNF was administered by pediatric residents functioning as clinical research coordinators. Future studies are needed to explore TNF screening during usual clinic workflows (e.g., via medical assistants, clinicians).21 This study also has strengths, including randomization of TNF/PENS screening to minimize fatigue effects, robust psychometric analyses, and leveraging of a successful book-based approach to literacy screening.8,9 TNF is family-centered, inexpensive ($3.00/book, given to the child to take home), requires minimal training, and is scalable within existing programs (e.g., Reach Out and Read22), with substantial advantages relative to kit-based measures. Altogether, this pilot provides compelling data that TNF may be an appealing option for early numeracy screening in pediatric settings in the preschool age range (3–5 years old), complementing existing school-based tools. More expansive studies are warranted, with a goal to help align efforts by pediatricians and early childhood educators in this vital school readiness domain, including timely screening with familiar, reliable tools and tailored guidance for families to help their child succeed.

For more information about The Number Farm or to order for research or clinical use, visit http://www.bluemanateepress.com/nonprofit/ or email press@bluemanateebooks.com or John.Hutton@utsouthwestern.edu.