Introduction

Dental caries is one of the most common chronic diseases. It results from the interplay of cariogenic bacteria, dietary carbohydrates, inadequate nutrition, and various social factors across different ages and socioeconomic backgrounds [1]. Early childhood caries (ECC) is defined as the presence of one or more decayed, missing (due to caries), or filled tooth surfaces in any primary tooth in children under 6 years old [2]. ECC can lead to severe pain, tooth loss, low self-esteem, poor academic performance, sleep disturbances, and reduced quality of life [3]. Considering that oral diseases can adversely affect the development of permanent teeth, establishing a healthy oral environment early in life is critical for lifelong oral health. Therefore, preventing ECC is essential for children to maintain oral health, supporting healthy development, and reduce future dental treatment costs [4].

Most research on ECC prevention has focused on preschool children aged 3–5 years, whereas longitudinal evidence on ECC risk factors among infants and toddlers (0–2 years) remains limited. In 2021, the Beijing Stomatological Disease Prevention and Control Institute launched an oral health care program targeting infants and young children. Leveraging Beijing’s well-established primary healthcare network, the program aims to provide cost-effective early ECC screening, dietary and behavioral counseling, and preventive and/or restorative interventions for local children, ultimately reducing the incidence of early childhood caries.

Modern caries management emphasizes personalized prevention based on caries risk assessment (CRA). The AAPD guidance on pediatric caries risk assessment and management indicates that CRA can be used to understand an individual patient’s pathogenic and protective factors, and thereby to tailor the selection and frequency of preventive and/or therapeutic measures [5]. Meanwhile, some studies have suggested that baseline caries experience is a significant determinant of subsequent incident caries. Baseline caries status may reflect the cumulative effects of multiple cariogenic factors, many of which are also associated with future caries risk [6]. Consequently, the same oral health behavior may exert effects of differing magnitude among children with different “baseline caries status”. If only the average main effects of behaviors are estimated, without evaluating effect modification, important heterogeneity may be obscured, potentially limiting the ability of risk assessment and intervention strategies to achieve genuine stratification and precision prevention.

This study was designed to address two questions: (1) the determinants of ECC risk among infants and toddlers in Beijing, including hereditary factors as well as dietary and oral hygiene behaviors; and (2) whether baseline caries experience exerts a modifying effect on the associations between other factors and ECC risk. The overarching aim was to enable the program to identify children who are susceptible to caries and to support risk-stratified management and more efficient allocation of healthcare resources.

Materials and methods

Study population

This study was conducted between 2021 and 2023 and was nested within two routinely implemented community-based oral public health programs. The baseline survey was embedded in the “Beijing Comprehensive Health Care Programme for Children Aged 0–3 Years”. Clinical examinations were conducted in a clinical setting, and questionnaire data were collected from parents through a WeChat mini-program. The program was designed for children aged 0–3 years and their parents in communities across Beijing. Inclusion criteria were: (1) the child had at least one healthy primary tooth; (2) written informed consent provided by the parent; and (3) the child was generally healthy and able to cooperate with oral examination. Exclusion criteria were: (1) the child had no healthy teeth; (2) the child was uncooperative during examination; (3) the caregiver was unable to complete the questionnaire; or (4) the child had systemic disease. The follow-up examination was embedded in the “Beijing Fluoride Foam Caries Prevention Programme” and was conducted 12–24 months after baseline, using the same inclusion and exclusion criteria and under the same examination conditions as the baseline examination.

Questionnaire survey

The parent questionnaire covered the child’s general information, parents’ caries status, feeding practices, dietary habits, oral hygiene behaviors, and history of dental attendance. (The questionnaire items are detailed in Table 1).

Table 1 Oral examination results in the study children (baseline and follow-up).

Oral examination

Oral examinations were conducted in community dental clinics under the supervision of the Beijing Stomatological Disease Prevention and Control Institute. Children’s caries status was measured using the dmft index (decayed, missing due to caries, and filled primary teeth) according to WHO criteria. A tooth was recorded as decayed (dt) if it had a visible cavity, undermined enamel, or detectably softened wall or floor. Teeth extracted due to caries were recorded as missing (mt), and teeth permanently filled with no decay were recorded as filled (ft). Each child’s baseline DMFT score ranged from 0 to 20.

Indicator definition

Incident caries was defined as the appearance of decay on a previously unaffected primary tooth. Δdmft was defined as the difference between the dmft score at follow-up and at baseline. Caries incidence was defined as the percentage of children with Δdmft > 0.

Quality control

Oral examinations were conducted in multiple community primary healthcare facilities by local clinicians. To standardize caries assessment across examiners, we developed a standardized examination protocol and diagnostic criteria in accordance with the World Health Organization (WHO) standards from the Oral Health Surveys: Basic Methods (5th edition). Prior to data collection, clinicians received centralized training (online/offline) and were provided with a manual specifying examination procedures, instruments and lighting conditions. During fieldwork, regular supervision and query-resolution procedures were used to address ambiguous cases.

Data analysis

Data analysis was performed using SPSS 30.0 (IBM, Armonk, NY, USA), Stata 18.0 (StataCorp, College Station, TX, USA), and Python 3.12.7 with the statsmodels and scikit-learn libraries. Pearson’s chi-square test was used to evaluate differences in caries incidence between categorical groups. Because the caries increment (Δdmft) was not normally distributed, a non-parametric Mann–Whitney U test was used to compare the distribution of Δdmft between groups. We conducted Vuong and likelihood-ratio tests to select the best-fitting model among Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial models. The zero-inflated negative binomial (ZINB) model was chosen as optimal for our data. Variance inflation factors (VIF) were calculated under a linear regression framework to check multicollinearity. We included all candidate predictor variables in both the negative binomial component and the zero-inflation component of the ZINB model. Stepwise backward elimination based on the Akaike Information Criterion (AIC) was applied to simplify the model, stopping when the minimum AIC was achieved. To assess whether the associations between risk factors and subsequent caries risk varied by baseline caries status, a separate model was fitted for children who were caries-free at baseline (Model 2), and, in the full sample, the effects of these risk factors were compared between children who were caries-free at baseline and those with caries at baseline (Model 3).

Ethical approval and consent

The study protocol was approved by the Ethics Committee of Beijing Stomatological Hospital, Capital Medical University (Project No. CSA-ICP2022-07). The protocol included the research plan and informed consent forms. Before enrollment, all parents signed informed consent forms, and they received written feedback on the oral examination results for their child.

Results

Study participants, ECC characteristics, and univariate analyses

At baseline, a total of 1195 eligible children were enrolled. The follow-up rate was 76.9%, and the 276 children lost to follow-up were mainly lost due to school transfer or absenteeism. No significant differences were observed between children lost to follow-up and those who completed follow-up with respect to questionnaire data or baseline oral examination findings (χ² test, P > 0.05). A total of 919 children were included in the analysis, of whom 480 (52.2%) were boys and 439 (47.8%) were girls. At the baseline examination, the children’s mean (SD) age was 2.3 (0.4) years. The caries prevalence at baseline was 30.8%, and the mean (SD) dmft score was 1.29 (2.43). At the follow-up examination, the mean (SD) age was 3.8 (0.4) years, and the caries prevalence had increased to 46.5%, with a mean (SD) dmft of 2.24 (3.39). The mean (SD) interval between the two exams was 1.5 (0.3) years. Over the study period, caries incidence was 29.8% and the mean (SD) Δdmft was 0.94 (1.94). The distribution of Δdmft was highly skewed (skewness = 2.68) (Table 1).

Univariate analyses (Table 2) showed that children who frequently fell asleep with bottle, less frequently rinsed their mouth after eating, consumed snacks ≥2 times/day, consumed candy ≥1 time/day, brushed less often before bedtime, or had caries at baseline all had significantly higher caries incidence (χ² test, P < 0.001) and greater Δdmft (Mann–Whitney U test p < 0.001).

Table 2 Children distribution according to the caries incidence (n = 919).

zero-inflated negative binomial (ZINB) regression of ECC risk factors

Vuong’s tests indicated that a zero-inflated Poisson model fit significantly better than a standard Poisson model (z = 9.89, P < 0.001), and a zero-inflated negative binomial (ZINB) fit better than a standard negative binomial (z = 5.18, P < 0.001). A likelihood ratio test further showed that the ZINB model (with dispersion parameter α ≈ 0.45) provided a superior fit over the zero-inflated Poisson model (P < 0.001). Variance inflation factors (VIF) were calculated under a linear regression framework to check multicollinearity; all predictors had VIF < 1.18 (mean 1.08), indicating no significant multicollinearity.

After stepwise reduction, the final ZINB model for the full sample (Table 3, Model 1) achieved an AIC of 2101.92 which reduced from 2123.72. The results from the negative binomial part indicated that snack intake frequency ≥2/day (IRR = 1.43; 95% CI 1.13–1.82; p < 0.01) and less frequent bedtime brushing (IRR = 1.37; 95% CI 1.08–1.74; p < 0.05) significantly predicted larger Δdmft. In the zero-inflation component, frequent bottle use at bedtime (OR = 0.42; 95% CI 0.21–0.83; p < 0.05), snack frequency ≥2/day (OR = 0.65; 95% CI 0.44–0.96; p < 0.05), candy frequency ≥1/day (OR = 0.38; 95% CI 0.22–0.66; p < 0.001), less frequent bedtime brushing (OR = 0.61; 95% CI 0.40–0.92; p < 0.05), and having caries at baseline examination (OR = 0.38; 95% CI 0.26–0.57; p < 0.001) were all associated with a higher probability of being in the “susceptible to new caries” group.

Table 3 Model 1: Risk factors for ECC in all surveyed children (ZINB regression model, n = 919).

For the subset of children who were caries-free at baseline (Table 4, Model 2), a separate ZINB model was fitted. After model reduction, the final Model 2 had an AIC of 1190.52, which reduced from 1201.64. The results from the negative binomial part indicated that significant predictors of Δdmft were: at least one parent with caries (IRR = 1.53; 95% CI 1.12–2.09; p < 0.01), frequent saliva-sharing behavior (IRR = 1.39; 95% CI 1.03–1.89; p < 0.05), frequent bottle use at bedtime (IRR = 1.59; 95% CI 1.08–2.35; p < 0.05), less frequent post-meal mouth rinsing (IRR = 1.51; 95% CI 1.10–2.08; p < 0.05), presence of additives in daily drinking water (IRR = 1.97; 95% CI 1.35–2.89; p < 0.001), and less frequent bedtime brushing (IRR = 1.60; 95% CI 1.18–2.18; p < 0.05). In the zero-inflated part, children whose snack intake frequency ≥2/day had a higher probability of being susceptible to new caries (OR = 0.56; 95% CI 0.35–0.90; p < 0.05).

Table 4 Model 2: Risk factors for ECC of baseline caries-free children (ZINB regression model, n = 636).

Interaction analyses (Table 5) showed that for the negative binomial component, the effects of parents’ caries status (β = –0.89; p < 0.001), saliva-sharing frequency (β = –0.62; p < 0.01), post-meal rinsing frequency (β = –0.74; p < 0.01), presence of additives in daily drinking water (β = –0.61; p < 0.05), and bedtime brushing frequency (β = –0.51; p < 0.05) on Δdmft were significantly greater in children without baseline caries than in those with baseline caries. In the zero-inflation component, having a history of dental attendance had a significantly stronger positive effect on the probability of new caries among children without baseline caries (β = –1.09; p < 0.05). Complete results of the interaction model (Model 3) are provided in Table 6.

Table 5 Interaction effects of baseline caries status with other variables.
Table 6 Model3 ZINB model in the full sample incorporating interaction terms between baseline caries status and other variables (n = 919).

Discussion

In this study of children aged 1–2 years in Beijing, we identified several risk factors significantly associated with the development of early childhood caries (ECC) and demonstrated a significant modifying effect of baseline caries status on the impact of these risk factors. Using a zero-inflated negative binomial (ZINB) regression model, we found that certain behavioral factors (e.g., frequent snack consumption and inadequate oral cleaning) were associated with a significantly higher risk of incident ECC. Notably, however, the strength of these associations depended on the child’s baseline caries status. Among children who were caries-free at baseline, adverse dietary habits and suboptimal oral hygiene behaviors were closely associated with new caries onset during follow-up, whereas among children with caries at baseline, the associations between these behavioral factors and further increments in caries were markedly attenuated or non-significant. These findings highlight baseline caries experience as an effect modifier, reshaping the observed relationships between specific oral health behaviors and subsequent caries progression.

The relationship between children’s oral health behaviors and caries has been well-studied, but most research has focused on preschool-aged children (3–5 years old) and is often cross-sectional in design [7]. Longitudinal studies on oral hygiene behaviors and caries risk in infants and young toddlers are relatively few. Tashiro et al. [8], Syafri̇za et al. [9], and Wigen et al. [10] conducted oral examinations and questionnaire surveys among infants and toddlers aged 0–2 years and investigated the associations between the above factors and new developed caries during follow-up. These and most similar studies primarily relied on univariate comparisons or multivariable logistic regression for analysis, which may have limited statistical power. In the present study, we initially attempted to use a multivariable logistic regression to predict caries incidence, but due to the highly unbalanced outcome (many children had no incident caries), the logistic model’s sensitivity was effectively 0. Given the large number of zero values in dmft and the over-dispersion of incident dmft score, Böhning et al. [11] and Lewsey et al. [12] introduced the zero-inflated negative binomial model to dental caries research. This model assumes the population consists of “not susceptible to new caries” children and “susceptible” children and models each subpopulation separately. Compared to traditional linear models or standard Poisson regression, the ZINB model has been shown to provide a better fit for caries count data [13]. To our knowledge, the present study is the first to apply a ZINB model in a longitudinal study of caries risk among infants and young toddlers.

Our findings confirm that baseline caries experience is a strong predictor of future caries risk, consistent with previous studies [14]. In Model 1, children who were caries-free at baseline had 0.38 times the odds of being classified as caries-susceptible compared with those with caries at baseline (OR = 0.38). In Model 3, when all behavioral variables were at their reference levels, children with caries at baseline exhibited approximately a four-fold greater caries increment during follow-up than those who were caries-free at baseline (IRR = 4.08). Our results further indicate that baseline caries experience is not merely an independent risk factor; rather, it represents an integrated reflection of tooth susceptibility to caries, the plaque micro-ecosystem, diet, hygiene practices, fluoride use, dental treatment, and other caries-related factors [15]. Accordingly, baseline caries experience may, to some extent, “reshape” the relative contribution of other behavioral factors to incident caries risk. Among children who were caries-free at baseline, parental caries history, rinsing after meals, and daily drinking-water habits were significantly associated with caries increment. However, these associations were attenuated among children with caries at baseline and, in some instances, even appeared as a reversed “protective” effect. Saliva-sharing behaviors—an established pathway for the vertical transmission of mutans streptococci—showed significant differences in their association with caries increment between children with different baseline caries status [16]. This is consistent with the findings of O’Sullivan and colleagues, who reported that high baseline mutans streptococci levels more strongly predicted caries increment among children who were initially caries-free [17]. Collectively, these findings provide statistical support for developing stratified and graded prevention strategies: for children who are caries-free at baseline, behavioral interventions and family-based oral hygiene management are critical for reducing future caries risk.

The frequency of bedtime brushing, the frequency of snacking, and the frequency of falling asleep with a bottle were all significant risk factors in both Model 1 and Model 2. Snacks commonly introduce sticky carbohydrates [18, 19]; nocturnal bottle use results in prolonged carbohydrate exposure during periods of low oral activity [20, 21]; and infrequent bedtime toothbrushing promotes dental plaque accumulation [22, 23]. The increased caries risk attributable to these factors has been confirmed in multiple studies. Notably, bedtime toothbrushing showed a significant association with the caries incidence among children with caries at baseline, but not with their caries increment; in contrast, the pattern was reversed among children who were caries-free at baseline. This suggests that once the disease process has been initiated, behavioral modification alone may be insufficient to halt further progression. In such cases, clinical preventive measures should be incorporated, such as fissure sealants and professionally applied fluoride [5].

In Model 3, we observed that some poor oral habits were associated with negative coefficients for the dmft increment. This does not mean that these behaviors are protective against caries; rather, it likely reflects the combined effects of index event bias and a ceiling effect. On one hand, both baseline caries status and caries increment are influenced by numerous risk factors. This interrelation can introduce index event bias, causing the contribution of a given behavior to the caries increment to be significantly underestimated or even appear reversed [24, 25]. On the other hand, children with caries at baseline may have already accumulated a high dmft score, leaving limited room for new lesions—a ceiling effect [26]. As a result, their dmft increment during the follow-up is small, and in the model this manifests as a “spurious protective association” for that behavior. Understanding these biases is important when interpreting the model results, to avoid misclassification of harmful behaviors as seemingly protective in certain subgroups.

We did not observe a significant protective effect of floss use or toothpaste use, which contrasts with some previous studies [27]. This discrepancy may be partly explained by limitations inherent in the questionnaire survey. To maintain broad program coverage and practical feasibility, the questionnaire could not be overly complex. As a result, information on the fluoride concentration of the toothpaste, frequency of use, and the effectiveness of toothbrushing was not further verified. In addition, interproximal contacts in the primary dentition typically become established more tightly at around 3–4 years of age; therefore, in this younger cohort, the benefits of flossing may not be statistically detectable. With respect to dental attendance, a history of dental visits showed a protective tendency in the caries-free group but was associated with higher risk in the baseline caries-present group. This pattern may reflect indication bias, whereby dental visits among caries-free children are typically preventive, whereas those among children with existing caries are more often treatment-driven [28]. Future risk assessment should distinguish preventive attendance from problem-oriented visits.

This study was nested within two large-scale community-based oral public health programs routinely implemented in Beijing, whose primary objective was to reduce the prevalence of dental caries among children through sustainable public health services. Owing to the high population mobility in the study area, the rate of loss to follow-up was relatively high. In addition, the study sample was not derived from a standard probability sampling procedure; therefore, the representativeness of the sample with respect to the target population remains uncertain. At present, some longitudinal studies of dental caries have been conducted on the basis of large-scale registry data, electronic health records, or routine public oral health service databases, with methodological emphasis typically placed on data source, outcome definition, follow-up pathway, and statistical control rather than examiner calibration specifically implemented for research purposes [29, 30]. Similarly, although formal examiner calibration was not available in the present study, the data were generated through a large-scale public health service process and therefore still retain research value. In the context of the long-term implementation of public health programs, this exploratory work could in the future be extended to larger samples and longer observation periods to further enhance the robustness of the findings.

A further methodological consideration in the present study was the choice of caries diagnostic criteria. Compared with the WHO criteria, ICDAS is more sensitive in detecting early carious lesions; however, it is also more complex and requires more extensive training and examiner calibration to ensure reliable and consistent assessments. In large-scale primary care or community-based oral screening, the WHO criteria are more feasible to implement and more likely to yield stable and consistent examination results, although they may underestimate early lesions. Despite this limitation, significant associations were still identified between the selected caries risk factors and caries outcomes, indicating that the main findings of this study were not substantively affected by the diagnostic criteria adopted.

Compared with other caries risk studies, we collected relatively rich questionnaire data; however, as the information relied on parental self-report, it may be subject to recall bias and social desirability bias. In terms of social background characteristics, the questionnaire in the present study recorded only parental caries status. Some studies have suggested that maternal caries status may represent a more direct family-level marker of a child’s caries risk, as it may capture the combined influence of maternal oral health behaviors, knowledge, and other social determinants [31, 32]. However, a considerable body of literature has also demonstrated significant associations of ECC risk with factors such as maternal educational level, household income, and parental smoking. Therefore, the assessment of sociodemographic and parental factors in this study remains incomplete and should be expanded in future research. Furthermore, in order to reduce demands on human resources, financial resources, and time while maximizing program coverage, no biological assessments were performed and no objective plaque index was recorded, which also constrained interpretation of the potential mechanisms underlying the model [33, 34]. Baseline caries was analyzed as a binary variable rather than as a continuous dmft score, which may have obscured a gradient in disease severity [15, 30].

Conclusion

This cohort study of children aged 1–2 years in Beijing further elucidates the multifactorial etiology of early childhood caries (ECC) and highlights baseline caries status as a strong determinant of risk. We observed a modifying effect of baseline status on multiple cariogenic factors, which may help to optimize risk-stratified early intervention strategies and promote personalized oral health management from the earliest stages of life.