Introduction

The prevalence of gingivitis in children is high, affecting half of the population by age 4–5 years, and some reports suggest that the prevalence reaches nearly 100% by adolescence1. However, periodontitis in children may be associated with systemic or immune disorders, or may result in aggressive periodontitis of unknown etiology, but these are quite rare1,2. According to many periodontists, bleeding on probing is the most reliable method for assessing a patient’s gingival status in clinical practice3. Once children can cooperate with bleeding on probing, it is recommended that they begin regular probing to detect early signs of periodontal disease and prevent its progression.

Gingivitis in children is frequently observed, mainly due to poor oral hygiene. Saliva is an effective tool for objectively evaluating biological data related to health, including information about the oral cavity, which can also be used to evaluate gingivitis4,5. Liquid after gargling can be collected more easily than saliva, making it particularly useful when testing children. However, few studies have attempted to evaluate the state of gingivitis in children using liquid after gargling.

Testing methods for periodontal disease include kits that can detect blood-derived hemoglobin from inflamed gingiva in saliva or liquid after gargling6,7. Hemoglobin can hold oxygen in an aerobic environment (oxy form), while hemoglobin without oxygen in an anaerobic environment (deoxy form) has a high buffering capacity8. Current hemoglobin detection kits are accurate and highly sensitive, but they only allow quantitative assessment of hemoglobin.

The recently developed Salivary Multi Test (SMT; Lion Dental Products Co. Ltd, Tokyo, Japan) uses saliva or liquid after gargling to obtain multiple items of information about oral health in a short time9. The SMT can measure six items: cariogenic bacteria, acidity, buffering capacity, white blood cells, protein, and ammonia. Cariogenic bacteria, acidity, and buffering capacity are used as evaluation items for dental caries risk, white blood cells and protein are used as evaluation items for periodontal condition, and ammonia is used as an evaluation item for oral hygiene9. The SMT is easy to apply to young children who are able to gargle10.

In the present study, we developed a novel method for specifically and simply assessing the risk of gingival inflammation in children by combining a hemoglobin detection kit and six items from the SMT, using liquid after gargling.

Results

Subjects

Subjects were 76 children (50 boys and 26 girls) who visited the Pediatric Dentistry Department of Hiroshima University Hospital. The subjects ranged in age from 3 to 19 years, with a mean age of 8.5 ± 3.9 years (mean age ± standard deviation) and a median age of 7 years.

Relationship between hemoglobin concentration and PMA index

Gingival inflammation of the maxillary anterior teeth was evaluated using PMA index. Each region was assigned a score of 1 for the presence of inflammation and 0 for the absence of inflammation, with the sum of these scores calculated as the maximum of 34. Subjects’ scores ranged from 0 to 24 points, with the distributions being 0 (21.1%), 1–5 (28.9%), and 6–10 (30.3%), with less than 10% of subjects scoring 11 or more points (Fig. 1A).

Fig. 1
figure 1

Distribution of papillary marginal attached (PMA) values and hemoglobin concentrations by Perioscreen and their relationships. (A) Distribution of PMA values. (B) Distribution of hemoglobin concentrations. (C) Relationship between PMA value and hemoglobin concentration. Each circle represents the data of a subject. Bars indicate the mean PMA score. Data expressed as the mean ± standard deviation. ***P < 0.001 between the groups.

Hemoglobin levels were measured in the liquid after gargling using Perioscreen, and the percentages of subjects with hemoglobin concentrations of 0 μg/mL, 2 μg/mL, and 5 μg/mL were distributed between 30 and 40% (Fig. 1B). The PMA values were significantly higher in subjects with hemoglobin concentrations of 2 μg/mL (P = 0.0010) and 5 μg/mL (P < 0.0001) compared to subjects with a hemoglobin concentration of 0 μg/mL (Fig. 1C).

Relationship between SMT items and PMA index

The liquid collected after gargling was analyzed using an SMT device to quantify six factors (cariogenic bacteria, acidity, buffering capacity, white blood cells, protein, and ammonia), each with a minimum score of 0 and a maximum score of 100. The score ranges for each parameter were as follows (minimum score—maximum score, mean ± standard error): cariogenic bacteria 0–60, 20.0 ± 2.0; acidity 0–100, 55.6 ± 2.6; buffering capacity 0–70, 20.9 ± 1.7; white blood cells 0–79, 33.9 ± 2.6; protein 0–88, 38.7 ± 2.0; and ammonia 11–100, 45.0 ± 2.2. The data from the regression analysis between the PMA index and SMT items are shown in Table 1. The slope of the correlations between the PMA index and SMT items showed significant differences for four of the six SMT items (acidity, buffering capacity, protein, and ammonia) (P < 0.05) (Figs. 2A and 3F). Of these, buffering capacity and protein had a coefficient of determination of 0.25 or more.

Table 1 Data from the regression analysis between the PMA index and SMT items.
Fig. 2
figure 2

Relationship between papillary marginal attached (PMA) value and Salivary Multi Test (SMT) items. SMT items: cariogenic bacteria (A), acidity (B), buffering capacity (C), white blood cells (D), protein (E), and ammonia (F). Each circle represents the data of a subject.

Fig. 3
figure 3

Relationship between the Perioscreen hemoglobin concentration and Salivary Multi Test (SMT) items. SMT items: cariogenic bacteria (A), acidity (B), buffering capacity (C), white blood cells (D), protein (E), and ammonia (F). Each circle represents the data of a subject. Bars represent the mean SMT value. Data expressed as the mean ± standard deviation. *P < 0.05 and ***P < 0.001 between the groups.

Relationship between hemoglobin concentration and SMT items

Next, we analyzed the correlation between the hemoglobin concentration obtained from the Perioscreen and the six SMT items (Figs. 3A–F). The item of protein increased depending on the hemoglobin concentration and there were significant differences between each group (0 μg/mL vs 2 μg/mL P = 0.0347; 0 μg/mL vs 5 μg/mL P < 0.0001; 2 μg/mL vs 5 μg/mL P = 0.0151). However, the other five items did not differ significantly between each hemoglobin concentration. There was a significant difference between buffering capacity and protein in the linear trend test (buffering capacity P = 0.0146; protein P < 0.0001).

Comparison of PMA index, hemoglobin concentration and SMT items between each age group

The subjects were divided into the following groups based on age (age range, mean age ± standard deviation); preschoolers (3–5 years, 4.4 ± 0.7 years) (n = 18), middle childhood (6–12 years, 8.1 ± 2.0 years) (n = 44), and adolescents (13–19 years, 15.0 ± 1.8 years) (n = 14). Data for each age group regarding evaluation items related to oral health are shown in Table 2. Gingival inflammation measured using the papillary marginal attached (PMA) index was significantly higher in the middle childhood (P = 0.0010) and adolescent groups than in the preschoolers (P = 0.0001). In addition, the white blood cells and protein levels in the liquid after gargling with Perioscreen (Sunstar Inc., Osaka, Japan) or SMT were significantly higher in the middle childhood group than in the preschoolers (white blood cells P = 0.0498; protein P = 0.0451). Furthermore, the hemoglobin, buffering capacity, and protein levels in the liquid after gargling with Perioscreen or SMT were significantly higher in the adolescent group than in the preschoolers (hemoglobin P = 0.0104; buffering capacity P = 0.0063; protein, P = 0.0007).

Table 2 Test data for each age group.

Receiver operating characteristic (ROC) analysis for the diagnosis of gingival inflammation

The hemoglobin concentration as measured by the Perioscreen and the two SMT items of buffering capacity and protein showed correlations with the PMA index; we decided to focus on these for further analysis. We performed receiver operating characteristic (ROC) analysis using hemoglobin, buffering capacity, and protein as explanatory variables, with PMA values as the outcome (low PMA score group (0–5, n = 38) and high PMA score group (6–24, n = 38)). For hemoglobin, a cutoff value of 1 obtained using the Youden index yielded a sensitivity of 0.947, specificity of 0.553, and area under the curve (AUC) of 0.787 (Fig. 4A). For buffering capacity, a cutoff value of 21 yielded a sensitivity of 0.658, specificity of 0.789, and AUC of 0.744 (Fig. 4B). For protein, a cutoff value of 36 yielded a sensitivity of 0.763, specificity of 0.737, and AUC of 0.780 (Fig. 4C).

Fig. 4
figure 4

Receiver operating characteristic (ROC) curves for the identification of gingivitis using each evaluation method. (A) Hemoglobin: area under the curve (AUC) = 0.787, cutoff value 1, sensitivity = 0.947, specificity = 0.553. (B) Buffering capacity: AUC = 0.744, cutoff value 21, sensitivity = 0.658, specificity = 0.789. (C) Protein: AUC = 0.780, cutoff value 36, sensitivity = 0.763, specificity = 0.737.

Relationship between hemoglobin concentration, SMT items, and PMA index

The items of buffering capacity and protein were divided into two groups based on the median value of all subjects (buffering capacity 17 and 18, protein 35 and 36). Hemoglobin concentration was divided into two groups, with 0 μg/mL as negative and 2 μg/mL and 5 μg/mL as positive.

In the analysis focusing on hemoglobin and buffering capacity, the hemoglobin-positive / high buffering capacity group (18) had significantly higher PMA values than both of the hemoglobin-negative groups (buffering capacity17 P < 0.0001; buffering capacity18 P = 0.0028) and the hemoglobin-positive / low buffering capacity group (17) (P = 0.0276) (Fig. 5A). In the analysis focusing on hemoglobin and protein, the hemoglobin-positive / high protein group (36) had significantly higher PMA values than the hemoglobin-negative / low protein group (35) (P < 0.0001) (Fig. 5B). Regarding the relationship between buffering capacity and protein, the high buffering capacity / high protein group had significantly higher PMA values than the low buffering capacity / low protein group (P < 0.0001) and the high buffering capacity / low protein group (P = 0.0057) (Fig. 5C).

Fig. 5
figure 5

Relationship between papillary marginal attached (PMA) value, the Perioscreen hemoglobin concentration, and Salivary Multi Test (SMT) items of buffering capacity and protein. (A) Relationship between PMA value, hemoglobin concentration, and buffering capacity. (B) Relationship between PMA value, hemoglobin concentration, and protein. (C) Relationship between PMA value, buffering capacity, and protein. (D) Relationship between PMA value, hemoglobin, buffering capacity, and protein. Each circle represents the data of a subject. Bars represent the mean PMA value. Data expressed as the mean ± standard deviation. *P < 0.05, **P < 0.01, and ***P < 0.001 between the groups.

We then focused on the three items of hemoglobin, buffering capacity, and protein. The group that was hemoglobin-positive and with high values for both buffering capacity and protein had significantly higher PMA values than the groups that were hemoglobin-negative and with low values for protein (buffering capacity17 P < 0.0001; buffering capacity18 P = 0.0041), and the group that was hemoglobin-positive and with low values for both buffering capacity and protein (P = 0.0418) (Fig. 5D).

Multivariate ROC analysis for the diagnosis of gingival inflammation

We performed a multivariate ROC analysis based on a logistic regression model, combining variables including hemoglobin, buffering capacity, and protein, with PMA score as the outcome (Table 3, Supplementary Fig. 1). The AUCs for hemoglobin and buffering capacity, hemoglobin and protein, and buffering capacity and protein were 0.848, 0.834, and 0.819, respectively, and all of which had higher AUCs than models using any single variable (0.744–0.787). Furthermore, the AUC for the combined hemoglobin, buffer capacity, and protein variables was 0.856, which was significantly higher than the AUC for any single variable (vs hemoglobin P = 0.035; vs buffering capacity P = 0.032; vs protein P = 0.050).

Table 3 Multivariate Receiver Operating Characteristic (ROC) analysis using hemoglobin, buffering capacity, and protein as explanatory variables, either singly or in combination, with gingivitis as the outcome measured by papillary marginal attached (PMA) value.

Discussion

Kits that can test for blood-derived hemoglobin in gingival crevicular fluid using saliva or liquid after gargling have been used mainly in adults6,7. Recent studies in adults have reported that detection of hemoglobin in gingival crevicular fluid can be a useful marker for detecting subtle damage to periodontal tissues that may be a sign of the onset of periodontal disease, even in cases where bleeding on probing is not a problem11. We aimed to analyze whether hemoglobin in gingival crevicular fluid can be detected in children, who have fewer teeth than adults, shallower gingival sulci, and little severe periodontitis. Our results show that the Perioscreen detected 2 μg/mL or 5 μg/mL of hemoglobin in the liquid after gargling of approximately 70% of children. The PMA values were significantly higher in hemoglobin-positive subjects than in hemoglobin-negative subjects, suggesting that this is a simple and effective method for screening gingival inflammation.

Unexpectedly, the SMT item of buffering capacity, which is a criterion for evaluating dental caries, positively correlated with the PMA value. Hemoglobin has a buffering capacity, and structurally, hemoglobin that does not hold oxygen (deoxy form) has a higher buffering capacity than hemoglobin that holds oxygen (oxy form)8. In the present study, correlations were found between hemoglobin and the PMA value, and between buffering capacity and the PMA value. These results suggest that the buffering capacity value of the SMT index can include hemoglobin leaked from periodontal tissues. It is possible that the buffering capacity value increases when hemoglobin increases quantitatively due to periodontal tissue destruction or when hemoglobin qualitatively changes to the deoxy form under anaerobic conditions in the periodontal pocket.

The SMT items of white blood cells and protein were used to evaluate periodontal status9. There was no correlation between white blood cells and PMA value. White blood cells, including neutrophils, macrophages, B cells, and substances released from these cells, are activated by infection with highly pathogenic bacteria such as Porphyromonas gingivalis, causing inflammation and progression of periodontitis12,13,14,15. Unlike adults, children rarely suffer from periodontitis, and highly pathogenic periodontal bacteria are rarely detected16,17,18. In the present study, none of the subjects had severe periodontitis, and it is possible that there was little induction of white blood cells associated with bacterial infection in the periodontal tissues.

Proteins in the gingival crevicular fluid are biochemical indicators that increase in the early stages of periodontal disease19. Gingival crevicular fluid contains proteins such as defensins, annexins, transferrin, lactoferrin, and albumin20,21. In the present study, the SMT item of protein correlated with PMA value, and it became clear that protein measurement is effective for evaluating gingivitis in children. Additionally, the Perioscreen hemoglobin value correlated with the SMT item of protein, possibly because hemoglobin is the main iron-containing protein in blood22. SMT protein was detected when no hemoglobin was detected by the Perioscreen probe, suggesting that the SMT detects proteins other than hemoglobin in gingival crevicular fluid.

ROC analysis allows for visual and statistical understanding of diagnostic accuracy and is widely used in the medical field23,24. The results of the ROC analysis suggested that hemoglobin, buffering capacity, and protein may all be valuable biomarkers for identifying gingivitis. Hemoglobin has a high sensitivity (0.947), making it useful for screening purposes; however, its low specificity (0.553) may lead to false positives. Buffering capacity has a relatively high specificity (0.789), potentially contributing to improved positive predictive value, but the somewhat low sensitivity (0.658) means caution is advised when using it alone. Protein exhibited a good balance between sensitivity (0.763) and specificity (0.737), demonstrating its usefulness as a diagnostic indicator for gingivitis.

When the influence of the SMT items of buffering capacity or protein on the PMA index was analyzed in combination with the Perioscreen hemoglobin, the PMA value was significantly higher in the high buffering capacity group than in the low buffering capacity group in the hemoglobin-positive group. Additionally, the PMA value was significantly higher in the high protein group than in the low protein group in the hemoglobin-positive group. These results suggest that by combining the Perioscreen and the SMT, in addition to the existing quantitative evaluation for hemoglobin6,7, it is possible to qualitatively evaluate hemoglobin by focusing on buffering capacity, and to detect a variety of proteins. Furthermore, when subjects were divided into groups based on the three items of hemoglobin, buffering capacity, and protein, the group that was hemoglobin-positive with high buffering capacity and high protein had the highest PMA values. These results indicate that determining buffering capacity and protein using the SMT in addition to conventional hemoglobin testing is effective in detecting subjects at highest risk of periodontal disease.

Because hemoglobin, buffering capacity, and protein each have different sensitivity and specificity, combining them may enable more accurate gingivitis detection. Therefore, we performed a multivariate ROC analysis25,26 that combined these indices to examine their accuracy in identifying gingivitis. Among single variables, hemoglobin (AUC = 0.787) demonstrated the highest discrimination performance, followed by protein (AUC = 0.780) and buffering capacity (AUC = 0.744). However, combining variables significantly improved discrimination accuracy, with hemoglobin + buffering capacity (AUC = 0.848), hemoglobin + protein (AUC = 0.834), and buffering capacity + protein (AUC = 0.819) all outperforming the individual models. Furthermore, the model using all three items showed the highest discrimination accuracy (AUC = 0.856), suggesting that it may be a useful predictive model for detecting gingivitis.

Our study has a limitation due to a small sample size of 76 subjects and the fact that there were more males than females. Clinical studies containing essential insights into the diagnosis, risk assessment, and prevention of oral diseases in pediatric dentistry often involve small sample sizes of several dozen27,28,29. Meanwhile, systematic reviews and meta-analyses of papers with small sample sizes, ranging from tens to several dozen, have shown promising associations30. Although our study’s small sample size limits its statistical power, we believe that our novel diagnostic approach, focusing on hemoglobin, buffering capacity, and protein, may play a helpful role in evaluating pediatric gingivitis. To confirm the findings of this study, further studies with larger sample sizes and increased statistical power are needed.

Another limitation is that we used PMA to evaluate the subjects’ gingival inflammation. While PMA can assess the prevalence of gingival inflammation, it is not suitable for determining the severity. To assess the severity of subjects’ gingival inflammation, it is preferable to evaluate bleeding on probing, which is recommended by many periodontists as the most reliable method3. In future developments, evaluating bleeding on probing to detect early signs of periodontal disease and prevent its progression is needed.

A possible mechanism for detecting gingivitis risk based on the results of our study is presented in Fig. 6. The Perioscreen detects hemoglobin derived from the gingival crevicular fluid of children with gingivitis. The SMT not only detects proteins including hemoglobin, but also detects the buffering capacity of hemoglobin, especially in the deoxy form. Our findings suggest that a combination of the Perioscreen and the SMT is associated with the prevalence of gingival inflammation and is effective as a novel tool for diagnosing gingival disease in children.

Fig. 6
figure 6

A possible mechanism for assessing gingivitis risk by the detection of hemoglobin, buffering capacity, and protein from gingival crevicular fluid using the Perioscreen and the Salivary Multi Test (SMT).

Methods

Subjects

This study was approved by the Hiroshima University Epidemiological Research Ethics Committee (Approval Number: No. E-D56-4). The study was performed in accordance with the Declaration of Helsinki. The subjects were 76 children (50 boys and 26 girls, aged 3–19 years, 8.5 ± 3.9 years (mean age ± standard deviation), median 7 years) who visited the pediatric dentistry department of Hiroshima University Hospital. Parents and guardians were informed of the purpose of this study and provided written informed consent. Subjects were recruited consecutively, and those who had received antibiotics within the past month or had abnormalities in the oral mucosa were excluded.

Evaluation of gingival condition

Gingival condition was evaluated by dentists with at least 5 years of experience in pediatric dentistry using the PMA index in accordance with previous studies31,32,33. Briefly, the labial gingiva between the upper or lower canines (between the primary canines in the case of primary dentition) was divided into three areas: the interdental papilla, the marginal gingiva, and the attached gingiva. Each area was assigned a score of 1 when there was inflammation, and a score of 0 when there was not, and the sum was calculated as the PMA value (the minimum score was 0 and the maximum score was 34).

Perioscreen test

Eating, drinking, and undertaking oral hygiene were not permitted for at least 2 h prior to the test, and the liquid after gargling was collected after rinsing with 3 mL of distilled water for 10 s. The liquid after gargling was dropped onto test paper coated with anti-human hemoglobin monoclonal antibody-conjugated gold colloid provided with the Perioscreen kit34. After 5 min, the color change of the test paper due to the antigen–antibody reaction (0 μg/mL, 2 μg/mL, 5 μg/mL) was judged by comparing it to graded samples.

SMT test

The liquid after gargling was dropped onto a test strip attached to the SMT device and analyzed for 5 min to quantify each factor (cariogenic bacteria, acidity, buffering capacity, white blood cells, protein, and ammonia). The minimum and maximum scores for each item were 0 and 100, respectively.

ROC analysis

We divided 76 subjects into two groups: a low PMA score group (0–5; n = 38) and a high PMA score group (6–24; n = 38). With PMA as the outcome and hemoglobin, buffering capacity, and protein as individual variables, we performed ROC analysis to determine whether each variable helped identify gingivitis. The Youden index method, which maximizes the sum of sensitivity and specificity, was used to determine cutoff values. Decimal cutoff values ​​obtained using the Youden index were rounded up to the nearest integer for clinical use. Sensitivity, specificity, and AUC were calculated for each cutoff value to evaluate the accuracy of identifying gingivitis. In addition, multivariate ROC analysis was performed using logistic regression analysis, with PMA values ​​as the outcome and hemoglobin, buffering capacity, and protein measurements as explanatory variables. The seven variables analyzed were: (1) hemoglobin alone, (2) buffering capacity alone, (3) protein alone, (4) hemoglobin + buffering capacity, (5) hemoglobin + protein, (6) buffering capacity + protein, and (7) hemoglobin + buffering capacity + protein. ROC curves were created using the predicted values ​​(linear predictors) in each logistic regression model, and the AUC was calculated.

Statistical analysis

Statistical analyses were conducted using GraphPad Prism 9 (GraphPad Software Inc., La Jolla, CA, USA). The results were presented as mean ± standard deviation or mean ± standard error. Statistical significance was determined using the Kruskal–Wallis test for nonparametric analysis, followed by the Dunn test for multiple comparisons. The correlation between PMA value and the scores of each SMT item was calculated using simple linear regression to obtain the intercept, slope, R squared value, and P value. The test for linear trend was performed using one-way ANOVA. ROC analysis was performed using SPSS for Windows version 22.0 (IBM Japan, Tokyo, Japan), and AUC comparison was performed using R version 4.3.1 (pROC package). Statistically significant differences were determined at P < 0.05.