Introduction

Frailty refers to an aging-associated state of decline in physical, mental, and social functions1. This condition includes a reduction in muscle strength, the deterioration of nutritional status, decreased activity levels, cognitive decline, and increased feelings of loneliness. Frailty is associated with adverse outcomes such as increased mortality and higher rates of hospitalization, disability, and falls2. It is a preventable stage that occurs before dependency or the need for advanced medical care, and appropriate interventions can slow its progression.

In cross-sectional and longitudinal studies, oral status has been reported to contribute potentially to the prevention of sarcopenia and frailty through nutritional status. In a cross-sectional study using structural equation modeling, Sawada et al.1,2 reported that tongue–lip motor function was positively associated with nutritional status, and that nutritional status was negatively associated with frailty. In a review, Moynihan et al.3 found that tooth loss was associated with an increased risk of sarcopenia, but the data were largely cross-sectional; thus, longitudinal data and intervention studies are needed to clarify whether the association between poor oral function and sarcopenia is causal or bidirectional.

Silva et al.4 reported that tongue strengthening exercises not only increase muscle strength and thickness and exert a positive effect on swallowing, but also prevent sarcopenia and avoid functional changes5,6. Kawamura et al.7 found that individuals who routinely chewed gum had better oral, cognitive, and physical functions, and thus may contribute to the prevention of frailty.

In another review, Morisaki8 discussed oral exercise methods for improving oral function based on 14 previous studies and reported that oral exercises should involve a combination of multiple movements and be performed frequently for more than 2 months. However, many questions remain regarding the optimal methods.

Oral function training has been reported to maintain and improve oral function; however, few intervention studies have reported associations between the maintenance and improvement of nutritional status and physical function. Sasajima et al.9 reported significant improvements in the one-leg standing time with eyes open and the Timed Up and Go tests after 3 months of physical and oral function training. However, they also stated that more detailed studies are needed to confirm the relationship between oral health and physical performance.

To our knowledge, very few intervention studies investigate the relationship between frailty and oral function training alone. In addition, reports discussing the frequency of oral function training feasible for social implementation are lacking. In the present study, we hypothesize that the implementation of oral exercises leads to improvements in nutritional status, sarcopenia, and frailty, and that a minimum frequency of oral exercises is required for these improvements to occur. Given this background, this randomized controlled trial aimed to examine associations with improvements in nutritional status, sarcopenia, and frailty in four groups with different oral exercise frequencies. The results show that oral exercises performed three times per day and three days per week improve frailty status and achieve the highest adherence. These findings suggest that a socially feasible oral exercise routine can contribute to reducing frailty in older adults without disabilities or cognitive impairments.

Methods

Study design

This study was planned and conducted as a prospective, parallel multi-arm randomized controlled trial following the CONSORT 2025 checklist. The intervention consisted of an identical oral exercise program across all groups, with only the frequency differing among the four groups. The four intervention groups were designed not only to examine the effect of oral exercises on pre-frailty or frailty improvement, but also to explore feasible implementation strategies for social application. Consequently, the study concentrated on the comparison of practical combinations of oral exercise frequency (per day and per week). Written informed consent was obtained from all participants. This trial is registered in the Japan Registry of Clinical Trials (jRCT; jRCT1062210063) on December 6, 2021 and is completed. The full trial protocol is publicly available in the Japan Registry of Clinical Trials under the registration number jRCT1062210063. There were no important changes in methodology after the trial commencement.

Participants

The study participants were recruited from participants aged ≥60 years who regularly visited the Department of Preventive Dentistry at Okayama University Hospital. The exclusion criteria included being unable to walk independently or respond to the questionnaire. Participants or members of the public were not involved in the design, conduct, or reporting of this trial.

Recruitment

The recruitment period was from October 11, 2022, to December 22, 2023. After a 3-month intervention period for each participant, reevaluations were conducted from January 13 to March 27, 2024. Follow-up for outcomes of benefits and harms lasted for at least 3 months after the intervention for all participants.

Participant flow

The details of the participant flow in the present study are shown in Fig. 1. In this study, 105 participants who provided consent were recruited. From the eligibility assessment for frailty, 60 individuals categorized as pre-frailty or frailty were selected and randomly assigned to the following four oral exercise groups: (1) three times a day, seven days a week (3 times/day & everyday; n = 15); (2) three times a day, three days a week (3 times/day & 3 days/week; n = 15); (3) once a day, seven days a week (once/day & everyday; n = 15); and (4) once a day, three days a week (once/day & 3 days/week; n = 15).

Fig. 1: Flow chart of oral exercise intervention program participants.
figure 1

This figure shows the participant flow through the study, including screening, randomization into four oral exercise frequency groups (3 times/day & everyday; 3 times/day & 3 days/week; once/day & everyday; once/day & 3 days/week), and inclusion in intention-to-treat (n = 58) and per-protocol analyses (n = 53).

Sample size

Sample size was calculated using STATA software (version 19.5; Stata Corp LLC, College Station, TX, USA) based on the primary outcome, namely improvement in frailty after oral exercise intervention. As there were no previous studies on oral exercise alone for frailty, reference values were obtained from studies reporting changes in tongue pressure following oral exercise10. Effect sizes were defined as Cohen’s d for change scores and converted to Cohen’s f for repeated-measures ANOVA. Specifically, the difference in change between the experimental and control groups was Δ = 14.6, with a pooled standard deviation of 4.5 (Cohen’s d ≈ 3.24; Cohen’s f ≈ 1.62). To be conservative, α was set at 0.01, power at 0.95, the within-subject correlation at r = 0.5, and the error variance (varerror) at 4.52 in Stata’s “power repeated” command, yielding a required sample size of 12 participants per group (total N = 48). Allowing for an anticipated dropout rate of 20%, a total of 60 participants were recruited. The same sample size calculation was not applied to secondary outcomes, which were not used for power calculations.

The criteria for discontinuing the research were as follows: (1) if the research participant requested to withdraw from the study or revoked consent; (2) if the entire research project was discontinued; (3) if the participant engaged in or had participated in oral exercise programs organized by local authorities during the study period; (4) if the participant started or had started new exercises or physical activities during the study period; and (5) for any other reason deemed appropriate by the research supervisor to terminate the study.

Randomization and implementation

Participants were randomly assigned to one of the four groups using simple randomization based on an Excel-generated random number table, with an allocation ratio of 1:1:1:1 and no restrictions (i.e., no stratification or blocking). The random sequence was created by N.T., and group assignment was performed by N.S. Allocation concealment was maintained by storing the randomization list in a secured binder to be consulted only at the time of assignment. The personnel who enrolled participants and those who assigned participants to interventions both had potential access to the random allocation sequence; however, access was restricted by protocol and controlled operationally.

Interventions

Participants received standardized oral exercise instructions after a baseline examination. A trained dental hygienist (I.S.) provided face-to-face guidance using individual leaflets (Fig. 2). Participants were instructed to record on a calendar whether they performed the oral exercises as instructed for the prescribed frequency; the calendar was collected during the reevaluation (Fig. 3). All participants were directed to engage in the oral exercises for a duration of 3 months. Participants were randomly assigned to one of four exercise frequencies: 3 times/day & everyday; once/day & everyday; 3 times/day & 3 days/week; and once/day & 3 days/week. A random number table was used for the assignments, ensuring a ratio of 1:1:1:1 for random allocation to one of the four groups.

Fig. 2: Oral exercises leaflet.
figure 2

This figure displays the leaflet used in the intervention, illustrating specific oral exercise methods and movements for the oral and surrounding areas. Adapted from free licensed materials provided by “Rihatsubame”, in Japanese (https://zaitaku-st.com/), used under an Open Access license.

Fig. 3: Calendar for checking the implementation of oral exercises.
figure 3

This figure shows the calendar used to record the implementation rate of the oral exercise program. The sheet includes a QR code linking to detailed exercise instructions.

Implementation was monitored using these calendars, and the implementation rate was calculated as the percentage of completed exercises relative to the prescribed number. Fidelity was maintained by providing identical instructions and materials across all groups, with the only difference being exercise frequency. No additional interventions beyond the assigned oral exercise program were provided.

Outcomes

The primary outcome was the change in the number of frailty criteria from baseline to follow-up, and the primary analysis was conducted in the intention-to-treat population. The demographic characteristics of the enrolled participants were age and sex. The secondary outcomes included body mass index (BMI), skeletal muscle mass, appendicular skeletal muscle mass, SMI, nutritional status, mental health status, social frailty, and oral status, including oral function.

Assessment of frailty

The frailty assessment utilized the revised Japanese version of the Cardiovascular Health Study criteria (revised J-CHS criteria)11 for defining frailty. Globally, the most commonly used diagnostic criteria are those proposed by Fried, known as the CHS criteria. For screening frailty, these criteria summarize the signs expressed because of aging-associated functional decline into a phenotypic model.

In the revised J-CHS criteria, frailty is defined by five diagnostic criteria: (1) weight loss, (2) fatigue, (3) decreased physical activity, (4) slowness (delayed walking speed), and (5) weakness (muscle strength reduction). If three or more criteria are met, the individual is classified as frailty; if one to two criteria are met, the individual is classified as pre-frailty; and if no criteria are met, the individual is considered robust. For the measurement of weakness (muscle strength reduction), grip strength was assessed using a handgrip dynamometer (TOEI LIGHT CO., LTD., Saitama, Japan).

Physical status

BMI

BMI was calculated using a body composition meter with a stature meter (BH-300A-N, TANITA, Tokyo, Japan) to measure height (m) and weight (kg), and is defined as the value obtained by dividing weight by the square of height.

Skeletal muscle mass, appendicular skeletal muscle mass, and SMI

Skeletal muscle mass, appendicular skeletal muscle mass, and SMI measurements were conducted using a body composition analyzer that employs two frequencies and an eight-electrode system with a grip (InnerScan DUAL RD-804L; TANITA, Tokyo, Japan). Skeletal muscle mass was measured for the whole body and five specific regions (left arm, right arm, left leg, right leg, and trunk). Appendicular skeletal muscle mass was defined as the sum of the muscle mass in the left arm, right arm, left leg, and right leg. SMI is an indicator of whether the muscle mass in the arms and legs is excessive relative to height, calculated by dividing appendicular skeletal muscle mass (kg) by the square of height (m).

Nutritional status

The Mini Nutritional Assessment (MNA)12 was used to evaluate nutritional status. The MNA is a tool designed to assess the nutritional status of older individuals aged ≥65 years. It is composed of 18 items, including six screening items (food intake, weight loss, mobility, acute illness within the last 3 months, psychological problems, and BMI) and 12 assessment items (independence in daily living, taking four or more medications, presence of pain or ulcers, number of meals per day, protein intake, daily consumption of at least two servings of fruits and vegetables, hydration, assistance with meals, self-assessment of nutrition, self-assessment of health status, upper arm circumference, and calf circumference), with a higher score indicating better nutritional status.

Mental health status

Mental health status was assessed using the Geriatric Depression Scale-15 Japanese version13 (GDS-15-J). The GDS-15-J is a validated scale for evaluating depression in older adults composed of 15 questions that can be answered with a simple “yes” or “no”. A total score of ≥ 6 indicates the presence of depressive tendencies13,14. The data for this study were based on the total scores.

Social frailty

Social frailty was evaluated based on the criteria established by Makizako et al.15. The evaluation includes five items: “Living alone (yes: 1 point)”, “Frequency of going out has decreased compared with last year (yes: 1 point)”, “Visiting friends’ homes (no: 1 point)”, “Feeling that I am helpful to family or friends (no: 1 point)”, and “Having daily conversations with someone (no: 1 point)”. A total score of ≥ 2 points is classified as social frailty. The data for this study were based on the total scores.

Oral status

Oral status was examined based on the present number of teeth, number of functional teeth, bacteria count on the dorsal surface of the tongue, oral moisture status, occlusal force, tongue–lip motor function, tongue pressure, masticatory function, and swallowing function. The oral status examination included both oral cavity and oral function assessments. The oral examination was assessed in regard to the number of present and functional teeth. Oral function was examined in terms of bacterial count on the dorsal surface of the tongue, oral moisture status, maximum occlusal force, tongue–lip motor function, maximum tongue pressure, masticatory function, and swallowing function. The oral examination was conducted by N.T., D.E., and M.M., while the oral function examination was performed by S.I and N.S.

Number of present and functional teeth

The number of present teeth included natural teeth, restored teeth, and untreated teeth, and did not include artificial teeth or retained roots. The number of teeth present was defined as the number of teeth erupting into the oral cavity, even partially. The number of functional teeth included natural and prosthetic teeth, dentures, and implants16. The remaining roots were excluded.

Bacterial count on the dorsal surface of the tongue

The bacterial count on the dorsal surface of the tongue was measured using a simple oral bacterial quantification device (Bacterial Counter; Panasonic Healthcare, Tokyo, Japan). A sterilized swab was attached to a constant-pressure specimen collection device, and the central part of the dorsal tongue was swabbed back and forth for ~1 cm in length three times to collect the sample. Subsequently, the swab, a dedicated sensor chip, and a disposable cup containing 5.0 mL of distilled water were inserted into the designated position of the bacterial counter device. Bacteria in the liquid were collected onto the electrodes through dielectrophoresis, and the change in impedance was measured to quantify the bacterial concentration (CFU/mL) in the sample, allowing for a quantitative assessment of the bacterial count17.

Oral moisture status

Oral moisture status was measured using an oral moisture meter (Mucus; Life, Koshigaya, Japan) placed at the center of the tongue18, and the average of two measurements was taken as the final value. Participants were instructed not to eat, drink, or rinse their mouths for at least 30 min prior to the measurement. A lower value indicates a lower oral moisture level.

Maximum occlusal force

The DENTAL PRESCALE II (GC Corporation, Tokyo, Japan), which consists of two layers of polyethylene terephthalate film and a color-developing material sandwiched with numerous microcapsules, was used to measure occlusal force. This device was utilized alongside a pressure-sensitive film and an occlusal force analysis device (GC Corporation). Subsequently, the films were scanned using occlusal force analysis software to measure the occlusal force. The unit of measurement for the occlusal force was N, and the pressure filter function of the occlusal force measuring device was employed. Measurements were taken with the dentures in place, and the average of two measurements was used for analysis.

Tongue-lip motor function

The evaluation of tongue-lip motor function was conducted using an automatic measurement device (Kenko-kun Handy; Takei Scientific Instruments Co., Ltd., Tokyo, Japan). The motor abilities of the lips, anterior tongue, and posterior tongue were assessed by recording the number of repetitions per second for the monosyllables /pa/, /ta/, and /ka/. Each syllable was pronounced for a total of 5 seconds, and the number of syllables produced per second was evaluated. Each measurement was conducted twice, and the average values were used for analysis.

Maximum tongue pressure

The evaluation of tongue pressure was conducted using the JMS Tongue Pressure Measurement Device (TPM-01; JMS, Hiroshima, Japan). This device objectively assesses tongue strength using maximum tongue pressure as an indicator. A balloon-type probe was used for measurement; after automatically pressurizing the balloon to a predetermined value outside the oral cavity, the balloon of the tongue pressure probe was placed on the tongue inside the oral cavity. The hard ring of the tongue pressure probe was fixed by biting down with the front teeth, and the balloon inside the oral cavity was compressed against the hard palate with the maximum force of the tongue to measure the maximum tongue pressure. Regarding data collection, practice was conducted until the above procedure could be performed smoothly, followed by two measurements, and the average value was taken as the tongue pressure value.

Masticatory function

Masticatory function was measured using the masticatory ability testing system (GLUCO SENSOR GS-2, GC Corporation). A cylindrical gummy jelly (GLUCOLUMN; GC Corporation) with a diameter of 14 mm, a height of 8 mm, and a weight of ~2 g was chewed on the habitual side for 20 s, and the glucose elution amount was calculated. To calculate the glucose elution amount, after chewing the gummy jelly, the participant held 10 mL of water in the mouth and spat it into a filter cup, where the glucose concentration in the filtrate was measured using a glucose meter. Measurements were performed twice, and the average value was used for analysis.

Swallowing function

Assessment of swallowing function was conducted using the Japanese version of the Eating Assessment Tool-10 (EAT10)19 and the repetitive saliva swallowing test (RSST). The EAT10 consists of a questionnaire with 10 items designed to screen for swallowing disorders. Each question is rated on a scale from 0 (no problem) to 4 (severe problem), resulting in a total score ranging from 0 to 40 points. A score of ≥3 is considered to indicate a “risk of aspiration”. In this study, the total score was used for analysis.

The RSST is a swallowing disorder screening method that measures how many times a person can perform dry swallowing in 30 s while palpating the thyroid cartilage. Being unable to perform three dry swallows is considered to indicate an increased risk for swallowing disorders20. This method does not require special equipment and poses no risk of aspiration during the test, and thus, it is widely used in many care facilities in Japan. In this study, the total score was used for analysis.

Oral hypofunction

The Japanese Society of Gerodontology proposed “oral hypofunction” as the stage at which recovery can be expected by performing dental treatment before oral dysfunction occurs. Therefore, we developed a hypothesis that oral frailty and hypofunction emerge during the process toward oral dysfunction among the various declines in ability. In addition, as a starting point for discussing this problem, we presented criteria for diagnosing oral hypofunction.

Seven conditions were set to diagnose poor oral function: poor oral hygiene, xerostomia, poor bite strength, poor lingual–labial motor function, poor tongue pressure, poor masticatory function, and poor swallowing function. Furthermore, based on the results of a study21 conducted at Fujita Health University Hospital, “oral hypofunction” was defined as a condition that meets three or more of these diagnostic criteria. The number of criteria met for these seven items was designated as the “number of criteria for oral hypofunction”.

Systemic disease

A questionnaire was used to record the participants’ medical history and number of medications taken. The questionnaire requested information on the following conditions that have been linked to frailty: stroke22, heart disease23, respiratory disease24, hypertention25, diabetes mellitus26, kidney disease27, knee osteoarthritis28, osteoporosis29, rheumatoid arthritis30, Parkinson’s disease30, anemia31, and hearing dysfunction24.

Implementation rate

The implementation rate was calculated by dividing the total number of actual oral exercises performed by the total number of exercises if all had been conducted, then multiplying by 100. The total number of exercises for each group was as follows: 252 times for the 3 times/day & everyday group; 108 times for the 3 times/day & 3 days/week group; 84 times for the once/day & everyday group; and 12 times for the once/day & 3 days/week group.

Blinding

This study utilized a single-blind design. Each assessor evaluated regardless of group. The evaluators were not informed of which group the subjects were assigned to, only one research team member maintained information about study group assignment and communication with subjects occurred only through that one person. The interventions were similar in content and instructions; the only difference was the frequency of oral exercise.

Statistics and reproducibility

Descriptive statistics were used to summarize the participants’ demographics, baseline characteristics, and study outcomes. The normality of data distribution was assessed using the Kolmogorov-Smirnov test. As most variables, except for age, skeletal muscle mass, and tongue pressure, were not normally distributed, nonparametric tests were used for analysis.

All data are presented as means and standard deviations. Comparisons among the four groups at baseline were conducted using the Kruskal-Wallis test. Comparisons between baseline and follow-up for each group were performed using the Wilcoxon signed-rank test. The primary analysis was based on the intention-to-treat (ITT) population; per-protocol (PP) analysis was performed as a sensitivity analysis. Confidence intervals for the mean difference between baseline and reassessment were calculated with 95% confidence intervals for the primary outcome and 99.8% confidence intervals for the other 22 items to control for multiplicity. MCID for each measure was calculated using half the standard deviation of the difference between baseline and reassessment (reassessment value minus baseline value)32.

We conducted a two-way repeated measures ANOVA to evaluate the effects of each group (3 times/day & everyday, 3 times/day & 3 days/week, once/day & everyday, and once/day & 3 days/week) and time (baseline and follow-up) on number of frailty criteria, SMI, number of present teeth, number of functional teeth, number of oral hypofunction criteria, MNA, Social frailty score, GDS15. Mean differences for the main outcome number of frailty criteria were calculated with 95% confidence intervals. The 99.3% confidence interval was calculated for the following seven secondary outcomes (SMI, number of present teeth, number of functional teeth, number of oral hypofunction criteria, MNA, social frailty score, and GDS15) to account for multiplicity using Bonferroni correction. Partial eta squared (ηp2) was utilized to assess the effect size (ES), with values of 0.01 <ηp2 < 0.06 indicating a small effect, 0.06 ≤ ηp2 ≤ 0.14 representing a medium effect, and ηp2 > 0.14 denoting a large effect. Planned pairwise comparisons were performed using the Bonferroni post hoc correction to identify differences. Cohen’s d was employed to calculate the ES for the post hoc comparisons, which was categorized as trivial (<0.20), small (0.20–0.49), moderate (0.50–0.79), or large ( ≥ 0.80). An additional analysis using generalized linear models (logistic regression) was conducted for categorical outcomes (frailty33, EAT-1034, social frailty15, and GDS-1535) to confirm the robustness of the findings. This analysis was performed post hoc.

The implementation rate was calculated by dividing the actual number of oral exercises performed by the number of times they should have been performed, and then multiplying by 100. The differences in implementation rates were analyzed using the Kruskal-Wallis test with Bonferroni correction.

All other statistical analyses were performed using SPSS (version 26.0; IBM, Armonk, NY, USA). The overall significance level was set at p < 0.05 for the primary outcome, and Bonferroni correction was applied for multiple comparisons to control the family-wise error rate. Although this adjustment was noted in the protocol or SAP, it was implemented post hoc to ensure rigorous control of type I error across multiple comparisons. For oral moisture status, maximum occlusal force, tongue–lip motor function, tongue pressure, and masticatory function, each measurement was performed twice at both baseline and follow-up, and the average of the two values was used for analysis. All other outcome measures were assessed once per participant at each time point. Each participant was considered as a single experimental unit, and no technical replicates beyond the repeated measurements were used. Missing data were handled as follows: For participants with two scheduled measurements, if one measurement was missing, the available measurement was used in the analysis rather than calculating an average. No imputation was performed.

Ethical considerations

This study was conducted in accordance with the tenets of the Declaration of Helsinki, and procedures involving human participants were approved by the ethics committee of Okayama University Graduate School of Medicine, Dentistry, and Pharmaceutical Sciences and Okayama University Hospital (approval No.: 1803-038).

Results

Number analyzed

A total of 58 participants who began the trial were included in the ITT analysis, and 53 who adhered to their assigned protocol were involved in the PP analysis.

Baseline characteristics

The characteristics of the participants at baseline, including demographic, clinical, physical, oral function, and nutritional variables, are shown in Table 1. The full version of this table, including all variables, is provided in Supplementary Table 1. No significant differences were observed among the four groups for all items.

Table 1 Participants’ characteristics (intention-to-treat analysis, Simplified Version)

Outcomes and estimation

There were no changes to trial outcomes after the trial commenced. Table 2 shows the baseline and follow-up outcomes, including the means ± standard deviations, within-group comparisons, and between-group comparisons. The confidence intervals (CI) for the mean differences (95% CI for the frailty criterion and 99.8% CI for the other 22 items to account for multiplicity) and the minimum clinically important difference (MCID) in each group are provided in Supplementary Table 2. There were no significant differences between the four groups at baseline.

Table 2 The baseline and follow-up outcomes in the each groups (intention-to-treat analysis, Simplified Version)

Primary outcome

A two-way repeated measures analysis of variance (ANOVA) was used to evaluate the impact of the four frequencies of oral exercise methods on frailty in older adults (Table 3). The results regarding the analysis of effect sizes and CI for the mean differences (95% CI for the frailty criterion and 99.3% CI for the other 7 items to account for multiplicity) are shown in Table 3. For the primary outcome (number of frailty criteria), the estimated mean differences from baseline to follow-up in the ITT population were −0.357 (95% CI, −0.787 to 0.073) in the group performing oral exercises three times per day every day (3 times/day & everyday group), −0.600 (95% CI, −1.255 to 0.055) in the 3 times/day & 3 days/week group, −0.571 (95% CI, −1.379 to 0.236) in the once/day & everyday group, and −0.600 (95% CI, −1.008 to −0.192) in the once/day & 3 days/week group. A two-way repeated measures ANOVA in the ITT analysis revealed a significant main effect of time for this outcome (F (1, 54) = 14.803, p < 0.001, partial η² = 0.215), indicating an overall reduction in frailty criteria after the intervention across all frequency groups. Bonferroni-adjusted pairwise comparisons showed no significant differences between groups (all p > 0.007).

Table 3 Results of two-way repeated measures ANOVA analysis in 4 groups (intention-to-treat analysis)

Secondary outcomes

A significant main effect of time was observed for the number of applicable items for oral hypofunction (F(1, 54) = 16.456, p < 0.001, partial η² = 0.234), suggesting improvement in oral function followings. The PP analysis showed similar results to the ITT analysis (see Supplementary Tables 35 for detailed results). An additional analysis using categorical data with a generalized linear model (logistic regression) was conducted on frailty, EAT-10, social frailty and GDS-15, and the results were consistent (see Supplementary Tables 6 and 7 for detailed ITT and PP results).

Implementation rates

The average implementation rates and 95%CI for the groups were as follows: 3 times/day & everyday group, 66.4 ± 35.0 (%), 95%CI (42.29, 87.56); 3 times/day & 3 days/week group, 93.8 ± 15.1, 95%CI (85.07, 102.50); once/day & everyday group, 57.2 ± 38.0, 95%CI (29.99, 84.41); and once/day & 3 days/week group, 69.1 ± 41.8, 95%CI (44.93, 93.17). The results of a Kruskal-Wallis test with Bonferroni correction indicated that the 3 times/day & 3 days/week group had a significantly higher implementation rate than the 3 times/day & everyday and once/day & everyday groups.

Possible harms

Harms were defined as any adverse events or discomfort related to the oral exercise program, such as pain, fatigue, or injury. These were assessed non-systematically: participants were instructed to report any adverse effects if they occurred during the intervention period. No adverse effects were reported throughout the conduct of the entire study.

Discussion

In this study, the 3-month intervention of oral exercises alone resulted in a significant improvement in frailty. The frailty assessment showed similar results when analyzed using the categories of robust, pre-frailty, and frailty (Supplementary Table 4). To the best of our knowledge, such findings have not been extensively reported in previous research.

Previous studies have reported that combining oral exercises with other interventions leads to enhanced physical abilities. Watanabe et al.36 reported improvements in physical function through a 12-week self-monitored comprehensive geriatric intervention program that consisted of four components: low-load resistance exercises using body weight and rubber bands, increased physical activity, oral function care, and nutritional guidance. Kito et al.37 reported that a group undergoing a combined program of physical and oral exercises showed significantly greater improvements in both oral and physical functions compared with a 12-week physical exercise intervention group, but did not discuss the relationship between improvements in oral function and physical function.

In the present study, no significant temporal main effects of nutritional status or skeletal muscle mass index (SMI) were observed, which suggests that improvements in physical function were not due to increases in skeletal muscle mass through nutritional status; other factors may have been involved. As possible other factors, examination itself or self-monitoring can be involved. The previous intervention studies suggest the effects of examination on disease improvements38 and reduction of worry39, and the effects of self-monitoring on improvement in metabolic syndrome40. Thus, in this study, baseline oral examination and/or self-monitoring of oral exercise might affect improvement in frailty.

Also in this study, the implementation of oral exercises alone did not improve nutritional status. In a systematic review, Smit et al.41 reported that only cross-sectional studies were available regarding the relationship between masticatory function and nutritional status, and the results were not very consistent. Ozsürekci et al.42 found no association between chewing ability and the Mini Nutritional Assessment Short Form (MNA-SF). Iwasaki et al.43 reported that a low number of current teeth was associated with malnutrition as evaluated by the MNA-SF or serum albumin levels. In the present study, only guidance on oral exercises was provided, without any nutritional counseling. Although objective improvements in oral function were observed, the individuals did not have self-awareness of these improvements and their dietary content may not have changed. This could be one of the factors that did not lead to an improvement in nutritional status.

It has been reported that providing nutrition and oral hygiene guidance to older adults and their caregivers leads to improvements in the nutritional status of older adults44. Zhu et al.45 reported that combining swallowing exercises with nutritional guidance for patients with laryngeal cancer leads to improvements in swallowing function and nutritional status. Kikutani et al.46 found that an intervention combining high-calorie, high-protein diets with oral function training for older individuals requiring care resulted in the maintenance of oral function and improvements in nutritional status. However, to our knowledge, no studies have indicated that interventions including oral exercises do not improve nutritional status. It is possible that a multifaceted intervention that includes not only oral exercises, but also other methods related to nutritional status, could lead to improvements in nutritional status.

In this study, the implementation of oral exercises alone did not lead to improvements in sarcopenia. In a systematic review, Smit et al.41 reported that only cross-sectional studies were available regarding the relationship between chewing function and sarcopenia, and the results were not very consistent. In a study involving community-dwelling older individuals, Yoshida et al.47 reported a correlation between chewing ability and sarcopenia. By contrast, Aquilanti et al.48 discussed the potential influence of a well-trained dietitian providing specialized dietary therapy to institutionalized older individuals, but reported no correlation between chewing ability and sarcopenia. Given that no improvement in nutritional status was observed in the present study, it is possible that there was also no impact on sarcopenia. Watanabe et al.36 reported that interventions involving exercise, oral function, and nutrition improved physical function and the thickness of the anterior thigh muscles, thereby suggesting a potential prevention strategy for sarcopenia. To the best of our knowledge, no studies have indicated that interventions including oral exercises do not improve sarcopenia. It is possible that a multifaceted intervention, which includes not only oral exercises, but also other methods related to muscles, could lead to improvements in sarcopenia.

In the present study, the frequency of three times a day & 3 days a week had the highest implementation rate. Shu et al.49 reported that the frequency and duration of exercises varied among the reviewed studies in their systematic review on oral exercises and the improvement of oral function. In the subgroup meta-analysis,49 they showed that both high- and low-intensity daily exercises were effective for the improvement of oral function; however, they did not discuss adherence rates. They49 also noted that the improved occlusal force after ceasing exercise is expected to be maintained for about 2–3 weeks, gradually decreasing to pre-intervention levels. Therefore, similar to other forms of physical exercise, regular oral exercises are necessary to sustain such effects50,51. Accordingly, the frequency of three times a day & 3 days a week may be effective for continuing long-term exercise. Adherence rates may be also influenced by factors other than the ease of frequency such as participants’ experiences, difficulties, and preferences. However, this study did not conduct qualitative assessments of these factors, leaving the impact on adherence rates unclear. To more accurately examine adherence rates, it is necessary to consider the characteristics of the participants. Additionally, during the 3-month study period, we examined adherence at a uniform frequency from start to finish, and found that a frequency of three times a day, three times a week, had the highest implementation rate. For example, a more flexible approach that starts with fewer sessions in the initial weeks and then transitions to daily implementation might lead to higher adherence rates in the long term.

In intervention research, it is essential to recognize that identifying effective intervention methods based on evidence is just the beginning; the real significance lies in their actual social implementation. The findings of the present study suggest that oral exercises not only contribute to frailty improvement, but also propose a feasible implementation frequency of three times a day & 3 days a week. This aspect aligns with implementation research52, bridging the gap between evidence-based interventions and their practical application in society.

The strength of this study lies in its examination of the effects of oral exercises solely on the improvement of frailty. Additionally, this study investigated feasible implementation frequencies and included aspects of implementation research, which is the next phase following intervention studies.

There are several limitations in this study. The first limitation is the lack of a control group in this study. In a previous study that evaluated the improvement of frailty using a mobile/web-based application and sensor kits, only 9.9% of the control group (pre-frailty, n = 44, average age 82.6 years) became robust after 3 months53. In another study that assessed the impact of an educational program on the frailty status, physical function, physical activity, sleep patterns, and nutritional status of older adults living in the community, none of the control group (pre-frailty and frailty, 83 participants with an average age of 74.9 years) became healthy after 6 months54. Additionally, a study focusing on older adults who were frailty or pre-frailty in the community evaluated the effects of an educational program on depressive symptoms, cognitive function, social support, quality of life, and physical frailty. The median number of frailty criteria for the control group (pre-frailty, n = 90, average age 75.7 years) in the intervention study remained unchanged at after 6 months55. Although simple comparisons cannot be made due to differences in community and background, it can be inferred that the frailty status is unlikely to improve significantly without any intervention. Second, the participants were restricted to patients regularly visiting a university hospital’s preventive dentistry department. It is possible that the participants had a higher health consciousness than the general population, which may have resulted in an adherence rate that was higher than what would typically be expected. They could have been proactive and cooperative regarding treatment. Therefore, a broader range of subjects needs to be investigated. Third, the reevaluation period of 3 months was relatively short. Kito et al.37 reported that a 12-week physical and oral exercise program effected oral and physical function in older individuals. Watanabe et al.36 reported that a month oral exercise effected 12 week physical and oral exercise increased in daily steps. Koponen et al.44 reported that the intervention group receiving individual nutritional counseling and oral health guidance for 6 months showed an improvement in nutritional status. To investigate whether oral exercises alone influence the improvement of frailty, it may have been necessary to establish an intervention period that accounts for the duration required for changes in nutritional status, sarcopenia status, and psychosocial conditions, which are intervening factors. A more comprehensive study would benefit from a longer research duration, such as 6 months. Fourth, the implementation rates for the two groups that performed oral exercises daily were low. This may have been due to participants feeling burdened by the daily requirement, leading to a decrease in implementation rates. Fifth the intervention methods do not include exercise or nutrition. While it has been reported that the combination of exercise and nutrition is beneficial for frailty prevention56, this study aimed to investigate the impact of oral exercises on frailty, and therefore, interventions related to exercise and nutrition could not be examined. Further investigation is needed.

Conclusion

After conducting oral exercises for 3 months on older adults with pre-frailty or frailty, improvements in frailty were observed within the aforementioned limitations. The implementation frequency that yielded the best results was three times/day, 3 days/week for older people without disabilities or cognitive impairments.