Automatic quantification of disgust reactions in mice using machine learning

Inaba, Shizuki; Uesaka, Naofumi; Tanaka, Daisuke H.

doi:10.1038/s41598-025-01244-3

Download PDF

Article
Open access
Published: 21 May 2025

Automatic quantification of disgust reactions in mice using machine learning

Shizuki Inaba¹,
Naofumi Uesaka¹ &
Daisuke H. Tanaka¹

Scientific Reports volume 15, Article number: 17573 (2025) Cite this article

3083 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Disgust, a primary negative emotion, plays a vital role in protecting organisms from intoxication and infection. In rodents, this emotion has been quantified by measuring the specific reactions elicited by exposure to unpleasant tastes. These reactions were captured on video and manually analyzed, a process that required considerable time and effort. Here we developed a method to automatically count disgust reactions in mice by using machine learning. The disgust reactions were automatically tracked using DeepLabCut as the coordinates of the nose and both front and rear paws. The automated tracking data were split into test and training data, and the latter were combined with manually labeled data on whether a disgust reaction was present and, if so, which type of disgust reaction was present. Then, a random forest classifier was constructed, and the performance of the classifier was evaluated in the test dataset. The total number of disgust reactions estimated by the classifier highly correlated with those counted manually (Pearson’s r = 0.97). The present method will decrease the time and effort required to analyze disgust reactions, thus facilitating the implementation of the taste reactivity test in large-scale screening and long-term experiments that necessitate quantifying a substantial number of disgust reactions.

A neurofunctional signature of subjective disgust generalizes to oral distaste and socio-moral contexts

Article 19 April 2024

Generalization gradients for fear and disgust in human associative learning

Article Open access 09 July 2021

Efficiency and safety of automated label cleaning on multimodal retinal images

Article Open access 05 January 2025

Introduction

Disgust, a primary negative emotion recognized since Darwin¹, is elicited by a distaste for food and contact with feces. This emotion plays a crucial role in protecting animals from intoxication and infection by helping them avoid harmful substances and pathogens². Abnormal manifestations of disgust have been observed in patients receiving chemotherapy for cancer^3,4 and have been proposed in the pathogenesis of various intractable psychiatric disorders, including eating disorders and obsessive-compulsive disorder^5,6,7. Therefore, elucidating the neural mechanisms underlying disgust is essential for understanding the biological basis of emotion, reducing drug side effects, and effectively managing various psychiatric disorders.

Since disgust is a conscious experience and thus cannot be directly observed^8,9,10,11,12, an observable readout is required to elucidate its neural mechanisms. Taste reactivity, consisting of orofacial and somatic behavioral reactions in response to taste stimuli, expresses the hedonic evaluation of taste, but not taste quality^13,14,15. Several specific behaviors, such as gape, in taste reactivity have been established as reliable indicators of disgust¹⁵ and are referred to as disgust reactions^16,17,18,19.

The quantification of disgust reactions is so complex that manual analysis has been carried out and automated analysis methods have not been developed. In previous analyses of disgust reactions, orofacial and somatic behavioral reactions were video-recorded, and the relevant behaviors were subjectively distinguished while the video was played frame-by-frame. Thus, the quantified data of a particular test can be somewhat variable between the people analyzed, and a significant amount of time and effort is required for the analysis²⁰.

To overcome these limitations and reproducibly and quickly assess disgust reactions in mice, we constructed a classifier to automatically count them. To accomplish this, we utilized DeepLabCut as an estimation system for the position of the body part based on transfer learning with deep neural networks that track defined body parts of animals without physical markers²¹ and random forest as a supervised learning model with an ensemble learning method²².

Methods

Animals

Adult (6–7 weeks-old) male C57BL/6J mice (n = 27) were obtained from Japan SLC Inc. (Shizuoka, Japan). Upon arrival, mice were housed in clear plastic cages (18 × 26 × 13 cm) with wood tips (Soft tip: Japan SLC, Shizuoka, Japan) in groups of 2–4 males per cage and were then housed individually in smaller cages (14 × 21 × 12 cm) immediately before handling. Mice were maintained at 23 ± 1 °C under a 12 h light/dark cycles (lights on at 8:00 am) and given ad libitum access to food (Labo MR stock; Nosan Corp., Kanagawa, Japan) and tap water. The tap water was changed to Milli-Q water 3 days before beginning the reaction experiments. All behavioral experiments were performed during the light cycle. All the animal experiments were approved (No. 0150384 A, 0160057C2, 0170163 C, A2017-194 A, and A2018-138C4) by the Institutional Animal Care and Use Committee of the Institute of Science Tokyo, and was performed according to the ARRIVE and relevant guidelines and regulations.

Surgery for intraoral tubing

For implantation of an intraoral tube, mice were anesthetized by intraperitoneal (i.p.) injection of sodium pentobarbital (40 mg/kg BW; Nembutal; Abbott Laboratories) or a mixture of midazolam (4 mg/kg BW; Astellas Pharma Inc.), butorphanol (5 mg/kg BW; Meiji Seika Pharma Co., Ltd.), and medetomidine (0.3 mg/kg BW; Nippon Zenyaku Kogyo Co., Ltd.)²³. The depth of anesthesia was maintained at a sufficient level to prevent paw pinch reflex. An incision was made in the midline of the scalp. A curved needle attached to an intraoral polyethylene tube (SP-10; Natsume Seisakusho Co., Ltd., Tokyo, Japan) was inserted from the incision site and advanced subcutaneously posterior to the eye, to exit at a point lateral to the first maxillary molar on the right side of the mouth. The intraoral end of the tube was heat-flared to an approximate diameter of 1 mm to prevent it from drawing into the oral mucosa. Mice were mounted in a stereotaxic frame (David Kopf Instruments, Tujunga, CA, USA) using ear bars. Two alcohol-sterilized small screws (PN-04; LMS Co., Ltd., Tokyo, Japan) were anchored to the skull and a piece of plastic bar was fixed to the screws with dental cement (Unifast3; GC Corp., Tokyo, Japan). The end of the tube exiting the head incision was fixed to a plastic bar with dental cement. Mice received subcutaneous injections of the antibiotic chloramphenicol sodium succinate (60 mg/kg BW, Chloromycetin Succinate; Daiichi Sankyo Co., Ltd., Tokyo, Japan) for infection prevention and carprofen (5 mg/kg BW, Rimadyl; Zoetis, Tokyo, Japan) for pain relief. Three days after surgery, one end of a delivery tube (SP-10; Natsume Seisakusho Co., Ltd.) was connected to the end of the intraoral tube fixed to the plastic bar on the mouse head using a connector (KN-394, two directions (0.3 + 0.3); Natsume Seisakusho Co., Ltd.), and the other end was connected to a needle (30Gx13mm; ReactSystem Co., Osaka, Japan) attached to a 1-mL syringe, and ~ 20 µL of sterile MilliQ water was introduced into the mouth of the animal to test the patency of the tubes. The intraoral tube was flushed with ~ 20 µL sterile MilliQ water every 2–3 days to prevent occlusion. The mice were allowed 1–3 weeks to recover from surgery before the beginning of the behavioral experiments (Fig. 1).

Taste reactivity test

The taste reactivity test^14,15,18 was used to measure affective behavioral reactions in mice. The test chamber was composed of a glass floor and an acrylic cylinder (30 cm height, 10 cm outside diameter, and 3 mm thickness). A digital video camera (HDR-PJ800; Sony, Tokyo, Japan) was placed beneath the glass floor to record the ventral view of the mouse. One end of the delivery tube was connected to the end of the intraoral tube fixed to the plastic bar on the mouse head using a connector, and the other end was connected to a needle attached to a 1-mL syringe. The stimulant solution was filled into a syringe and infused into the mouth of the mice. Orofacial and somatic behaviors during the infusions were video-recorded in a bottom-up view at 30 frames per second (Fig. 2A).

To assess disgust reactions in the innate and learned experimental conditions, the mice were divided into an innate group (n = 13) and a learned group (n = 14) after surgery. As the experiments were conducted while optimizing the experimental conditions, the conditions differed from experiment to experiment. Three different schedules and experiments were performed for the innate groups and two slightly different schedules and experiments were used for the learned groups (Fig. 2B; Table 1). Each mouse was used in a single experiment. The number of mice used in each experiment was determined by unintentional factors such as the number of mice available at the time the experiment was performed. The experimental schedule for each experiment is as follows.

Table 1 Taste reactivity test schedule.

Full size table

Experiment 1 for the innate group: On Day 1, mice (n = 5) received intraoral infusions of 50 µL of 3 mM quinine solution twice for 1 min each with a 1 min interval in between to adapt to the test procedure. On Day 2 (test day), the mice received the same infusions as those administered on Day 1, and their behavior during the infusions was recorded (Fig. 2B; Table 1).

Experiment 2 for the innate group: On Day 1, mice (n = 2) continuously received infusions of 80 µL of 3 mM quinine solution for 2 min without an interval to adapt to the test procedure. On Day 2 (test day), the mice received the same infusions as those administered on Day 1, and their behaviors during the infusions were video-recorded.

Experiment 3 for the innate group: On Day 1, mice (n = 6) received intraoral infusions of 50 µL of 3 mM quinine solution twice for 1 min each with a 1 min interval in between. Shortly thereafter, the mice received an intraperitoneal (i.p.) injection of physiological saline (10 mL/kg body weight [BW]) as a control treatment in the learned group. The same procedure was repeated on Day 4. On Day 6 (test day), the mice received the same infusions as those administered on Day 1, and their behaviors during the infusions were video-recorded.

Experiment 4 for the learned group: On Day 1, mice (n = 11) received infusions of 50 µL of 5.4 mM saccharin solution twice for 1 min each with a 1 min interval in between. Shortly thereafter, the mice received an i.p. injection of LiCl solution (0.3 M, 10 mL/kg BW) (1st conditioning). The same procedure was repeated on Day 4 (2nd conditioning). On Day 6 (test day), the mice received the same infusions as those administered on Day 1, and their behaviors during the infusions were video-recorded.

Experiment 5 for the learned group: On Day 1, mice (n = 3) continuously received infusions of 80 µL of 5.4 mM saccharin solution for 2 min without an interval. Shortly thereafter, the mice received an i.p. injection of LiCl solution (0.3 M, 10 mL/kg BW) (1st conditioning). The same procedure was repeated on Day 4 (2nd conditioning). On Day 6 (test day), the mice received the same infusions as those administered on Day 1, and their behaviors during the infusions were video-recorded.

Thus, videos obtained from five different experiments were used for subsequent analysis. Among these videos, almost half (n = 14; innate = 7, learned = 7) were obtained in a previous study in which automated tracking was not performed¹⁸ (Table 2).

Table 2 Videos used in the present studies.

Full size table

Automated body part tracking

DeepLabCut2.2rc1 (DLC)²³ was used to develop a tracking framework with five mouse-body parts (Fig. 3A). Frame choice and training were performed using the default settings of the DLC. Initially, 20 frames were selected from each video for training purposes. The provisional coordinates of the mouse body part were estimated according to the temporarily created model, and 40 candidate frames for false predictions were selected for each video and manually annotated. This procedure was repeated until the loss was < 0.001 and the error was sufficiently small. After the training sessions were completed, the coordinates of the mouse body part in the other (not manually annotated) frames were estimated (Fig. 3A).

Disgust reaction labeling

All 27 videos, including those taken in a previous study¹⁸, were (re)analyzed by a blinded observer who was not involved in the previous study¹⁸. According to previously established rules^15,18, all video frames were labeled manually frame-by-frame (30 frames/sec) to determine whether they constituted disgust reactions and, if so, which of the five types of disgust reactions they were. The five types of disgust reactions were gapes (large opening of the mouth with retraction of the lower lip), headshakes (rapid lateral movement of the head), face washes (wipes over the face with the paws), forelimb flails (rapid waving of both forelimbs), and chin rubs (pushing the chin against the floor of the test chamber).

Random forest classification and model evaluation

We extracted 25 per-frame geometric features. They consisted of five displacements of each body part (i.e., nose, left front paw, right front paw, left rear paw, and right rear paw) from the previous frame, ten distances between each body part (e.g., the distance between the nose and left front paw), and ten amounts of changes of each distance from the previous frame. In addition, for each frame, the sum of all features from five consecutive frames, including the two frames before and after the frame, was considered, so that each frame had 125 features. The features created in this way were verified to determine whether they were appropriate for inclusion in the model in the training process for random forests, which is described later. Specifically, after confirming the correlation between the features, they underwent a feature-selection process based on the feature importance of random forests using the Boruta package²⁴ in R. The feature importance is the Z-score of the mean decrease accuracy measure (the default setting in the Boruta package). The importance of each feature is determined from the degree of degradation in the prediction performance when only that feature is randomly shuffled from the original data. This is based on the hypothesis that if the feature is very useful, the prediction using the data containing the shuffled feature should have a much worse performance than the prediction using the original data that was not shuffled. The feature selection algorithm also computed the feature importance for the original features shuffled by feature (“shadow” features), and the original features that were significantly (p = 0.01, as the default value in the package) higher than the shadow features were finally selected for inclusion in the model.

We then constructed a random forest classifier using the caret²⁵ and Rborist²⁶ packages in R. Before developing the classifier, the data were randomly divided into training (n = 19; innate = 9, learned = 10) and test (n = 8; innate = 4, learned = 4) datasets, with almost equal numbers of innate and learned groups (Table 2). The classifier was constructed by setting the classWeight argument in the train function of the caret package to ‘balance’ in view of the imbalanced presence of frames containing movements of interest, which allowed the classes to be weighted inversely proportional to their frequency of occurrence in the data. In the training sessions, the dataset was leave-one-subject-out (19-fold) cross-validated for each mouse to prevent overfitting (Fig. 3B). In addition, we performed hyperparameter tuning and selected the model with the highest accuracy as the final one using the tuneLength argument in the train function of the caret package. Although the number of ensembled decision trees was fixed at the default value of 500 during the model creation process, we confirmed that the number of ensembled decision trees was sufficient by separate training after the final model creation, with the same training parameters changing only nTree.

After developing the final model, we evaluated the classifier’s frame-by-frame performance in discriminating each type of disgust reaction by calculating the overall accuracy, as well as its positive predictive value (PPV) and sensitivity. PPV, also called precision, is expressed by the following equation:

$$\:\frac{true\:positive}{true\:positive\:+\:false\:positive}$$

Sensitivity, also called recall, is expressed by the following equation:

$$\:\frac{true\:positive}{true\:positive\:+\:false\:negative}$$

Disgust reaction scoring

For the five types of disgust reactions, we analyzed consecutive frames with the same label as one count. To ensure that each component of the disgust reactions contributed equally to the final scores, reactions that occurred in continuous counts were scored in time bins according to previously established rules^15,27. Components characterized by long-duration counts, such as chin rubs, were scored in five-second bins (successive repetitions within five seconds scored as one occurrence), and face wash was scored in two-second bins. The other three reactions that could occur as a single behavior were scored as separate occurrences (e.g., one forelimb flail equals one occurrence). The total disgust reaction scores were quantified as the sum of all five binned counts during two minutes of intraoral infusion of the test solutions.

Computer software and hardware

A desktop computer equipped with an Intel Core i7-10700 CPU, an NVIDIA GeForce GTX 1660 SUPER GPU, and 16 GB of RAM was used for mouse posture estimation in DLC and data processing in R.

Results

Manual analysis of disgust reactions

To determine which of the five types of behavior that make up the disgust reactions were included in each frame of the recorded videos used in this study, we manually labeled and quantified 97,200 frames from 27 videos based on established rules^15,18 (“Human observation” in Fig. 1). Consistent with a previous study¹⁸, disgust reactions were observed in both the innate and learned groups with a similar number of reactions for each type of disgust reaction (Fig. 4A). Regarding the number of counts for each type of disgust reaction, forelimb flails, face washes, and chin rubs were more common than were gapes and head shakes (Fig. 4B, Wilcoxon signed-rank sum test; gape vs. chin rub: p = 0.049, gape vs. face wash: p < 1e-6, gape vs. forelimb flail: p < 1e-7, head shake vs. chin rub: p = 0.031, head shake vs. face wash: p < 1e-7, head shake vs. forelimb flail: p < 1e-7).

This classical disgust reaction quantification method requires significant time and effort for analysis²⁰.

Automated body part tracking

To extract features from recorded videos for machine prediction of the disgust reaction by random forest in the following step, we first semi-automatically annotated the positions of the body parts of mice in the videos using DeepLabCut (DLC) (“Machine prediction” in Fig. 1). We automatically estimated the position of the body part to perform automatic quantification of video-recorded disgust reactions. Because the optimal number of body parts to be labeled for posture estimation is yet to be established, we referred to the occurrence rate of each type of disgust reaction (Fig. 4B), to determine the number of markers. The movements that can be tracked by focusing on the nose and the front and rear paws (i.e., chin rubs, face washes, forelimb flails, and head shakes) comprised 96.23% of the disgust reactions. In contrast, movements that required attention to mouth movements (i.e., gapes) were rarely observed (3.77%). Thus, we defined the five body parts of mice, the nose, and the front and rear paws (Fig. 5A) were sufficient to meet the objective of the present study, which estimated the total scores for disgust reactions.

In the DLC training process, the coordinate information of the five body parts was annotated in 2674 frames, which were automatically selected by DLC, from 27 single-trial video recordings through three training sessions with 500,000 iterations each (Fig. 3). The errors in the estimation of the body part position were evaluated after the model training procedure. The results showed that the mean error for each body part was less than 1 mm (Fig. 5B), which was sufficiently small compared to previous studies using DeepLabCut (1–8 mm)^28,29 to confirm that the training was complete.

Outlier frame detection and correction

Next, we corrected for mouse posture estimation because our posture estimation often failed in specific postures (e.g., standing up with its paws against a wall) and rapid movements that could not be captured on a recording. We introduced a rule-based correction procedure that refers to the likelihood of preventing incorrect estimations in such cases. Specifically, the following two corrections were made for all frames in the video posture-estimated by DLC when the estimated likelihood of a particular body part in a specific frame was less than a predetermined threshold value. (1) If the likelihood of the body part in the next frame exceeded the threshold value, the position of the body part in the frame was corrected in the middle of the frame before and after. (2) If the likelihood of the body part in the next frame was below the threshold, the body part’s position in the frame was corrected to the same position as that in the previous frame. In the present study, we set the threshold value to 0.6, which is the default value for an outlier correction method inside the DLC.

Random forest classifier

We developed a random forest classifier to automate the image classification task of determining whether each video frame contained the behaviors of interest. Before developing the classifier, the data were randomly and equally divided into training and test datasets between the innate and learned groups. There were no significant differences in the incidence of each type of disgust reaction between the datasets (Fig. 6; Table 3, Wilcoxon rank-sum test; Chin Rub, p = 0.466; Face Wash, p = 0.441; Forelimb Flail, p = 0.212; Gape, p = 0.547; Head Shake, p = 0.413; Others, p = 0.222).

Table 3 Observed behaviors.

Full size table

Next, we checked the possible correlations between the geometrical features. The results showed that the features created from the five consecutive frames of each of the distances between each body part had a high correlation (Pearson’s r = 0.82 ~ 0.99). This correlation was clearly higher than that of the other features (r = -0.30 ~ 0.75). Because the presence of feature pairs with very high correlations increases the complexity of the model and does not contribute to its prediction performance, all features generated as two frames before and after the frame of the distances between each body part (40 features in total) were not included in the model. This resulted in the employment of 85 feature candidates.

We developed three random forest models with different predFixed hyperparameters, that is, the number of randomly selected features when forming each split in a classification tree, of the Rborist package in R. Finally, we adopted a model with a parameter value of 85 and the highest cross-validated accuracy (Fig. 7). To confirm that 500 was sufficient for the number of decision trees ensembled in the final model, separate training was conducted with nTree varying from 10 to 1000 in steps. The results confirmed that there was no significant difference in Accuracy for nTree > 200.

After the random forest model was completed, the importance of the features was checked using the Boruta package²⁴ to find the features that are likely to contribute strongly to the performance of this model and to remove those that do not contribute sufficiently. During this process, the importance calculation was automatically repeated 18 times until all the features were adopted or rejected. The most important features were the distance between the two front paws and the distance between the nose and right/left front paws (Fig. 8), indicating that the positional relationship between the two front paws and nose of the mouse was particularly important for the prediction of this model. In addition, all 85 original features exceeded the importance of the randomly generated shadow features and none were deemed to be removed (Fig. 8).

In the test set, face washes (sensitivity = 0.65, positive predictive value (PPV) = 0.85) (Fig. 9A), and forelimb flails (sensitivity = 0.55 and PPV = 0.73) (Fig. 9B) were detected frame-by-frame with reasonable accuracy. In contrast, chin rubs (sensitivity = 0.16, PPV = 0.63) (Fig. 9C), gapes (sensitivity = 0), and head shakes (sensitivity = 0) were almost undetectable (Table 4). Others (i.e., movements other than the behaviors of interest) were detected with 0.99 sensitivity and 0.97 PPV. This prediction result was similar to that of the cross-validated prediction in the training set (Table 5).

Table 4 Observation and prediction of the number of frames that include each type of disgust reaction and other movement in the test set.

Full size table

Table 5 Observation and prediction of the percentual average counts overall cross validation that include each type of disgust reaction and other movement in the train set.

Full size table

Disgust reaction scoring

To evaluate the performance of the classifier for the disgust reaction score, predictions of the total number of frames that constitute individual disgust reactions were scored according to certain rules, as mentioned above, and the scored values were compared with the values manually counted and scored by a human. We found a high correlation between the scores calculated from the classifier’s prediction and human count (Pearson’s r = 0.97, Fig. 10, see Supplementary Movie).

Discussion

We constructed a classifier using DLC and random forest to automatically count innate and learned disgust reactions in the taste reactivity test. The total predicted values of the classifier showed a strong correlation with human observations. The present study is the first to automatically assess disgust reactions in taste reactivity tests.

Our classifier was constructed according to the published taste reactivity test protocols. Therefore, the scores analyzed by our classifier can be compared with manually counted scores. We found a very high correlation in this comparison (r = 0.97; Fig. 10). The accuracy of our classifier is comparable to that of previous automated methods: the pup retrieval behaviors in mice were automatically assessed with 0.51–0.90 correlations using DeepLabCut and machine learning²⁸, and the number of chemically induced scratching in mice was predicted with 0.98 correlations using a convolutional recurrent neural network³⁰.

The present study used DLC for marker-less estimation of the position of the body part because a previous study demonstrated excellent performance³¹. Nonetheless, alternative open-source software programs such as DeepPoseKit³² and SLEAP³³ allow for comparable estimations. Our present classifier may also assess disgust reactions even if these programs are used instead of DLC. The research field of markerless estimation of the position of body parts has made remarkable progress in recent years, and more accurate programs are likely to become available. Accordingly, automatic behavior evaluation procedures that estimate the position of the body part in advance, such as our present classifier, are expected to improve in accuracy because at least some of the failures of our classifier resulted from the failure to estimate the position of the body part (Fig. 9).

A recent study evaluated facial expressions in response to taste stimuli in mice using a machine-learning-based method³⁴. Facial expressions in response to bitter stimuli were automatically distinguishable from those in response to the other stimuli. Bitter-induced facial expressions were also induced by the intake of saccharin solution paired with intraperitoneal LiCl injection and by the intake of NaCl solution at high concentrations. However, it remains unclear what specific brain functions are reflected in the observed facial expressions because the taste stimuli used may evoke multiple brain functions, including disgust and motivation to avoid the stimulants. In addition, the facial expressions detected by this method in mice were challenging to interpret as relevant facial expressions in other species, including humans, making it difficult to interpret the mouse’s facial expressions as a readout of emotion. In contrast, all disgust reactions expressed by mice are shared by several different species, such as rats, monkeys, and human neonates¹⁵. This fact supports the idea that a mouse’s disgust reactions and human disgust may share a common neural basis. In support of this, taste reactivity may reflect different brain functions and neural activities from facial expressions because the number of disgust reactions and facial expressions do not always fit in mice³⁵. In addition to facial expressions, intake and licking behaviors have been widely examined in taste avoidance and evaluation studies, but the results from these behaviors do not always match those of taste reactivity^20,36. Thus, taste reactivity will continue to be a valuable readout for future research to understand the neural basis of disgust.

Our classifier can be built with minimal resources. In the video of the taste reactivity used in the present study, three of the five types of behavioral reactions that make up the disgust reaction, chin rub, face wash, and forelimb flail, accounted for 93.46% of the total disgust reactions (Fig. 4B). As these three reactions are characterized by the movement of the nose and the front and rear paws, tracking the five body parts of the mice, the nose, and the front and rear paws (Fig. 5A) seemed to be sufficient to track these reactions, and thus the disgust reactions observed in the present study. These five tracking points are generally smaller than those used in other studies using DeepLabCut (8–13 points/mouse)^29,31. Although the behaviors detected in other studies are different from those in the present study and cannot be directly compared, the relatively small number of tracking points in the present study achieved automated tracking with a relatively small burden of manual annotation and machine training costs. In addition, an inexpensive and commercially available handheld video camera with low frame rates (30 frames per second), which is equal to or less than that used in previous studies (30–60 frames/sec)^30,37, was sufficient to record mouse behavior for subsequent analysis using our classifier. Using videos with low frame rates saves time in estimating the position of the body part and reduces the storage size of the original videos, ultimately improving the model³⁸.

However, our classifier could hardly detect gapes and head shakes, which are the other two reactions constituting the disgust reaction (Tables 3 and 4). This may be because the tracking point for the gapes was not set at a sufficiently high level to detect movement around the oral cavity, which was assumed necessary for detection. For the head shake, the 30 fps record may have been insufficient because the movements were too fast. Although the inability to detect these reactions was not problematic in the present study because the frequency of these reactions was low in mice (Fig. 4), increasing the number of tracking points may be required to predict disgust reactions in rats, where gapes are more frequently observed³⁹.

In the present study, we developed an automated assessment method for mouse disgust reactions. The present method allows for the evaluation of disgust reactions independent of the skill level and bias of the human evaluator. In addition, the present method will facilitate the implementation of taste reactivity tests in large-scale screening and long-term experiments that require counting numerous disgust reactions, which are challenging to perform manually. Thus, the present method may help researchers implement taste reactivity tests in a broader range of studies than previously thought²⁰. The present method is expected to accelerate our understanding of the neural basis of disgust, contribute to understanding the pathophysiology of various psychiatric disorders, and advance the development of preventive and therapeutic interventions.

Data availability

The corresponding authors can provide the videos, code, and generated datasets used in the present study upon reasonable request.

References

Darwin, C. The Expression of the Emotions in Man and Animals, 3rd edition, Fontana Press, London UK (1872).
Rozin, P. & Fallon, A. E. A perspective on disgust. Psychol. Rev. 94, 23–41 (1987).
Article CAS PubMed Google Scholar
Bernstein, I. L. Learned taste aversions in children receiving chemotherapy. Science 200, 1302–1303 (1978).
Article ADS CAS PubMed Google Scholar
Jacobsen, P. B. et al. Formation of food aversions in cancer patients receiving repeated infusions of chemotherapy. Behav. Res. Ther. 31, 739–748 (1993).
Article CAS PubMed Google Scholar
Phillips, M. L., Senior, C., Fahy, T. & David, A. S. Disgust – the forgotten emotion of psychiatry. Br. J. Psychiatry. 172, 373–375 (1998).
Article CAS PubMed Google Scholar
Knowles, K. A., Jessup, S. C. & Olatunji, B. O. Disgust in anxiety and obsessive-compulsive disorders: recent findings and future directions. Curr. Psychiatry Rep. 20, 68 (2018).
Article PubMed PubMed Central Google Scholar
Anderson, L. M., Berg, H., Brown, T. A., Menzel, J. & Reilly, E. E. The role of disgust in eating disorders. Curr. Psychiatry Rep. 23, 4 (2021).
Article PubMed PubMed Central Google Scholar
Nagel, T. What is it like to be a Bat?? Philos. Rev. 4, 435–450 (1974).
Article Google Scholar
Chalmers, D. J. The Conscious Mind: in Search of a Fundamental Theory (Oxford University Press, 1996).
Chalmers, D. J. First-person methods in the science of consciousness. (1999). http://consc.net/papers/firstperson.html
Velmans, M. Heterophenomenology versus critical phenomenology. Phenomenol Cogn. Sci. 6, 221–230 (2007).
Article Google Scholar
Tanaka, D. H. & Tanabe, T. CHANging consciousness epistemically (CHANCE): An empirical method to convert the subjective content of consciousness into scientific data. J. Mind Behav. 40, 177–190 (2019).
Google Scholar
E Steiner, J. The gustofacial response: observation on normal and anencephalic newborn infants. Symp. Oral Sens. Percept. 4, 254–278 (1973).
Google Scholar
Grill, H. J. & Norgren, R. The taste reactivity test. I. Mimetic responses to gustatory stimuli in neurologically normal rats. Brain Res. 143, 263–279 (1978).
Article CAS PubMed Google Scholar
Berridge, K. Measuring hedonic impact in animals and infants: microstructure of affective taste reactivity patterns. Neurosci. Biobehav Rev. 24, 173–198 (2000).
Article CAS PubMed Google Scholar
Parker, L. A. Taste avoidance and taste aversion: evidence for two different processes. Learn. Behav. 31, 165–172 (2003).
Article PubMed Google Scholar
Ho, C. Y. & Berridge, K. C. Excessive disgust caused by brain lesions or temporary inactivations: mapping hotspots of the nucleus accumbens and ventral pallidum. Eur. J. Neurosci. 40, 3556–3572 (2014).
Article PubMed PubMed Central Google Scholar
Tanaka, D. H. et al. Genetic access to gustatory disgust-associated neurons in the interstitial nucleus of the posterior limb of the anterior commissure in male mice. Neurosci 413, 45–63 (2019).
Article CAS Google Scholar
Tanaka, D. H. et al. Genetic recombination in disgust-associated bitter taste-responsive neurons of the central nucleus of amygdala in male mice. Neurosci. Lett. 742, 135456 (2021).
Article CAS PubMed Google Scholar
Dwyer, D. M. Licking and liking: the assessment of hedonic responses in rodents. Q. J. Exp. Psychol. 65, 371–394 (2012).
Article Google Scholar
Mathis, A. et al. Deeplabcut: markerless pose Estimation of user-defined body parts with deep learning. Nat. Neurosci. 21, 1281–1289 (2018).
Article CAS PubMed Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Kawai, S., Takagi, Y., Kaneko, S. & Kurosawa, T. Effect of three types of mixed anesthetic agents alternate to ketamine in mice. Exp. Anim. 60, 481–487 (2011).
Article CAS PubMed Google Scholar
Miron, B. K. & Witold, R. R. Feature selection with the Boruta package. J. Stat. Softw. 36, 1–13 (2010).
Google Scholar
Kuhn, M. Classification and Regression Training. R package version 6.0–93. (2022). https://CRAN.R-project.org/package=caret
Seligman, M. & Rborist Extensible, Parallelizable Implementation of the Random Forest Algorithm. R package version 0.3-2. (2022). https://CRAN.R-project.org/package=Rborist
Koizumi, M., Cagniard, B. & Murphy, N. P. Endogenous nociception modulates diet preference independent of motivation and reward. Physiol. Behav. 97, 1–13 (2009).
Article CAS PubMed Google Scholar
Winters, C. et al. Automated procedure to assess pup retrieval in laboratory mice. Sci. Rep. 12, 1663 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Nilsson, S. R. O. et al. Simple Behavioral Analysis (SimBA) – an open source toolkit for computer classification of complex social behaviors in experimental animals. Preprint at https://www.biorxiv.org/content/ (2020). https://doi.org/10.1101/2020.04.19.049452v2
Kobayashi, K. et al. Automated detection of mouse scratching behaviour using convolutional recurrent neural network. Sci. Rep. 11, 658 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sturman, O. et al. Deep learning-based behavioral analysis reaches human accuracy and is capable of outperforming commercial solutions. Neuropsychopharmacol 45, 1942–1952 (2020).
Article Google Scholar
Graving, J. M. et al. DeepPoseKit, a software toolkit for fast and robust animal pose Estimation using deep learning. eLife 8, e47994. https://doi.org/10.7554/eLife.47994 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pereira, T. D. et al. A deep learning system for multi-animal pose tracking. Nat. Methods. 19, 486–495 (2022).
Article CAS PubMed PubMed Central Google Scholar
Dolensek, N., Gehrlach, D. A., Klein, A. S. & Gogolla, N. Facial expressions of emotion States and their neuronal correlates in mice. Science 368, 89–94 (2020).
Article ADS CAS PubMed Google Scholar
Kawai, H. et al. Median Raphe serotonergic neurons projecting to the interpeduncular nucleus control preference and aversion. Nat. Commun. 13, 7708 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Lin, J. Y., Arthurs, J. & Reilly, S. Conditioned taste aversion, drugs of abuse and palatability. Neurosci. Biobehav Rev. 45, 28–45 (2014).
Article PubMed Google Scholar
Jhuang, H. et al. Automated home-cage behavioural phenotyping of mice. Nat. Commun. 1, 68 (2010).
Article ADS PubMed Google Scholar
von Ziegler, L., Sturman, O. & Bohacek, J. Big behavior: challenges and opportunities in a new era of deep behavior profiling. Neuropsychopharmacol 46, 33–44 (2021).
Article Google Scholar
Berridge, K., Grill, H. J. & Norgren, R. Relation of consummatory responses and preabsorptive insulin release to palatability and learned taste aversions. J. Comp. Physiol. Psychol. 95, 363–382 (1981).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Dr. Tsutomu Tanabe at TMDU for his helpful comments and discussion.

Author information

Authors and Affiliations

Department of Cognitive Neurobiology, Graduate School of Medical and Dental Sciences, Institute of Science Tokyo, 1-5-45 Yushima, Bunkyo-ku, Tokyo, 113-8519, Japan
Shizuki Inaba, Naofumi Uesaka & Daisuke H. Tanaka

Authors

Shizuki Inaba
View author publications
Search author on:PubMed Google Scholar
Naofumi Uesaka
View author publications
Search author on:PubMed Google Scholar
Daisuke H. Tanaka
View author publications
Search author on:PubMed Google Scholar

Contributions

N.U. supervised this project. S.I. and D.H.T. conceived the project and designed experimental and analytical strategies. D.H.T. conducted experiments. S.I. analyzed the data and wrote the code. S.I. drafted the manuscript, which was discussed and edited by all the co-authors.

Corresponding authors

Correspondence to Naofumi Uesaka or Daisuke H. Tanaka.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download PDF )

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Inaba, S., Uesaka, N. & Tanaka, D.H. Automatic quantification of disgust reactions in mice using machine learning. Sci Rep 15, 17573 (2025). https://doi.org/10.1038/s41598-025-01244-3

Download citation

Received: 28 January 2025
Accepted: 05 May 2025
Published: 21 May 2025
Version of record: 21 May 2025
DOI: https://doi.org/10.1038/s41598-025-01244-3

Supplementary Material 2

Subjects

Abstract

Similar content being viewed by others

A neurofunctional signature of subjective disgust generalizes to oral distaste and socio-moral contexts

Generalization gradients for fear and disgust in human associative learning

Efficiency and safety of automated label cleaning on multimodal retinal images

Introduction

Methods

Animals

Surgery for intraoral tubing

Taste reactivity test

Automated body part tracking

Disgust reaction labeling

Random forest classification and model evaluation

Disgust reaction scoring

Computer software and hardware

Results

Manual analysis of disgust reactions

Automated body part tracking

Outlier frame detection and correction

Random forest classifier

Disgust reaction scoring

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1 (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links