Abstract
Despite advances in gut health research, the variability of important gut markers within individuals over time remains underexplored. We investigated the intra-individual variation of various faecal gut health markers using an optimised processing protocol aimed at reducing variability. Faecal samples from ten healthy adults over three consecutive days demonstrated marker-specific intra-individual coefficients of variation (CV%), namely: stool consistency (16.5%), water content (5.7%), pH (3.9%), total SCFAs (17.2%), total BCFAs (27.4%), total bacteria and fungi copies (40.6% and 66.7%), calprotectin and myeloperoxidase (63.8% and 106.5%), and untargeted metabolites (on average 40%). For thirteen microbiota genera, including Bifidobacterium and Akkermansia, variability exceeded 30%, whereas microbiota diversity was less variable (Phylogenetic Diversity 3.3%, Inverse Simpson 17.2%). Mill-homogenisation of frozen faeces significantly reduced the replicates CV% for total SCFAs (20.4–7.5%) and total BCFAs (15.9–7.8%), and untargeted metabolites compared to faecal hammering only, without altering mean concentrations. Our results show the potential need for repeated sampling to accurately represent specific gut health markers. We also demonstrated the effectiveness of optimised preprocessing of human stool samples in reducing overall analytical variability.
Similar content being viewed by others
Introduction
Nowadays the impact of gut health on the maintenance of overall human health is commonly acknowledged1,2. So far, gut health research has strongly focused on faecal microbiota composition, because microbiota shifts have been correlated to many diseases3,4. Numerous factors like demographics5,6, diet7,8,9, physical activity10,11, medication use12,13, age14,15, health status16, gut transit time17,18, and other environmental factors19 have been reported to affect the microbiota. Interestingly, only a minor fraction of inter-individual microbiota differences can be explained by such exogenous and intrinsic host factors20.
To provide a comprehensive characterisation of an individual’s gut health status, it is of interest to measure a combination of gut health-related markers. For instance, the Bristol Stool Scale (BSS) can be used as proxy for faecal water content % (WC%) and intestinal transit time (ITT)17,21,22. Water content is however rarely measured23,24. In addition to relative microbiota abundance, absolute microbiota numbers have also been suggested to be of importance for the interpretation of microbiota data25,26,27. Moreover, important other kingdoms than bacteria, for instance fungi have been described28,29. Until recently, the importance of the intestinal fungal microbiome (“mycobiome”) has been largely unstudied and only constitute 3% of the total microbiome studies30. Importantly, while many studies focus on microbiota composition, the microbiota-derived metabolites might be the crucial functional regulators in health and disease31,32,33,34,35. One well-known example is the suggested health benefits of short-chain fatty acids (SCFAs)36. Furthermore, branched-chain fatty acids (BCFAs) are mainly produced by the microbial fermentation of branched-chain amino acids37, and receive attention in human studies more frequently, as potential negative health associations have come to light38,39. Faecal pH is an indicator of the intestinal environment, which could be of relevance given its proposed influence on bacterial metabolic processes, growth and composition39,40, and is associated with the presence of faecal SCFAs and BCFAs41,42. Despite being easily measured with a probe, pH is infrequently assessed in human studies in the context of gut health. Furthermore, faecal biomarkers that are (potentially) clinically relevant include the inflammatory markers calprotectin and myeloperoxidase (MPO)43,44. Both inflammatory markers are neutrophil proteins, with calprotectin commonly used as a biomarker in inflammatory bowel disease (IBD)45, while MPO has been highlighted in recent years, as a potential biomarker for inflammation46.
As yet, the intra- and inter-individual variation of most of the abovementioned gut health markers have not been well studied in literature. Data from healthy subjects47,48 and patients49 highlight the importance of repeated faecal sampling, with three to five consecutive samplings, to capture the intra-individual microbiota variation. Additionally, large day-to-day variations in the microbiota composition and diversity within individuals in consecutive faecal samples over a six-week period were found48. A longitudinal study suggested that faecal microbiota is generally stable over one year, but can undergo rapid and profound changes due to factors like traveling50. Even in controlled enclosed environments, the human faecal microbiota was found to be dynamic51.
Additionally, for faecal metabolite evaluation it might be relevant to include repeated faecal samples, because the three main SCFAs, acetate, propionate, and butyrate, showed considerable intra-individual variability in four human faecal samples collected within a week52. Also, the total SCFAs, and especially butyrate, varied over 12 weeks in free-living adults53. Moreover, stool consistency variability is high in both inflammatory bowel syndrome patients and healthy subjects49. The variability of other gut health markers in consecutive collected faecal samples is less well-described. More in-depth knowledge about intra-individual variation in gut health markers enhance understanding of the relevance of repeated faecal measurements to control for temporal variation. Furthermore, establishing more accurate baseline values of these markers within participants is crucial in assessing the actual intervention-induced effects, rather than studying day-to-day variation.
Limitations and differences in faecal sampling- and handling methodology may play a role in the large inter-individual differences that are reported54,55. Currently, there are no clearly defined standard practices for faecal sampling, handling, and analysis in human studies. Often only a single scoop of a few grams of faeces is collected56. Optimising gut health analysis using faeces might require adjusted sampling and processing procedures. For example, many microbiota-derived metabolites are volatile, sensitive to degradation, and require specific solvents54,55,57,58,59,60,61,62,63. To measure the microbiota, microbial metabolites, and other gut markers accurately and with minimal technical variation, strict faecal sample collection, storage, preparation, and analytical procedures are required54,64. These procedures are especially important to reduce variation due to sample handling, which could be falsely attributed to biological intra- and inter-individual differences, or give a false indication of the baseline versus post-intervention effects on gut health status56.
One important consideration in the sampling procedures is the collection of larger volumes of faeces by the participant by taking multiple scoops from different locations of the faeces, because it was previously shown that spot sampling rather than collection from the top, middle, bottom or edge positions of the faeces resulted in higher microbiota and metabolites variability60,65. Other examples of optimised procedures include keeping samples frozen during processing at all times, thus avoiding freeze-thaw cycles and temperature fluctuations, which may lead to metabolite degradation and microbial fermentation, and selecting appropriate storage conditions54,66. Furthermore, because of the heterogeneity of faeces60,65, applying specific devices to properly homogenise the samples becomes crucial. Blenders64 or other milling devices suitable of grinding deep-frozen materials into fine powders, such as an IKA mill, are often used in plant metabolomics and soil microbiome research67, while these are not often reported in the context of human faeces processing. Nevertheless, homogenising faeces may reduce the variation in bacteria abundances and SCFAs levels as compared to non-homogenised faeces65,68.
We investigated the variation of a selected panel of gut markers in healthy adults, with a focus on the intra-individual variation. Furthermore, the effect of an optimised sample pre-processing procedure, including mill-homogenisation in liquid nitrogen, on the analysis of gut health markers was evaluated. For this purpose, ten participants collected three faecal samples on consecutive days. The optimised faecal sampling, processing and storage protocol was applied. The measured gut health markers included BSS, pH, and WC%, microbiota composition and diversity, absolute abundance of the total bacteria and fungi, SCFAs, untargeted metabolite profiles and specific inflammatory biomarkers.
Results
Subjects and faecal sample collection
Ten healthy adults finished the study, of which 2 males and 8 females. Seven participants collected their stools on three consecutive days (Supplementary Table 1), while three participants collected on non-consecutive days.
Variation of a selected panel of gut health markers
The results presented in Fig. 1 reveal the intra-subject variation for a panel of different gut health markers. The BSS and pH demonstrated a low CV%intra of 16.5 ± 14.9 and 3.87 ± 1.74, with a moderate test-retest reliability of ICC 0.74 [0.43–0.92] and ICC 0.56 [0.16–0.85] (Table 1). Water content demonstrated a low CV%intra of 5.74 ± 3.20 with a low test-retest reliability of ICC 0.37 [-0.01-0.76]. Faecal SCFAs analysis revealed CV%intra of below 30% for total SCFAs (17.2 ± 13.8), total BCFAs (27.4 ± 15.2), acetic acid (16.0\(\:\pm\:\)11.7), propionic acid (17.8\(\:\pm\:\)12.4) and butyric acid (27.8 ±17.4). Isobutyric acid, valeric acid, and 4-methyl valeric acid had a CV%intra lower than 30%, while isovaleric acid, heptanoic acid, and hexanoic acid had a CV%intra higher than 30% (Table 1). Test-retest reliability scores showed moderate test-retest reliability for total SCFAs (ICC 0.65 [0.29–0.89]), acetic acid (ICC 0.73 [0.41–0.92]) and propionic acid (ICC 0.64 [0.28–0.88]) while butyric acid (ICC 0.40 [-0.01-0.77]) and total BCFAs (ICC 0.35 [-0.03-0.74]) showed poor test-retest reliability. To assess whether the genetic potential for fermentation and production of SCFAs was less variable compared to the metabolites, microbial pathways and enzymes related to fermentation and production of SCFAs in the predicted bacterial genomes were analysed (Supplementary Table 2). The CV%intra for pathways and enzymes related to acetic acid, propionic acid, and butyric acid production were 22.4 ± 22.3, 23.4 ± 13.3, and 23.1 ± 9.43, respectively. From the 26 pathways and enzymes, 14 demonstrated good or excellent test-retest reliability over time (ICCs 0.75-1.0), 10 demonstrated moderate test-retest reliability over time (ICCs 0.50–0.75), and two demonstrated poor test-retest reliability. Furthermore, high CV%intra were demonstrated by both total fungi (67.0 ± 35.8, corrected by wet weight) and total bacteria (40.6 ± 16.9, corrected by wet weight). Total fungi counts showed excellent test-retest reliability (ICC 0.90 [0.75–0.97], corrected by wet weight), while total bacteria showed poor test-retest reliability ICC 0.32 [-0.01-0.71] (corrected by wet weight). Corrections for faecal dry weight demonstrated similar results for both the CV%intra and ICC of the total fungi and total bacteria copies compared to corrections for faecal wet weight (Supplementary Table 3). Calprotectin showed a high CV%intra of 63.8 ± 69.0 with excellent test-retest reliability (ICC 0.92 [0.79–0.98]), and MPO demonstrated a high CV%intra of 106.5 ± 51.1 with poor test-retest reliability (ICC 0.37 [-0.03-0.77]) largely driven by variation in participant 10. All markers demonstrated an intra- and inter-assay technical variation of < 14% (Supplementary Table 4).
Intra-individual variation in a panel of gut health markers in faecal samples of healthy subjects collected over three consecutive days. The (A) Bristol Stool Scale, (B) water content, (C) pH, (D) acetic acid, (E) propionic acid, (F) butyric acid, (G) total fungi, (H) total bacteria, (I) total SCFAs, (J) total BCFAs, (K) calprotectin, and (L) MPO are presented as measured in the three faecal samples collected. Data from n = 10 participants are presented. For calprotectin, n = 8 subjects had at least one measurement above the detection limit of 11.92 µg/g wet weight, and for MPO, also n = 8 subjects had at least one measurement above the detection limit of 68.5 ng/g wet weight. Individual participant’s samples are shown by the three dots. Participants were instructed to choose only one scale point for the Bristol Stool Scale, but when two types were indicated by the participant, the highest of the two scores was included in the analysis.
The CV%intra of faecal water content demonstrated significant positive correlations with the CV%intra of butyric acid and calprotectin (Supplementary Fig. 1). The CV%intra of stool consistency, assessed by the BSS, demonstrated significant positive correlations with the CV%intra of acetic acid, propionic acid, butyric acid, hexanoic acid, and the total SCFAs. In summary, all markers except for total fungi, total bacteria, calprotectin and MPO, demonstrated a CV%intra of less than 30%. Overall, 8 out of 12 markers demonstrated moderate to excellent test-retest reliability, with poor test-retest reliability shown by water content, butyrate, MPO, total BCFAs, and total bacteria counts (both wet and dry weight corrections).
Variation in faecal microbiota composition and diversity
The relative microbiota composition was plotted to visualise changes of the top 25 genera at the three consecutive timepoints (Fig. 2A). Visualisation of the overall microbiota community revealed clusters based on individuals and not per sampling timepoint (Fig. 3A). From the top 25 genera, 13 genera demonstrated CV%intra >30%, with CV%intra ranging between 15.1 and 84.4. For all bacteria that were measured on genus level, the mean CV%intra was 56.5 ± 9.0, of which 43.1 ± 14.7% of the genera had a CV%intra lower than 30% (Supplementary Table 5). Furthermore, 17 bacteria were demonstrating good or excellent test-retest reliability over time (ICCs 0.75-1.0), 7 genera demonstrating moderate test-retest reliability over time (ICCs 0.50–0.75), and one genus, Akkermansia, demonstrating poor test-retest reliability (ICC 0.48 [0.12–0.81], Table 2). Also the faecal microbiota alpha-diversity was quantified (Fig. 2B), as calculated by the indexes Fisher Diversity, Inverse Simpson, Shannon Diversity, Phylogenetic Diversity, and the estimated number of species (Chao1) that had a CV%intra of 13.9 ± 8.19, 17.2 ± 10.1, 3.42 ± 1.68, 3.27 ± 2.77, and 11.4 ± 5.06, respectively. All demonstrated moderate to good test-retest reliability over time, namely ICC 0.55 [0.17–0.84], 0.77 [0.48–0.93], 0.81 [0.56–0.95], 0.87 [0.68–0.96], and 0.61 [0.25–0.87], respectively. Overall, the microbiota dissimilarity between individuals is much larger compared to the day-to-day microbiota variation within individual. The microbiota alpha-diversity per individual within one week of consecutive sampling remained stable, but the variability of the bacteria was highly genus-dependent.
The variation in microbiota composition and diversity in faecal samples of healthy subjects collected over three consecutive days. (A) The relative abundance of the faecal microbiota at three consecutive sampling timepoints in the ten individuals. The top 25 bacteria on genus level are shown. If genus was not classified, the family name (f_) is provided. (B) The microbiota alpha-diversity, measured by the indexes Fisher Diversity, Inverse Simpson, Shannon Diversity and the Phylogenetic diversity. Individual participant’s samples are shown by the three dots. Data from n = 10 participants are presented.
Variation of untargeted metabolites
In addition to targeted metabolite analyses, untargeted metabolomics was also applied to investigate the variation of all features, which are signals representing chemical compounds measured in the positive or negative ionization mode, in the faecal samples. The number of detected features, the pre-processing workflow, and final included features can be found in Supplementary Fig. 2. As shown in Fig. 3B, the differences in metabolites within individuals are smaller than differences between individuals. For the positive features, the mean CV%intra was 40.1 ± 4.44, of which 49.1 ± 6.8% of the features had a CV%intra lower than 30% (Table 3). 15.2% demonstrated good or excellent test-retest reliability over time (ICCs 0.75-1.0), 37.0% demonstrated moderate test-retest reliability over time (ICCs 0.50–0.75), and 47.8% demonstrated poor test-retest reliability (ICC < 0.50, Table 3). For the negative features, the mean CV%intra was 41.7 ± 5.43, of which 47.6 ± 7.6% of the features had a CV%intra lower than 30%. In comparison, the CV%inter was higher than the CV%intra, namely 76.9 ± 2.18 for the positive features, and 89.0 ± 3.52 for the negative features. 24.1% demonstrated good or excellent test-retest reliability over time (ICCs 0.75-1.0), 37.3% demonstrated moderate test-retest reliability over time (ICCs 0.50–0.75), and 38.7% demonstrated poor test-retest reliability (ICC < 0.50).
Variation in faecal microbiota and metabolite profiles in faecal samples of healthy subjects collected over three consecutive days. (A) The PCoA plot visualises the microbiota variation between the three consecutive timepoints per subject the beta-diversity was calculated using Weighted UniFrac. (B) The PCA scores plot based on all metabolites measured in both positive and negative ionization modes. The 95% confidence ellipses are shown, coloured according to the ten individuals. PCA: Principal Component Analysis, PCoA: Principal Coordinate Analysis.
The effect of faecal sample preparation on the gut-health related outcome measures
To determine the most appropriate pre-processing procedure per marker, the effect of faecal sample homogenisation on the analysis of gut-health markers was investigated using ten replicates from the same pool of faeces. Five of these replicates were processed by hammering only, while the other five replicates were mill-homogenised after hammering (Fig. 4A). Both pH and water content showed a slight reduction in CV% due to the additional mill-homogenisation: for pH from 0.43% (hammered-only) to 0.38% (mill-homogenised), and for water content from 1.63 to 1.46% (Fig. 4B; Table 4). For 10 out of the 13 markers, the mill-homogenisation step reduced the CV% between replicates, while for heptanoic acid, 4-methyl valeric acid and MPO, the CVs were increased. Mill-homogenising the faeces reduced the CV% for total SCFAs from 20.4 to 7.53%, and for total BCFAs from 15.9 to 7.82%. The CV% for calprotectin decreased from 15.5 to 6.43%, while MPO increased from 24.2 to 48.7%. For none of the markers a statistical difference in the mean concentration values between hammered-only samples and mill-homogenised samples was observed (P-values > 0.05; Table 4).
Comparison of variation of faecal pre-processing methods within replicate of the same pooled faecal sample. (A) Replicates (n = 10) of a pooled faecal sample, of which 5 were pre-processed by hammering-only, and 5 by both hammering and mill-homogenising. (B) The coefficient of variation percentage as calculated within the replicates of hammered-only samples (circles) and within replicates of mill-homogenised samples (triangles). (C,D) The untargeted metabolomics PCA scores plot based on features detected in positive ionization mode (C) and in negative ionization mode (D) of the hammered-only (circles) and mill-homogenised replicates (triangles).
We also investigated the effect of faecal sample homogenisation on untargeted metabolomics and the microbiota composition. The variation in metabolites in the faecal replicates after mill-homogenising was smaller compared to the variation in faecal replicates after hammering only, for both the features detected in positive ionization mode (Fig. 4C) and the negative ionization mode (Fig. 4D). For the positive mode, the CV%intra (mean ± SD) of all detected features within the faecal mill-homogenised replicates was 15.0 ± 18.0%, while the hammered-only replicates showed a CV%intra of 16.5 ± 18.3%.For the negative mode features, the CV%intra (mean ± SD) for all metabolites was 12.1 ± 13.1% within the faecal mill-homogenised replicates, while the hammered-only replicates had a CV%intra of 15.3 ± 19.3%. The overall variability within the hammered-only and mill-homogenized groups was not significantly different for either the positive features (P-value = 0.35) or negative features (P-value = 0.32). However, visual inspection showed that the distances to the centroid were smaller in the mill-homogenized group compared to the hammered-only group (Supplementary Fig. 3). To determine the effect of milling on microbiota analysis, first we evaluated the potential fragmentation of DNA, and based on agarose gel analyses it was observed that faecal sample mill-homogenising did not cause DNA fragmentation. The mean (± SD) correlation of the microbiota composition in the hammered-only and mill-homogenised faeces at timepoint one within individual was 0.92 ± 0.07 (Supplementary Table 6). In comparison, the technical replicates, which was the same microbial DNA included in all consecutive PCR- and sequencing steps, showed a comparable correlation of 0.96 ± 0.03. The overall microbiota composition between the mill-homogenised and hammered-only faeces was not significantly different (PERMANOVA P-value = 0.96), and visualisation of the overall microbiota community did not reveal clusters based on mill-homogenised or hammered-only faeces (Supplementary Fig. 4). Overall, we showed the added value of the mill-homogenising step for targeted metabolite analyses, since it reduced the variation of SCFAs and BCFAs and untargeted metabolomics in samples obtained from the same faecal pool, and the mill-homogenising did not impact the microbial DNA and composition.
Discussion
Variability in gut health markers is marker specific
The main aim of this study was to evaluate the intra-subject variation of selected faecal gut health markers in three faecal samples collected on consecutive days in healthy adults. This marker panel included BSS, pH, WC%, inflammatory biomarkers, total bacteria and fungi copies, microbiota composition and diversity, SCFAs, BCFAs, and untargeted metabolomics. Most markers had a CV%intra less than 30%. This included technical intra-assay variation, which ranged from 0.49 to 14%. We also reported ICC, but emphasized CV% for interpretation. ICC as a marker of test-retest reliability compares intra-individual variability to inter-individual variation of the sample population, providing a ratio of the variance components. This indicates the relative contributions of different sources of variability. Several markers showed a relatively high intra-individual variability, but nevertheless a good test-retest reliability, due to the ratio with inter-individual variation.
We found that stool consistency, as measured by BSS, had a low intra-individual variability of 17% (range 0–43%), comparable to previous findings of 0–58% over 9 days in healthy adults41. However, our moderate test-retest reliability score (ICC 0.74) contrasts with a study by Vork et al., showing lower test-retest reliability in healthy adults over 7 consecutive days (ICC 0.28)49. The BSS test-retest reliability may decrease at the boundaries of constipation (type 2/3) or diarrhoea (type 5/6)23,69, so the median stool form type 4 may have contributed to better test-retest reliability in our study. Faecal water content, a more objective measure of stool consistency, had a lower intra-individual variability of 6% (range 1–11%), contrasting with a slightly wider range of 2–24% CV%intra in 61 healthy individuals41. Despite moderate test-retest reliability (ICC 0.62) in previous literature over 7 consecutive days in healthy controls49, we found poor test-retest reliability (ICC 0.37) likely due to the relatively low inter-individual variation in comparison to the intra-individual variability. Overall, we confirmed low variation when using BSS scores in healthy participants with mostly stool consistency types 3 or 4. Additionally, a single measure of water content is sufficient, and offers less variability than BSS. Studies that wish to account for stool consistency may consider including faecal water content as objective and less variable measure. Furthermore, faecal pH, reflecting distal colonic pH39, showed a low intra-individual variability (1–7%), comparable to findings elsewhere (0–8%), showing that faecal pH was the most stable over 9 days compared to other gut markers41. We also found a that intra-individual variability in pH was positively correlated with intra-individual variability in some faecal SCFAs. We conclude that a single faecal pH measurement is sufficient due to minimal variation and overall stability.
There was moderate intra-individual variability in the concentrations of faecal acetate, propionate, and butyrate. Of these three SCFAs, butyrate showed the highest intra-individual variability (28%), followed by propionic acid (18%), and acetic acid (16%). These findings are consistent with previous findings in faeces collected over three days65. Heptanoic, hexanoic, and isovaleric acid had intra-individual variabilities above 30%. For heptanoic acid, the higher variability may be explained by the very low faecal concentrations. Others reported that faecal SCFAs and BCFAs concentrations fluctuated considerably from day-to-day (CVintra range 26–40%), with valerate varying the least and acetate the most41. We found a significant positive correlation between intra-individual variability of faecal water content and intra-individual variation of total SCFAs and butyric acid, highlighting the relevance of including faecal water content as a potential confounding factor in metabolite measurements, also demonstrated by others53,70. BCFAs showed more variable intra-individual responses than SCFAs (CV%intra of 27% versus 17%). A single faecal measurement likely suffices for acetate, propionate, butyrate, or total SCFAs and BCFAs, with water content correction potentially reducing butyrate variability.
Furthermore, we found high intra-individual (67%) and inter-individual (222%) variation in total fungi copies. Analysis of intra-individual variability in fungi is sparse, however, previous studies indicate that a large number of the faecal fungi is transient71,72,73, mainly derived from dietary consumption and passage from the oral cavity and skin71,72,74,75,76. While we did not investigate variability of specific fungi species, others indicated that the occurrence of the same fungi in faeces of healthy volunteers at two timepoints 13–16 weeks apart was less than 20%71. We similarly demonstrated high intra-individual variability of 40% (range 13–59%) in total bacteria copies, but with notably lower inter-individual variation (55%) compared to total fungi copies. Our results are similar to findings by Procházková et al., demonstrating an intra-individual variability range of 8–73% in total faecal bacterial load over 9 consecutive days, despite their use of a different quantification technique, namely flow cytometry41. The high variability may be explained by intestinal transit time that strongly correlates with bacterial mass77,78, but we could not confirm a correlation between variability of total bacteria and stool consistency. Additionally, correction for faecal dry weight compared to wet weight did not affect total bacteria and fungi counts variability. Overall, caution is warranted in interpreting single measurements of total bacteria and fungi abundances, and averaging total bacteria and fungi copies from multiple samples is recommended. Further investigation into specific bacteria and fungi abundances within- and between individuals is warranted.
Both inflammatory markers, calprotectin (64%) and MPO (107%), showed high intra-individual variability. Calprotectin’s variability aligns with previous studies in inflammatory bowel disease (IBD) patients (variabilities of 54% within four days, 39% within two days, and 36% within two days79,80,81) and healthy controls (38% in two non-consecutive samples 1–2 weeks apart45). Due to demonstration of diurnal variation by calprotectin, some studies recommend always sampling from the first stool sample of the day to reduce variation82. We did not account for the time of day in faecal sampling. However, stool frequency in our study was typically no more than one stool per day, and other studies have shown reliability of calprotectin concentrations regardless of sampling in the morning or evening80. Other factors that may affect calprotectin concentrations variability are stool volume and hydration79, and stool consistency82. In our study, we found a significant correlation between the variability of calprotectin and faecal water content, which suggests the importance of including stool consistency in explaining the day-to-day calprotectin variation, also according to international consensus83. We conclude that more than one sample of calprotectin is likely needed to accurately quantify calprotectin levels. Further investigation into correcting calprotectin concentrations by faecal water content to account for intra-individual variability is needed. The high intra-individual variability of MPO was consistent with previous variability (20–80%), likely due to difficulties with extracting tightly-bound MPO from the solid stool fraction84. In our healthy participants we expected low levels of intestinal inflammation, and indeed only one individual had detectable MPO concentrations in all faecal samples, while five participants had only detectible levels at two timepoints. Therefore, the reliability of the variability of MPO in healthy individuals might not be well illustrated in our small sample size. Despite this, our results indicate that gut inflammation as measured by MPO, is likely to be highly variable in individuals. Future studies incorporating MPO may require multiple faecal samples for accurate interpretation of gut inflammation.
Next, we analysed the variability in the microbiota composition and diversity, and in the overall metabolome using untargeted LCMS-based metabolomics. We demonstrated an overall mean CV%intra of 57% for all microbiota genera within one week. Previously, substantial intra-individual ecological variability longitudinally over 6 weeks to 6 years was seen48,85,86. In line with our results, most genera showed significant day-to-day variability within individuals, with 72% of all genera demonstrating more than a tenfold change in abundance between consecutive faecal samples48. We similarly demonstrated that intra-individual variability is genus-specific. Notably, genera with the highest relative abundance, Prevotella, Alistipes, Roseburia, Akkermansia, Parabacteroides, Bifidobacterium, Dialister, Clostridium sensus stricto 1, Holdemanella, UCG-002, Sutterrella, Banesiella demonstrated high intra-individual variability of above 30%. Clostridium_sensu_stricto_1 (Clostridium ss1)87,88,89 demonstrated the highest intra-individual variability (84%), followed by Akkermansia(82%). As yet, there are no publications describing the variation of Clostridium ss1 as a cluster. Recent interest in the probiotic potential of Clostridium butyricum, one species of this cluster, may however, lead to further investigation90,91. Akkermansia was the only genus in the top 25 genera with a low test-retest reliability measure (ICC 0.48). A. muciniphilia is frequently cited due to its association with beneficial health outcomes92,93. Despite the composition variability in our study, the overall microbial diversity remained stable, which is in line with previous findings demonstrating the stability of intra-individual alpha-diversity over one week49, but in contrast to substantial alpha-diversity Shannon index changes in individuals over six weeks (33%)48. We conclude that despite the relative stability in microbial community, large genus-specific variation is seen. In comparison to the high intra-individual microbiota composition variation, the untargeted metabolomics results showed lower variation, with mean CV%intra of 40% and 41% for features detected in positive and negative ionization mode, respectively. Less than half of all features had CV%intraof < 30%, indicating a relatively high day-to-day variation in faecal metabolite composition within individuals. The intrinsic dynamics of the microbiota, as well as external factors such as habitual diet, could introduce bias in clinical studies48,85,86. We demonstrated that this bias may be pronounced on a day-to-day basis when considering specific genera individually. Studies targeting Akkermansia as well as other genera with high intra-individual variability, such as Bifidobacterium, which are of interest in pre- and probiotic studies, should consider multiple timepoint sampling for accurate assessment. Additionally, future research should confirm intra-individual variability in absolute abundances of these genera, as it may differ from relative abundances48.
Overall, the intra-individual variability of most of the targeted gut-health markers within subjects was lower than the inter-individual variation between individuals, indicating that the inferences made from analysis of a single timepoint faecal sample should be reliable in the general population. Additionally, we propose that when considering single sampling versus multiple sampling at specific timepoints in studies, the specific gut-health outcomes of primary interest in a study should be taken into account. If genera or markers with high intra-individual CV% are the primary focus of a study, averaging multiple samples may provide a more accurate assessment of intervention effects, compared to relying on a single faecal sample only.
Mill-homogenisation during faecal pre-processing decreased variability of gut health markers in faecal replicates
We assessed the impact of an improved sample pre-processing procedure, including additional homogenisation of the frozen hammered faeces into a fine powder using a dedicated liquid nitrogen-cooled mill, on the outcome of measured gut health markers. Compared to sample hammering-only, the additional mill-homogenisation step reduced the CV% between replicates from the same faecal pool for most targeted gut health markers, except for heptanoic acid, 4-methyl valeric acid, and MPO. Our findings corroborate previous studies indicating that homogenisation can decrease the variability in microbiota composition65,68, and SCFA levels65. Importantly, there were no significant differences in mean marker concentrations between replicates processed by hammering-only and those subjected to mill-homogenisation. Previous methods, such as manual homogenisation with sterile plastic scoops65, hand homogenisation53, homogenisation in saline buffers using commercial grinders94, or preparing faecal slurries95 are likely less labour-intensive than our proposed mill-homogenisation method. However, in contrast to our method, which includes a mill that can be cooled with liquid-nitrogen, these other methods do not allow homogenisation of deep-frozen faeces and thus intrinsically incorporate potentially destructive freeze-thaw cycles. Additionally, another study found reduced variability by homogenising entire stool samples in liquid nitrogen and subsampling from the crushed powder, also allowing them to be kept frozen68. Hsieh et al. compared randomly sampled fresh and frozen stools that were either not milled at all or milled in a blender or homogenised with a pneumatic mixer, and concluded that homogenisation by any method reduced the microbiota variability64. So, for a reduction in measurement variability an adequate sample homogenisation step is preferred and both fresh and frozen stools can be used. However, to prevent microbiome and metabolome changes after sampling, stools should be frozen as soon as possible and then always kept frozen. Storage of homogenised fine faecal powders, rather than as complete stools or roughly hammered faecal pieces, allows multiple representative aliquoting of the same sample for complementary and repeated analyses. Avoiding thawing of stool samples before microbe or metabolite extraction is key to prevent any thawing-induced metabolic activity and related changes in the microbiome and metabolome of the specific sample. The currently used grinder (an IKA A11 Basic Analytical Mill) required thorough cleaning between faecal samples, which is time consuming. Alternative grinders like the IKA Tube Mill 100 control in combination with disposable plastic grinding chambers pre-filled with dry ice could streamline this process. Regarding the two protein markers, mill-homogenisation had no significant impact on their concentrations, while the CV% was decreased for calprotectin and increased for MPO, as compared to hammering-only. Previously, for other tissue matrixes (carcinoma tissues) ball milling was compared to classic tissue pulverization using mortar and pestle, and no differences in protein quality and concentrations were observed96. The differential effects of mill-homogenisation on the CVs for protein inflammation markers need further validation and verification by also analysing other proteins that are relevant for gut health, such as immunoglobulin A, lipocalin, and β-defensins97. Previously, it has been shown that the intra-stool CV% of calprotectin in three single stool punches ranged between 13 and 27% (median 17%), of which the authors concluded that a single-stool punch would be reliable for faecal calprotectin measurement81. We found a similar CV% for this protein for the hammered-only samples, but showed that mill-homogenising has the potential to further reduce the CV% to only 6%. Overall, mill-homogenisation added value, particularly in the analysis of faecal metabolites, by reducing variation of SCFAs, BCFAs, and untargeted metabolites within samples obtained from the same faecal pool. The proposed milling method can eliminate potential subsampling bias arising from heterogeneous distribution of some gut health markers in faeces. This finding also underscores the importance of applying adequate sample pre-processing techniques to avoid potential erroneous conclusions regarding study outcomes.
Limitations of the study
Study strengths included the comprehensive assessment of a series of gut health markers as well as both the microbiota and the global metabolome of human stools, which is particularly innovative within the context of intra-individual variability studies. More in-depth insights in the intra-individual stool variability is also key to studies focusing on inter-individual variability. A diverse range of gut health markers, both established and less frequently measured like total bacteria and fungi, provided a multifaceted understanding of gut health marker variability. The optimised faecal sampling procedure, from collection to pre-processing into frozen powders, ensured that the observed variability can be primarily attributed to individual differences rather than methodological and analytical inconsistencies. Furthermore, the proposed mill-homogenisation method was validated using pooled faecal samples, and offers standardization benefits for future research. However, limitations of the present study include a relatively small participant group with mainly female participants, thereby limiting the generalizability and highlighting the necessity for future studies to validate currently obtained results in larger gender-balanced cohorts. Another study limitation is that the exact time of faecal collection was not recorded, making it impossible to quantify the time interval between faecal sample collection and the potential effect of diurnal variation. Additionally, in healthy adults with minimal gut inflammation, markers such as calprotectin and MPO may have limited relevance. The current focus on sampling stools on consecutive days overlooks potential within-day variations, as shown previously by others65, limiting our understanding of even shorter-term variability. For the faecal SCFA analyses, we did not include freeze-drying of the faecal samples. However, recent literature suggests that lyophilisation may further minimize within-sample variation, and increased faecal SCFAs stability98. The lack of detailed background information on the volunteers, including dietary records and medication use, does not allow for explanation of potential causes of the observed variation in assessed gut health markers. Known factors influencing variability of microbial metabolites and other health markers include for instance metabolite absorption rates in the intestine, GI transit time, responses to diet, microbiota composition, colonic pH, and time since last meal37,38,39,71,78,99,100. In the present study we only included water content and stool consistency as proxies for transit time, where future studies should include direct measures101 to account for microbiota and metabolite variation. Future research aiming for a more comprehensive analysis should aim to incorporate these factors potentially influencing gut marker variation.
Conclusions
In conclusion, our study revealed that most of the selected gut-health markers in stools exhibited a low intra-individual variability, except for total fungi, total bacteria, calprotectin, MPO and certain microbiota genera. Notably, high intra-individual variability was observed in several commonly measured genera, including Prevotella_9, Bifidobacterium, Roseburia and Akkermansia. Microbiota alpha-diversity within individuals remained stable over a week of consecutive stool sampling, and microbiota variability was genus-dependent. Our findings help determine whether a single faecal sample or multiple samples are needed for accurate marker representation. Additionally, our optimised sample pre-processing, including mill-homogenisation of frozen stools, effectively reduced the variation in faecal metabolites. Therefore, we conclude that faecal mill-homogenisation mitigates the analytical variability in metabolites and eliminates subsampling bias that may result from a heterogeneous distribution of certain gut health markers in human faecal samples.
Methodology
Study participants and ethics approval
Ten healthy adults were recruited from Wageningen, the Netherlands. Exclusion criteria were antibiotic use within the three months before study start, and the presence of gastrointestinal disorders. The Medical Ethical Reviewing Committee of Wageningen University (METC-WU) evaluated this study, and concluded that no ethical approval was needed as this research does not fall within the remit of the Dutch Medical Research Involving Human Subjects Act. The subjects gave written informed consent for the use of their faeces. All specimens were used and coded anonymously. The human faecal samples were dealt with according to the Dutch code of conduct for health research.
Faecal sample collection
Participants collected faecal samples on three consecutive days, or alternatively at three timepoints as close as possible to each other within one week. The collection occasions can be found in Supplementary Table 1. Participants collected faeces at home using a faecal collection paper (Faecesvanger, Tag Hemi VOF, The Netherlands) and 55 × 44 mm polypropylene collection containers with a collection scoop integrated into the screw cap (Sarstedt, Nümbrecht, Germany). Participants were instructed to collect from three different topographical locations of the faeces, and to fill the container until two-thirds. Directly after collection, samples were stored at -20 °C at home, for maximally seven days. Afterwards, samples were transported under frozen conditions, in Styrofoam containers with ice packs, to the research facility and were immediately stored at -80 °C upon arrival. Participants indicated for each faecal sample the date and timepoint of sample collection, the number of stools passed per day, and the stool consistency on a Bristol Stool Scale (BSS). This scale describes seven types of stools ranging from 1: hard/lumpy to 7: watery without solid pieces.
Faecal sample processing
Faecal samples were processed under frozen conditions in two phases. First, frozen samples were crushed with a dead-blow hammer (Performance Tool Dead-Blow Hammer, Wilmar Corporation) into smaller particles of approximately 0.5 cm2. During the hammering, samples were always kept frozen using dry ice and liquid nitrogen. The hammered sample was stored at -80 °C, and 20 g of the hammered faeces was additionally mill-homogenised in a IKA A11 Basic Analytical Mill (Sigma-Aldrich, Darmstadt, Germany). Samples were always kept on dry ice. Liquid nitrogen was applied to the samples during both the hammering and milling steps to ensure they remained in a deeply-frozen state. Between homogenisation of samples, the mill was thoroughly cleaned using milli-q water and ethanol. Liquid nitrogen and dry ice were used to ensure the faeces always remained deeply frozen. Processed samples were stored at -80 °C. Subsequent aliquoting and weighing faeces for consecutive analyses were also performed under frozen conditions. Samples were thawed only if indicated for specific downstream analysis. For all analyses, except for the protein biomarker analyses, mill-homogenised samples were used. All faecal sample analyses from the same individual for all measurements were performed within the same batch.
Evaluation of the effect of mill-homogenisation using pooled faeces
To determine whether IKA milling affected outcome measures, 10 replicates of 1–2 gram faeces were made from a pooled sample of 15 gram faeces obtained from all ten participants. Afterwards, five replicates were then processed by hammering only, while five replicates were both hammered and mill-homogenised. For all markers of interest, comparison was made between the CV% within the five hammered-only replicates or the five mill-homogenised replicates. For the microbiota comparison only, the mill-homogenised samples and hammered-only samples of timepoint 1 of all participants were compared. To determine the effect of milling on DNA integrity, DNA was isolated from hammered-only and mill-homogenised samples. Afterwards, the DNA was checked on agarose gels (1%) using electrophoresis (100 V, 60 min) and ethidium bromide to check potential DNA degradation.
Water content and pH measurement
For faecal water content measurement 1 gram of mill-homogenised faecal sample was placed on an aluminium dish (VWR Aluminium smooth weigh dish 57 mm, VWR International, USA) and dried at 95 °C for a minimum of 48 h by vacuum drying (Heraeus D-6450 vacuum oven). Samples were weighed before and after drying. The water content percentage was determined as follows: 100-[dry weight (g)/wet weight (g)*100]. For faecal pH measurement 1 gram of mill-homogenised faecal sample was diluted in 10 mL milli-q water. Samples were homogenised for 15 min on a horizontal shaker (Universal shaker SM-30, Edmund Bühler GmbH, Bodelshausen, Germany) and centrifuged (5292 xg, 10 min, RT) before measurement. The pH of the supernatant was measured using a PCE-228-R meter (PCE Instruments, Enschede, The Netherlands) modified on the method described elsewhere102.
Protein biomarkers
Concentrations of faecal protein biomarkers were measured using enzyme-linked immunosorbent assay (ELISA) for MPO (Immundiagnostik, Bensheim, Germany), and calprotectin (Immundiagnostik) according to manufacturer’s instructions. For each assay, 60 ± 6 mg of hammered-only faeces was dissolved in extraction buffer (IDK Extract, Immundiagnostik) and further processed according to manufacturer’s instructions.
Short chain and branched chain fatty acids
The SCFAs (acetic acid, propionic acid, butyric acid, valeric acid, heptanoic acid, hexanoic acid) and BCFAs (isobutyric acid, isovaleric acid, 4-methylvaleric acid) levels were measured using gas chromatography with flame ionization detection (GC-FID) using 8690 GC system (Agilent, Amstelveen, the Netherlands) and a Restek Stabilwax-DA column (Restek Corporation, United States) as described by Lotti et al.103 with minor adjustments. In short, 40–50 mg of mill-homogenised faeces was weighed off in duplicate and 990 µL methyl tert-butyl ether, 10 µL 15% phosphoric acid in milli-q water and 10 µL 2-ethylbutyric acid in MTBE as the internal standard (IS) was added. Samples were homogenised for 30 min at 1300 rpm on an orbital shaker (IKA VXR basic Vibrax®, IKA®-Werke GmbH & Co. KG, Staufen, Germany) and centrifuged (3000xg, 10 min, RT). 200 µL of the organic phase layer was then transferred to an injection vial. Injection volume on the GC-FID was 2 µl with an additional split ratio of 1:10. A final volume of 0.1 µL was injected into the column. The GC run settings were as follows: (i) 1 min at 80 °C, (ii) raised by 10 °C /min to 200 °C, (iii) hold time of 5 min, (iv) raised by 25 °C/min, to a temperature of 230 °C, (v) 5 min hold time. The total run time was 24.2 min. The carrier gas was nitrogen at a flow rate of 1.32 ml/min. A 6-point calibration curve was prepared from standards in MTBE: acetic acid (2.47–600 mM), propionic acid (0.49–120 mM), iso-butyric acid (0.25-60 mM), butyric acid (0.99–240 mM), isovaleric acid (0.25-60 mM), valeric acid (0.25-60 mM), 4-methyl valeric acid (isohexanoic/isocaproic acid) (0.25-60 mM), hexanoic acid (0.25-60 mM), heptanoic acid (0.25-60 mM), and IS was set at 40 mM. The fatty acids were identified using a reference standard mix (article CRM46975, Sigma-Aldrich). This reference mix was also used to verify the accuracy of the calibration curve, which showed a deviation of < 5% for all compounds. A pooled faecal sample was analysed in duplicate in each analytical batch to monitor the quality of the analyses. The inter-batch CV% values ranged between 6 and 14% for all compounds. Concentrations of the SCFAs were calculated per mg of wet weight.
Untargeted metabolomics
Approximately 80 mg of freeze-dried mill-homogenised faeces was used for untargeted metabolomics analysis using liquid chromatography–mass spectrometry (LC-MS). UHPLC-Q-Orbitrap-MS/MS analysis was conducted using an Vanquish UHPLC (Thermo Fisher Scientific, Carlsbad, CA, USA) connected to a Q-Exactive Focus™ mass spectrometer (Thermo Scientific, Sunnyvale, CA, USA) with heated electrospray ionization (HESI-II; Thermo Fisher Scientific). The stool extracts were chromatographically separated on an Waters ACQUITY HSS T3 UPLC column (2.1 × 100 mm, 3.0 μm) using a mobile phase consisting of 0.1% formic acid/water (A) and acetonitrile (B). Gradient elution was performed as: 0–1 min, 5% B; 1–8 min, 5–95% B; 8–11 min, 95% B; 11–12, 95 − 5% B and 12–15 min, 5% B. The column was maintained at 40 °C, the flow rate was 0.4 mL/min and the injection volume was 2 µL. The mass spectrum data was acquired in polarity switching mode positive and negative ion mode through full MS. Representative samples were also acquired using data-dependent MS/MS. Control software used was Xcalibur, version 4.2.27. The scan parameters were set as follows: scan range, 80 to 1000 m/z; resolution, 70000; AGC target, 1 × 106; maximum inject time, 100 ms. The following parameters were used for the heated electrospray ionization source (HESI source): spray voltage, +/-3.5 kV for ESI − and ESI + in polarity switching mode; sheath gas flow rate, 40 arb; auxiliary gas flow rate, 10 arb; sweep gas flow rate, 0 arb; capillary temperature, 320 °C.
Six quality controls (QC), which consisted of a pool of the faecal samples from the participants, were analysed to ensure precision and stability of the LC-MS machine during the whole process of injection. QC samples were injected every after 10 sample injections and was used to assess chromatographic drift and instrumental errors. Features were extracted using the raw spectra as import in open-source software104. The detailed processing settings can be found in Supplementary Table 7. RANSAC (RANdom SAmple Consensus) alignment of features across all samples was performed. Gap filling was performed on the aligned feature list using the peak finder option. The raw data files including the ID, m/z, retention time, and peak area of the positive- and negative polarity mode features were exported for further pre-processing in MetaboAnalyst 5.0 (https://www.metaboanalyst.ca/). The features with more than 20% of missing values were removed. The remaining missing values were imputed with the limit of detection values using 1/5 of the minimum positive value for each variable. Features with a relative standard deviation (RSD = SD/mean) of over 20% in the quality control samples were removed. Additionally, 40% of the features was filtered using IQR filter. The data pre-processing flowchart of positive and negative polarity modes is shown in Supplementary Fig. 2. Data was normalized using normalization by sum, log transformation with base 10, and auto scaling. Principal Component Analysis (PCA) was performed on the normalized and scaled metabolomics data. To analyse the multivariate homogeneity of group dispersions (variances), the normalized data were subjected to the vegdist function to compute the Euclidean distance matrix, and the betadisper function to assess the variability of sample distances from their group centroids, both from the vegan package. An ANOVA like permutation test with 999 permutations was then performed to determine if significant differences in dispersion existed between the hammered-only and mill-homogenised groups.
Faecal DNA isolation
Faecal DNA was extracted using the DNeasy PowerSoil Pro Kit (Qiagen, Venlo, the Netherlands) according to the manufacturer’s instructions. Briefly, approximately 250 ± 20 mg of faeces was added to PowerBead Pro tubes, followed by the addition of the provided lysis buffer. The samples were subjected to mechanical disruption using a bead-beating step to ensure efficient cell lysis. Homogenisation was performed with use of the Qiagen TissueLyser II MM400 (Retsch GmbH, Haan, Germany). After centrifugation, the supernatant was processed through a series of purification steps, including inhibitor removal and DNA binding to a silica membrane. The DNA was then eluted in 50 µL elution buffer and stored at -20 °C until further analysis. DNA concentration and purity were assessed using a NanoDrop spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). Intra-individual CV% of the DNA concentration per gram of wet weight faeces over three days ranged between 3 and 38% (mean 11.0 ± 10.2%).
Total bacteria and total fungi quantification
The total copy number of the 16 S rRNA gene, indicative for the total bacteria, and the total copy number of the 18 S rRNA gene, indicative for the total fungi, were quantified using digital droplet PCR (ddPCR, QX200™ Droplet Digital™ PCR System, Bio-Rad Laboratories, CA). All materials and apparatus for the assay were obtained from Bio-Rad Laboratories. Before analysis, the DNA stocks were diluted 2000 times for fungi analyses, and 100,000 times for bacteria analyses. The primers and probes for the analysis of total bacteria106 and total fungi107 were used as described previously106,107. The PCR reaction mix contained 10 µL QX200™ ddPCR™ Supermix for Probes (no UTP), 1 µL forward primer (18 µM), 1 µL reverse primer (18 µM), 1 µL probe (5 µM), 2 µL nuclease free water, and 5 µL diluted DNA. The final volume of 20 µL was transferred to the DG8 cartridge. Droplet Generator Oil (70 µL) for EvaGreen were also added and the cartridge was covered with DG8 gasket, to generate droplets in the droplet generator (Bio-Rad). The generated droplets (40 µL) were transferred to the PCR plate. The plate was sealed at 170 °C for 5 s (PX1 PCR Plate Sealer, Bio-Rad). Then, the PCR reaction took place in the thermos T100 Thermal Cycler. The cycling protocol was as follows: 95 °C for 10 min; 50 cycles of 94 °C for 30 s and 58 °C for 2.5 min; 98 °C for 10 min; and 4 °C for 30 min. The plate was placed in the Droplet Reader (Bio-Rad). QuantaSoft Analysis Pro was used for the data analysis.
Microbiota composition and predicted functionality
The variable V4 region of the 16 S rRNA gene was amplified in a PCR reaction using barcoded primers, and afterwards PCR products were purified, quantified and pooled in equimolar concentrations as described in detail elsewhere108. The PCR products were subjected to paired-end sequencing using the Illumina Novaseq6000 platform (Novogene, Cambridge, UK). The sequencing data was processed using NG-Tax 2.0 using default settings109. Raw sequencing data were demultiplexed by trimming barcodes and primer sequences. The SILVA reference database version 138 was used to assign taxonomy. R version 4.2.1 was used for the microbiota analyses. Mitochondrial filtering was applied. In total, 1391 amplicon sequencing variants (ASVs) were detected, and 216 genera. The median number of reads per sample was 194,292 (IQR 152,269, 219,321). Raw counts were transformed to relative abundance for intra- and inter-individual variability analyses. The alpha-diversity, the within-sample diversity, was calculated on the read counts using the microbiome package using the indexes Inverse Simpson (considers number of species and evenness), Shannon (considers abundance and evenness), and Fisher (associated with the number of species). Also the Faith phylogenetic diversity, which is based on branch length connecting taxa in those samples and the root node of the phylogenetic tree, was calculated using the picante package. Beta-diversity, the between-sample diversity, using proportion normalized abundances of all amplicon sequence variants using weighted UniFrac distance as implemented in the phyloseq package to determine overall microbiota differences, and visualised using Principal Coordinate Analysis (PCoA). Permutational multivariate analysis of variance (PERMANOVA) as implemented in the vegan package was used to determine differences in the overall microbiota community between milled and non-milled faeces. Principal Coordinate Analysis (PCoA) with Weighted UniFrac using the microbiota data on ASV level was performed using the R package phyloseq. The microbiota functional pathways related to SCFA production were predicted based on the 16 S rRNA gene sequences using the phylogenetic investigation of communities by reconstruction of unobserved states algorithm version 2.5.0 (PICRUSt2) with default settings110, but with an adjusted minimum alignment of 50%. The predicted pathways and enzymes were transformed to relative abundance. Pathways classes and enzymes related to SCFA production were selected from the MetaCyc database using Pathway Tools version 26.0111.
Statistical analysis
Statistical analyses were performed in R version 4.2.1. Test-retest reliability of markers over the three timepoints were assessed using intra-class correlation coefficients (ICC). The ICC evaluates the variance as the variance within individuals in relation to the total variance, where total variance is the variance within and between individuals. For our analysis, the ICCs were computed using R package irr with a two-way mixed effects model using absolute agreement between timepoints based on single measurements112; formula = \(\:\frac{MSsubjects-MSresidual,}{MSsubject\:\:\:\:}\) where the MSsubjects indicates the mean squares due to the subjects as is calculated by comparing the variance of subject means to the grand mean, and MSresidual indicates the residual mean square reflecting variance (represents the within-subject variability). ICC values indicated test-retest reliability as follows: < 0.5 poor, 0.5 to 0.75 moderate, 0.75 to 0.9 good, and > 0.90 excellent112. Pairwise Pearson’s or Spearman’s correlations, depending on the normality of distribution, were calculated to compare all markers between the three timepoints within individual, between the milled and non-milled faeces, and between technical replicates. Coefficient of variation (CV%, formula: [(σ / µ) × 100%]) were calculated between consecutive timepoints within individuals (CV%intra) and between individuals (CV%inter) per timepoint, and within and between replicates from the pooled faeces, either hammered-only or mill-homogenised, quality controls. A cut-off of 30% was considered as high biological variation. Statistical significance was accepted as P < 0.05.
Data availability
Data analysed during this study is provided within the manuscript and the supporting supplementary data files (supplementary material 2, supplementary material 3, supplementary material 4, supplementary material 5). Raw 16 S rRNA gene sequences files were deposited in public repository NCBI under BioProject accession number PRJNA1162476.
References
Ahlawat, S., Asha & Sharma, K. K. Gut-organ axis: A microbial outreach and networking. Lett. Appl. Microbiol. 72(6), 636–668 (2021).
Lama Tamang, R. et al. The diet-microbiota axis: A key regulator of intestinal permeability in human health and disease. Tissue Barriers. 2077069 (2022).
Ghosh, T. S. et al. Adjusting for age improves identification of gut microbiome alterations in multiple diseases. Elife 9(2020).
Durack, J. & Lynch, S. V. The gut microbiome: Relationships with disease and opportunities for therapy. J. Exp. Med. 216(1), 20–40 (2019).
He, Y. et al. Regional variation limits applications of healthy gut microbiome reference ranges and disease models. Nat. Med. 24(10), 1532–1535 (2018).
Yatsunenko, T. et al. Human gut microbiome viewed across age and geography. Nature. 486(7402), 222–227 (2012).
Asnicar, F. et al. Microbiome connections with host metabolism and habitual diet from 1,098 deeply phenotyped individuals. Nat. Med. 27(2), 321–332 (2021).
Campaniello, D. et al. How diet and physical activity modulate gut microbiota: Evidence, and perspectives. Nutrients 14(12). (2022).
Huang, X. et al. Dietary variety relates to gut microbiota diversity and abundance in humans. Eur. J. Nutr.(2022).
Aya, V. et al. Association between physical activity and changes in intestinal microbiota composition: A systematic review. PLoS One. 16(2), e0247039 (2021).
Dziewiecka, H. et al. Physical activity induced alterations of gut microbiota in humans: A systematic review. BMC Sports Sci. Med. Rehabil. 14(1), 122 (2022).
Vich Vila, A. et al. Impact of commonly used drugs on the composition and metabolic function of the gut microbiota. Nat. Commun. 11(1), 362 (2020).
Walsh, J. et al. Drug-gut microbiota interactions: Implications for neuropharmacology. Br. J. Pharmacol. 175(24), 4415–4429 (2018).
Wilmanski, T. et al. Gut microbiome pattern reflects healthy ageing and predicts survival in humans. Nat. Metab. 3(2), 274–286 (2021).
Wu, Y. L. et al. Gut microbiota alterations and health status in aging adults: From correlation to causation. Aging Med. (Milton). 4(3), 206–213 (2021).
Manor, O. et al. Health and disease markers correlate with gut microbiome composition across thousands of people. Nat. Commun. 11(1), 5206 (2020).
Procházková, N. et al. Advancing human gut microbiota research by considering gut transit time. Gut(2022).
Vandeputte, D. et al. Stool consistency is strongly associated with gut microbiota richness and composition, enterotypes and bacterial growth rates. Gut. 65(1), 57–62 (2016).
Tropini, C. How the physical environment shapes the microbiota. mSystems. 6(4), e0067521 (2021).
Zhernakova, A. et al. Population-based metagenomics analysis reveals markers for gut microbiome composition and diversity. Science. 352(6285), 565–569 (2016).
Hammer, J. & Phillips, S. F. Fluid loading of the human colon: Effects on segmental transit and stool composition. Gastroenterology. 105(4), 988–998 (1993).
Lewis, S. J. & Heaton, K. W. Stool form scale as a useful guide to intestinal transit time. Scand. J. Gastroenterol. 32(9), 920–924 (1997).
Blake, M. R., Raker, J. M. & Whelan, K. Validity and reliability of the Bristol Stool Form Scale in healthy adults and patients with diarrhoea-predominant irritable bowel syndrome. Aliment. Pharmacol. Ther. 44(7), 693–703 (2016).
Nordin, E. et al. Modest conformity between self-reporting of Bristol Stool Form and fecal consistency measured by Stool Water Content in Irritable Bowel Syndrome and a FODMAP and Gluten Trial. Am. J. Gastroenterol. 117(10), 1668–1674 (2022).
Galazzo, G. et al. How to count our microbes? The effect of different quantitative microbiome profiling approaches. Front. Cell. Infect. Microbiol. 10, 403 (2020).
Knight, R. et al. Best practices for analysing microbiomes. Nat. Rev. Microbiol. 16(7), 410–422 (2018).
Vandeputte, D. et al. Quantitative microbiome profiling links gut community variation to microbial load. Nature. 551(7681), 507–511 (2017).
Gutierrez, M. W. & Arrieta, M. C. The intestinal mycobiome as a determinant of host immune and metabolic health. Curr. Opin. Microbiol. 62, 8–13 (2021).
Shuai, M. et al. Mapping the human gut mycobiome in middle-aged and elderly adults: multiomics insights and implications for host metabolic health. Gut(2022).
Zhang, L. et al. The role of gut mycobiome in health and diseases. Ther. Adv. Gastroenterol. 14, 17562848211047130 (2021).
Chen, H. et al. A Forward Chemical genetic screen reveals gut microbiota metabolites that modulate host physiology. Cell. 177(5), 1217–1231e18 (2019).
Krautkramer, K. A., Fan, J. & Bäckhed, F. Gut microbial metabolites as multi-kingdom intermediates. Nat. Rev. Microbiol. 19(2), 77–94 (2021).
Schroeder, B. O. & Bäckhed, F. Signals from the gut microbiota to distant organs in physiology and disease. Nat. Med. 22(10), 1079–1089 (2016).
Visconti, A. et al. Interplay between the human gut microbiome and host metabolism. Nat. Commun. 10(1), 4505 (2019).
Zheng, X., Cai, X. & Hao, H. Emerging targetome and signalome landscape of gut microbial metabolites. Cell. Metab. 34(1), 35–58 (2022).
Blaak, E. et al. Short chain fatty acids in human gut and metabolic health. Benef. Microbes. 11(5), 411–455 (2020).
Rios-Covian, D. et al. An overview on fecal branched short-chain fatty acids along human life and as related with body mass index: Associated dietary and anthropometric factors. Front. Microbiol. 11, 973 (2020).
Ramos Meyers, G., Samouda, H. & Bohn, T. Short chain fatty acid metabolism in relation to gut microbiota and genetic variability. Nutrients 14(24) (2022).
Yamamura, R. et al. Intestinal and fecal pH in human health. Front. Microbiomes 2(2023).
Firrman, J. et al. The impact of environmental pH on the gut microbiota community structure and short chain fatty acid production. FEMS Microbiol. Ecol. 98(5). (2022).
Procházková, N. et al. Gut environmental factors explain variations in the gut microbiome composition and metabolism within and between healthy adults. bioRxiv 2024.01.23.574598. (2024).
LaBouyer, M. et al. Higher total faecal short-chain fatty acid concentrations correlate with increasing proportions of butyrate and decreasing proportions of branched-chain fatty acids across multiple human studies. Gut Microbiome 3(2022).
Prata, M. M. G. et al. Comparisons between myeloperoxidase, lactoferrin, calprotectin and lipocalin-2, as fecal biomarkers of intestinal inflammation in malnourished children. J. Transl. Sci. 2(2), 134 (2016).
Gubatan, J. et al. Antimicrobial peptides and the gut microbiome in inflammatory bowel disease. World J. Gastroenterol. 27(43), 7402 (2021).
Padoan, A. et al. Improving IBD diagnosis and monitoring by understanding preanalytical, analytical and biological fecal calprotectin variability. Clin. Chem. Lab. Med. 56(11), 1926–1935 (2018).
Hansberry, D. R. et al. Fecal myeloperoxidase as a biomarker for inflammatory bowel disease. Cureus. 9(1), e1004 (2017).
Johnson, A. J. et al. Daily sampling reveals personalized diet-microbiome associations in humans. Cell. Host Microbe. 25(6), 789–802e5 (2019).
Vandeputte, D. et al. Temporal variability in quantitative human gut microbiome profiles and implications for clinical research. Nat. Commun. 12(1), 6740 (2021).
Vork, L. et al. Does day-to-day variability in Stool consistency link to the fecal microbiota composition? Front. Cell. Infect. Microbiol. 11, 639667 (2021).
David, L. A. et al. Host lifestyle affects human microbiota on daily timescales. Genome Biol. 15(7), R89 (2014).
Turroni, S. et al. Temporal dynamics of the gut microbiota in people sharing a confined environment, a 520-day ground-based space simulation, MARS500. Microbiome. 5(1), 39 (2017).
Venkataraman, A. et al. Variable responses of human microbiomes to dietary supplementation with resistant starch. Microbiome. 4(1), 33 (2016).
McOrist, A. L. et al. Bacterial population dynamics and faecal short-chain fatty acid (SCFA) concentrations in healthy humans. Br. J. Nutr. 100(1), 138–146 (2008).
Karu, N. et al. A review on human fecal metabolomics: Methods, applications and the human fecal metabolome database. Anal. Chim. Acta. 1030, 1–24 (2018).
Hosseinkhani, F. et al. Towards standards for human fecal sample preparation in targeted and untargeted LC-HRMS studies. Metabolites 11(6). (2021).
Johnson, A. J. et al. A guide to diet-microbiome study design. Front. Nutr. 7, 79 (2020).
De Saedeleer, B. et al. Systematic characterization of human gut microbiome-secreted molecules by integrated multi-omics. ISME Commun. 1, 82 (2021).
Deda, O. et al. Sample preparation optimization in fecal metabolic profiling. J. Chromatogr. B Anal. Technol. Biomed. Life Sci. 1047, 115–123 (2017).
Hassani, E. M. E. Optimized sample preparation for fecal volatile organic compound analysis by gas chromatography-mass spectrometry. Metabolomics. 16(10), 112 (2020).
Gratton, J. et al. Optimized sample handling strategy for metabolic profiling of human feces. Anal. Chem. 88(9), 4661–4668 (2016).
Lamichhane, S. et al. Gut metabolome meets microbiome: A methodological perspective to understand the relationship between host and microbe. Methods. 149, 3–12 (2018).
Liang, Y. et al. Systematic analysis of impact of sampling regions and Storage methods on fecal gut microbiome and metabolome profiles. mSphere 5(1). (2020).
Trošt, K. et al. Describing the fecal metabolome in cryogenically collected samples from healthy participants. Sci. Rep. 10(1), 885 (2020).
Hsieh, Y. H. et al. Impact of different fecal Processing methods on assessments of bacterial diversity in the human intestine. Front. Microbiol. 7, 1643 (2016).
Jones, J. et al. Fecal sample collection methods and time of day impact microbiome composition and short chain fatty acid concentrations. Sci. Rep. 11(1), 13964 (2021).
Wu, W. K. et al. Optimization of fecal sample processing for microbiome study - The journey from bathroom to bench. J. Formos. Med. Assoc. 118(2), 545–555 (2019).
Bruce, K. et al. A Practical Guide to DNA-based Methods for Biodiversity Assessment(Pensoft Advanced Books, 2021).
Gorzelak, M. A. et al. Methods for improving human gut microbiome data by reducing variability through sample processing and storage of stool. PLOS ONE. 10(8), e0134802 (2015).
Chumpitazi, B. P. et al. Bristol Stool Form Scale reliability and agreement decreases when determining Rome III stool form designations. Neurogastroenterol.Motil. 28(3), 443–448 (2016).
Siigur, U. et al. Concentrations and correlations of faecal short-chain fatty acids and faecal water content in man. Microb. Ecol. Health Dis. 7(6), 287–294 (1994).
Hallen-Adams, H. E. Fungi inhabiting the healthy human gastrointestinal tract: A diverse and dynamic community. Fungal Ecol. 15, 9–17 (2015).
Hallen-Adams, H. E. & Suhr, M. J. Fungi in the healthy human gastrointestinal tract. Virulence. 8(3), 352–358 (2017).
Raimondi, S. et al. Longitudinal survey of Fungi in the human gut: ITS profiling, phenotyping, and colonization. Front. Microbiol. 10, 1575 (2019).
Auchtung, T. A. et al. Investigating colonization of the healthy adult gastrointestinal tract by fungi. mSphere. 3(2) (2018).
Underhill, D. M. & Iliev, I. D. The mycobiota: Interactions between commensal fungi and the host immune system. Nat. Rev. Immunol. 14(6), 405–416 (2014).
David, L. A. et al. Diet rapidly and reproducibly alters the human gut microbiome. Nature. 505(7484), 559–563 (2014).
Stephen, A. M., Wiggins, H. S. & Cummings, J. H. Effect of changing transit time on colonic microbial metabolism in man. Gut. 28(5), 601–609 (1987).
Daniel, H. Diet and gut microbiome and the chicken or egg problem. Front. Nutr. 8(2022).
Tibble, J. et al. A simple method for assessing intestinal inflammation in Crohn’s disease. Gut. 47(4), 506–513 (2000).
Kristensen, V. et al. Clinical importance of faecal calprotectin variability in inflammatory bowel disease: Intra-individual variability and standardisation of sampling procedure. Scand. J. Gastroenterol. 51(5), 548–555 (2016).
Cremer, A. et al. Variability of faecal calprotectin in inflammatory bowel disease patients: An observational case-control study. J. Crohn’s Colitis. 13(11), 1372–1379 (2019).
Lasson, A. et al. The intra-individual variability of faecal calprotectin: A prospective study in patients with active ulcerative colitis. J. Crohn’s Colitis. 9(1), 26–32 (2014).
D’Amico, F. et al. International consensus on methodological issues in standardization of fecal calprotectin measurement in inflammatory bowel diseases. United Eur. Gastroenterol. J. 9(4), 451–460 (2021).
Swaminathan, A. et al. Faecal myeloperoxidase as a biomarker of endoscopic activity in inflammatory bowel disease. J. Crohn’s Colitis. 16(12), 1862–1873 (2022).
Olsson, L. M. et al. Dynamics of the normal gut microbiota: A longitudinal one-year population study in Sweden. Cell. Host Microbe. 30(5), 726–739e3 (2022).
Zhou, X. et al. Longitudinal profiling of the microbiome at four body sites reveals core stability and individualized dynamics during health and disease. Cell. Host Microbe. 32(4), 506–526e9 (2024).
Gupta, R. S. & Gao, B. Phylogenomic analyses of clostridia and identification of novel protein signatures that are specific to the genus Clostridiumsensu stricto (cluster I). Int. J. Syst. Evol. MicroBiol. 59(2), 285–294 (2009).
Li, C. J. et al. Comparative genomic analysis and proposal of Clostridium yunnanense sp. nov., Clostridium rhizosphaerae sp. nov., and Clostridium paridis sp. nov., three novel Clostridium sensu stricto endophytes with diverse capabilities of acetic acid and ethanol production. Anaerobe. 79, 102686 (2023).
Wiegel, J., Tanner, R. & Rainey, F. A. The Prokaryotes Volume 4. Bacteria: Firmicutes, Cyanobacteria. An Introduction to the Family Clostridiaceae.(ed. Dworkin, M.). Vol. 4. (Springer)
Liu, L. et al. Clostridium butyricum potentially improves immunity and nutrition through alteration of the microbiota and metabolism of elderly people with malnutrition in long-term care. Nutrients 14(17). (2022).
Sun, Y. Y. et al. The effect of Clostridium butyricum on symptoms and fecal microbiota in diarrhea-dominant irritable bowel syndrome: A randomized, double-blind, placebo-controlled trial. Sci. Rep. 8(1), 2964 (2018).
Geerlings, S. Y. et al. Akkermansia muciniphila in the human gastrointestinal tract: When, where, and how? Microorganisms 6(3). (2018).
Zhang, T. et al. Akkermansia muciniphila is a promising probiotic. Microb. Biotechnol. 12(6), 1109–1125 (2019).
Ezzy, A. C. et al. Storage and handling of human faecal samples affect the gut microbiome composition: a feasibility study. J. Microbiol. Methods. 164, 105668 (2019).
Gangadoo, S. et al. The multiomics analyses of fecal matrix and its significance to coeliac disease gut profiling. Int. J. Mol. Sci. 22(4) (2021).
Ostapowicz, J. et al. Comparison of Tumour Tissue Homogenisation Methods: Mortar and Pestle Versus Ball Mill.(2023).
Siddiqui, I., Majid, H. & Abid, S. Update on clinical and research application of fecal biomarkers for gastrointestinal diseases. World J. Gastrointest. Pharmacol. Ther. 8(1), 39–46 (2017).
Ueyama, J. et al. Freeze-drying enables homogeneous and stable sample preparation for determination of fecal short-chain fatty acids. Anal. Biochem. 589, 113508 (2020).
McOrist, A. L. et al. Fecal butyrate levels vary widely among individuals but are usually increased by a diet high in resistant starch. J. Nutr. 141(5), 883–889 (2011).
Roager, H. M. & Dragsted, L. O. Diet-derived microbial metabolites in health and disease. Nutr. Bull. 44(3), 216–227 (2019).
Asnicar, F. et al. Blue poo: Impact of gut transit time on the gut microbiome using a novel marker. Gut. 70(9), 1665–1674 (2021).
Million, M. et al. Increased gut Redox and depletion of anaerobic and methanogenic prokaryotes in severe acute malnutrition. Sci. Rep. 6, 26051 (2016).
Lotti, C. et al. Development of a fast and cost-effective gas chromatography–mass spectrometry method for the quantification of short-chain and medium-chain fatty acids in human biofluids. Anal. Bioanal. Chem. 409(23), 5555–5567 (2017).
Tsugawa, H. et al. A lipidome atlas in MS-DIAL 4. Nat. Biotechnol. 38(10), 1159–1163 (2020).
Schmid, R. et al. Integrative analysis of multimodal mass spectrometry data in MZmine 3. Nat. Biotechnol. 41(4), 447–449 (2023).
Liu, C. M. et al. BactQuant: an enhanced broad-coverage bacterial quantitative real-time PCR assay. BMC Microbiol. 12(1), 56 (2012).
Liu, C. M. et al. FungiQuant: a broad-coverage fungal quantitative real-time PCR assay. BMC Microbiol. 12(1), 255 (2012).
Rios-Morales, M. et al. A toolbox for the comprehensive analysis of small volume human intestinal samples that can be used with gastrointestinal sampling capsules. Sci. Rep. 11(1), 1–14 (2021).
Poncheewin, W. et al. NG-Tax 2.0: a semantic framework for high-throughput amplicon analysis. Front. Genet. 10, 1366 (2020).
Langille, M. G. et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat. Biotechnol. 31(9), 814–821 (2013).
Caspi, R. et al. The MetaCyc database of metabolic pathways and enzymes. Nucleic Acids Res. 46(D1), D633–D639 (2017).
Koo, T. K. & Li, M. Y. A Guideline of selecting and reporting Intraclass correlation coefficients for Reliability Research. J. Chiropr. Med. 15(2), 155–163 (2016).
Acknowledgements
First, we thank all volunteers who donated stool samples for this study. We would like to thank the Gut Health Club colleagues Anna Malinowska and Marie-Luise Puhlmann for the input on the design of the study. Also we thank Soumya Kar for the input on the set-up of the study. We would like to thank Geert Althuizen, Yueyi Zhong, and Panos Fourmouzis for assisting with the laboratory and data analysis. We would also like to thank Maria-João Paulo for her advice on statistical analysis. But above all, we would like to thank our beloved mentor Wilma Steegenga, who passed away before this manuscript was finalised. She initiated and set-up this study, which she was so passionate about, and she inspired us all with her expertise, scientific rigor, and ambition. She is immensely missed. This project was partially funded by the Horizon 2020 Framework Programme of the European Union (ITN SmartAge, H2020-MSCA-ITN-2019-859890). The contents of this publication are the sole responsibility of the authors and do not necessarily reflect the opinion of the European Union. K.K. and Y.M were funded by the Horizon 2020 Framework Programme of the European Union (ITN SmartAge, H2020-MSCA-ITN-2019-859890). M.P.H.v.T. was funded by a Crossover grant (MOCIA 17611) of the Dutch Research Council (NWO). The MOCIA programme is a public-private partnership (see https://mocia.nl/scientific/).
Author information
Authors and Affiliations
Contributions
Conceptualization, W.T.S, D.E.K, G.J.H, R.C.H.d.V, G.B.G. and M.P.H.v.T.; Methodology and Investigation, K.K., Y.M., N.v.d.W., S.K., R.C.H.d.V, M.G.B., K.M., M.D.d.B, M.G.J.B., N.L., G.B.G, W.T.S., and M.P.H.v.T.; Funding Acquisition, L.C.P.G.M.d.G. and W.T.S.; Supervision, L.C.P.G.M.d.G. and Y.V.; Writing original draft, K.K., M.P.H.v.T., and W.T.S.; Writing- Review and editing - All authors. All authors, except W.T.S. who passed away before finalization of the manuscript, approved the final version.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Kruger, K., Myeonghyun, Y., van der Wielen, N. et al. Evaluation of inter- and intra-variability in gut health markers in healthy adults using an optimised faecal sampling and processing method. Sci Rep 14, 24580 (2024). https://doi.org/10.1038/s41598-024-75477-z
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-024-75477-z
This article is cited by
-
Multiomics: the intersection of personalized nutrition in cardiometabolic diseases
Journal of Translational Medicine (2026)
-
Effects of Short-Chain Fatty Acid Combinations Relevant to the Healthy and Dysbiotic Gut upon Candida albicans
Current Microbiology (2025)






