Introduction

Heart rate variability (HRV) – the fluctuation in the time intervals between consecutive heartbeats – is an indirect measure of autonomic nervous system (ANS) regulation. Its significance in both clinical and non-clinical areas has grown considerably since its early application in fetal monitoring and post-myocardial infarction care in the 70s and 80s1,2. Since then, HRV has been explored in diverse areas from mental health and sleep disorders to sports performance, as it provides a valuable window into an individual’s physiological and psychological states, including stress levels, recovery capacity, and overall health. In clinical settings, it serves as a sensitive biomarker in conditions such as sleep disorders3,4 and stress-related illnesses, depression, and anxiety5,6. Outside of strictly clinical applications, HRV has gained traction as a tool for monitoring training load, recovery, and readiness in athletes7,8. Recent research further underscores HRV’s pivotal role in capturing the dynamic interplay between the brain and heart during cognitive and emotional processes: error-related cardiac deceleration, or the transient slowing of the heart following an error, illuminates the tight coupling between performance monitoring and autonomic reactivity9, while fear-induced bradycardia has emerged as a psychophysiological measure of defensive responding in fear conditioning paradigms, offering fresh insight into stress-related and psychopathological conditions10,11. Together, these findings suggest that HRV—and more broadly cardiac variability—transcends a simple risk marker, instead serving as a key index of brain–body integration. Disruptions in this integration may underlie various psychiatric, physiologic and neurological disorders12,13.

Despite its wide utility, HRV analysis is not without challenges. One significant obstacle is that HRV can vary considerably depending on a host of psychological, physiological, methodological, and environmental factors. Over the past few decades, researchers have attempted to establish consensus guidelines for HRV data collection and interpretation—most notably through the foundational Task Force report in 1996 and more recently by Quigley and al. —but the field still lacks a universally accepted standard14,15. For instance, there remains debate over issues like the optimal timing of data collection (e.g., upon awakening, during nighttime, or pre/post-exercise or intervention), the duration of recordings (e.g. 5-minute “short-term” vs. 24-hour monitoring), and the body position to be used (e.g. supine, standing, or seated)15,16,17. Each of these factors can influence HRV metrics, leading to inconsistencies and difficulties in comparing results across studies18,19,20,21. Hence, reliable and feasible protocols are urgently needed to enhance the robustness and reliability of HRV research.

One promising strategy is the use of short-term HRV measures – often ≤ 5 min – that balance practical feasibility with sufficient diagnostic and research value22. A shorter recording period reduces participant burden, is easier to replicate in both clinical and athletic contexts, and may be less prone to artifacts introduced by movement or posture changes14,23. As portable technologies advance, short-term HRV measurements have become increasingly accessible, allowing assessments to be performed even in non-traditional settings such as homes, workplaces, or sports field. However, while short-term assessments are convenient, they still necessitate careful consideration of methodological variables to ensure consistency.

In this regard, a dual-position HRV protocol has gained particular attention24,25. This approach captures HRV in both supine and standing positions to account for the physiological challenges associated with each posture26. In the supine position, the cardiovascular system experiences minimal gravitational stress, generally resulting in higher parasympathetic activity. Conversely, standing requires orthostatic adaptation and typically increases sympathetic outflow to maintain blood pressure26. Monitoring HRV in both positions thus provides a more comprehensive snapshot of ANS function, capturing a balance between parasympathetic and sympathetic responses. Studies have suggested that this dual-position method is particularly useful for differentiating types of fatigue, as it allows researchers to observe changes in low-frequency (LF) and high-frequency (HF) bands under two contrasting autonomic loads24,27,28. If administered with rigorous standardization—ideally at consistent times, such as upon awakening—the dual-position protocol has the potential to yield reliable and context-rich data that inform tailored interventions in both healthcare and sport.

Nevertheless, questions remain about the stability and generalizability of these measurements across different environments and populations. Laboratory-based assessments offer controlled conditions, but the resulting data may not fully reflect an individual’s daily life, particularly when measurement timing varies29. Conversely, home-based assessments provide ecological validity and convenience, but introduce potential uncontrolled variables, such as room temperature. Additional factors including participant familiarity with the measuring device and adherence to pre-measurement guidelines (e.g., abstaining from caffeine) can further influence results12,30,31. These considerations highlight the importance of reliability studies designed to examine how HRV measures vary under different conditions and over time, particularly when applying the dual-position approach.

Against this backdrop, the present study aims to address a gap in the literature regarding the reliability and interpretative value of a standardized morning dual-position HRV protocol. Building on recent efforts to streamline short-term HRV assessments, our work investigates the performance of these measurements in healthy, physically active individuals across both at-home and laboratory settings. We hypothesize that this dual-position protocol will provide reliable indicators of autonomic function, regardless of the data collection environment, while also revealing trivial day-to-day variability. Specifically, our study focused on four main objectives: (1) assessing overall short-term HRV reliability over a 24-hour period; (2) analyzing day-to-day variance; (3) comparing at-home and in-laboratory measurements performed on the same day; and (4) evaluating short-term variance in lab conditions. By measuring HRV across multiple time points and settings, we aimed to assess whether the protocol reliably captures shifts in HRV between supine and standing positions, while accounting for potential effects of environmental stressors and participant comfort. If proven valid and reliable, this dual-position method could guide individualized interventions in healthcare and sports, enabling clinicians to track cardiac rehabilitation progress and coaches to optimize training and recovery. Moreover, establishing HRV reliability in a young, active population also establishes a basis the way for future investigations in clinical cohorts, with impaired autonomic regulation. This research aims to examine the reliability and applicability of this method in realistic scenarios. The findings will contribute to an evolving dialogue on HRV measurement best practices, potentially paving the way for broader adoption of dual-position assessments in routine clinical practice and athletic monitoring.

Results

Participants

Out of the initial 43 participants recruited for this study, 34 successfully completed five HRV measurements (measure A, B, C, D, E) under the designated conditions and were included in the final analysis. Nine participants were excluded due to non-compliance with the experimental protocol, incomplete measurements, symptomatic responses (orthostatism), or technical issues that affected data integrity. Additionally, three standing measurements from the included 34 participants were excluded due to similar reasons. Detailed justifications for these exclusions are provided in the supplementary materials. Figure 1 presents the study design. All measurements were conducted during two consecutive mornings, both at home and in the laboratory, with an average interval of 1h01 ± 15 min on the first day and 1h03 ± 19 min on the second day. Table 1 outlines the characteristics of the participants and the timing of the measurements. Importantly, none of the participants exhibited significant fatigue scores on the Profile of Mood States (POMS) questionnaire, ensuring the homogeneity of the fatigue state of the population studied.

Table 1 Gender repartition, age, weight, and height of participants with POMS results and time of measurement. Data are presented as mean ± SD. TMD, total mood disturbance; POMS, profile of mood state.
Fig. 1
figure 1

Study design.

Descriptive statistics

Absolute and ln-transformed data are presented in Tables 2 and 3. As expected, all absolute variables showed a skewed distribution. Only the standard deviation of normal-to-normal (NN) intervals (SDNN) standing was normally distributed. Of note, the ln-transformation aimed at normalizing the distribution failed for several variables (Total power (TP), low frequency (LF) normalized units (LFnu), high frequency (HF) normalized units (HFnu) supine, and heart rate (HR), very low frequency (VLF), ratio between LF and HF (LF/HF), LFnu standing). Logically, distinct physiological responses were observed between supine and standing positions suggesting substantial postural influences on autonomic balance26. Our HRV values are consistent with established norms for healthy, active individuals32. Environmental differences between at-home and in-lab settings also impacted HRV readings, suggesting that the measurement environment plays a substantial role in autonomic activity (see repeated measures analysis below).

Table 2 Descriptive statistics for absolute HRV metrics across different measurement points and conditions. Data are presented either in mean ± standard deviations or median [25th;75th percentile]. SDNN (standard deviation of NN intervals), HR (heart rate), RMSSD (root mean square of successive differencess), PNN50% (percentage of successive NN intervals that Ddffer by more than 50 ms), VLF (very low frequency), LF l(ow frequency), HF (high frequency), TP (total power), LFnu (low frequency normalized units), HFnu (high frequency normalized units).
Table 3 Descriptive statistics for ln-transformed HRV metrics across different measurement points and conditions. Data are presented either in mean ± standard deviations or median [25th;75th percentile]. SDNN (standard deviation of NN intervals), HR (heart rate), RMSSD (root mean square of successive differences), PNN50% (percentage of successive NN intervals that differ by more than 50 ms), VLF (very low frequency), LF (low frequency), HF (high frequency), TP (total power), LFnu (low frequencyy normalized units), HFnu (high frequency normalized units).

Repeated measures analysis

For absolute data supine, only VLF showed significant differences between sessions (A vs. B (p = 0.012) and D vs. E (p = 0.022)). Standing, HR, LF, LF/HF, LF + HF, TP, LF/HR, LFnu and HFnu showed significant differences exclusively on home vs. lab analysis (p = 0.016 ,0.0001, 0.0005, 0.019, 0.035, 0.003, 0.0006, 0.0006, respectively). No differences were observed between daily successive measurements (24-h) performed at-home (A vs. D) and in-lab (B vs. E). For ln-transformed data supine, LF, LF + HF and LF + HF/HR (p = 0.041, 0.037, 0.032, respectively) showed significant interaction effects but multiple comparisons analysis did not allow for decisive separation between sessions, with only trends observed for B vs. E (in-lab 24 h, p = 0.0113, 0.09 and 0.078, respectively). Standing, HR, LF/HF, LFnu showed post-hoc significant differences with HR being lower in-lab (A vs. B, p = 0.016) and LF/HF and LFnu being lower in-lab (p = 0.0005 and 0.001 for A vs. B and D vs. E). To evaluate short-term physiological responses within the lab, paired comparisons between measurements B and C showed significant difference for absolute HR supine (+ 2.7 bpm for B; p < 0.0001). Supine root mean square of successive differences (RMSSD) was higher for C (p = 0.041). Standing, SDNN, RMSSD and VLF were higher for C (p = 0.012, 0.021, 0.034, respectively). ANOVA results are available in supplementary materials.

Relative reliability analysis

The exhaustive results of the relative reliability analysis are presented in Tables 4 and 5. The intraclass correlation coefficients (ICCs) across the five different sessions demonstrated varying degrees of reliability. Both absolute and ln-transformed data showed comparable reliability, with slightly higher values for ln-transformed data in the frequency-domain metrics.

Table 4 Intraclass correlation coefficients (ICCs) with 95% confidence intervals for heart rate variability (HRV) absolute metrics across various measurement conditions. SDNN (standard deviation of NN intervals), HR (heart rate), RMSSD (root mean square of successive differences), PNN50% (percentage of successive NN intervals that differ by more than 50 ms), VLF (very low frequency), LF (low frequency), HF (high frequency), TP (total power), LFnu (low frequency normalized units), HFnu (high frequency normalized units). Symbols indicate whether the variable differs significantly between the compared sessions. * (p < 0.05), ** (p < 0.01), *** (p < 0.001).
Table 5 Intraclass correlation coefficients (ICCs) with 95% confidence intervals for heart rate variability (HRV) ln-transformed metrics across various measurement conditions. SDNN (standard deviation of NN intervals), HR (heart rate), RMSSD (root mean square of successive differences), PNN50% (percentage of successive NN intervals that differ by more than 50 ms), VLF (very low frequency), LF (low frequency), HF (high frequency), TP (total power), LFnu (low frequency normalized units), HFnu (high frequency normalized units). Symbols indicate whether the variable differs significantly between the compared sessions. * (p < 0.05), ** (p < 0.01), *** (p < 0.001).

In the supine position, time-domain variables such as HR, SDNN and RMSSD exhibited consistently good ICCs (> 0.75), indicative of robust autonomic signal detection, irrespective of transformation. Conversely, frequency-domain metrics displayed moderate reliability (ICCs < 0.75), suggesting some sensitivity to testing conditions, except for LF which showed good reliability (ICCs > 0.75). In the standing position, time-domain metrics and frequency-domain metrics LF and HF demonstrated moderate reliability (ICCs < 0.75), generally lower than in the supine position. Only HR, lnHR, and lnLF/HR reached good reliability, emphasizing the impact of postural changes on measurement stability.

Day-to-day analysis revealed variability in reliability. Supine absolute HR, RMSSD, LF, LF + HF, LF + HF/HR demonstrated good reliability across at-home measurements, while ln-transformed data showed lower ICCs in comparison. In-lab reliability (B vs. E) was significantly lower, especially for frequency-domain analysis. Supine RMSSD consistently exhibited good to excellent reliability across all testing conditions. In contrast, the LF/HF ratio exhibited moderate to poor reliability in certain conditions. For standing measurements, HR exhibited excellent reliability for in-lab comparisons (B vs. C). Most metrics in the standing position demonstrated moderate reliability (ICCs < 0.75), indicating reduced reliability compared to other metrics and conditions. This finding underscores the potential challenges of capturing reliable frequency-domain data in upright postures over extended intervals. Short-term in-lab (A vs. B) measurements showed good to excellent reliability in most of the variables used for clinical interpretation25 (e.g. HR, LF, HF), with ln-transformation trending to increase reliability.

Absolute reliability analysis

Table 6 presents absolute reliability data for all situations from absolute variables mainly used for clinical interpretation. In the supine position, Median Absolute Deviation (MAD) for HR ranged from 1.6 bpm to 2.4 bpm, indicating measurements with minor fluctuations. For SDNN and RMSSD, MAD values varied from 6.3 to 7.9 ms, reflecting modest variance. HF exhibited higher fluctuation with MADs up to 519.6 ms2, emphasizing the sensitivity of frequency-domain parasympathetic activity markers to measurement conditions. In the standing position, the MAD for HR demonstrated a range from 1.3 bpm to 3.3 bpm. The LF component showed noteworthy variance with MAD extending to 573 ms2. Relative Standard Error of Measurement (SEM) and Minimal Detectable Change (MDC) were significantly higher for frequency-domain variables, with ranges from 1.2 to 9.1% for time-domain and from 16.8 to 34.4% for frequency-domain. Conditions with the lowest SEM were at-home (A vs. D) and short-term in-lab (B vs. C).

Table 6 Absolute reliability variables according to each situation for main absolute variables of clinical interest. SEM (standard error of measurement), CV (coefficient of variation), MDC (minimally detectable change).

Performing the measurement under strictly controlled laboratory conditions (A vs. B and D vs. E) revealed no significant improvement except in supine VLF and standing HR, LF/HF, and LFnu metrics. Conversely, relative reliability was generally superior and SEM inferior when data were collected at home. Our results validate the reliability of short-term heart rate variability measurements under various conditions among healthy, physically active individuals. They confirmed that HRV is a robust and stable biomarker, showing high consistency across different environments—both at home and in laboratory settings—and under varying postural conditions. The observed variations between different postures reinforce the sensitivity of HRV to physiological changes, supporting its broader application in health assessments and performance monitoring.

Discussion

HRV is a cornerstone measure for evaluating ANS function. Its non-invasive nature and ability to capture subtle fluctuations in cardiac autonomic regulation have spurred significant interest across diverse fields, including sports medicine, telehealth, and chronic disease management. However, debates persist regarding how best to measure and interpret HRV in real-world scenarios. Methodological inconsistencies such as posture, timing, environmental variations challenge the reliability of HRV findings18,19. To address these issues, this study aimed to assess the clinical reliability of short-term HRV measurements in supine and standing positions, comparing at-home and in-laboratory settings in healthy active individuals.

Main findings

Our study contributes to the growing field of HRV research by showing that HRV remains a reliable indicator of autonomic status in young, healthy, active adults, even under varied conditions — including supine and standing postures and at-home and laboratory environments. Indeed, our findings show that short-term HRV measurements, when performed under standardized conditions with rigorous protocols, exhibit good-to-excellent reliability for most of the relevant variables across these conditions. Day-to-day variability was minimal, underscoring the consistency of HRV measurements for monitoring autonomic function in healthy, active individuals. These results reinforce previous work highlighting the robustness of short-term HRV data under carefully controlled conditions33,34. Time-domain HRV metrics such as RMSSD and HR were particularly robust across all conditions, with ICCs often exceeding 0.75. Overall, these findings align with the existing literature reporting the reliability of HRV measurements across a variety of physiological states and settings, with intra-class correlation coefficients ranging from 0.69 to 0.90 across multiple sessions35,36. Particularly notable are the obtained ICCs for RMSSD (0.917) and for HR (0.88), which are similar or even better to those reported in adolescent athletes (0.71 and 0.88, respectively)37. Frequency-domain metrics, while moderately reliable, were more sensitive to variations in posture and environment, in line with previous reports indicating larger standard errors of measurement in certain conditions where controlling all confounding factors is more challenging, such as non-laboratory environments, short sampling intervals, or varying postures33,34. The comparison of our data with those obtained in elite athletes – who showed low variance and good reliability of HRV – further corroborates the stability of HRV measurements in strictly controlled conditions38. This underscores that HRV’s “signal” can be distinguished from methodological “noise,” contingent on consistent timing, posture control, and careful data handling. Additionally, our findings showed good-to-excellent in-lab short-term reliability particularly in the supine position (B vs. C), confirming previous findings performed across four consecutive 5-minute measurements39. This short-term reliability of in-laboratory HRV provides strong support for its use in detecting true changes in cardiac autonomic regulation in pre/post intervention designs40. In contrast, the LF/HF ratio demonstrated moderate to poor reliability in certain conditions, underscoring its sensitivity to variations in environmental factors or measurement protocols. This sensitivity may limit its usefulness as a robust autonomic marker, which has already been discussed in the literature, especially in scenarios where consistent data collection across diverse settings is required41,42.

Overall, our results highlight the robustness of HRV as a tool for assessing autonomic nervous system function under different conditions and support the use of HRV as a stable biomarker for health and performance monitoring while underscoring the need for rigorous standardization to enhance interpretative accuracy.

Environmental influence: Home vs. Laboratory

Despite showing no drastic differences between at-home and in-lab environments, home-based measurements showed finer absolute reliability in the supine position. This modest, yet potentially impactful advantage highlights the potential of at-home HRV data collection to provide more consistent conditions compared to in-lab recordings. Measuring HRV upon awakening in the familiar home environment may reduce the influence of incidental physical activity or other confounding factors that may occur when measuring in clinical settings. Indeed, psychological factors such as anxiety and fear may be more likely to arise in unfamiliar environments like laboratories or hospitals11,43. These findings align with the growing adoption of remote health monitoring technologies, where wearable sensors and smartphone apps enable continuous physiological tracking without requiring frequent clinical visits44,45, benefiting those who cannot easily attend laboratory or clinical appointments—be it due to geographic limitations, mobility constraints, or busy schedules. Notwithstanding this, by carefully controlling confounding factors—such as fasting status and the absence of prior physical activity, we showed that measurements taken in a one-hour slot post-awakening in a laboratory setting can also achieve substantial reliability and offer a robust way of assessing ANS function. This highlights the potential for both approaches to be utilized interchangeably, depending on practical considerations and participant accessibility, without compromising data quality.

Importance of standardization and remaining challenges

One of the enduring challenges in HRV research is controlling for the myriad of variables that can influence measurements, including circadian rhythms, recent physical activity, emotional stress, and environmental factors such as temperature12,29,46,47. In our study, we attempted to minimize these confounders by instructing participants to collect data upon morning awakening, enforcing fasting, and ensuring minimal prior physical activation. These measures aimed to capture a more “baseline” state of autonomic function. Additionally, we rigorously followed the standardized protocols recommended by Schmitt et al. (2015) focusing on a dual-position analysis upon morning awakening to reduce potential confounding factors and maximize consistency25. This dual-position protocol also enabled us to explore the influence of posture on HRV, an aspect often overlooked in less standardized protocols26. Our approach thus builds on recommended clinically implemented protocols and resonates with broader efforts to unify HRV measurement protocols15,16,24,27. Moreover, we applied rigorous standards to our statistical analysis, including adjustments for skewed distributions48,49,50, ensuring that the reliability estimates accurately captured true autonomic function rather than artifacts.

Despite this rigorous design, we acknowledge that several challenges persist. Indeed, variability in personal routines, differences in participants’ sleep patterns, and individual sensitivity to posture shifts may still introduce noise. Moreover, while short-term and day-to-day measurements exhibit good reliability, as indicated by our findings, the literature suggests that long-term HRV tracking (e.g. over months) is more susceptible to within-subject variability21,51. Indeed, while our study demonstrated good reliability of HRV measurements over a 24-hour period, longer-term HRV monitoring still faces challenges. Notably, two studies reported substantially lower ICCs for HRV over intervals spanning several months, reflecting considerable within-subject variability and highlighting the limitations of extended HRV monitoring21,51. Additionally, factors like aging, lifestyle modifications, or the onset of subclinical conditions may substantially alter baseline autonomic function12,29, which highlights the influence of dynamic physiological factors on autonomic regulation over time and underscores the importance of context when interpreting HRV trends over extended timeframes.

Perspectives – technological advancements

Refining these methodological standards enables future studies to make confident cross-population comparisons and pre-/post-intervention HRV comparisons29,40, expanding HRV applications in clinical and non-clinical contexts. Our findings provide a foundation for further investigations in diverse cohorts, from patients with autonomic dysfunction to those aiming to optimize health and performance. Advances in wearable devices and photoplethysmography (PPG) sensor technology are making large-scale, real-time HRV monitoring more feasible, and artifact-correction algorithms promise more convenient HRV data collection in everyday environments. However, while PPG offers convenience, it remains less precise than ECG due to motion artifacts and external factors, with mixed validation results. Time-domain variables like RMSSD show acceptable agreement with ECG52, but frequency-domain parameters exhibit lower reliability53,54,55,56. Thus, while PPG can be suitable for tracking certain isolated metrics or trends, ECG remains the preferred method for clinical applications and research scenarios where high precision is essential. Emerging analytical techniques, including non-linear measures57 and artificial intelligence (AI)-driven models, offer new avenues for capturing complex cardiovascular dynamics. AI integration could establish individualized HRV baselines by incorporating various factors (e.g. age, training load, sleep quality, psychosocial stress), and flag deviations to guide training or clinical decisions, thus enhancing personalized healthcare and performance monitoring58. Building on our robust reliability data, future studies could leverage AI to deliver real-time feedback, guiding individualized training adjustments or clinical decision-making and thus broadening the impact of HRV monitoring in personalized healthcare.

Perspectives – clinical applications

The robust HRV reliability shown in our study reinforces the value of HRV as a practical tool in healthcare. The consistency of our measures across multiple days and under varying postural conditions, suggest that clinicians and researchers can use HRV to track autonomic changes in clinical populations with reasonable confidence. In chronic disease management – especially for conditions associated with autonomic dysregulation like hypertension, heart failure, or diabetes – regular HRV assessments could flag early signs of deterioration. Because our results support the feasibility of remote HRV measurements, healthcare providers may integrate home-based data collection to encourage patient engagement and timely intervention. In athletic and sports medicine contexts, the reliable day-to-day readings evidenced in our study suggest that HRV can serve as a trustworthy gauge of training load, recovery status, and overtraining risk. By adapting training programs according to systematic HRV evaluations, coaches and practitioners may optimize athletes’ performance while mitigating injury or burnout25,48. Moreover, our findings of stable HRV metrics across different environments support its growing recognition as a psychosomatic indicator – one that could be included in a broader psychophysiological framework for mental health management. The ability to measure HRV consistently in everyday settings lends itself to tracking stress management interventions, therapeutic progress, and overall well-being in populations dealing with anxiety and depression.

Conclusion

In conclusion, our study confirms HRV as a highly reliable measure of autonomic function in healthy, active adults when assessed under standardized conditions. We observed consistent reliability across postures and environments, highlighting HRV’s adaptability in real-world scenarios. While at-home measurements showed modest yet potentially meaningful advantages for certain metrics, our broader findings underscore HRV’s utility as a stable biomarker for cardiovascular health and training adaptations. Future research integrating wearable sensors and AI analytics could further enhance its diagnostic potential, supporting personalized, data-driven interventions. Moreover, our findings reinforce HRV’s significance as a critical measure in both clinical and athletic settings7,14,23. By demonstrating the robustness of HRV measurements, this study underscores the importance of precise and adaptable assessment protocols, ensuring HRV remains a valuable tool in both clinical and athletic contexts.

Methods

Study design

This observational study employed a rigorous design to examine the clinical reliability of HRV measurements, notably by contrasting at-home measurements upon awakening with those conducted in a controlled laboratory environment in individuals without any health complaints. After an initial screening visit, participants underwent five supine/standing HRV measurements within a 24-hour period under standardized conditions, as shown in Fig. 1. The first measurement (A) was taken at home upon awakening, followed later that day by two laboratory measurements: one upon arrival (B) and a second after a 15-minute passive sitting break (C). The process was repeated the next morning with another at-home measurement (D) upon awakening and a final lab measurement (E). The study aimed to: (1) assess short-term HRV reliability across these five time points to evaluate consistency; (2) analyze day-to-day variance by comparing HRV measurements taken in different environments across 24-h; (3) determine whether at-home measurements could reliably substitute or complement lab-based assessments; and (4) evaluate short-term reproducibility of lab-based measurements through repeated testing in a controlled setting.

Participants

Forty-three participants voluntarily took part in the study. Inclusion criteria required participants to be generally healthy, physically active (engaging in more than 150 min of moderate physical activity per week), aged between 18 and 70, free from acute injury or inflammatory disease, and with fatigue level ≤ 14 on the Profile of Mood State questionnaire. Exclusion criteria included pregnancy, current sickness or injury or acute musculoskeletal pathology, current treatment impacting cardiovascular function, current cardiac or pulmonary pathology, recent episodes of loss of balance or unexplained vagal symptoms, loss consciousness within the past year, and/or susceptibility to orthostatic intolerance. This population was chosen due to their stable physiological profiles and the relevance to sports and exercise medicine applications, where accurate monitoring of ANS activity is critical for optimizing health, performance, and recovery. The study was approved by the Canton de Vaud ethics committee (CER-VD, protocol #2020-00071) and conducted in accordance with the ethical standards of the Declaration of Helsinki. All participants signed an informed consent form prior to data collection.

Heart rate variability

Upon completion of the screening process during the first visit, participants were provided with a heart rate monitor (Polar H10, Polar, Finland) to perform all measurements. They were thoroughly familiarized with the smartphone application (inCORPUS® app, Be.Care, Lausanne, Switzerland) to conduct the HRV measurements at home independently. The principle of the measurement was to record HR at rest in supine and standing positions for 5 min each, as previously described24. Given the sensitivity of HRV to various internal and external factors, rigorous instructions were given. External factors (location, time, room temperature, air humidity, noise, devices used, persons present during the measurements) were kept as stable as possible. For all measurements, participants were also instructed to control for internal confounding factors by: (1) avoiding intense physical efforts 48 h before the measurements, (2) abstaining from alcohol 12 h before the measurements, (3) abstaining from nicotine or caffeine after awakening on the day of measurement (4) performing the measurements in a fasting state, free from muscle soreness, illness, and on an empty bladder. Additionally, participants were instructed to relax throughout the duration of the measurement, without focusing their thoughts on anything specific. During the standing phase, participants were asked to keep their arms along the body and remain motionless, without moving their feet, or transferring their weight from one foot to the other. Any interference (movement, coughing, yawning, spasm, etc.) occurring during the recordings had to be reported to the investigators. Laboratory measurements were performed exclusively between 7 and 9 am to minimize the influence of circadian variations, digestion, and other daily stressors. The conditions of measurement were kept identical between visits. Confounding factors were checked at every visit, and measurements that did not meet the required conditions were excluded.

Data processing and analysis

Analyses were performed on raw RR intervals of the supine and standing phases extracted from the app dashboard. Each file was visually inspected for artifacts and ectopic beats, which were automatically corrected with a dedicated software (Kubios HRV Premium, Kuopio, Finland; automatic beat correction and medium automatic noise detection59). Standard time- and frequency-domain variables were calculated automatically by the software on recording periods of 240 s after excluding the first minute22,60. Time- and frequency-domain variables retained for analysis were SDNN, reflecting overall HRV14, HR, RMSSD, indicative of parasympathetic influence22, percentage of successive NN intervals differing by more than 50 ms (pNN50%), VLF, LF, which highlights increased sympathetic activity and baroreflex sensitivity upon standing61, HF, which assesses parasympathetic activity and is valuable for evaluating vagal tone under resting conditions, the LF/HF ratio, LF + HF (sum of low and high frequency power), TP, and normalized units such as LFnu and HFnu. Supine, High frequencies and the sum of HF + LF were also calculated relative to HR to provide a comprehensive and normalized view of total autonomic balance, proving to be suitable for clinical assessments as well as for evaluating physiological responses in physically active individuals independent from naturally low breathing rate which could happen in this population42. LF in standing position was also transformed to assess the sympathetic nervous system response to orthostatic stress, adjusted for individual heart rate differences.

We performed our analysis on both absolute and ln-transformed HRV data in the same study to both provide direct biological interpretability and meet parametric test assumptions. Normality was tested with Shapiro-Wilk test. For normally distributed variables, parametric tests were used and descriptive statistics are presented as mean ± standard deviation (SD). If normality failed, data are presented as median [25th ;75th percentile].

To investigate significant differences between measurements, we performed a two-way repeated measures (RM) ANOVA focusing on measures A, B, D and E, which allowed the assessment of main and interaction effects of time (Day 1 vs. Day 2) and environment (at-home vs. in-lab). Measure C was excluded from this analysis as it introduced within-day variability that could confound the effects of the environment (at-home vs. in-lab) and across-day changes. For normally distributed data, a RM ANOVA with post-hoc Fisher’s Least Significant Difference test was used to explore specific pairwise comparisons between groups under different conditions. For non-normally distributed data, the Friedman test with post-hoc Dunn’s multiple comparison test was applied.

To evaluate short-term physiological responses within the lab, paired comparisons between measurements B and C were analyzed using paired t-tests or Wilcoxon signed-rank test, depending on data normality.

To assess relative reliability, we computed single measures intraclass correlation coefficients with a two-way random model, single measures, and absolute agreement. Data are presented with 95% confidence. ICCs were calculated over all 5 data points (measurement A to E) as well as between the following recordings: (1) At-home Day 1 (Measure A) vs. In-lab Day 1 (Measure B), (2) At-home Day 1 (Measure A) vs. At-home Day 2 (Measure D), (3) In-lab Day 1 (Measure B) vs. In-lab Day 1 (Measure C), (4) In-lab Day 1 (Measure B) vs. In-lab Day 2 (Measure E). ICCs were interpreted as < 0.5 = “poor”, 0.5–0.75 = “moderate”, 0.75–0.9 = “good”, > 0.9 = “excellent” reliability62.

To assess absolute reliability and clinical interpretations, we restricted our analysis to the absolute variables most used in clinical settings: SDNN, HR, RMSSD, HF; LF + HF/HR in the supine phase, and HR; LF, LF/HR in the standing phase. For each of those variables, SEM was calculated using the mean of standard deviations between each paired measurements, multiplied by √(1-ICC)50. SEM was then presented in absolute and relative values (SEM/mean values of the measurements). Coefficient of variation (CV) was calculated by dividing standard deviation by the mean of each configuration (e.g. A vs. B). The 95% limits of agreement (LOA) to determine MDC) were calculated as ± 1.96×√2×SEM49 for each SEM, and presented both in absolute and relative value (percentage of the medians). Considering the skewed distribution we also utilized the median absolute deviation (MAD) and Relative Deviation Index (RDI). Absolute deviation from the median was calculated from each measure and MAD was calculated by taking the mean of absolute deviations of the medians. The use of MAD offers then a robust assessment of variability less affected by outlier and skewed distribution. The relative deviation index was calculated by expressing this value in function of the median. The use of MAD and RDI ensures that the analyses are both statistically robust and clinically relevant. All analysis were performed on each situation (all 5 measurements, A vs. B, A vs. D, B vs. C and B vs. E.)

Statistical analyses were conducted using several software tools to ensure comprehensive data evaluation: intraclass correlation coefficients were calculated using Jamovi 2.3.28 (The Jamovi Project, Sydney, Australia); normality tests and comparisons analysis were performed with GraphPad Prism 10.1.2 (324) (GraphPad Software, San Diego, CA, USA); standard error of measurement and limits of agreement analysis were facilitated by Microsoft Excel (Microsoft Corporation, Redmond, WA, USA, Version 2402). Statistical significance was established at p < 0.05.