Abstract
Cardiac dysfunction plays a critical role in clinical diagnostics and treatment. Although traditional methods like echocardiography and blood biomarkers are effective, their limitations highlight the need for noninvasive and continuous monitoring solutions. Ballistocardiography (BCG), which captures subtle body vibrations generated by cardiac mechanical activity, has emerged as a promising tool for remote cardiovascular monitoring. This study presents a multi-pathology BCG dataset comprising recordings from healthy participants, patients with heart failure (HF), and those with arrhythmias such as atrial fibrillation (AF), premature ventricular contractions (PVCs), and premature atrial contractions (PACs). Synchronized electrocardiogram (ECG) and M-mode echocardiography recordings are also included, providing a comprehensive overview of cardiac function under diverse physiological and pathological conditions. The dataset aims to support the development of advanced algorithms and promote clinical validation of BCG as a tool for noninvasive cardiovascular monitoring.
Similar content being viewed by others
Background & Summary
Accurate monitoring and early detection of cardiac dysfunction are critical for effective clinical practice, as they directly affect diagnoses, prognostic evaluations, and treatment decisions1,2. Although echocardiography and blood biomarkers are widely accepted clinical tools for evaluating cardiac function, their dependence on specialized equipment and professional operation may limit their practicality for remote or home-based monitoring. Therefore, these limitations underscore the need for novel approaches that enable continuous, user-friendly, and noninvasive assessment of cardiac function, particularly in out-of-hospital settings.
Ballistocardiography (BCG) has emerged as a promising noninvasive technique for monitoring hemodynamic and cardiac cycle events. By detecting subtle vibrations on the body’s surface, BCG provides valuable information on cardiac function without the need for invasive procedures. Unlike traditional electrocardiogram (ECG)-based methods that require attaching electrodes to the body—even in portable or wearable forms—BCG is entirely unobtrusive, offering greater comfort and ease of use. In addition, BCG captures both cardiac and respiratory components, enabling the detection of subtle hemodynamic changes associated with inhalation, exhalation, and blood ejection3,4. For example, during inhalation, negative intrathoracic pressure increases venous return, resulting in higher wave amplitudes, while exhalation reduces cardiac output and wave amplitudes3. These characteristics make the BCG especially useful for measuring key indicators such as heart rate, respiratory rate, and stroke volume(SV)5,6,7,8,9,10.
Over the past two decades, significant advancements have been made in BCG-based vital sign monitoring and cardiovascular evaluation. Initially, research focused on improving sensor technologies, advancing from basic weighing or pressure sensors to electromechanical film sensors (EMFI)11,12,13,14,–15, which provide more sensitive detection of micro-vibrations related to cardiac cycle events. Furthermore, various analytical techniques, including wavelet transforms4, empirical mode decomposition16,17, filtering18, and machine learning19, have been extensively explored to extract key physiological metrics such as heart rate, heart rate variability, and respiratory patterns. Moreover, BCG has been combined with complementary modalities such as ECG and photoplethysmography (PPG) to derive parameters such as pulse arrival time (PAT) and pulse transit time (PTT)20,21,22,23,–24, helping to estimate blood pressure and providing a more comprehensive hemodynamic analysis. In addition, the feasibility of using BCG to measure advanced parameters such as cardiac output(CO)25,26,27,–28 and SV29 has been validated, with high correlations reported with reference methods25. These developments suggest that BCG is likely to become a practical, noninvasive tool for long-term cardiovascular monitoring in both clinical and home settings.
In addition to its capability for quantifying hemodynamic parameters, BCG has shown increasing promise in the detection and monitoring of a variety of cardiovascular diseases, including coronary heart disease30,31, arrhythmias32,33, hypertension34,35, and heart failure(HF)36,37,–38. These studies demonstrate that BCG signals can capture disease-relevant physiological changes, supporting their potential clinical utility. However, in certain areas such as valvular disease, most prior research including foundational studies from the mid-20th century39,40, was limited by the sensing and analysis technologies of that time. Modern validation studies employing advanced sensors and synchronized imaging are still rare, primarily due to the lack of comprehensive, publicly available datasets. Meanwhile, recent work has explored the use of BCG for sleep monitoring and broader cardiovascular risk prediction41,42, yet these datasets are often restricted to healthy participants or lack paired clinical imaging, which limits their usefulness in diagnostic model development.
To address these challenges, we present a comprehensive multi-pathology BCG dataset, which includes data from healthy participants as well as those diagnosed with HF, atrial fibrillation (AF), premature ventricular contractions (PVCs), and premature atrial contractions (PACs). The dataset also contains synchronized ECG and M-mode echocardiographic recordings to provide an integrated representation of cardiac function under diverse physiological and pathological conditions. M-mode echocardiography captures the motion of cardiac structures along a single ultrasound line over time, providing high temporal resolution for evaluating wall motion and chamber dimensions. It is widely used to assess left ventricular function and estimate ejection fraction(EF), offering key clinical indicators for diagnosing systolic dysfunction. Furthermore, our dataset includes left ventricular SV and EF values for all participants, measured via echocardiography at the time of signal acquisition. The SV values, combined with recorded heart rates, enable CO estimation and provide ground truth for future regression modeling. Although this study does not include direct CO analysis, the dataset is designed to support such research. In addition, EF values can be used to assess over all cardiac function.
This dataset is intended to promote research into innovative methods for assessing cardiac function, particularly in arrhythmic conditions, and to help create more accurate, cost-effective, and accessible monitoring methods for both clinical and home settings.
Method
Participants and ethical approval
This study enrolled 85 participants, including individuals with sinus rhythm, HF, and cardiac arrhythmias such as AF, PVCs, and PACs. Recruitment was conducted at Wuhan University of Technology and the Department of Cardiology at Zhongnan Hospital of Wuhan University between July and August 2022. Posters were displayed to invite healthy participants and those with diagnosed arrhythmias or HF. The study was approved by the Institutional Review Board of Zhongnan Hospital of Wuhan University (Ethics Approval Number: 2022075). All procedures complied with ethical guidelines, and written informed consent was obtained from all participants.
The demographic and clinical characteristics of the participants are presented in Table 1. The study population consisted of 67 participants with sinus rhythm, 15 with arrhythmias, and 7 with HF. Of these, 4 participants had both arrhythmias and HF, resulting in a total cohort of 85 unique participants. Overall, 72% of the total participants being male. The mean age of the sinus rhythm group was 42 years, whereas the arrhythmia and HF groups were older, with mean ages of 63 years. The HF group exhibited the lowest EF(%), with a mean of 39%, significantly lower than that of the sinus rhythm group (64%) and arrhythmia group (57%). Additional details, including heart rate (HR), body mass index (BMI), height (cm), and weight (kg), are provided in Table 1.
Inclusion and exclusion criteria
Eligible participants were screened by licensed cardiologists using echocardiography and ECG to confirm their status and ensure they met the inclusion criteria. To maintain the quality of the BCG signal, participants with severe respiratory conditions or diseases that cause involuntary body movements (e.g., tremor or muscle spasms) were excluded, as such conditions could introduce motion artifacts. Although BCG were collected in a supine position, this posture sometimes compromised echocardiographic image quality. Therefore, cases with poor quality ultrasound images were excluded to ensure the final dataset contained only clinically reliable measurements.
Data collection requirements
During data collection, participants were asked to remain calm and lie flat on an optical fiber sensor mat43. ECG electrodes were placed on their wrist and near the collarbones to continuously monitor their heart rate as reference signals (Fig. 1a). Simultaneously, a licensed cardiologist performed M-mode echocardiographic imaging using a Mindray M7 Expert ultrasound system with a P4-2S probe, operating at a frequency of 3.2 MHz. This ensured that image acquisition adhered to clinical standards and anatomical accuracy. This integrated protocol enabled synchronized acquisition of BCG, ECG, and echocardiographic signals under well-controlled conditions, enabling accurate correlation and validation across modalities.
Schematic diagram of the ballistocardiogram (BCG) sensing system and sensor structure. (a) System setup. BCG and electrocardiogram (ECG) signals are recorded simultaneously using a sensing mattress with an embedded sensor. (b) Sensor flexibility. (i–iv) Demonstrations of the sensor’s flexibility and its integrated module. (c) Structural layers. The mattress consists of a sensitization layer (including two parts: the top is a specially hardened polyvinyl chloride material, and the second is a high-density sponge; both components are designed to enhance sensing sensitivity), a sensing layer, a support layer, and a buffer layer. (d) Working principle. The sensitized optical fiber deforms in response to cardiac-induced vibrations. (e) Signal transmission. Mechanical input modulates the optical signal, producing the processed BCG waveform.
Data acquisition
BCG recording
An optical fiber sensor system43 (Fig. 1a), operating at 100 Hz, was used to record BCG signals. The sensor, composed of multiple flexible layers, was placed beneath the participant to ensure comfort and adaptability (Fig. 1b,c). Its soft and pliable material allows it to bend and conform to body contours, making it ideal for long-term monitoring.
The BCG acquisition process relies on modulation and demodulation techniques within a unified system (Fig. 1e)43,44.When a participant lies on the sensor, heartbeat-induced micro-movements cause slight variations in the pressure exerted on the optical fiber (Fig. 1d). These variations modulate the optical signal, which is then converted into an electrical signal through demodulation. The electrical output undergoes hardware filtering and analog amplification, generating an original BCG signal that contains inherent noise. Simultaneously, the system extracts the ECG signal from the same demodulation device, ensuring precise synchronization using a shared timing clock.
ECG recording
ECG signals were recorded simultaneously using a dedicated AD8232 ECG module, which provides high-gain, low-noise signal acquisition for real-time heart rate monitoring. The ECG module operated at a sampling rate of 100 Hz, ensuring synchronization with the BCG signal (Fig. 1a). To maintain accurate and stable R-wave detection, standard ECG electrodes were placed on the participant’s chest, neck, and wrists to optimize signal acquisition. The ECG module was integrated into the same demodulation system as BCG, ensuring precise timing alignment under a unified clock reference.
M-mode echocardiograph
To further assess cardiac mechanical activity, M-mode echocardiograms were acquired during BCG and ECG monitoring using the Mindray M7 Expert ultrasound system with a P4-2S probe. All scans were performed by a licensed cardiologist to ensure operator consistency. Since BCG acquisition requires the supine position, all echocardiographic recordings were also conducted in the supine position, rather than the conventional left lateral decubitus position, to maintain alignment between BCG and ultrasound imaging. Depending on the study protocol, ultrasound images were obtained from one of three cardiac sites: the mitral valve (referred to as EJ), the aortic valve (referred to as ZJ), or the left ventricle (referred to as XJ). The echocardiographic sequences were directly saved in AVI format, ensuring compatibility with subsequent signal analysis and multi-modal integration with the BCG and ECG recordings.
Enhanced Singular Value Thresholding (ESVT)-based Denoising Algorithm
To enhance the quality of BCG signals, we employed the Enhanced Singular Value Thresholding (ESVT) algorithm, which improves upon conventional wavelet-based denoising techniques44,45. The workflow of the ESVT algorithm is illustrated in Fig. 2 and consists of four main steps.
Workflow of the Enhanced Singular Value Thresholding (ESVT) algorithm. (a) Thresholding & iteration selection. A simulated BCG signal with Gaussian white noise undergoes a grid search to determine the optimal threshold and iteration count, based on signal-to-noise ratios(SNR) analysis. (b) Enhanced singular value thresholding. The noisy BCG signal is converted into a Hankel matrix, then iteratively filtered for iterations using ESVT with τ, progressively reducing noise. (c) Signal reconstruction and output. The denoised matrix is transformed back into the time domain, yielding a refined BCG signal with preserved physiological features.
Simulation-guided parameter selection
To determine the optimal denoising parameters, we conducted a grid search over a range of threshold values τ and iteration counts i.
To evaluate the performance of the proposed denoising algorithm, synthetic BCG signals were generated using a sum-of-sinusoids model informed by empirical frequency analysis. Real BCG recordings from healthy subjects were analyzed in the frequency domain, revealing five to six dominant frequency components ranging from 1.5 Hz to 9 Hz. The synthetic BCG signal was constructed as six frequency components (1.5 Hz, 3 Hz, 4.5 Hz, 6 Hz, 7.5 Hz, 9 Hz), which were combined with a low-frequency respiratory component (0.2 Hz) to simulate physiologically realistic fluctuations. The complete waveform was defined as follows:
Where fk are the dominant cardiac frequency components, and the amplitude Ak set as is {0.1, 0.2, 0.1, 0.4, 0.3, 0.6}. A respiratory component at fr = 0.2 Hz with an amplitude of 2. Notably, in the actual sensing system, the analog signal is passed through a 1–30 Hz band-pass filter and then amplified by a factor of 4 before digitization. Despite this hardware filtering, residual respiratory components remain present in the recorded signals. Therefore, it is essential to include the respiratory term in the synthetic BCG waveform to accurately reflect the physiological characteristics observed in real recordings. To simulate realistic noise conditions, Gaussian white noise was added to the synthetic signal with signal-to-noise ratios (SNR) ranging from 0 to 15 dB. This choice was based on baseline noise measurements from seven unloaded sensing systems, which demonstrated that the system’s intrinsic noise followed a Gaussian distribution.
Then, a grid search was performed across different values of threshold τ and iteration count i. Finally, the performance of each parameter combination was evaluated using SNR. The results were visualized as a heatmap, where the optimal parameters (τopt, iopt) were identified from the region with the highest SNR (deep red in Fig. 2).
Matrix Embedding and Singular Value Decomposition (SVD)
Once the optimal parameters were determined, the real BCG signals were processed using the following steps:
Step 1: The raw BCG signal x(n) of length N was embedded into a Hankel matrix X using a window length L (200 points, that is 2 seconds in this study).
Step 2: Singular Value Decomposition (SVD) was applied to decompose the matrix:
where Σ is the diagonal matrix containing singular values σ1 ≥ σ2 ≥ … .
Thresholding process
To remove noise while preserving the primary signal components, the singular values were iteratively thresholded.
A soft-thresholding operation was applied:
which selectively suppresses noise while retaining dominant periodic components.
This process was repeated for iopt iterations, ensuring progressive removal of noise components and convergence to an optimal denoised solution.
Signal reconstruction
The modified singular values were used to reconstruct the denoised matrix:
Then, the inverse embedding process was applied to obtain the final denoised BCG time series x*(n). Compared to the original signal, x*(n) exhibits enhanced waveform fidelity with significantly reduced noise and motion artifacts.
By iteratively refining singular value selection and thresholding, the ESVT algorithm effectively suppresses noise while preserving physiologically relevant BCG waveforms. The use of grid search-based parameter optimization ensures that the filtering process is adapted to the specific characteristics of the recorded signals, leading to improved signal quality for subsequent analysis. This structured denoising approach enhances the robustness of BCG signal processing, making it well-suited for real-world physiological monitoring applications.
Data Records
The dataset is available at Figshare (https://doi.org/10.6084/m9.figshare.28416896)46. It is organized into two main folders: Information and Data. The data folder consists of 153 subfolders, derived from 85 participants (Fig. 3). Each participant typically contributes 2–3 subfolders, depending on the completeness and quality of their M-mode echocardiography (UCG) recordings. There are 152 subfolder named following the format: hdata + SubjectID + EJ/XJ/ZJ to denote the specific cardiac region captured in the ultrasound data: EJ denotes M-mode imaging of the mitral valve, XJ denotes M-mode imaging of the left ventricle, and ZJ denotes M-mode imaging of the aortic valve. In addition, the Data folder contains a subfolder named EF_ALLsubject, which stores EF measurement images for all 85 participants. Each image corresponds to a single participant and is named using the format hdata + SubjectID + EF. All files are saved in.JPG format, and each image shows the M-mode echocardiographic trace used for EF assessment at the time of data collection.
Structure of the dataset. The dataset is organized into two primary directories: Data and Information. The Data folder contains 153 subfolders derived from 85 participants. Among them, 152 subfolders correspond to synchronized BCG, ECG, and M-mode ultrasound recordings. Each subfolder is named using the format hdata + SubjectID + EJ/XJ/ZJ, where EJ, XJ, and ZJ indicate the echocardiographic view of the mitral valve, left ventricle, and aortic valve, respectively. The remaining subfolder, EF_ALLsubject, stores M-mode echocardiographic images used for ejection fraction (EF) assessment from all 85 participants. Each image file is named hdata + SubjectID + EF and saved in.JPG format. The Information folder contains demographic metadata and session-level annotations, summarized in a single master file named Subject_Info.xlsx, which provides subjectID, sex, age, weight, height, heart rate, EF, and available imaging types for each participant.
For instance, a participant with identifier “001” may have subfolders named hdata1EJ, hdata1XJ, and/or hdata1ZJ, corresponding to each available M-mode echocardiographic segment. Each subfolder contains five distinct files, described in detail below.
BCG J-peak file
-
File name: hdata + subjectID + EJ/XJ/ZJ_BCG.csv
-
Content: J-peak positions in the BCG signal, presented in two columns.
-
The first column provides the raw data point index.
-
The second column specifies the corresponding time (in seconds) for each J-peak.
ECG R-peak file
-
File name: hdata + subjectID + EJ/XJ/ZJ_ECG.csv
-
Content: R-peak positions in the ECG signal, also in two columns.
-
The first column provides the raw data point index.
-
The second column specifies the corresponding time (in seconds) for each R-peak.
M-mode echocardiography (UCG) video
-
File name: hdata + subjectID + EJ/XJ/ZJ_UCG.AVI
-
Content: An AVI-format video of the simultaneously acquired M-mode echocardiogram. The suffix EJ, XJ, or ZJ indicates whether the imaging targeted the mitral valve, left ventricle, or aortic valve, respectively.
Signal data
-
File name: signal.csv
-
Content: Three columns of time-series data sampled at 100 Hz. Raw BCG signal (Column 1).ECG data (Lead V2 or another designated lead) (Column 2). Denoised BCG signal (Column 3), derived using the ESVT algorithm.
Signal visualization
-
File name: signal.pdf
-
Content: A graphical representation of the signals from signal.csv. This file facilitates quick inspection of waveform alignment and overall signal quality.
In addition to the data directory, an Information folder provides demographic and clinical details of the participants. All metadata, including date, subject IDs, demographic information, and available the type of imaging files(i.e., EJ, XJ, ZJ) with arrhythmic cases such as AF, PVCs, and PACs annotated in parentheses following the imaging type are summarized in a single Excel file named Subject_Info.xlsx, which applies to the entire dataset. Each row in subject_info.xlsx corresponds to an individual participant, and lists their ID, sex, weight, height, age, heart rate, and EF (%). These parameters establish an informative link between each participant’s anthropometric profile, cardiac function metrics, and the corresponding BCG, ECG, and ultrasound data.
Technical Validation
Parameter selection and denoising performance
To optimize the performance of the ESVT algorithm, we conducted a grid search over a range of threshold values (τ) and iteration counts (i). The optimal parameters were determined by evaluating the SNR, with the results visualized in a heatmap (Fig. 4a). The optimal threshold τopt and iteration count iopt were selected from the region with the highest SNR values, highlighted in deep red in Fig. 4a.
Threshold selection and denoising performance of the ESVT algorithm. (a) Heatmap of SNR versus threshold and iteration count. A grid search was used to identify the optimal threshold (τopt) and iteration count (iopt), with the highest SNR values highlighted in deep red. (b) Denoising results of one example from a sinus cardiac cycle signal. The original noisy BCG signal (top) is processed using the optimal parameters, producing a denoised signal (middle) with improved clarity. The bottom plot shows the corresponding ECG signal for reference. (c) Comparison of SNR improvement between the proposed ESVT method and a conventional wavelet-based(wave) denoising technique across synthetic BCG signals. Input SNR were applied from 0 to 15 dB using Gaussian white noise. Due to the inclusion of respiratory components, the actual measured SNR ranged from approximately –6.12 dB to 4.13 dB. ESVT demonstrates superior noise suppression, particularly under low-SNR conditions.
Using these optimized parameters, all BCG signals in the dataset were processed. An example of a denoised BCG signal is shown in Fig. 4b, where the ESVT algorithm effectively removes high-frequency noise and low-frequency respiratory components, while preserving the primary cardiac waveform. This demonstrates that denoised BCG signals retain their physiological integrity, with minimal signal contamination, ensuring high-quality data for further analysis.
To benchmark the performance of ESVT, we compared the SNR improvement achieved by ESVT with the results from our previously uesed wavelet-based(wave) denoising approach44. The results (Fig. 4c) show that ESVT yields consistently higher SNR gains across all tested levels, particularly under low SNR conditions.
Consistency between BCG-Derived and ECG-Derived Heart Rate
To assess the reliability of heart rate estimation from the BCG signals of this dataset, we compared the heart rate derived from BCG with the reference heart rate from ECG in the dataset. Figure 5a presents a scatter plot of heart rates obtained from both methods, showing a strong correlation (Pearson correlation coefficient R = 0.98) with a low root mean square error (RMSE) of 1.86 bpm and a mean absolute error (MAE) of 1.11 bpm. The Bland-Altman plot in Fig. 5b further confirms agreement, with a mean bias of 0.71 bpm and limits of agreement (LOA) ranging from −2.67 to 4.09 bpm. These results indicate that BCG-derived heart rate measurements fall within clinically acceptable limits, aligning with established medical diagnostic standards for heart rate monitoring. This supports the feasibility of using BCG as a noninvasive alternative to ECG for clinical and ambulatory cardiac evaluations.
Validation of heart rate estimation from BCG signals. (a) Scatter plot of BCG-derived versus ECG-derived heart rates. The heart rate extracted from BCG signals is plotted against the ECG reference, with color density representing distribution. (b) Bland-Altman plot of heart rate differences. The differences between BCG and ECG-derived heart rates are visualized to assess agreement, with dashed lines indicating the limits of agreement (LOA).
Qualitative analysis of arrhythmic events
A qualitative analysis was conducted on two participants(003 and 036) with cardiac arrhythmias to investigate the relationship between BCG signals, ECG, and M-mode ultrasound data (Fig. 6). In the case of the first patient with PVCs (Fig. 6a), an abnormality was observed in the ECG signal, but no corresponding mechanical response was detected in the BCG signal. Ultrasound imaging revealed that the mitral valve did not show significant movement during the PVCs event, indicating a lack of mechanical response to the abnormal electrical signal.
BCG and ECG signals with synchronized M-mode echocardiography in arrhythmic patients. (a) Mitral valve imaging (Subject ID: 003, hdata3EJ). BCG and ECG signals are shown in the upper panel, with a magnified segment highlighting an arrhythmic event. The lower panel presents the corresponding M-mode echocardiography of the mitral valve. (b) Aortic valve imaging (Subject ID: 036, hdata36ZJ). Similar to a, the upper panel displays BCG and ECG signals, while the lower panel shows the M-mode echocardiography of the aortic valve, synchronized with the arrhythmic episode.
In contrast, for the second participant with premature atrial contractions (PACs, Fig. 6b), the BCG signal remained present and showed a slightly earlier peak than expected, consistent with the prolonged RR interval in ECG. The M-mode trace of the aortic valve captured this timing variation as well. These observations suggest that certain arrhythmic events manifest as visible changes across multiple modalities, including ECG, BCG, and ultrasound, demonstrating both temporal and morphological consistency.
To further support this, we include two additional examples (081 and 025) in the Supplementary Fig. S1 and Fig. S2, which illustrate multimodal signal alignment in additional cases of PVCs and AF, respectively. While ECG provides critical information about the heart’s electrical conduction, it does not directly reflect mechanical performance. The findings here indicate that BCG may also capture mechanical consequences of arrhythmic electrical activity, although further validation in larger cohorts is warranted.
Dataset limitations
While the dataset includes a substantial number of high-quality BCG recordings, it currently contains a higher proportion of healthy samples compared to pathological cases. This imbalance may limit the applicability of certain machine learning models for rare or complex cardiac conditions, such as advanced heart failure or specific arrhythmia subtypes.We acknowledge this limitation and are actively working to expand the dataset by collecting and annotating more pathological BCG signals that span a broader range of clinical presentations.While BCG is not currently used as a standalone diagnostic tool in clinical settings, it holds significant potential for long-term, noninvasive monitoring of cardiac function. Future dataset releases will aim to provide a more balanced representation of subject types, thereby improving the dataset’s value for diagnostic algorithm development and validation.
In addition, we acknowledge that the left ventricular ejection fraction (LVEF) values in this dataset were measured using M-mode echocardiography. While M-mode offers high temporal resolution and ease of bedside acquisition, it has certain limitations in patients with heart failure. Specifically, M-mode relies on a single ultrasound line, and in HF cases with irregular wall motion or dilated ventricles, the positioning of the ultrasound beam may lead to underestimation of the true LVEF. In this study, we have cross-validated the M-mode EF values with the participants’ clinical diagnoses to ensure relative accuracy. Nonetheless, in future data collection efforts, we plan to incorporate B-mode echocardiographic imaging synchronized with BCG recordings to provide more comprehensive structural and functional references.
Furthermore, during comparative analysis between ECG-derived RR intervals and BCG-derived JJ intervals, we observed greater discrepancies at elevated heart rates, particularly in signals from patients with arrhythmias or heart failure. Although over 90% of cardiac cycles still exhibit clear one-to-one correspondence between ECG and BCG signals, some premature beats such as those in PVCs result in weak or absent BCG responses, thereby increasing RR–JJ divergence. We plan to investigate this phenomenon in more detail in future studies.
Usage Notes
The dataset is available at Figshare (https://doi.org/10.6084/m9.figshare.28416896)46. All files are provided in.csv,.AVI, and.pdf formats, and the data processing code is included in a separate folder labeled “code”. The data processing and analysis were performed using Python.
Code availability
All code for data processing and technical validation are available at Figshare (https://doi.org/10.6084/m9.figshare.28416896)46, and is also accessible via GitHub at: https://github.com/jingzhan1988/ESVT-algorithm-for-BCG.
References
Ouyang, D. et al. Video-based AI for beat-to-beat assessment of cardiac function. Nature 580, 252–256, https://doi.org/10.1038/s41586-020-2145-8 (2020).
Castiglione, V. et al. Biomarkers for the diagnosis and management of heart failure. Heart failure reviews 1–19 https://doi.org/10.1007/s10741-021-10105-w (2022).
Kim, C.-S. et al. Ballistocardiogram: Mechanism and Potential for Unobtrusive Cardiovascular Health Monitoring. Sci Rep 6, 31297, https://doi.org/10.1038/srep31297 (2016).
Ben Nasr, M. C., Ben Jebara, S., Otis, S., Abdulrazak, B. & Mezghani, N. A Spectral-Based Approach for BCG Signal Content Classification. Sensors 21, 1020, https://doi.org/10.3390/s21031020 (2021).
Ashouri, H., Orlandic, L. & Inan, O. T. Unobtrusive estimation of cardiac contractility and stroke volume changes using ballistocardiogram measurements on a high bandwidth force plate. Sensors 16, 787, https://doi.org/10.3390/s16060787 (2016).
Le, T. et al. Continuous non-invasive blood pressure monitoring: a methodological review on measurement techniques. IEEE Access 8, 212478–212498, https://doi.org/10.1109/ACCESS.2020.3040257 (2020).
Escobedo, P. et al. Bed-based ballistocardiography system using flexible rfid sensors for non-invasive single-and dual- subject vital signs monitoring. IEEE Transactions on Instrumentation Meas. https://doi.org/10.1109/TIM.2024.3369134 (2024).
Lin, J. et al. Wearable sensors and devices for real-time cardiovascular disease monitoring. Cell reports physical science 2 https://doi.org/10.1016/j.xcrp.2012.100541 (2021).
Linschmann, O., Leonhardt, S., Vehkaoja, A. & Hoog Antink, C. Estimation of the respiratory rate from ballistocardiograms using the hilbert transform. BioMedical Eng. OnLine 21, 54, https://doi.org/10.1186/s12938-022-01024-4 (2022).
Ahmedov, S. & Amirjanov, A. Genetic-fuzzy logic model for a non-invasive measurement of a stroke volume. Comput. Methods Programs Biomed. 203, 106046, https://doi.org/10.1016/j.cmpb.2021.106046 (2021).
Usman, A., Mukhtar, M. & Umer, R. M. Mems based wireless energy harvesting mechanism for ecg and bcg application. In 2014 International Conference on Emerging Technologies (ICET), 142–146 https://doi.org/10.1109/ICET.2014.7021033 (IEEE, 2014).
Junnila, S., Akhbardeh, A. & Värri, A. An electromechanical film sensor based wireless ballistocardiographic chair: Implementation and performance. J. Signal Process. Syst. 57, 305–320, https://doi.org/10.1007/s11265-008-0307-2 (2009).
Alametsä, J., Värri, A., Viik, J., Hyttinen, J. & Palomäki, A. Ballistocardiogaphic studies with acceleration and electromechanical film sensors. Med. engineering & physics 31, 1154–1165, https://doi.org/10.1016/j.medengphy.2009.07.020 (2009).
Alametsä, J., Palomäki, A. & Viik, J. Short and longer term repeatability of ballistocardiography in a sitting position with emfi sensor. Med. & biological engineering & computing 49, 881–889, https://doi.org/10.1007/s11517-011-0746-y (2011).
Hyun, E., Noh, S., Yoon, C. & Kim, H. C. Patch type integrated sensor system for measuring electrical and mechanical cardiac activities. In 2014 IEEE Sensors Applications Symposium (SAS), 94–96, https://doi.org/10.1109/SAS.2014.6798924 (IEEE, 2014).
Martinek, R. et al. Ballistocardiography signal processing by wavelet transform. In 2018 International Conference on Intelligent Systems (IS), 45–49, https://doi.org/10.1109/IS.2018.8710478 (IEEE, 2018).
Pinheiro, E., Postolache, O. & Girão, P. Assessment of empirical mode decomposition implementation in cardiovascular signals. Proc. 17th IMEKO TC 4 (2010).
Pinheiro, E., Postolache, O. & Girão, P. Empirical mode decomposition and principal component analysis implementation in processing non-invasive cardiovascular signals. Measurement 45, 175–181, https://doi.org/10.1016/j.measurement.2011.03.022 (2012).
Enayati, M., Farahani, N. Z. & Skubic, M. Machine learning approach for motion artifact detection in ballistocardiogram signals. In Proceedings of the 14th EAI International Conference on Pervasive Computing Technologies for Healthcare, 406–410,https://doi.org/10.1145/3421937.3421970 (2020).
Perez-Macias, J. M., Värri, A., Himanen, S.-L., Tenhunen, M. & Viik, J. Heart pulse demodulation from emfit mattress sensor using spectral and source separation techniques. In 2022 Computing in Cardiology (CinC), vol. 498, 1-4, https://doi.org/10.1145/3421937.3421970 (IEEE, 2022).
Winokur, E. S. Single-site, noninvasive, blood pressure measurements at the ear using ballistocardiogram (BCG), and photoplethysmogram (PPG), and a low-power, reflectance-mode PPG SoC. Ph.D. thesis, Massachusetts Institute of Technology, https://doi.org/10.22489/CinC.2022.304 (2014).
Kim, C.-S., Carek, A. M., Mukkamala, R., Inan, O. T. & Hahn, J.-O. Ballistocardiogram as proximal timing reference for pulse transit time measurement: Potential for cuffless blood pressure monitoring. IEEE Transactions on Biomed. Eng. 62, 2657–2664 (2015).
Yousefian, P. et al. Pulse transit time-pulse wave analysis fusion based on wearable wrist ballistocardiogram for cuff-less blood pressure trend tracking. IEEE Access 8, 138077–138087, https://doi.org/10.1109/TBME.2015.2440291 (2020).
Yaghoobi, N., Narimani, M. & Park, E. J. Integration of ballistocardiogram with ppg and ecg using a cnn-lstm model for cuff-less blood pressure estimation. In 2024 IEEE SENSORS, 1–4, https://doi.org/10.1109/SENSORS60989.2024.10785134 (IEEE, 2024).
Inan, O., Etemadi, M., Paloma, A., Giovangrandi, L. & Kovacs, G. Non-invasive cardiac output trending during exercise recovery on a bathroom-scale-based ballistocardiograph. Physiol. measurement 30, 261, https://doi.org/10.1088/0967-3334/30/3/003 (2009).
Quesada, O. et al. A pilot study of a modified bathroom scale to monitor cardiovascular hemodynamic in pregnancy. J. Am. Coll. Cardiol. 67, 1455–1455, https://doi.org/10.1088/0967-3334/30/3/003 (2016).
Foulger, J. H., Smith, P. E. Jr & Fleming, A. J. Cardiac vibrational intensity and cardiac output. Am. Hear. J. 35, 953–961, https://doi.org/10.1016/0002-8703(48)90593-6 (1948).
Kurumaddali, B. et al. Cardiac output measurement using ballistocardiogram. In The 15th International Conference on Biomedical Engineering: ICBME 2013, 4th to 7th December 2013, Singapore, 861–864 (Springer, 2014).
Yazdi, D. et al. Novel cardiac scale with ballistocardiography, electrocardiography, and impedance plethysmography sensors compared to the direct fick method in measuring stroke volume and cardiac output. In European Journal of Heart Failure, vol. 23, 299–299 (WILEY 111 RIVER ST, HOBOKEN 07030-5774, NJ USA, 2021).
Howell, J. D. Coronary Heart Disease and Heart Attacks, 1912–2010. Med. Hist. 55, 307–312, https://doi.org/10.1017/S0025727300005317 (2011).
Feng, J. et al. Non-invasive monitoring of cardiac function through Ballistocardiogram: an algorithm integrating short-time Fourier transform and ensemble empirical mode decomposition. Front. Physiol. 14, 1201722, https://doi.org/10.3389/fphys.2023.1201722 (2023).
Su, Q. et al. Multi-scale attention convolutional neural network for noncontact atrial fibrillation detection using BCG. Biomedical Signal Processing and Control 92, 106041, https://doi.org/10.1016/j.bspc.2024.106041 (2024).
Jiang, F., Xu, B., Zhu, Z. & Zhang, B. Topological Data Analysis Approach to Extract the Persistent Homology Features of Ballistocardiogram Signal in Unobstructive Atrial Fibrillation Detection. IEEE Sensors J. 22, 6920–6930, https://doi.org/10.1109/JSEN.2022.3153647 (2022).
Henry, B. et al. Cuffless Blood Pressure in clinical practice: challenges, opportunities and current limits. Blood Pressure 33, 2304190, https://doi.org/10.1080/08037051.2024.2304190 (2024).
Si, S. et al. Unconstrained blood pressure monitoring based on a neural network–assisted multistage pressure textile sensor. Nano Energy 115, 108730, https://doi.org/10.1016/0002-8703(48)90593-6 (2023).
Fudim, M. et al. Use of a Cardiac Scale to Predict Heart Failure Events: Design of SCALE-HF 1. Circ: Heart Failure 16, https://doi.org/10.1161/CIRCHEARTFAILURE.122.010012 (2023).
Al Younis, S. M., Hadjileontiadis, L. J., Stefanini, C. & Khandoker, A. H. Non-invasive technologies for heart failure, systolic and diastolic dysfunction modeling: a scoping review. Front. Bioeng. Biotechnol. 11, 1261022, https://doi.org/10.3389/fbioe.2023.1261022 (2023).
Xu, A., Chen, E. & Chen, J. A Low-Cost Non-Invasive Continuous Cardiovascular Monitoring System Based on Oil-Filled Rubber Tubing. IEEE Sensors J. 22, 4644–4650, https://doi.org/10.1109/JSEN.2022.3144961 (2022).
Soames, R. & Atha, J. Three-dimensional ballistocardiographic responses to changes of posture. Clin. Phys. Physiol. Meas. 3, 169 (1982).
Eddleman, E. Jr, Yoe, R. H., Tucker, W. T., Knowles, J. L. & Willis, K. The dynamics of ventricular contraction and relaxation in patients with mitral stenosis as studied by the kinetocardiogram and ballistocardiogram. Circulation 11, 774–783 (1955).
Carlson, C. et al. Bed-based ballistocardiography: Dataset and ability to track cardiovascular parameters. Sensors 21, 156 (2020).
Li, Y.-X. et al. A ballistocardiogram dataset with reference sensor signals in long-term natural sleep environments. Sci. Data 11, 1091 (2024).
Zhao, T. et al. Noncontact monitoring of heart rate variability using a fiber optic sensor. IEEE Internet Things J. 10, 14988–14994, https://doi.org/10.1109/JIOT.2023.3262634 (2023).
Zhan, J. et al. Non-contact assessment of cardiac physiology using fo-mvss-based ballistocardiography: a promising approach for heart failure evaluation. Sci. Reports 14, 3269, https://doi.org/10.1038/s41598-024-53464-8 (2024).
Karthikeyan, P., Murugappan, M. & Yaacob, S. Ecg signal denoising using wavelet thresholding techniques in human stress assessment. Int. J. on Electr. Eng. Informatics 4, 306 (2012).
Zhan, J. et al. A multi-pathology ballistocardiogram dataset for cardiac function monitoring and arrhythmia assessment. Figshare https://doi.org/10.6084/m9.figshare.28416896 (2025).
Acknowledgements
The authors would like to thank all participants in this study. This study has been supported by the National Natural Science Foundation of China (61735013 and 82070425).
Author information
Authors and Affiliations
Contributions
Zhengying Li, Jing Zhan, and Xiaoyan Wu designed the experimental protocol. Jing Zhan, Xiaoyan Wu, and Chao Zhang were responsible for data collection and the compilation and organization of the dataset. Tao Zhao and Kewei Chen were in charge of implementing sensors and hardware circuits. Jing Zhan and Zhengying Li wrote and reviewed the manuscript. Jing Zhan processed and analyzed the data. Zhengying Li defined the overall research direction and provided funding support. Zhibing Lu supervised the entire study.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhan, J., Li, Z., Wu, X. et al. A Multi-Pathology Ballistocardiogram Dataset for Cardiac Function Monitoring and Arrhythmia Assessment. Sci Data 12, 963 (2025). https://doi.org/10.1038/s41597-025-05287-z
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41597-025-05287-z








