Advances in digital health exclude Hispanic/Latinx populations with limited English language proficiency, despite their high cardiovascular risk and smartphone usage. Large language models (LLMs) offer promising English-to-Spanish translational solutions, with comparable accuracy to professional medical translators. In this work, we present data and propose a hybrid workflow that combines automated LLM translation with professional review to reduce costs and improve Hispanic/Latinx inclusion in cardiovascular digital health research.
The digital transformation and language barriers
Cardiovascular medicine is undergoing a digital transformation, with increasing reliance on technologies such as telehealth, home-based cardiac rehabilitation, wearable devices for remote monitoring, and smartphone applications to deliver patient care1. However, the evidence base supporting these digital innovations is limited: most trials and studies are conducted exclusively in English2.
This language barrier has significant implications for healthcare delivery and outcomes in the diverse population of the United States. As of 2020, over 67 million United States residents speak a language other than English, with 62% identifying Spanish as their primary language3. Within this Spanish-speaking population, more than 15.9 million individuals have limited English language proficiency3, making them dependent on Spanish translation for daily life and medical care4. As a result, emergency visit times are frequently prolonged and more frequent, post-discharge care is suboptimal, and there are higher readmission rates for individuals with limited English language proficiency5. Unfortunately, nearly one-third of healthcare systems fail to provide translation services or linguistically tailored health information6. Even when such services exist, professional translators are often unable to meet high volume and rapid turnarounds common in clinical practice7. Moreover, Hispanic/Latinx patients with limited English language proficiency often report dissatisfaction with both care quality and interpretation services received8.
Cardiovascular health differences and digital engagement in Hispanic/Latinx communities
These language barriers are particularly concerning given the cardiovascular health differences affecting Hispanic/Latinx communities. This population experiences higher rates of cardiovascular risk factors9, with recent data showing worsening in population-level prevalence and incidence of hypertension and diabetes10. Paradoxically, this is also a population that could greatly benefit from digital health innovations: over 91% of Hispanic/Latinx adults in the United States own smartphones, and compared to other racial/ethnic groups, they are more likely to use these devices to access health information11.
Despite this technological engagement, Hispanic/Latinx individuals with limited English language proficiency remain largely excluded from digital health research2. A recent analysis of digital health trials registered on ClinicalTrials.gov found that 51% explicitly required English language proficiency for participation, which is likely an underestimate2. This systematic exclusion may worsen the digital divide and prevent populations who might benefit the most from cardiovascular technological innovations from participating in the research that validates these tools and informs clinical decision-making12.
Large language models as translation tools
Large language models (LLMs) offer a promising solution to improve Hispanic/Latinx representation in digital health trials. While machine translation tools like Google Translate have existed for years, their medical applications have been limited by accuracy and usability/readability13. In contrast, newer LLMs such as ChatGPT, GEMINI, and DeepSeek—trained on vastly larger datasets and able to efficiently batch process large volumes of materials14—are positioned as potentially superior tools for English-to-Spanish translation in future digital health trials.
Emerging evidence supports the efficacy of LLMs in healthcare translation. Studies have demonstrated that LLM-translated materials (from English to Spanish) perform comparably to professional medical translators in medical accuracy and Spanish fluency for discharge instructions15 and post-operative care guidelines16. Additionally, LLMs have successfully translated kidney transplantation information from English to Spanish, achieving high scores for both medical accuracy and cultural sensitivity appropriate for Hispanic/Latinx populations17. These data suggest that LLMs could serve as valuable supplements to professional medical translators for health-related information, while also offering the unique ability of LLMs to provide conversational, reading-level appropriate, and culturally resonant translations18.
Limitations of large language model-mediated translation
Some limitations of LLMs should be considered. First, the risk of inaccuracy and “hallucination” (or the generation of false or misleading information) remains a valid concern with LLM use14. This risk can be substantially mitigated by restricting LLM use to direct English to Spanish translation tasks rather than content generation, leveraging their linguistic capabilities while avoiding their generative uncertainties. With this restriction, studies focusing specifically on health-related English to Spanish translations have highlighted the high accuracy rates (ranging in accuracy of translation from 4.70 to 4.94 out of 515,17 and scoring 100% for accuracy in 6/7 procedure-related discharge summaries16). Second, while translations may be technically accurate, they might lack clinical relevance or cultural appropriateness. However, recent iterations of LLMs pre-trained on a larger corpus of non-English language data appear to have produced more sophisticated LLMs that are linguistically adept and more in tune with cultural and contextual aspects of Spanish language17. Third, gender bias in LLMs is an ongoing challenge19, with analyses finding that females were frequently described with body-related adjectives and males with behavior-related descriptors20. Some investigators have addressed these concerns through careful prompt engineering17, though this remains an area requiring continued attention. Fourth, current LLMs lack the nuance to accommodate regional Spanish dialects (e.g., Argentinian versus Mexican Spanish), potentially limiting their effectiveness across diverse Hispanic/Latinx communities. Finally, the absence of regulatory oversight poses a practical barrier. Unlike approved medical devices, LLMs operate without formal healthcare regulation13. This regulatory gap creates a disconnect: while patients report greater satisfaction with LLM translations, clinicians remain hesitant to rely on them in health-related settings13.
A proposed hybrid workflow
Balancing the advantages and limitations of current LLM technology, we propose a hybrid workflow to increase Hispanic/Latinx participation in digital health trials while maintaining translation quality and cultural sensitivity (see Fig. 1). Our proposed workflow operates in two phases: first, LLMs would automatically translate all text-based trial materials—including consent forms, procedural instructions, and other patient information—from English to Spanish. Second, professional medical translators would review these automated translations for accuracy, cultural nuance, and clinical appropriateness. This step of human oversight is essential, as LLMs are not approved or regulated by a regulatory body and hence require further supervision at this stage and time. This hybrid approach strategically leverages the strength of LLMs (rapid, high-accuracy initial translations), while human experts ensure quality control. The workflow addresses key limitations of standalone LLM translation—potential inaccuracies, cultural blind spots, and regulatory concerns21—through professional oversight, while dramatically reducing the time and cost barriers that currently prevent many digital health trials from including Spanish-language arms. The efficiency gains would allow professional translators to focus on reviewing and refining existing translations (rather than translating materials from scratch), significantly reducing both project timelines and costs. It should be noted that this hybrid workflow is dependent on the availability of trained medical interpreters, who may not be available in all healthcare-related settings6. Moreover, given regional variations in Spanish (i.e., differences in vocabulary, idioms, etc.) and current limitations in LLMs, this workflow will be reliant on trained medical interpreters familiar with these language nuances to ensure that there is cultural alignment with the diverse population of Spanish-preferred-speaking individuals. However, with potential remote/fully-digital implementation, this approach could make Spanish language trial participation feasible for a broader range of research studies and potentially expand access to cutting-edge cardiovascular digital health innovations for Hispanic/Latinx communities who have been largely excluded from such trials and studies to date2.
This figure highlights the proposed hybrid workflow, leveraging both large language models and human expertise, which we believe will increase participation of non-English language fluent individuals in digital health trials. We also highlight emerging capabilities of large language models for future implementation in digital health trials. LLM – large language model. Created in BioRender. Kim, D. (2025) https://BioRender.com/ybmpmh3.
Future applications of large language models in digital health trials
Beyond their translation abilities, LLMs offer several emerging functionalities that could enhance digital health trials for Hispanic/Latinx (and other non-English language fluent) populations (see Fig. 1). First, LLMs can potentially be used to create audio content via text-to-speech (e.g., verbally conveying digital trial materials or information)22. This could make digital trial materials accessible to participants with low literacy or limited written language proficiency. Additionally, the generative properties of LLMs could enable the creation of visual aids and simplified representations of complex documents, such as informed consent forms, potentially improving patient comprehension across educational backgrounds. However, such use of LLMs would have to be closely monitored and regulated given the potential for misuse and hallucination. The reverse capability of LLMs (i.e., speech-to-text)23 could also be used to allow for real-time patient-reported outcome collection in patient’s native languages, preserving important linguistic nuance that may be lost in traditional translation processes. This capability could enable more authentic and comprehensive data collection from diverse populations. Finally, looking toward more distant future applications (and requiring much more generation of evidence at the foundational level), agentic AI systems could function as virtual health navigators24, providing motivational support in both a culturally and linguistically sensitive fashion, potentially improving both the enrollment and retention of patients in digital health trials.
Conclusions
In summary, the digital transformation of cardiovascular medicine presents both an unprecedented opportunity and critical challenge for addressing the digital divide. While technological innovations hold immense promise for improving cardiovascular care, their benefits risk being unequally distributed if Hispanic/Latinx populations with limited English language proficiency continue to be excluded from research that validates these tools. Our proposed LLM-assisted workflow offers a practical pathway forward, leveraging technological advances to overcome traditional barriers to inclusive research while maintaining the quality and cultural sensitivity essential for meaningful participation. Moreover, emerging applications of LLMs have potential to further improve recruitment and retention in future digital health trials. Additionally, while this Comment has focused on the Hispanic/Latinx population with limited English language proficiency living within the United States, this hybrid workflow can be applied to nearly any racial or ethnic minority group via thorough pre-training and/or fine-tuning of an LLM in the intricacies of languages and dialects. By implementing such approaches while being mindful of weaknesses of LLMs, we can ensure that the digital revolution serves to bridge, rather than widen, existing differences in health-related outcomes for all communities.
Data availability
No datasets were generated or analysed during the current study.
References
Kim, D. S., Eltahir, A. A., Ngo, S. & Rodriguez, F. Bridging the gap: how accounting for social determinants of health can improve digital health equity in cardiovascular medicine. Curr. Atheroscler. Rep. 27, 9 (2024).
Schütz, N. et al. Speaking the language of inclusion examining English language requirements in cardiovascular digital health trials. JACC Adv. 4, 102123 (2025).
Census Bureau Releases 2020 Demographic Analysis Estimates https://www.census.gov/newsroom/press-releases/2020/2020-demographic-analysis-estimates.html (2025).
Flower, K. B. et al. Spanish-speaking parents’ experiences accessing academic medical center care: barriers, facilitators and technology use. Acad. Pediatr. 21, 793–801 (2021).
Diamond, L., Izquierdo, K., Canfield, D., Matsoukas, K. & Gany, F. A systematic review of the impact of patient–physician non-English language concordance on quality of care and outcomes. J. Gen. Intern. Med. 34, 1591–1606 (2019).
Schiaffino, M. K., Nara, A. & Mao, L. Language services in hospitals vary by ownership and location. Heal Aff. 35, 1399–1403 (2017).
Khoong, E. C. & Fernandez, A. Addressing gaps in interpreter use: time for implementation science informed multi-level interventions. J. Gen. Intern. Med. 36, 3532–3536 (2021).
Rodriguez, J. A. et al. Telehealth experience among patients with limited english proficiency. JAMA Netw. Open 7, e2410691 (2024).
Gomez, S., Blumer, V. & Rodriguez, F. Unique cardiovascular disease risk factors in hispanic individuals. Curr. Cardiovasc. Risk Rep. 16, 53–61 (2022).
Odlum, M. et al. Trends in poor health indicators among black and hispanic middle-aged and older adults in the United States, 1999-2018. JAMA Netw. Open 3, e2025134 (2020).
Center P. R. Demographics of mobile device ownership and adoption in the United States. https://www.pewresearch.org/internet/fact-sheet/mobile/
Sieck, C. J. et al. Digital inclusion as a social determinant of health. npj Digit. Med. 4, 52 (2021).
Genovese, A. et al. Artificial intelligence in clinical settings: a systematic review of its role in language translation and interpretation. Ann. Transl. Med 12, 117–117 (2024).
Thirunavukarasu, A. J. et al. Large language models in medicine. Nat. Med. 29, 1930–1940 (2023).
Brewster, R. C. L. et al. Performance of ChatGPT and Google translate for pediatric discharge instruction translation. Pediatrics 154, e2023065573 (2024).
Ayoub, N. F., Lee, Y.-J., Grimm, D. & Balakrishnan, K. Comparison between ChatGPT and Google search as sources of postoperative patient instructions. JAMA Otolaryngol. Head Neck Surg. 149, 556–558 (2023).
Valencia, O. A. G. et al. AI-driven translations for kidney transplant equity in Hispanic populations. Sci. Rep. 14, 8511 (2024).
Mantena, S. et al. Fine-tuning LLMs in behavioral psychology for scalable health coaching. npj Cardiovasc. Health 2, 48 (2025).
Omar, M. et al. Evaluating and addressing demographic disparities in medical large language models: a systematic review. Int. J. Equity Health 24, 57 (2025).
Garrido-Muñoz, I., Martínez-Santiago, F. & Montejo-Ráez, A. MarIA and BETO are sexist: evaluating gender bias in large language models for Spanish. Lang. Resour. Eval. 58, 1387–1417 (2024).
Pérez-Guerrero, E. et al. Performance of large language model-generated spanish discharge material. J. Gen. Intern. Med. https://doi.org/10.1007/s11606-025-09758-2 (2025). 1–3.
Spechbach, H. et al. A speech-enabled fixed-phrase translator for emergency settings: crossover study. JMIR Méd. Inf. 7, e13167 (2019).
Gállego, G. I., Pareras, O., Cortada Garcia, M., Takanori, L. & Hernando, J. Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios. In Proc. Interspeech 2025, 31–35, https://doi.org/10.21437/Interspeech.2025-1954 (2025).
Karunanayake, N. Next-generation agentic AI for transforming healthcare. Inf. Health 2, 73–83 (2025).
Acknowledgements
The funders had no role in the interpretation of data or publication. D.S.K. is supported by the Wu-Tsai Human Performance Alliance as a Clinician-Scientist Fellow, the Stanford Center for Digital Health as a Digital Health Scholar, the Pilot Grant from the Stanford Center for Digital Health, NIH 1L30HL170306 and 9L30DK144879, the Robert A. Winn Excellence in Clinical Trials Career Development Award, the American Heart Association (AHA) Career Development Award (AHA 25CDA1436622), and the American Diabetes Association (ADA) Pathway to Stop Diabetes Initiator Award (7-25-INI-11). N.S. is supported by the Wu Tsai Human Performance Alliance Postdoctoral Fellowship and the Swiss National Science Foundation under the Postdoc. Mobility Fellowship 210803. F.R. is supported by grants from the NIH National Heart, Lung, and Blood Institute (R01HL168188; R01HL167974, R01HL169345), the American Heart Association/Harold Amos Medical Faculty Development Program, and the Doris Duke Foundation (Grant #2022051).
Author information
Authors and Affiliations
Contributions
D.S.K. and F.R. wrote the main manuscript text. DSK and FR revised the manuscript. A.E., S.M., R.D., and C.M. provided literature review. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
D.S.K. reports grant support from Amgen and the Bristol Myers Squibb Foundation (via the Robert A. Winn Excellence in Clinical Trials Career Development Award), outside the submitted work. F.R. reports equity from Carta Healthcare and HealthPals, and consulting fees from HealthPals, Novartis, Novo Nordisk, Esperion Therapeutics, Movano Health, Kento Health, Inclusive Health, Edwards, Arrowhead Pharmaceuticals, HeartFlow, and iRhythm outside the submitted work. The remainder of the authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Kim, D.S., Eltahir, A., Mantena, S. et al. Leveraging large language models to bridge the digital divide in cardiovascular health research. npj Cardiovasc Health 2, 60 (2025). https://doi.org/10.1038/s44325-025-00095-1
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s44325-025-00095-1
