Abstract
Palliative care is known to improve quality of life in advanced cancer. Natural language processing offers insights to how documentation around palliative care in relation to metastatic cancer has changed. We analyzed inpatient clinical notes using unsupervised language models that learn how words related to metastatic cancer (e.g. “mets”, “metastases”) and palliative care (e.g. “palliative care”, “pal care”) appear relationally and change over time. We included any note from adults hospitalized at the University of California, San Francisco system. The primary outcome was how similarly terms related to metastatic cancer and palliative care appeared in notes using a mathematical approach (cosine similarity). We used word2vec to model language numerically as vectors. Relational data between vectors was captured using cosine similarity. We performed linear regression to identify changes in these relationships of terms over time. As a sensitivity analysis, we performed the same analysis per year restricted only to patients with an ICD-9/10 diagnosis code for metastatic cancer. Metastatic cancer and palliative care terms appeared in similar contexts in clinical notes each year, suggesting a close relationship in documentation. However, over time, this relationship weakened, with these terms becoming less commonly used together as measured by cosine similarities. We found similar trends when we retrained models just on patients with a diagnosis code for metastatic cancer. Text in clinical notes offers unique insights into how medical providers document palliative care in patients with advanced malignancies and how these documentation practices evolve over time.
Similar content being viewed by others
Introduction
Specialty palliative care (PC) for patients with advanced cancer leads to improved quality of life and reduced symptom intensity1. Nationally, metastatic cancer is the most common diagnosis of patients who receive specialty PC2. However, despite guidance emphasizing early PC integration in metastatic cancer3, specialty PC remains underutilized4. This may reflect changing makeup of PC referrals in hospitalized settings5. If specialty PC is a constrained resource, then increasing referrals within one specialty may come at the expense of another, like advanced cancer. PC is also delivered in different ways in different contexts. Given, the resource constraints associated with specialty PC, much PC is provided directly by the primary clinical services or teams (so-called “primary PC”)6. Evolving practices and cultures of how PC is delivered are dictated by resource constraints, patient needs, hospital and clinical cultures and clinician comfort around these interventions. Thus, identifying quality PC processes is challenging.
Studies to evaluate PC processes are limited given the poor sensitivity of administrative data for ascertaining specialty PC7. Natural language processing (NLP) allows notes to be leveraged for novel questions about how clinicians document PC. Incorporation of note text improves PC ascertainment when structured data is lacking, allowing for improved evaluation of PC quality metrics8. Nonetheless, documentation of PC and quality measures around PC needs are often incomplete9,10 and PC ascertainment using NLP requires additional approaches beyond presence or absence of “palliative care” being mentioned in notes. Notes require certain conscious or unconscious decisions by note authors during the writing process. To the extent that language, and thus medical notes, represents thought in accordance with cognitive linguistics, the inclusion or exclusion of certain language in notes could represent whether concepts are being considered by clinicians. In patients who may stand to benefit specialty PC, like those with metastatic cancers, documentation and methods that identify language similar to PC could be used to better address gaps in care and provide insights into how PC as a concept related to metastatic cancer may be evolving.
NLP converts unstructured text data into numerical data that can be inputted into statistical models. Word embeddings represent the generation of numerical representations of words relative to other words in multidimensional space, thus maintaining important contextual relationships11. While these concepts may be foreign to many clinicians, they represent foundational principles in large language models (like ChatGPT) that are increasingly being used in healthcare settings. If every word in a document (or note) is converted into a unique number, then this would create a substantially large amount of features that would overwhelm many statistical modelling tasks. Word embeddings help retain contextual relationships between words and minimize the number of features before inputting them into additional models for targeted statistical tasks (like clinical risk prediction). Relational distances of these contextual numerical representations of words (word embeddings) represent ways of identifying relationships of words in clinical notes, such as how commonly they appear together or how contextually related they may be. When applied over time, embeddings also allow for the study of how language changes and evolves. For instance, word embeddings trained on New York Times articles by decade capture the temporal changes in attitudes toward women and ethnic minorities12. Temporal interactions of words like “palliative care” with other important concepts (e.g. metastatic disease) within notes may be captured through these contextual mappings.
We applied previously described methods11 from our group, using temporal word embeddings, to study semantic change over time. We traced the temporal development of concepts within notes to generate insights into the evolution of clinical documentation related to PC through language analysis. We trained neural network models on inpatient notes, one year at a time, and queried terms related to metastatic cancer and PC. We hypothesized that terms related to metastatic cancer were closely related to PC terms and this relationship increased or strengthened over time. This study was not a mixed methods study, whereby clinician authors were interviewed about their use of language and thus, is limited to documentation practices only. Nuances within documentation that represent lexical relationships beyond the simple presence or absence of a word, can inform future interventions and studies around clinical documentation practices and potentially behaviors, such as who best to target for PC interventions or alerts.
Methods
We conducted a retrospective cohort study using de-identified notes of adults ≥ 18 years old at University of California, San Francisco (UCSF)13, from 2013–2020 (includes COVID-19). The primary goal of this study was to identify lexical changes within documentation at UCSF beyond presence or absence of particular terms. This study was approved by UCSF’s institutional review board (IRB 20-30590). All research was carried out in accordance with UCSF guidelines and regulations and all protocols were approved by the UCSF IRB above. Due to the retrospective nature of this study and that it was performed on de-identified data, informed consent was waived by the UCSF IRB. See Supplement for details on UCSF’s de-identified notes. We include EQUATOR guidelines to facilitate transparent reporting (STROBE as a separate file).
Some contextual information of the EHR, notes and PC services during the study period merit further discussion. Epic was formally adopted at UCSF (in its current form) in 2012–2013. Our group has noted that 2012 EHR data during the rollout of Epic can be challenging to interpret given missingness and phasing of EHR integrations and thus was not included in this study period. Inpatient PC services were available throughout this study period at UCSF but makeup of the team (e.g. inclusion of a social worker vs. chaplain) did change during the study period. Through Epic’s MyChart, UCSF notes were largely available to patients during the study period. Specific requirements around speed by which patient medical records are made available to patients online were mandated by the 21st Century CURES Act which was implemented at UCSF formally in March, 2021 which occurred after our study period. Nonetheless, many of the notes used for this study were available to patients at least within days of their writing. There were no changes to hospitals within UCSF included or excluded during the study period. Finally, patients with COVID-19 presented to UCSF as early as 2/2020 and shelter-in-place orders began the following month with important impacts on specialty PC delivery in our study population14.
We adopted previously described methods11 from our group to identify relationships of words or groups of words across clinical documents. This was assessed using a concept called “cosine similarities” which represents the “distance” of words (metastatic cancer terms and palliative care terms), not within an individual document but across all documents. Specifically, this represents the distance between the vector representations of words (word embeddings) that capture contextual elements of the words themselves. While this concept is difficult to capture compared to the presence or absence of words in general, they represent “learned” relationships by neural network models using context across a large set of documents. Stated another way, they can also be considered how likely or not one word may be replaceable by another based on the context of that word. Frequency of words present, how close they exist within notes and whether the presence of one indicates the presence or absence of another all impact the relational distances learned by neural networks.
To accomplish this, we trained word2vec models using a continuous bag of words architecture15, on processed words/phrases for each year. This yielded a separate NLP model for each year, allowing us to compare the same word relationships per year. See Supplement for further details on note preprocessing and word2vec models.
To develop a metastatic and PC term lexicon, we initially chose “metastatic”, “mets”, “palliative care” and “palliative”. We identified the top 50 most contextually similar words (highest cosine similarities—see below) for each term across w2v models during the study period. This identified contextually similar terms which were further refined by the authors (JC, HYY). Each added term was cross-referenced again from w2v models to further expand the lexicon. Our final lexicon is shown in Table 1. We included terms present ≥ 5 times per year (eTable 1).
The primary outcome was contextual similarity or the geometric distance between the group of metastatic terms and individual target PC terms. We calculated this using cosine similarity, a commonly used metric for word embeddings, and represents the co-occurrence of words in space16. Outputs range from − 1 (contextually dissimilar) to + 1 (contextually similar). For instance, “SCC” and “squamous cell carcinoma” would be highly contextually similar (e.g. cosine similarity close to + 1), while words like “SCC” and an unrelated object like a “syringe” would be highly contextually dissimilar (e.g. cosine similarity close to − 1). We calculated point estimates for average cosine similarities using means and precision-weighted averages11. We used bootstrapping with resampling to evaluate uncertainty of point estimates17. Linear regression was used to determine whether base-target relationships changed over the study period.
Since running models on all inpatients would not distinguish between those with and without an actual metastatic cancer diagnosis, we also retrained models on a subset of patients with a diagnosis of metastatic cancer. This served as a sensitivity analysis and we isolated the subsets of patients with metastatic cancer International Classification of Diseases (ICD) codes-9 and -10 administrative codes. We adopted ICD-9 and ICD-10 codes for metastatic cancer from the literature. However, because our study spanned the ICD-9 to ICD-10 transition (October 2015), we used algorithms to map ICD-9 codes to ICD-10 codes using Generalized Equivalence Mapping crosswalks published by the Centers for Medicare and Medicaid Services18. Only codes that represented disseminated or secondary or metastatic cancers were included in the final list. See Supplement for further details on the final list of included ICD diagnosis codes (eTable 2). All w2v and linear regression models were repeated across years for this sample. All w2v models and primary and sensitivity analyses were performed using Python (version 3.8).
Results
Across 28,600,649 inpatient adult notes from any discipline (e.g. nurses, physicians, etc.), we identified PC terms were utilized across 190,778 patients and 971,085 notes. Counts of metastatic and PC terms used annually are presented in Supplement (eTable 3). In general, counts for each term increased over time. Cosine similarities for all terms were positive (with 95% confidence intervals above zero) across each year meaning terms were more likely than chance to be co-associated with one another by note authors. The terms “metastatic” and “palliation” exhibited the highest similarity scores, with cosine similarity ranging from 0.13–0.19.
Cosine similarities for metastatic terms and unspecified PC terms showed a decreasing trend over time, while those for the metastatic group and “pall” remained relatively stable (Fig. 1a–d). Only metastatic terms and “palliation” showed a statistically significant decrease in cosine similarity over time. Notably, for the primary and sensitivity analyses, metastatic terms and “pall” had the smallest cosine similarities for most of the study period (these rose from 2017 to 2020), suggesting the weakest relationship among these terms. Individual base-target relationships for one example metastatic base term, are shown eFigure 2.
(a–d) Similarity scores of Metastatic Group and Different Palliative Terms from 2013 to 2020. Similarity scores are measured by cosine similarity (cosine angle of the mean point estimate of “metastatic” word vectors and individual palliative care target vectors). Blue lines represent the cosine similarity per year. 95% Confidence intervals for each cosine similarity value using bootstrapping with resampling, is shown vertically (and is very small for each point). Linear regression trendlines were performed for each study period. Point-wise estimates were performed to allow for the grouping of all possible “metastatic” terms. Cosine similarity ranges from − 1 (contextually dissimilar) to + 1 (contextually similar). (a Palliative; b Palliate; c Palliation; d Pall).
Patient, encounter and note counts for the patients with an ICD-code for metastatic or disseminated cancer are shown in eTable 4. The most common diagnoses included for ICD-9 were for “malignant neoplasms of ill-defined sites in the thorax” (439,824 notes across 5,833 patients), “secondary neoplasm of bone and bone marrow” (155,121 notes across 2472 patients) and “secondary malignant neoplasm of brain and spinal cord” (97,096 notes across 1258 patients). For ICD-10, the most common diagnoses included were for “secondary malignant neoplasm of other specified sites” (220,902 notes across 2,536 patients), “secondary malignant neoplasm of liver and intrahepatic bile duct” (171,148 notes across 2124 patients) and “secondary malignant neoplasm of unspecified site” (161,900 notes across 1782 patients). Cosine similarities for metastatic terms and unspecified PC terms when restricted to patients with a diagnosis code for metastatic cancer globally showed a decreasing trend over time, except for those for the metastatic group and “pall” which increased (Fig. 2a–d) but none were statistically significant (p > 0.2). Unlike the primary analyses, cosine similarities in the metastatic cancer subgroup increased in 2020 for “palliative” and “palliate” and slightly for “palliation”. Given the smaller corpus size (fewer notes in the metastatic cancer subgroup), confidence intervals are wider than the primary analysis and variation per year is larger.
(a–d) Similarity scores of Metastatic Group and Different Palliative Terms in the Subgroup of Patients with a Metastatic Cancer Diagnosis Code from 2013 to 2020. These figures represent models only trained on notes from patients with a diagnosis code of metastatic cancer using International Classifications of Diseases codes. Similarity scores are measured by cosine similarity (cosine angle of the mean point estimate of “metastatic” word vectors and individual palliative care target vectors). Blue lines represent the cosine similarity per year. 95% Confidence intervals for each cosine similarity value using bootstrapping with resampling, is shown vertically (and is very small for each point). Linear regression trendlines were performed for each study period. Point-wise estimates were performed to allow for the grouping of all possible “metastatic” terms. Cosine similarity ranges from − 1 (contextually dissimilar) to + 1 (contextually similar). All p-values were > 0.05. (a Palliative; b Palliate; c Palliation; d Pall)
Discussion
We present a novel method to study the evolution of how medical providers document PC utilization in patients with advanced cancer over time. While describing the presence or absence of words for metastatic cancer and palliative care could capture broad frequencies and data about their presence in notes, they do not capture more nuanced relationships such as co-occurrences, distance between these words within notes and commonality of their contexts. To capture these ideas at a higher level (a population- or corpus-level), we leveraged neural network models using NLP. Opposite to our original hypothesis, we observed broad decreases in the relational characteristics of metastatic and PC terms, suggesting a decline in how clinicians are documenting PC generally. However, only metastatic terms and “palliation” had a statistically significant decrease in this relationship. When trained on only notes from patients with a diagnosis for metastatic cancer, the relationships became less linear and more varied but also with larger confidence intervals. Importantly, our conclusions are limited to documentation trends and lexical change and do not necessarily mean that clinicians are thinking or considering PC interventions differently over the study period. A study that explores whether clinicians are truly considering PC requires mixed methods involving interviews with note authors as they are actively thinking about patients, which is out of the scope of this study. Instead, we sought to understand relational elements of metastatic cancer and PC within notes to generate insights into how documentation itself may have evolved.
Our interest in lexical relationships is predicated on an important assumption adopted from the field of cognitive linguistics, whereby language documented by authors could represent the cognitive frameworks of those authors and thus provide insights into author behaviors19. Various groups have described the use of stigmatizing language within medical notes, including pejorative nouns about individuals with substance use disorders20 (i.e. “substance abusers”), stigmatizing adjectives21 and other linguistic approaches like questioning patient credibility21. When text is inputted into language models in an unsupervised way, language models have been shown to transmit human biases22. Our group has also shown that these transmissions also occur when language models are trained from medical texts instead of text from the internet11. Language within notes may reflect many of the conscious and unconscious subjective perceptions of the author but this study should be considered as hypothesis-generating. Future studies are required to better understand if documentation trends reflect actual clinician behaviors and considerations of PC interventions.
We acknowledge that our study results should be limited to linguistic associations and while practice patterns should not be inferred from these results, linguistic phenomena may be affected by local PC utilization and considerations. During our study period, evolving inpatient PC utilization has been reported, including increased specialty PC in patients with advanced non-cancer diagnoses relative to cancer diagnoses5. The decreased linguistic associations (only significant in the primary analysis for “palliation”) represents evolving documentation of how and when PC is mentioned in reference to patients with metastatic disease. Notably, any increased outpatient PC integration or adoption of primary PC delivery (PC delivered by primary medical services instead of specialty PC teams) could impact linguistic relationships if we undercaptured PC mentions in other settings for the same patients23,24. Future studies should pursue even more granular clinical and linguistic data (e.g. coupled to the weekly level) in order to better understand the interactions between note language and practice patterns.
Interestingly, our primary analysis demonstrated that nominalized “palliation” decreased significantly while other parts of speech like the verbalized “palliate” or adjectivized “palliative” did not. While these are closely related etymologically, the nominalized construction may reflect PC as a goal in and of itself (over say, curative therapies)25. While studies comparing perceptions around different variations of PC terms (e.g. palliate vs. palliative vs. palliation) have not been well-explored, stigma when the word “palliative” is used in cancer care exists26. It is possible that nuances between these terms could convey or represent different stigmas, perceptions, considerations or suggestions by authors25. Our observation that the more active constructions (e.g. “palliate”) were stable and the nominalized construction (“palliation”) decreased, could reflect such changes. However, additional research using is required to better understand how individual note authors and readers may be using and perceiving these terms differentially and how they relate to patient care.
Our study period coincides with the growth in cellular- and immunotherapies and global mortality improvements in patients with metastatic disease27. These could lead to changing palliative needs and hospitalization characteristics (e.g. symptoms associated with drug side effects) of such, and in turn alter how inpatient PC involvement was documented. For instance, if patients with metastatic disease were more commonly hospitalized for therapeutic side effects instead of critical illness secondary to their underlying malignancy, then language around PC needs and perceptions around when to involve specialty PC teams may have changed as well. Additionally, they could reflect shifts in culture around how note authors describe and document PC delivery and interventions more globally (e.g. PC as active interventions rather than as a goal unto itself or a subliminal shift to more outpatient PC interventions). Notably, our lexicon includes any mention of PC and did not differentiate primary vs. specialty PC which also could lead to changes in parts of speech and how the PC was actively delivered by primary vs. specialty teams. The variations in relationships between metastatic cancer and PC terms increased when restricted to training only on patients with a metastatic cancer diagnosis, suggesting a more complex relationship, meriting further study. While it is possible that further restricting training samples on particular cancers (e.g. metastatic lung cancer vs. metastatic melanoma) may reveal even more patterns unique to when and how PC is documented and considered for specific malignancies, we also anticipate wider confidence intervals. Note author interviews and qualitative methods are required to better understand how parts of speech and uses of particular PC terms may have been chosen (consciously or not) by individual authors.
Our sensitivity analyses, which retrained w2v models on only patients with a diagnosis of metastatic cancer demonstrated notable differences compared to the larger corpus of all inpatient notes. While trends in models across all patient notes largely decreased in terms of contextual similarity of metastatic cancer and PC, there was more variation and much wider CIs in the metastatic cancer only analysis. Notably, the tail end of the data in 2020 showed stronger relationships between metastatic cancer and PC. While this could be due to variations across a smaller cohort, it is also notable that COVID-19 did alter PC delivery within the UCSF system in the final year of our study period14. It is possible that given COVID-19-related mortality was higher in patients with metastatic cancers28, more patients with metastatic cancer who were also infected with COVID-19 may have had more exposure to PC services or documentation of PC by inpatient teams. While PC resources for other non-COVID-19 diseases (e.g. cancer) during the COVID-19 pandemic changed in important ways29, the ways clinicians documented PC during this time are largely unknown. One study showed increased documentation of early decision-making regarding resuscitation during the COVID pandemic30 but other studies around PC documentation are limited. Further studies are needed to identify how COVID-19 may have impacted documentation of PC and metastatic cancer and/or whether trends around their co-associations may have perpetuated after 2020.
Our linguistic results can help inform future studies in documentation of PC-based concepts and global considerations when using neural networks for modeling language over time. W2v, the model used for this study, learns a single fixed vector representation for each word regardless of context. Newer transformer-based31 models, like newer Generative Pre-trained Transformers (GPT)32, account for more contextual elements within a set of documents allowing for richer context-aware linguistic representations. However, using more static representations (w2v or others, like GloVe33 or fastText34) but trained yearly, could provide more opportunity to evaluate subtler “semantic drift” or changes in meaning of words or concepts longitudinally over time35. Notably, these approaches are somewhat more intuitive, computationally efficient and adaptable compared to transformers. Some have argued that transformers may be less useful when capturing gradual semantic changes over time compared to w2v36 but it remains unclear precisely which time frames would benefit from using one model versus another. Given recent digitization efforts of electronic health records (e.g. Medical Information Mart for Intensive Care [MIMIC]37), there are increasing opportunities to explore changes in concepts and semantics over time across different datasets (with their own unique cultures, geographies and time periods). Our group has explored differences in biases in clinical notes across different temporal and geographic datasets using these approaches11 and evolving considerations around concepts that are not well-defined by explicit terms, like concepts within palliative care, could be particularly notable use cases for further explorations in semantic drift. Evaluating changes in meanings or documentation over time could help describe how behaviors and practice patterns may be evolving. Importantly, when language models are trained on a large multi-year corpus of data and time is not incorporated into the architectural frameworks of the model, then relational changes between terms that do occur over time (e.g. metastatic terms and “palliation”) may be lost or oversimplified depending on the outcome of interest.
A fundamental limitation to our study is that neural networks are difficult to understand, given their non-intuitiveness. However, these techniques help us understand texts’ contextual nature and allow for linguistic analysis beyond counting words. Another important limitation is that we did not interview note authors to understand their reasons for including or excluding PC mentions nor their rationale for why they mentioned PC and metastatic cancer terms in the ways they did. We sought to leverage NLP to capture more nuanced lexical relationships at scale given that qualitative studies around note author language decisions are infeasible. Other limitations include possible incompleteness of our lexicon and being limited to a single hospital system. Our group has previously shown that individual hospital systems have unique cultures around language and lexical relationships are dictated by geographic place and time11. Hence, other note datasets are required from other systems over the same time period to determine whether broader cultural and lexical conclusions can be made. Importantly, copy-and-paste was not addressed which could confound linguistic patterns38. Given that the presence of mentions of metastatic cancer does not differentiate patients that actually do have metastatic cancer, we performed a sensitivity analysis on the subset of patients with an ICD code for this disease. However, ICD codes are known to have their own limitations regarding sensitivity39 and different metastatic cancers have very different prognoses, associated morbidities and palliative needs. Hence, given heterogeneity in our corpus, cohorts and ICD-subsets, our analyses should be interpreted broadly. Our study period included all of 2020 (through the initial COVID-19 pandemic surges) and language around care along with care delivery were undoubtedly affected by the pandemic. Results from 2020 should be taken with some caution as a result. Finally, our study was limited to the inpatient setting and future models on outpatient notes could provide further insight into changing PC practices.
Conclusion
NLP to study changes of language within medical notes represents a novel method to better understand how authors document important interventions, like PC and how subtle changes in documentation could change over time. Unsupervised neural network models provide opportunities to leverage text features within notes to study novel questions otherwise difficult to study with structured data. Future research should determine to what extent what is documented related to PC represents how clinicians perceive, consider and use PC interventions.
Data availability
Where available data is provided within the manuscript or supplementary information files. Due to privacy concerns, UCSF’s source dataset cannot be made publicly available. However, the replication code used in the analysis is available upon request (Julien Cobert; Julien.cobert@ucsf.edu).
References
Temel, J. S. et al. Effects of early integrated palliative care in patients with lung and GI cancer: A randomized clinical trial. J. Clin. Oncol. 35(8), 834–841. https://doi.org/10.1200/JCO.2016.70.5046 (2017).
Meggyesy, A. M. et al. Utilization of palliative care resource remains low, consuming potentially avoidable hospital admissions in stage IV non-small cell lung cancer: A community-based retrospective review. Support Care Cancer Off. J. Multinatl. Assoc. Support Care Cancer 30(12), 10117–10126. https://doi.org/10.1007/s00520-022-07364-0 (2022).
Sanders, J. J. et al. Palliative care for patients with cancer: ASCO guideline update. J. Clin. Oncol. Off. J. Am. Soc. Clin. Oncol. 42(19), 2336–2357. https://doi.org/10.1200/JCO.24.00542 (2024).
Gebel, C. et al. Utilization and quality of palliative care in patients with hematological and solid cancers: A population-based study. J. Cancer Res. Clin. Oncol. 150(4), 191. https://doi.org/10.1007/s00432-024-05721-6 (2024).
Chapman, A. C. et al. Utilization and delivery of specialty palliative care in the ICU: Insights from the palliative care quality network. J. Pain Symptom Manage. 63(6), e611–e619. https://doi.org/10.1016/j.jpainsymman.2022.03.011 (2022).
Ernecoff, N. C. et al. Comparing specialty and primary palliative care interventions: Analysis of a systematic review. J. Palliat. Med. 23(3), 389–396. https://doi.org/10.1089/jpm.2019.0349 (2020).
Hua, M., Li, G., Clancy, C., Morrison, R. S. & Wunsch, H. Validation of the V66.7 code for palliative care consultation in a single academic medical center. J. Palliat. Med. 20(4), 372–377. https://doi.org/10.1089/jpm.2016.0363 (2017).
Lee, R. Y. et al. Identifying goals of care conversations in the electronic health record using natural language processing and machine learning. J. Pain Symptom Manage. 61(1), 136-142.e2. https://doi.org/10.1016/j.jpainsymman.2020.08.024 (2021).
Ernecoff, N. C. et al. How well do documented goals-of-care discussions for patients with stage IV cancer reflect communication best practices?. BMC Palliat. Care 20(1), 41. https://doi.org/10.1186/s12904-021-00733-2 (2021).
Ma, J. E. et al. Do goals of care documentation reflect the conversation?: Evaluating conversation-documentation accuracy. J. Am. Geriatr. Soc. 72(8), 2500–2507. https://doi.org/10.1111/jgs.18913 (2024).
Cobert, J. et al. Measuring implicit bias in ICU notes using word-embedding neural network models. Chest https://doi.org/10.1016/j.chest.2023.12.031 (2024).
Garg, N., Schiebinger, L., Jurafsky, D. & Zou, J. Word embeddings quantify 100 years of gender and ethnic stereotypes. Proc. Natl. Acad. Sci. https://doi.org/10.1073/pnas.1720347115 (2018).
Radhakrishnan, L. et al. A certified de-identification system for all clinical text documents for information extraction at scale. JAMIA Open 6(3), ooad045. https://doi.org/10.1093/jamiaopen/ooad045 (2023).
Schoenherr, L. A. et al. Proactive identification of palliative care needs among patients with COVID-19 in the ICU. J. Pain Symptom Manage. 60(3), e17–e21. https://doi.org/10.1016/j.jpainsymman.2020.06.008 (2020).
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. & Dean, J. Distributed representations of words and phrases and their compositionality. Published online October 16, 2013. Accessed November 17, 2023. http://arxiv.org/abs/1310.4546
Garcia-Rudolph, A., Saurí, J., Cegarra, B. & Bernabeu, G. M. Discovering the context of people with disabilities: Semantic categorization test and environmental factors mapping of word embeddings from reddit. JMIR Med. Inform. 8(11), e17903. https://doi.org/10.2196/17903 (2020).
James, G. et al. (eds) An Introduction to Statistical Learning: With Applications in R (Springer, 2013).
Archer, A., Campbell, A., D’Amato, C., McLeod, M. & Rugg, D. Putting the ICD-10-CM/PCS GEMs into practice (Updated). J AHIMA. 87(1), 48–53 (2016).
Thierry, G. Neurolinguistic relativity: How language flexes human perception and cognition. Lang. Learn. 66(3), 690–713. https://doi.org/10.1111/lang.12186 (2016).
Broyles, L. M. et al. Confronting inadvertent stigma and pejorative language in addiction scholarship: A recognition and response. Subst. Abuse 35(3), 217–221. https://doi.org/10.1080/08897077.2014.930372 (2014).
Park, J., Saha, S., Chee, B., Taylor, J. & Beach, M. C. Physician use of stigmatizing language in patient medical records. JAMA Netw. Open 4(7), e2117052. https://doi.org/10.1001/jamanetworkopen.2021.17052 (2021).
Caliskan, A., Bryson, J. J. & Narayanan, A. Semantics derived automatically from language corpora contain human-like biases. Science 356(6334), 183–186. https://doi.org/10.1126/science.aal4230 (2017).
Maetens, A., Deliens, L., Van den Block, L., Beernaert, K. & Cohen, J. Are we evolving toward greater and earlier use of palliative home care support? A trend analysis using population-level data From 2010 to 2015. J. Pain Symptom Manage. 58(1), 19-28.e10. https://doi.org/10.1016/j.jpainsymman.2019.04.008 (2019).
Kamal, A. H. et al. Standards, guidelines, and quality measures for successful specialty palliative care integration into oncology: Current approaches and future directions. J. Clin. Oncol. 38(9), 987–994. https://doi.org/10.1200/JCO.18.02440 (2020).
Applequist, H. & Daly, B. J. Palliation: A concept analysis. Res. Theory Nurs. Pract. 29(4), 297–305. https://doi.org/10.1891/1541-6577.29.4.297 (2015).
Formagini, T., Poague, C., O’Neal, A. & Brooks, J. V. “When i heard the word palliative”: Obscuring and clarifying factors affecting the stigma around palliative care referral in oncology. JCO Oncol. Pract. 18(1), e72–e79. https://doi.org/10.1200/OP.21.00088 (2022).
Howlader, N. et al. The effect of advances in lung-cancer treatment on population mortality. N. Engl. J. Med. 383(7), 640–649. https://doi.org/10.1056/NEJMoa1916623 (2020).
Castellano, C. A. et al. The impact of cancer metastases on COVID-19 outcomes: A COVID-19 and Cancer Consortium registry-based retrospective cohort study. Cancer 130(12), 2191–2204. https://doi.org/10.1002/cncr.35247 (2024).
Iqbal, J. et al. Socioeconomic status, palliative care, and death at home among patients with cancer before and during COVID-19. JAMA Netw. Open. 7(2), e240503. https://doi.org/10.1001/jamanetworkopen.2024.0503 (2024).
Connellan, D. et al. Documentation of do-not-attempt-cardiopulmonary-resuscitation orders amid the COVID-19 pandemic. Age Ageing. 50(4), 1048–1051. https://doi.org/10.1093/ageing/afab075 (2021).
Vaswani, A., Shazeer, N., Parmar, N., et al. Attention is all you need. Published online August 1, 2023. Accessed November 17, 2023. http://arxiv.org/abs/1706.03762
OpenAI, Achiam, J., Adler, S., et al. GPT-4 Technical Report. Published online 2023. Preprint at https://doi.org/10.48550/ARXIV.2303.08774
Pennington, J., Socher, R. & Manning, C. Glove: Global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics; 2014:1532–1543. https://doi.org/10.3115/v1/D14-1162
Bojanowski, P., Grave, E., Joulin, A. & Mikolov, T. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146. https://doi.org/10.1162/tacl_a_00051 (2017).
Hamilton, W. L., Leskovec, J. & Jurafsky, D. Diachronic word embeddings reveal statistical laws of semantic change. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics; 2016:1489–1501. https://doi.org/10.18653/v1/P16-1141 (2016).
Zhang, R., Nie, L., Zhao, C., Chen, Q. Achieving semantic consistency: Contextualized word representations for political text analysis. Published online 2024. Preprint at https://doi.org/10.48550/ARXIV.2412.04505 (2024)
Johnson, A. E. W. et al. MIMIC-III, a freely accessible critical care database. Sci. Data 3(1), 160035. https://doi.org/10.1038/sdata.2016.35 (2016).
Liu, J., Capurro, D., Nguyen, A. & Verspoor, K. “Note Bloat” impacts deep learning-based NLP models for clinical prediction tasks. J. Biomed. Inform. 133, 104149. https://doi.org/10.1016/j.jbi.2022.104149 (2022).
Liede, A. et al. Validation of International Classification of Diseases coding for bone metastases in electronic health records using technology-enabled abstraction. Clin. Epidemiol. 7, 441–448. https://doi.org/10.2147/CLEP.S92209 (2015).
Funding
Dr. Cobert was supported by the UCSF Noyce Initiative for Digital Transformation in Computational Biology & Health, the Hellman Fellows Foundation, UCSF Anesthesia Department Seed Grant and by the UCSF Claude D. Pepper Older Americans Independence Center funded by NIA (P30 AG044281). Dr. Pantilat is supported by grants from the Hellman Foundation.
Author information
Authors and Affiliations
Contributions
JC, HYY, KR, EW, SZP, AKS, SL, ACC, TB, TAH, HM, AL, JC provided substantial contributions to the conception, design, analysis, interpretation of data, drafting, final approval of work. HYY, HM, AL, EE, JB, were involved in development of the data pipeline, analysis, programming, statistical methods and drafting of the manuscript. JC agrees to be accountable for all aspects of the work.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
This study was approved by UCSF’s institutional review board (IRB 20-30590). All data used from this study was from UCSF de-identified datasets or de-identified PCQC data and no consent was required.
Disclosures
Dr. Cobert—supported by the UCSF Noyce Initiative for Digital Transformation in Computational Biology & Health, the Hellman Fellows Foundation, UCSF Anesthesia Department Seed Grant and by the UCSF Claude D. Pepper Older Americans Independence Center funded by NIA (P30 AG044281). Dr. Pantilat—supported by grants from the Hellman Foundation. Has advisory roles in the Cambia Sojourns Scholar Leadership Program and the Palliative Care Quality Collaborative Advisory Board.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yang, H.Y., Raghunathan, K., Widera, E. et al. Lexical associations can characterize clinical documentation trends related to palliative care and metastatic cancer. Sci Rep 15, 17245 (2025). https://doi.org/10.1038/s41598-025-01828-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-01828-z