Abstract
Introduction: Opioid use disorders (OUDs) constitute a major public health issue, and we urgently need alternative methods for characterizing risk for OUD. Electronic health records (EHRs) are useful tools for understanding complex medical phenotypes but have been underutilized for OUD because of challenges related to underdiagnosis, binary diagnostic frameworks, and minimally characterized reference groups. As a first step in addressing these challenges, a new paradigm is warranted that characterizes risk for opioid prescription misuse on a continuous scale of severity, i.e., as a continuum. Methods: Across sites within the PsycheMERGE network, we extracted prescription opioid data and diagnoses that co-occur with OUD (including psychiatric and substance use disorders, pain-related diagnoses, HIV, and hepatitis C) for over 2.6 million patients across three health registries (Vanderbilt University Medical Center, Mass General Brigham, Geisinger) between 2005 and 2018. We defined three groups based on levels of opioid exposure: no prescriptions, minimal exposure, and chronic exposure and then compared the comorbidity profiles of these groups to the full registries and to those with OUD diagnostic codes. Results: Our results confirm that EHR data reflects known higher prevalence of substance use disorders, psychiatric disorders, medical, and pain diagnoses in patients with OUD diagnoses and chronic opioid use. Comorbidity profiles that distinguish opioid exposure are strikingly consistent across large health systems, indicating the phenotypes described in this new quantitative framework are robust to health systems differences. Conclusion: This work indicates that EHR prescription opioid data can serve as a platform to characterize complex risk markers for OUD using existing data.
Introduction
The opioid epidemic is a significant public health challenge in the USA, with continued high rates of hospitalizations and mortality as a result of misuse of prescription and illicit opioids [1, 2]. We urgently need studies that characterize risk for development of opioid use disorders (OUDs), but characterizing OUD is extremely challenging. Notably, OUD evolves in a chronological fashion, starting with exposure to opioids and continuing through intermittent and regular use, development of physical dependence, misuse, and relapse [3, 4]. Prevalence estimates for these phenotypes vary widely, in part due to variation in ascertaining and defining them [5]. One key challenge to defining opioid use and misuse is the need to differentiate individuals across this spectrum of overlapping features or stages. Another challenge is the need to acquire large enough sample sizes to study these phenotypes.
Electronic health records (EHRs) offer novel solutions for capturing opioid use behaviors in real-world healthcare settings as they contain medical data relevant to OUD from large cohorts of patients. While illicit opioid use is difficult to capture in a medical setting, EHR provides rich data related to prescription opioid use, as well as other comorbidities, that can help further dissect the risk for developing OUD. However, characterizing OUD in EHR settings is not trivial. First, most OUD case definitions used to date rely on diagnostic codes, but this approach is problematic because OUD tends to be underdiagnosed [6]. Second, by focusing on a binary case:control framework, we may be missing the spectrum of severity associated with various opioid use behaviors, which can have direct implications for treatment, diagnosis, and prevention mechanisms. Third, most reference/control groups fail to incorporate prior opioid exposure, which can result in biases [7].
To address these challenges, we shift our attention to a new paradigm that characterizes risk for opioid prescription misuse on a continuous scale of severity, i.e., as a continuum. Several opioid phenotype definitions have been developed to date that extend beyond diagnostic codes to include other sources of data, including prescription data available in the EHR (see online suppl. material at www.karger.com/doi/10.1159/000525313; [8-27]). However, little is known about patterns and correlates associated with different levels of prescription opioid exposure, including which factors distinguish patients across clinically distinct categories of exposure and whether these patterns are consistent across different health systems. In this project, we used millions of EHRs from the PsycheMERGE network to further our understanding of the progression from prescription to dependence and misuse. PsycheMERGE (www.psychemerge.com), an extension of the Electronic Medical Records and Genomics (eMERGE) network [28], leverages EHR and genomic data for mental health research, including substance use disorders (SUDs). Prior work from this network has demonstrated the value of using EHR to characterize the risk of developing various psychiatric disorders [29]. This study is one of the first to use large-scale EHR opioid prescription data across large health systems to characterize risk for opioid use and misuse.
In the present study, we defined three opioid risk groups based on patterns of prescription opioid use and a fourth group based on International Classification of Diseases (ICD) diagnostic codes for OUD. Using data from three large health systems, we sought to (1) evaluate and compare demographics and psychiatric and medical comorbidities across the four groups; (2) assess how the four groups differ in comparison to patients with no prescription data and the general population of patients from each system; and (3) compare consistencies and differences in results across the three healthcare systems. A better understanding of opioid use phenotypes and comorbidities across different levels of opioid exposure is beneficial in various data-driven studies, including clinical prediction, treatment outcomes, diagnosis, prevention, epidemiology, and genomics.
Materials and Methods
Data Sources
Our data sources included registries from three health systems: Vanderbilt University Medical Center (VUMC), Mass General Brigham (MGB), and Geisinger Health System. Details of each registry, including demographics and data sources, are listed in the Appendix in the online supplement. We acquired Institutional Review Board approval (VUMC: 201767, MGB: 2018P002642); consent was not required for review of deidentified medical records. The Geisinger Institutional Review Board deemed this research exempt because all variables were extracted and summarized using an approved data broker. This work was deemed nonhuman subjects research by the VUMC IRB (IRB# 160650).
Patients were included in the analyses if they had at least 3 years of medical history available between 2005 and 2018 and were 18 years of age or older on Decemeber 31, 2018. A minimum of 3 years of medical history was chosen to increase the likelihood that patients had enough prescription data to detect opioid prescription patterns. Patients younger than 18 years were excluded to reduce the likelihood of including individuals who had not yet developed OUD. We excluded patients with a cancer diagnosis (online suppl. Table 1) due to potential for long-term analgesia for cancer-related pain. There were 627,396 patients from VUMC, 1,272,880 patients from MGB, and 733,637 patients from Geisinger who met the inclusion and exclusion criteria. We then extracted relevant ordered (VUMC, MGB, Geisinger) or filled (Geisinger) opioid prescriptions using a list of commonly prescribed opioids (online suppl. Table 2). A preprint version of this article is available on medRxiv [30].
Prescription Opioid Phenotyping and Group Definitions
Five groups were included in the study (Table 1). First, we identified all patients from each health system who met inclusion/exclusion criteria (described above), which we refer to as the “Unscreened” group (aka the overall study sample). From this group, we next defined three subgroups using inpatient and outpatient medication records based on prescription opioid exposure levels, derived from previously published work [9, 10]. Patients in the “No Prescription” group had no documented opioid prescriptions during the period of observation. Patients in the “Minimal Exposure” group received two opioid prescriptions within 90 days at least once and no additional prescriptions within 9 months but did not have 3 or more prescriptions no more than 90 days apart at any point during the period of observation [31]. Patients in the “Chronic Exposure” group received 10 or more opioid prescriptions within a 12-month period [9, 10]. The final “OUD” group included patients with at least one ICD code for OUD (online suppl. Table 3). The definition of this group did not incorporate prescription data. Therefore, patients in this group could overlap with the three prescription-based groups (code used to determine group membership available here: https://github.com/sanchezroigelab/OUD_spectrum_PsycheMERGE). The No Prescription, Minimal Exposure, Chronic Exposure, and OUD groups cover only a subset of the patients in the Unscreened group.
Outcome Measures
Within each group, we characterized the length and density of EHR, patient demographics, opioid use patterns, and OUD diagnoses (online suppl. material). Density was defined as the total number of nonunique ICD codes a patient received between 2005 and 2018 divided by the number of years included in the analysis. We also identified diagnoses previously identified as comorbid with OUD, including other SUDs [32], psychiatric disorders [32], and other medical conditions, including human immunodeficiency virus, hepatitis C, and pain-related diagnoses [33] (relevant ICD codes in online suppl. Tables 4–16).
To further examine patterns of prescription opioid use, we defined periods of exposure or “bouts” as at least two opioid prescriptions that occurred no more than 90 days apart. A bout was considered to end when there were more than 90 days between opioid prescriptions for a particular patient. We calculated the average number and length (in days) of bouts.
Statistical Analyses
As this is the first application of this continuous framework across three health systems, we focused more on descriptive statistics in order to establish comorbidity profiles, which can be used in future experiments for hypothesis testing. Descriptive statistics (frequency, percent, mean [M], standard deviation) were used to describe and compare the different groups (online suppl. Table 17). Throughout the text, we present ranges (e.g., in percentages) across the three registries. Demographic characteristics and outcomes across the three prescription-based groups were compared using χ2 tests for categorical outcomes and independent t tests or Kruskal-Wallis tests for continuous outcomes. We regarded both a p < 0.05 and a 5% difference in prevalence between any of the three prescription opioid groups as clinically important. Given the large sample sizes, with only a few exceptions, the differences between outcomes were statistically significant across groups and therefore only qualitative descriptions are provided in the Results section. Full statistical results are described in online supplementary Table 18.
Results
Demographics
The demographic composition for age and sex was similar across the Unscreened groups from different health system registries (online suppl. Table 17). Average age at the time of the analysis was 50.8–53.3 years, with 23.1–27.0% of the patients under age 35, and 41.3–44.6% male patients. Race/ethnicity varied within the registries, reflecting the geographic regions from which each health system draws. Yet, the majority of the patients were identified in the EHR as White (74.1–95.4%), while only 3.2–12.6% were identified as Black or African American, 2.3–7.5% as Hispanic, 0.6–4.5% as Asian, and 0.4–9.0% as other race/ethnicity.
A higher proportion of patients in the OUD group were male (44.8–60.7%) compared to the prescription-based groups (30.9–46.2%; online suppl. Table 17). Patients in the OUD group were 5–8 years younger, on average, than the Unscreened group and 12–15 years younger than patients in the Chronic Exposure group. Only 5.7–7.6% of the patients in the Chronic Exposure group were under the age of 35, compared to 15.3–33.7% for the OUD, No Prescription, and Minimal Exposure groups. The Chronic Exposure and OUD groups were predominantly composed of patients identified as White (81.7–96.8%) and Black/African American (2.3–13.9%), with lower proportions of patients identified as Hispanic (1.0–5.0%) and Asian (0.1–0.9%) in comparison to the other groups (71.1–96.5% White, 2.7–16.0% Black/African American, 0.3–7.4% Hispanic, and 0.1–1.1% Asian).
Length of EHR was greater for the Minimal Exposure (11.0–13.9 years), Chronic Exposure (11.4–14.6 years), and OUD (11.7–13.0 years) groups compared to patients in the Unscreened (9.9–11.9 years) and No Prescription (9.3–11.0 years) groups. Density of EHR (visits/year) was highest for the Chronic Exposure group (24.9–59.3), followed by the OUD group (15.5–33.1).
Prevalence of OUD Diagnoses
The overall prevalence of an OUD diagnosis in the Unscreened group was 1.2–1.7% (Fig. 1); this number fluctuated during the study period (online suppl. Fig. 1), with a steeper increase around 2013 in VUMC and MGB and a later peak (2017) in Geisinger. OUD diagnoses were more common in patients identified as non-Hispanic White and increased with exposure to prescription opioids – from 0.3 to 0.6% for No Prescription, to 1.6–2.7% for Minimal Exposure, and 9.0–24.4% for Chronic Exposure. These patterns were consistent across health systems. In the OUD group, 82.1–96.8% of the patients were non-Hispanic White, which was higher than other racial/ethnic groups.
Opioid Prescription Patterns
Average age at first opioid prescription ranged from 45.7 to 46.2 years in the Unscreened group across health systems, with the youngest average age of first prescription observed in the OUD group (34.8–39.0) and oldest among the Chronic Exposure group (46.8–52.5). The duration and number of periods of opioid exposure (“bouts”) increased as exposure to prescription opioids increased, with the Chronic Exposure group having the highest number (3.3–4.1) and length (224.1–566.5 days) of bouts. The OUD group had the second highest number (1.4–3.0) and length of bouts (135.8–185.0 days). In the Unscreened and Minimal Exposure groups, patterns of use diverged. For example, in the Unscreened group, the average number of bouts ranged from 0.4 to 1.8, and the length of bouts ranged from 12.6 to 97.0 days. In contrast, the Minimal Exposure group had a relatively higher number of bouts (1.5–1.8), but the length of use was shorter (10.0–15.9 days).
Substance Use Disorders
The prevalence of SUDs was highest in the OUD group (alcohol: 15.2–28.8%, tobacco: 47.0–68.6%, cannabis: 9.1–13.0%; Fig. 1). In contrast, the No Prescription group showed dramatically lower rates of SUDs (alcohol: 0.9–1.5%, tobacco: 2.3–6.8%, cannabis: 0.2–0.5%) than any other group, including the Minimal Exposure (alcohol: 3.1–4.1%, tobacco: 8.6–28.1%, cannabis: 0.8–1.2%) and the Chronic Exposure (alcohol: 6.1–13.2%, tobacco: 26.6–49.2%, cannabis: 1.9–5.1%) groups.
Psychiatric Disorders
Prevalence of psychiatric disorders was highest in the Chronic Exposure and OUD groups (Fig. 2). For example, the prevalence of anxiety, one of the most common psychiatric disorders observed, was 19.3–37.2% and 25.7–36.6%, respectively, for the Chronic Exposure and OUD groups, compared to 8.0–17.3% for the Minimal Exposure group, 4.2–7.2% for the No Prescription group, and 6.5–11.5% for the Unscreened group. Depression prevalence was higher (36.4–50.4%) in the Chronic Exposure and OUD groups compared to the other groups (6.3–20.0%).
Bipolar disorder prevalence was highest in the OUD group (13.1–21.1%), higher than the Chronic Exposure group (5.2–8.0%), and dramatically higher than the Minimal Exposure (2.0–2.6%), No Prescription (1.0–1.3%), and Unscreened (1.6–1.9%) groups. Schizophrenia prevalence in the OUD group was slightly elevated (1.2–2.4%) compared to all other groups (0.23–1.29%). Suicidal behavior prevalence was higher in the OUD group (8.0–11.5%), compared to all other groups (0.2–3.1%).
Pain and Other Medical Conditions
The prevalence of pain ICD codes was highest in the Chronic Exposure group (92.2–96.6%), higher than the Minimal Exposure (63.9–76.0%) and OUD groups (69.1–71.7%), and dramatically higher than the No Prescription or Unscreened groups (30.8–33.9% and 48.8–51.6%, respectively; Fig. 3). The prevalence of human immunodeficiency virus and hepatitis C was highest in the Chronic Exposure and OUD groups, particularly for hepatitis C (2.6–6.7% and 9.7–18.6%, respectively), compared to the other groups (0.1–1.2%).
Discussion
EHR represents a cost-effective strategy to study OUD using existing data from thousands to millions of patients. However, detecting OUD in EHR analyses is notoriously challenging. Diagnostic codes can be insufficient because OUD tends to be underdiagnosed [34, 35]. Furthermore, models often focus on extreme opioid use in service of predicting case:control classifications and thus miss the full spectrum of opioid exposures and use behaviors [36]. Our approach overcame these limitations by examining a continuum of opioid use behaviors based on levels of prescription opioid use. This approach is particularly important for defining opioid use phenotypes because behaviors range from appropriate use of prescribed opioids to nonmedical use of prescription (and illegal) opioids, and studies centered on examining each of the different transitions can have implications for prevention, diagnosis, and treatment.
This study systematically identified patterns and correlates of short- and long-term prescription opioid use and OUD across health systems, which facilitated important observations. First, we found that the OUD group had unique characteristics compared to the other groups, the most salient of which included comorbid psychiatric (anxiety, depression) and SUDs, particularly tobacco use disorders, in line with previous clinical studies [25, 27, 33, 37]. Prior studies estimated that 45–57% of individuals with OUD had at least one psychiatric disorder and reported that polysubstance abuse was exceedingly common [5], comparable to our findings. In addition, average age at first opioid prescription among the OUD group was approximately 10 years younger than that of other groups, highlighting the importance of age at first exposure to prescribed opioids and onset of OUD [38]. The demographic factor most noticeably associated with the OUD group was EHR identification as non-Hispanic White. This finding is consistent with the characterization of the opioid epidemic in the USA as primarily affecting rural and suburban individuals who identify as non-Hispanic White [39, 40].
Second, our findings emphasize the value of including opioid prescriptions in assessing risk for OUD [41, 42]. For example, with the exception of age at first opioid prescription, patients in the Chronic Exposure group most closely resembled the OUD group across most of the characteristics evaluated and may therefore represent the group with the highest risk of having or developing OUD [33, 43]. Consistent with this finding, the Chronic Exposure group also had a higher rate of OUD diagnosis than the No Prescription or Minimal Exposure groups. In addition, OUD diagnoses increased from the No Prescription to the Minimal Exposure group, providing further evidence for the relevance of assessing opioid prescriptions when determining risk for OUD [44].
Although findings were generally consistent across sites, we did observe some heterogeneity. This may have been due, in part, to differences in underlying patient populations, as evidenced by the differences in race/ethnicity proportions in the Unscreened groups; prior studies have documented varying correlates of prescription opioid misuse by race/ethnicity [45]. Differences across sites can also be due to a more rural population at Geisinger compared to MGB and VUMC [46] and the challenges of access to behavioral health services and treatment in rural populations. Nonetheless, the overall consistency in our findings suggests that correlates of opioid use phenotypes are shared among healthcare systems despite differences in data recording and patient populations and that opioid prescription EHR-based studies from different systems can be compared to one another, if using similar definitions.
Our findings have relevance for future EHR-based OUD research. First, our work re-emphasizes the importance of incorporating opioid prescriptions when defining the spectrum of opioid misuse and OUD. Prior algorithms have incorporated opioid prescriptions to identify opioid misuse (e.g., [10, 19, 47]), but additional efforts could place patients on a spectrum of opioid use behaviors. For example, to identify individuals at risk for developing OUD, phenotype risk scores [48, 49] could be constructed by agnostically training and testing a risk model using diagnosis codes (for OUD and other relevant predictors) and opioid prescriptions. Such a strategy would not only acknowledge OUD risk in the absence of an OUD diagnosis but could also allow for modeling of OUD risk trajectories over time [50]. Second, our descriptive data confirm the relevance of several predictors of opioid misuse (e.g., age, substance and psychiatric comorbidities) that have been previously incorporated into algorithms identifying opioid misuse but not necessarily validated [19, 51] into algorithms identifying opioid misuse. Third, the depth and breadth of data in EHR registries can be leveraged to clarify the phenotypic structure of OUD phenotypes. Future studies could use data-driven methods to integrate additional EHR components (e.g., prescriptions for other controlled substances, types of pain diagnoses, opioid dosages and types) and implement cluster-based methods such as latent profile analysis, k-means clustering, or principal component analysis to explore OUD sub-phenotypes [52].
Lastly, our work has implications for genetic studies, in particular informing selection of individuals for genetic analyses. A major roadblock in conducting genome-wide association studies of opioid use phenotypes is the lack of controls with characterized opioid use histories, resulting in the frequent use of unscreened individuals as controls. Consistent with previous work showing that using unscreened controls can introduce biases in genetic analyses [7], our work demonstrates that a Minimal Exposure group has a different set of clinical characteristics than an Unscreened or No Prescription group.
This study is subject to several limitations. We did not have complete information on opioid dosages [53] (and therefore morphine milligram equivalents) or information on the use of illicit opioids, and we were not able to differentiate between opioid types, which would have helped identify additional misuse phenotypes such as rapid dose escalation trajectories [41]. Further, we relied on opioid prescriptions to index opioid use, but this is likely an imperfect proxy for actual use. Reliance on OUD diagnosis for the OUD group could also have led to misclassification given the underdiagnosis of OUD [36]; this may have been particularly problematic in the data from the earlier years included in the study. Furthermore, the associations observed represent correlations and not causation; future studies may include sequences of events to disentangle potential trajectories of effect between the groups and the outcomes. Lastly, it is not known whether these results are generalizable to other populations, but the external validity of our findings is supported by consistencies observed across the three health systems, which serve diverse patient populations.
Conclusion
This work informs the selection of cases and controls for epidemiologic and genetic studies, demonstrates the utility of using levels of prescription opioid use in elucidating different aspects of OUD pathophysiology, and supports the appropriateness of combining EHR data across these health systems for future meta-analyses, despite known differences in geographical location, racial diversity, and potential differences in diagnostic practices across the systems.
Statement of Ethics
We acquired Institutional Review Board approval (VUMC: 201767, MGB: 2018P002642); consent was not required for review of deidentified medical records. The Geisinger Institutional Review Board deemed this research exempt because all variables were extracted and summarized using an approved data broker. This work was deemed nonhuman subjects research by the VUMC IRB (IRB# 160650).
Conflict of Interest Statement
Dr. Urman reports unrelated funding and/or fees from the NIH, AHRQ, NSF, Medtronic, Merck, AcelRx, Heron, and Pfizer. Dr. Smoller is a member of the Leon Levy Foundation Neuroscience Advisory Board and has received honoraria for an internal seminar at Biogen, Inc., and Tempus Labs. He is PI of a collaborative study of the genetics of depression and bipolar disorder sponsored by 23andMe for which 23andMe provides analysis time as in-kind support but no payments. He is also a member of the Scientific Advisory Board of Sensorium Therapeutics.
Funding Sources
Mariela V. Jennings, Sevim B. Bianchi, and Sandra Sanchez-Roige are supported by funds from the California Tobacco-Related Disease Research Program (TRDRP; Grant No. T29KT0526); Sevim B. Bianchi is also supported by P50DA037844; and Sandra Sanchez-Roige is also supported by NIDA DP1DA054394. Hyunjoon Lee is supported by funds from NIH 1R01MH118233-01. Brandon J. Coombes is supported by funds from NIMH R01 MH121924. Annika B. Faucon is supported by the Training Program On Genetic Variation And Human Phenotypes Training Grant (T32GM080178). Melissa N. Poulsen is supported by funds from NIDA K01DA04993. Rachel L. Kember is supported by funds from NIAAA (K01AA028292). Travis T. Mallard is supported by funds from NIH T32HG010464. Colin G. Walsh is supported by funds from NIMH R01 MH118233, R01 MH121455, R01 MH120122, R01 MH116269, DoD MSRC W81XWH-10-2-0181, and the FDA WO2006. Richard D. Urman is supported by NIH/NIDA – 1R34DA048268-01A1. Vanessa Troiani and Richard C. Crist are supported by funds from NIDA R01DA044015 and a grant from the Pennsylvania Department of Health. The PsycheMERGE consortium is supported by NIMH R01 MH118233 (J.W.S., L.K.D.). The project using VUMC data is supported by CTSA award No. UL1TR000445 from the National Center for Advancing Translational Sciences. The contents are solely the responsibility of the authors and do not necessarily represent official views of the National Center for Advancing Translational Sciences or the National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Author Contributions
Sandra Sanchez-Roige conceived the idea. Vanessa Troiani, Mariela V. Jennings, Hyunjoon Lee, Daniel B. Rocha, Brandon J. Coombes, Richard C. Crist, Yirui Hu, Maria Niarchou, Melissa N. Poulsen, and Richard D. Urman contributed to the design and implementation of the research. Mariela V. Jennings, Hyunjoon Lee, and Daniel B. Rocha led the main analyses. Lea K. Davis, Jordan W. Smoller, and Peter Straub contributed data. All authors contributed to the writing of the manuscript.
Data Availability Statement
Descriptive statistics are provided as online supplementary Tables. The code used to determine group membership is available here: https://github.com/sanchezroigelab/OUD_spectrum_PsycheMERGE.