Introduction: Metabolomics could offer novel prognostic biomarkers and elucidate mechanisms of diabetic kidney disease (DKD) progression. Via metabolomic analysis of urine samples from 995 CRIC participants with diabetes and state-of-the-art statistical modeling, we aimed to identify metabolites prognostic to DKD progression. Methods: Urine samples (N = 995) were assayed for relative metabolite abundance by untargeted flow-injection mass spectrometry, and stringent statistical criteria were used to eliminate noisy compounds, resulting in 698 annotated metabolite ions. Utilizing the 698 metabolites’ ion abundance along with clinical data (demographics, blood pressure, HbA1c, eGFR, and albuminuria), we developed univariate and multivariate models for the eGFR slope using penalized (lasso) and random forest models. Final models were tested on time-to-ESKD (end-stage kidney disease) via cross-validated C-statistics. We also conducted pathway enrichment analysis and a targeted analysis of a subset of metabolites. Results: Six eGFR slope models selected 9–30 variables. In the adjusted ESKD model with highest C-statistic, valine (or betaine) and 3-(4-methyl-3-pentenyl)thiophene were associated (p < 0.05) with 44% and 65% higher hazard of ESKD per doubling of metabolite abundance, respectively. Also, 13 (of 15) prognostic amino acids, including valine and betaine, were confirmed in the targeted analysis. Enrichment analysis revealed pathways implicated in kidney and cardiometabolic disease. Conclusions: Using the diverse CRIC sample, a high-throughput untargeted assay, followed by targeted analysis, and rigorous statistical analysis to reduce false discovery, we identified several novel metabolites implicated in DKD progression. If replicated in independent cohorts, our findings could inform risk stratification and treatment strategies for patients with DKD.

In the USA, 25% of diabetic patients have chronic kidney disease, a major precursor to kidney failure [1]. Nearly half of the patients with kidney failure have diabetes [2]. Thus, it is important to identify DKD progression risk factors, so that kidney disease can be detected and treated early. While clinical factors, such as microalbuminuria, are generally prognostic for DKD, there is still large variability in DKD risk for patients with similar risk profiles [3-5], and furthermore, clinical risk factors do not offer molecular insights into disease pathology. Thus, there is a need to explore new biomarkers which may add prognostic power and more importantly identify biological mechanisms of DKD [6].

Metabolites, low-weight intermediates and end products of cellular pathways, could uncover physiological or pathological changes in chronic diseases [7]. Recent research emphasizes the importance of and need for further research on the human urine metabolome in kidney disease [8, 9].

Previous urine metabolome in DKD studies has largely been cross-sectional and aimed at identifying metabolites distinguishing DKD from healthy or suitable comparison controls [10-12]. Studies examining metabolomics for DKD progression contained several limitations: evaluating specific compounds, serum/plasma only assays, restriction to type I diabetes or specific subpopulations, small sample size, or short follow-up [13-17]. Highlighting recent work, the branched-chain amino acid pathway in type 1 diabetes [18] and lipogenesis in type 2 diabetes among American Indians [13] have been linked to DKD progression. Herein, we aimed to expand on prior works by using an extensive panel of urine metabolites in a large, diverse longitudinal cohort of 995 patients with DKD and had a median 8 years of follow-up. Previously [19], we evaluated 13 a priori metabolites and their association with DKD progression. Here, using a high-throughput untargeted platform, we examined relative abundance of 698 annotated metabolite ions and implemented machine learning and rigorous statistical approaches to identify markers of DKD progression, rate of annual eGFR change, and ESKD. Further, we conducted a targeted metabolomics analysis on a subset of 15 amino acids. By implementing a robust multifaceted statistical approach, two metabolomic assays, and two clinically important outcomes, we aimed to identify potentially multiple sets of novel features prognostic for DKD progression and/or elucidate biological pathways of DKD.

Study Cohort

We used a metabolomics substudy of the Chronic Renal Insufficiency Cohort (CRIC). The parent CRIC study recruited (from 2003 on) a racially diverse group aged 21–74 years, ∼50% diabetic, and with a broad range of kidney function [20]. Informed consent was obtained from participants; protocols were approved by the IRBs and Scientific and Data Coordinating Center (approval # 807882). The current study analyzed the urine metabolome at study entry (baseline) of 995 randomly selected CRIC participants with diabetes across CKD stages 3a, 3b, and 4, eGFR 45–60, 30–45, and 20–30 mL/min/1.73 m2, respectively.

Disease Outcomes

We evaluated two outcomes: annual rate of eGFR change (eGFR slope) and time-to-ESKD. The eGFR slope was estimated via mixed models using serial eGFR measures as described previously [19]. Time-to-ESKD was the time from entry to the CRIC study to incident kidney failure with need for renal replacement therapy or kidney transplantation; drop-out or death before kidney failure was considered censoring events.

We used the CKD-EPI equation to calculate eGFR [21]. Results were consistent (data not shown) when we repeated analyses using the CRIC-eGFR equations [22], so we report findings using the widely used CKD-EPI equation.

Statistical Analysis

Overview of Analytic Steps (Fig. 1)

Details of the assay, sample processing, and feature extraction are given in online supplementary materials (for all online suppl. material, see www.karger.com/doi/10.1159/000521940). We implemented stringent filtering to exclude metabolite ions with high noise or low biological variability. We then tested single metabolite associations with DKD outcomes, correcting for multiple comparisons, with and without clinical variables adjustment. Next, we developed multivariate metabolite models, using statistical and machine learning methods to identify marker signatures associated with eGFR slope, and further tested these models on time-to-ESKD. A unique aspect of our approach is that we did not train models on the ESKD outcome; thus, ESKD results serve as “internal replication” for a long-term clinical outcome. Finally, we conducted pathway enrichment analyses to investigate biological underpinnings of selected metabolites and conducted a targeted analysis of a subset of metabolites.

Fig. 1.

Flow diagram for analysis steps and brief overview of results.

Fig. 1.

Flow diagram for analysis steps and brief overview of results.

Close modal

Filtering Metabolite Ions

Leveraging technical replicates, we used QC and CRIC samples to eliminate ions with poor reliability in the untargeted analysis (online suppl. materials). Of 1,899 annotated metabolite ions, the 698 which passed filtering criteria are the final metabolite ion set for all subsequent analyses. A single ion could annotate multiple metabolites; we will clarify these resulting ambiguities as appropriate.

Associations of Single Metabolite with Outcomes

We tested associations between each of the 698 log2-transformed metabolite ions and outcomes, with and without adjustment for clinical variables: age, gender, race, smoking, baseline BMI, mean arterial pressure, HbA1c, eGFR, and albuminuria [19]. For the eGFR slope, we calculated Pearson correlations and fit linear models; for ESKD, we fit Cox models. We used the Benjamini-Hochberg false discovery rate to correct for multiple comparisons [23].

Multivariate Models for the eGFR Slope

Using eGFR slopes as outcome, penalized regression (via lasso) and machine learning (via random forest) models were developed to elicit multivariate prognostic metabolomic signatures. The lasso reduces overfitting by imposing a penalty (λ) [24]. We considered two λ values chosen by 10-fold cross-validation: λ. min, the value yielding the lowest prediction error; and λ.1se, the value within one SD of lowest prediction error. Four lasso models were fit; each included 698 ions and 9 clinical variables as covariates. Two models forced all 9 clinical variables to be included, utilizing either λ.min or λ.1se. The other two models did not force the clinical variables to be included. For random forests, we used percent increase of mean squared error to order variable importance [24]; for comparability, we selected the same number of variables as the corresponding lasso models (without forcing clinical variables). Thus, we fit six multiple metabolite ion models: four lasso and two random forest models, each of which selected an optimal predictor set. As a final sensitivity analysis, we also fit four elastic net models [24], which can select groups of correlated features and may better mimic biological pathways.

Internal Replication on ESKD Outcome

To evaluate the models on time-to-ESKD, we fit six Cox models, in which predictors were variables selected in the six eGFR slope outcome analyses. We used likelihood ratio tests to compare each Cox model to its corresponding model of only clinical variables. To quantify model performance, we used 5-fold cross validation repeated 100 times to estimate mean and 95% CI of the C-statistic. Predictors used in these six Cox models were the predictors selected in the corresponding eGFR slope models; no tuning or variable selection was used in the Cox models. We intentionally avoided further training on ESKD to assess if predictors of the eGFR slope were also predictive of long-term outcomes (i.e., ESKD).

Enrichment Analysis

Definitions of well-known biological pathways by their respective involved metabolites were obtained from HMDB [25]. We considered 743 pathways and performed a hypergeometric test for each pathway definition via

graphic

where M = # of measured metabolite ions, K = # of measured compounds in the pathway definition, N = # of “hits,” i.e., selected/prognostic metabolite ions, and x = # of “hits” that mapped to the pathway. The p values were corrected for multiple testing using the Benjamini-Hochberg method.

Assay Validation

Using a targeted assay, we validated selection and annotation of 15 metabolites detected by untargeted analysis, which were prognostic in the ESKD analysis. Again, we fit Cox models, adjusted for 9 clinical variables. To compare untargeted and targeted analyses, we present standardized hazard ratios (HRs).

Participant Characteristics

At entry (Table 1), participants (n = 995) were mean 59.9 years, 56% male, 44% white, and 42% black; on average (mean [SD]), they were obese (BMI 34.2 [7.9]), had poor diabetes control (HbA1c 7.6 [1.5]%), and had moderate-to-poor kidney function (eGFR 40.6 [11.2] mL/min/1.73 m2). The eGFR decline (slope) averaged 1.8 (SD = 1.9) mL/min/1.73 m2/year; 36% (N = 360) had ESKD during the 10-year study (range: 2–10 years). We excluded subjects with missing clinical variables (<2%).

Table 1.

Baseline clinical characteristics of 995 metabolomics substudy participants with diabetes in the chronic renal insufficiency cohort (CRIC) study

Baseline clinical characteristics of 995 metabolomics substudy participants with diabetes in the chronic renal insufficiency cohort (CRIC) study
Baseline clinical characteristics of 995 metabolomics substudy participants with diabetes in the chronic renal insufficiency cohort (CRIC) study

Associations between Single Metabolites and Disease Outcomes

Of 698 ions, 89 were significantly correlated with eGFR slopes without adjustment for clinical variables (false discovery rate corrected p < 0.05). In adjusted analyses, 6 ions remained significant with β-coefficients from −0.45 to 0.3 (online suppl. Table S1). Also, 123 (unadjusted) and 99 (adjusted) ions were significantly associated with ESKD in Cox regression. After adjustment, HRs ranged from 1.12 to 1.84 (online suppl. Table S2). The prognostic ion set contained several amino acids (valine, isoleucine, and tryptophan) and other compounds, e.g., hydroxybutanoic acid (online suppl. Table S2). Only one ion, annotated as 3-(4-methyl-3-pentenyl)thiophene (Ion Index 1098, online suppl. Tables S1, S2), was associated with both eGFR slope and ESKD in adjusted models with coefficients (95% CI) of −0.44 (−0.68, −0.21) for the eGFR slope and HR (95% CI) 1.84 (1.45, 2.32), indicating that higher abundance of this compound might be associated with worse DKD progression. Complete results are in online supplementary Tables S1, S2. Henceforth, the 99 ESKD-associated metabolites will be denoted the 99-ESKD-associated set.

Multivariable Prognostic Metabolites for eGFR Slope Outcome

Each of the lasso or random forest models (online suppl. Table S3) selected 9–30 variables resulting in 49 (out of 698) ions across the 6 prognostic models (online suppl. Table S4), denoting the 49-eGFR-associated set. Baseline albuminuria, blood pressure, and HbA1c were selected in all 6 models, and unsurprisingly, higher levels of these clinical markers were associated with steeper eGFR decline; race was also selected in all 6 models; 3,4-dicaffeoyl-1,5-quinolactone was selected in all models except model 1 (clinical only model). Nine ions, annotated as 3,4-dicaffeoyl-1,5-quinolactone, butynal, 3-(4-methyl-3-pentenyl)thiophene, C10:3, zalcitibine, asparaginyl-hydroxyproline, valine (or betaine), argynil-glutamine, and pentose (or di-isopropyl disulfide), were selected in at least 3 models; an additional 13 ions were selected in at least 2 models (online suppl. Table S4). Elastic net models had similar C-statistics (data not shown), so the more parsimonious lasso models were retained.

Internal Replication

The final selected features from each of the 6 eGFR slope models (online suppl. Table S3) were tested on the ESKD outcome in Cox models. The likelihood ratio test p values were < 0.0001 when each of the 6 models was compared to the corresponding model with only clinical features, i.e., adding metabolite ions improved model fit significantly. Several ESKD models had similar median C-statistics (online suppl. Fig. S1), ranging from 0.82 to 0.85. The best (model 2, online suppl. Table S3) with 29 variables had a cross-validated median (95% CI) C-statistic of 0.85 (0.85, 0.86). This model selected 20 metabolite ions via lasso; 14 were significantly associated with the eGFR slope as evidenced by their bootstrap 95% CIs which excluded 0 (Table 2). Of greater interest are the five ions significantly (5% level) associated with time-to-ESKD. Higher abundance of valine (or betaine) and 3-(4-methyl-3-pentenyl)thiophene was each associated with increased risk of ESKD (HR 1.44 and 1.65, respectively); higher asparaginyl-hydroxyproline abundance was associated with lower ESKD risk (HR = 0.7). Two other significantly associated compounds were pipazethate and aminophylline. Importantly, since our lasso model was not trained on ESKD outcome, we expect the Cox model results to be valid and not influenced by model selection. We also ran sensitivity analysis including ACE inhibitor or ARB use in Cox regression for model 2. There were negligible changes in the Cox model coefficients for the 29 variables in model 2; the HR (95% CI) for ACE inhibitor or ARB use was 1.22 (0.92, 1.63, p = 0.17), and this was not statistically significant at 5% significance level.

Table 2.

Associations with DKD outcomes for the model with the largest C-statistic (model 2, online suppl. Table S3)

Associations with DKD outcomes for the model with the largest C-statistic (model 2, online suppl. Table S3)
Associations with DKD outcomes for the model with the largest C-statistic (model 2, online suppl. Table S3)

Enrichment Analysis

We conducted pathway enrichment based on the 49-eGFR-associated (online suppl. Tables S3, S4), the 99-ESKD-associated (online suppl. Table S2), and the combined 49- and 99-sets (=131 ions). Thus, in equation (1), M = 698; N = 49 or 99 or 131; K varied by pathway; and x = # of ions in a pathway and also in the prognostic 49-, 99-, or 131-set.

Pathways (online suppl. Fig. S2; Table S5) consistently and significantly enriched across both eGFR slope and ESKD models were enzyme deficiencies, acidurias and acidemias, and those related to amino acid metabolism. Of interest, the valine-leucine-isoleucine degradation pathway involved in insulin resistance, cardiometabolic risk, cardiomyopathy, and CKD [26, 27]; the 2-aminoadipic 2-oxoadipic aciduria and lysine degradation pathways implicated in diabetes and kidney disease [28, 29]; and the tryptophan pathway [30-32] was enriched in the 99-ESKD and combined 131-ion sets (online suppl. Fig. S2; Table S5).

Validation by Targeted CE-MS Approach

In the targeted validation analysis, 13 of 15 metabolites were significantly associated with ESKD (Fig. 2). HR (95% CI) in the untargeted and targeted analysis was comparable.

Fig. 2.

Forest plot (hazard ratios with 95% CIs) of ESKD risk of 15 untargeted and targeted metabolites. Above: untargeted assay; Below: targeted assay. The metabolite abundances were log2 transformed and then standardized by dividing by their SDs (standard deviation). Cox regression models are fitted using time to ESKD as outcome, each metabolite entered in the models separately with adjustment for 9 clinical variables. Note: In the untargeted assay (above), valine and betaine represent the same ion, due to ambiguities in annotation assignments. In the targeted quantitative analysis (below), these two compounds have separate annotations.

Fig. 2.

Forest plot (hazard ratios with 95% CIs) of ESKD risk of 15 untargeted and targeted metabolites. Above: untargeted assay; Below: targeted assay. The metabolite abundances were log2 transformed and then standardized by dividing by their SDs (standard deviation). Cox regression models are fitted using time to ESKD as outcome, each metabolite entered in the models separately with adjustment for 9 clinical variables. Note: In the untargeted assay (above), valine and betaine represent the same ion, due to ambiguities in annotation assignments. In the targeted quantitative analysis (below), these two compounds have separate annotations.

Close modal

High-throughput metabolomics offers great potential, yet the scale and measurement errors raise analytic and inferential challenges. To address these challenges, we implemented two steps: (1) identifying a sparse prognostic set of metabolites after adjusting for known clinical factors and (2) using pathway analysis to elicit biological meaning. First, using an untargeted assay, we tested single metabolite associations with eGFR slope and ESKD. After adjusting for clinical variables, 6 and 99 (online suppl. Tables S1, S2) metabolite ions were found to exhibit non-null associations with eGFR slope and ESKD, respectively. An interesting finding is that for the eGFR slope, adjusting for clinical factors greatly reduced the number of non-null associations (from 89 to 6 ions), whereas for ESKD, the number of prognostic metabolites was relatively similar with and without adjustment (123 unadjusted to 99 adjusted), suggesting that clinical factors may be more critical for proximal (e.g., eGFR slope) versus long-term (e.g., ESKD) outcomes. Of note, differences in prognostic factors for these outcomes have also been previously reported [33].

As a next step, we conducted a quantitative targeted assay of 15 amino acid metabolism metabolites from the 99 (untargeted) ions and validated the prognostic value of 13 (of 15). Amino acid metabolism is known to be associated with DKD in cross-sectional analysis [12], and in a few prospective studies [7, 11, 34], but these studies have generally had small sample sizes (<100) and/or evaluated specific subgroups (e.g., type I diabetes). Thus, our finding of the prognostic value of urine amino acids in the large CRIC study even after adjustment for clinical factors adds to the literature and suggests that additional amino aciduria, independent of albuminuria, may be a risk factor for DKD progression.

In multivariable analysis, we fit 6 multiple metabolite models (adjusted for clinical factors) using robust penalized regression and machine learning. These modeling approaches are ideally suited to BigData with a large number of possibly correlated predictors. We identified sets of 6–25 prognostic metabolite ions (online suppl. Tables 2, S3). Nine unique ions were selected in ≥3 models. Two of these ions, asparginyl-hydroxyproline, implicated in iron deficiency and Crohn’s disease [35], and arginyl glutamine, a dipeptide shown to be helpful for treating hypoxia-induced small intestine injury [36], are likely novel findings in DKD research. In addition, we identified valine, a glucogenic branched chain amino acid, and C10:3, an acyl-carnitine whose blood levels were associated with heart failure and maternal and newborn metabolic health [37, 38]. The rest were food or drug derivatives [39, 40], e.g., 3-(4-methyl-3-pentenyl)thiophene found in alcoholic beverages. For ESKD, the 6 models had similar C-statistics (range 0.82–0.85), comparable to the 0.84 C-statistic of the clinical variables-only model, i.e., adding metabolites did not improve model discrimination. The fact that multiple models had similar discrimination is not surprising given the complexity of DKD. It is likely that there are many biological pathways in DKD, and thus potentially multiple “equally” prognostic models encompassing metabolite subsets from each of these pathways. Importantly, in the multiple adjusted Cox model with highest C-statistic, we identified several interesting prognostic (p < 0.05) metabolite ions. Valine (or betaine) and 3-(4-methyl-3-pentenyl)thiophene were associated with 44% and 65% higher hazard of ESKD, respectively, per doubling of ion abundance. Importantly, the Cox model analysis, albeit on the same cohort, did not involve any training or variable selection, and hence HRs are likely less biased.

To systematically leverage the biological content of prognostic metabolite ions, we conducted enrichment analysis, using pathway, disease, and function definitions from the HMDB. We focused on metabolic pathway enrichment, since pathway definitions are less prone to curation anomalies, more finite, and well defined compared to disease and function definitions. We identified branched-chain amino acid pathways, namely, valine-leucine-isoleucine degradation, isovaleric acidemia/aciduria, propionic acidemia, and methylmalonic acidemia, all implicated in CKD [18, 26-29]. The tryptophan pathway was previously investigated in DKD primarily in serum/plasma samples [30-32] though findings were mixed. Our study found that higher levels of tryptophan were associated with higher ESKD risk in both untargeted and targeted analyses, and the tryptophan pathway was significantly enriched in our set of ESKD-related metabolites. To our knowledge, these findings are novel, and further investigation of urine tryptophan is warranted.

Our study fills several gaps in current DKD research. Previous urine metabolomics studies were cross-sectional, had small sample sizes, examined few single compounds, or have been restricted to subpopulations. In this work, we addressed these shortcomings. Our CRIC DKD sample is one of the largest in the USA, with comprehensive clinical data and long-term follow-up with annual assessments of kidney function. We conducted untargeted high-throughput metabolomics which identified 1,899 annotated ions, used stringent filtering to reduce to a reliable subset of 698 metabolite ions, and used rigorous paradigms to build cross-validated models to select metabolite ions prognostic for eGFR decline. The selected ions were then tested for association with ESKD without further training. While it could be argued that prognostic predictor sets may vary between the eGFR slope and ESKD outcomes [33], our approach of replicating findings on the ESKD outcome, rather than fitting separate ESKD models, is more statistically stringent. Of note, models were not trained on the ESKD outcome, which should reduce overfitting. Also, identifying predictors of short-term outcomes which also foretell long-term outcomes could be more impactful in the clinic for planning long-term disease management. However, we acknowledge that focusing feature selection solely on the ESKD outcome could be of value, if results could be validated, and we aim to pursue this line in future work in independent cohorts. Finally, we used a targeted CE-MS assay to validate and quantify a subset of “hits” identified in our untargeted analysis, illustrating the value of an untargeted high-throughput metabolomics approach despite the sample complexity of human urine.

There are limitations to our work. First, although we adjusted for known clinical factors, residual confounding and measurement error could still impact our findings. Results may also vary by clinical subgroups; we leave such investigation to future studies. Second, while our enrichment analysis revealed potential DKD-related pathways, our selected metabolites were also enriched for deficiencies associated with other diseases. Thus, the systematic pathway enrichment may not be specific to DKD, a limitation of pathway analyses when a given pathway could overlap multiple diseases. Finally, while we followed strict protocols for biospecimen storage, processing, and quality control, the age of our archived samples might also introduce errors. Also, while we validated a subset of metabolites with existing targeted approaches, it is necessary to develop quantitative assays to compare with our untargeted findings [41]. We are developing targeted assays for other prognostic markers for future validation. In addition, confirming (or refuting) our findings in independent cohorts using different assays will enhance generalizability.

In summary, we conducted large-scale untargeted and targeted metabolomics analysis using a diverse cohort and identified several metabolites implicated in DKD. We used robust statistical models incorporating rigorous techniques to reduce overfitting: cross-validation for feature selection, replicating results on a long-term clinical outcome, and validation using a targeted assay. While we highlighted and validated a few prognostic metabolites and related pathways, the most important aspect of our work is that we identified 131 potentially prognostic metabolites in our untargeted analysis, several of which are not addressed before. We hope that providing this list will spur further investigation of these promising metabolites and enhance understanding of DKD, eventually leading to better treatments and disease management.

We thank Lisa E. Wesby, MS, for assistance as the CRIC project manager in managing and facilitating the manuscript among all the authors. We thank the remaining CRIC co-investigators for providing the current data set and their continued efforts with this important study: Lawrence J. Appel, MD, MPH; Jing Chen, MD, MMSc, MSc; Alan S. Go, MD; James P. Lash, MD; Robert G. Nelson, MD, PhD, MS; Mahboob Rahman, MD; Vallabh O Shah, PhD, MS; Raymond R. Townsend, MD; Mark L. Unruh, MD, MS.

This study protocol was reviewed and approved by the IRBs and Scientific and Data Coordinating Center (approval # 807882). Informed consent was obtained from participants.

Uwe Sauer is a Cofounder of Metabolic Concepts Ltd., Boston, MA, USA. Kumar Sharma has a startup company, SygnaMap, that analyzes data generated from mass spec imaging of tissue sections for understanding solid organ diseases. Dr. Feldman reports consulting for Kyowa Hakko Kirin Co., Ltd., serving as the Editor-in-Chief of the American Journal of Kidney Diseases (AJKD), honoraria for speaking from InMed Physicians, and consulting with DLA Piper LLP. All other authors have nothing to disclose.

Funding for the CRIC study was obtained under a cooperative agreement from the National Institute of Diabetes and Digestive and Kidney Diseases (U01DK060990, U01DK060984, U01DK061022, U01DK061021, U01DK061028, U01DK060980, U01DK060963, U01DK060902, and U24DK060990). In addition, this work was supported in part by the Perelman School of Medicine at the University of Pennsylvania Clinical and Translational Science Award NIH/NCATS UL1TR000003, Johns Hopkins University UL1 TR-000424, University of Maryland GCRC M01 RR-16500, Clinical and Translational Science Collaborative of Cleveland, UL1TR000439 from the National Center for Advancing Translational Sciences (NCATS) component of the National Institutes of Health and NIH roadmap for Medical Research, Michigan Institute for Clinical and Health Research (MICHR) UL1TR000433, University of Illinois at Chicago CTSA UL1RR029879, Tulane COBRE for Clinical and Translational Research in Cardiometabolic Diseases P20 GM109036, Kaiser Permanente NIH/NCRR UCSF-CTSI UL1 RR-024131, Department of Internal Medicine, University of New Mexico School of Medicine Albuquerque, NM R01DK119199. Mr. Kwan, Dr. Fuhrer, Ms. Zhang, and Drs. Darshi, Montemayor, Sharma, and Natarajan were partially supported by NIDDK 1R01DK110541-01A1. Ms. Zhang and Drs. Darshi, Montemayor, Natarajan, and Sharma were also partially supported by DP3DK094352. Dr. Afshinnia is funded by K08-DK106523 and R03-DK121941. Dr. Anderson is supported by P20GM109036, U01DK060963, R01DK104730, and R01DK107566. Mr. Kwan was also supported by the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE-1650112. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

J.Z., L.N., K.S., T.F., D.M., H.P.Y, M.D., and B.K. conceptualized the study design. T.F., D.M., J.Z., and B.K. contributed to data curation. T.F., D.M., U.S., and K.S. contributed to data acquisition. J.Z. conducted study analyses. J.Z. and L.N. drafted the manuscript. L.N. and K.S. contributed to funding acquisition and provided oversight to the study analyses. All authors contributed to data interpretation, critically reviewed and edited the manuscript for important intellectual content, and approved the final version. J.Z. is the guarantor of this work, as such, had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

The data that support the findings of this study are available on request from the CRIC study at cristudy.org upon reasonable request.

1.
Zelnick
LR
,
Weiss
NS
,
Kestenbaum
BR
,
Robinson-Cohen
C
,
Heagerty
PJ
,
Tuttle
K
,
Diabetes and CKD in the United States population, 2009–2014
.
Clin J Am Soc Nephrol
.
2017
;
12
(
12
):
1984
90
.
2.
Centers for Disease Control and Prevention
.
Prevalence of CKD in the diagnosed diabetes, undiagnosed diabetes, and prediabetes adult population
;
2020
. Available from: https://adr.usrds.org/2020/chronic-kidney-disease/1-ckd-in-the-general-population.
3.
Leong
A
,
Ekinci
EI
,
Nguyen
C
,
Milne
M
,
Hachem
M
,
Dobson
M
,
Long-term intra-individual variability of albuminuria in type 2 diabetes mellitus: implications for categorization of albumin excretion rate
.
BMC Nephrol
.
2017
;
18
(
1
):
355
.
4.
MacIsaac
RJ
,
Ekinci
EI
,
Jerums
G
.
Progressive diabetic nephropathy. How useful is microalbuminuria?: contra
.
Kidney Int
.
2014
;
86
(
1
):
50
7
.
5.
Krolewski
AS
,
Niewczas
MA
,
Skupien
J
,
Gohda
T
,
Smiles
A
,
Eckfeldt
JH
,
Early progressive renal decline precedes the onset of microalbuminuria and its progression to macroalbuminuria
.
Diabetes Care
.
2014
;
37
(
1
):
226
34
.
6.
Thornton Snider
J
,
Sullivan
J
,
van Eijndhoven
E
,
Hansen
MK
,
Bellosillo
N
,
Neslusan
C
,
Lifetime benefits of early detection and treatment of diabetic kidney disease
.
PLoS One
.
2019
;
14
(
5
):
e0217487
.
7.
Pena
MJ
,
Heinzel
A
,
Heinze
G
,
Alkhalaf
A
,
Bakker
SJ
,
Nguyen
TQ
,
A panel of novel biomarkers representing different disease pathways improves prediction of renal function decline in type 2 diabetes
.
PLoS One
.
2015
;
10
(
5
):
e0120995
.
8.
Darshi
M
,
Van Espen
B
,
Sharma
K
.
Metabolomics in diabetic kidney disease: unraveling the biochemistry of a silent killer
.
Am J Nephrol
.
2016
;
44
(
2
):
92
103
. .
9.
Lecamwasam
A
,
Ekinci
EI
,
Saffery
R
,
Dwyer
KM
.
Potential for novel biomarkers in diabetes-associated chronic kidney disease: epigenome, metabolome, and gut microbiome
.
Biomedicines
.
2020
;
8
(
9
):
341
. .
10.
Chen
CJ
,
Liao
WL
,
Chang
CT
,
Lin
YN
,
Tsai
FJ
.
Identification of urinary metabolite biomarkers of type 2 diabetes nephropathy using an untargeted metabolomic approach
.
J Proteome Res
.
2018
;
17
(
11
):
3997
4007
. .
11.
Duranton
F
,
Lundin
U
,
Gayrard
N
,
Mischak
H
,
Aparicio
M
,
Mourad
G
,
Plasma and urinary amino acid metabolomic profiling in patients with different levels of kidney function
.
Clin J Am Soc Nephrol
.
2014
;
9
(
1
):
37
45
.
12.
Sharma
K
,
Karl
B
,
Mathew
AV
,
Gangoiti
JA
,
Wassel
CL
,
Saito
R
,
Metabolomics reveals signature of mitochondrial dysfunction in diabetic kidney disease
.
J Am Soc Nephrol
.
2013
;
24
(
11
):
1901
12
.
13.
Afshinnia
F
,
Nair
V
,
Lin
J
,
Rajendiran
TM
,
Soni
T
,
Byun
J
,
Increased lipogenesis and impaired β-oxidation predict type 2 diabetic kidney disease progression in American Indians
.
JCI Insight
.
2019
;
4
(
21
):
e130317
.
14.
Colombo
M
,
Looker
HC
,
Farran
B
,
Hess
S
,
Groop
L
,
Palmer
CNA
,
Serum kidney injury molecule 1 and β(2)-microglobulin perform as well as larger biomarker panels for prediction of rapid decline in renal function in type 2 diabetes
.
Diabetologia
.
2019
;
62
(
1
):
156
68
.
15.
Colombo
M
,
Valo
E
,
McGurnaghan
SJ
,
Sandholm
N
,
Blackbourn
LAK
,
Dalton
RN
,
Biomarker panels associated with progression of renal disease in type 1 diabetes
.
Diabetologia
.
2019
;
62
(
9
):
1616
27
.
16.
Niewczas
MA
,
Mathew
AV
,
Croall
S
,
Byun
J
,
Major
M
,
Sabisetti
VS
,
Circulating modified metabolites and a risk of ESRD in patients with type 1 diabetes and chronic kidney disease
.
Diabetes Care
.
2017
;
40
(
3
):
383
90
.
17.
Pongrac Barlovic
D
,
Harjutsalo
V
,
Sandholm
N
,
Forsblom
C
,
Groop
PH
.
Sphingomyelin and progression of renal and coronary heart disease in individuals with type 1 diabetes
.
Diabetologia
.
2020
;
63
(
9
):
1847
56
. .
18.
Mutter
SVE
,
Aittomäki
V
,
Nybo
K
,
Raivonen
L
,
Thorn
LM
,
Forsblom
C
,
Urinary metabolite profiling and risk of progression of diabetic nephropathy in 2670 individuals with type 1 diabetes
.
Diabetologia
.
2021
;
65
(
1
):
140
.
19.
Kwan
B
,
Fuhrer
T
,
Zhang
J
,
Darshi
M
,
Van Espen
B
,
Montemayor
D
,
Metabolomic markers of kidney function decline in patients with diabetes: evidence from the chronic renal insufficiency cohort (CRIC) study
.
Am J Kidney Dis
.
2020
;
76
(
4
):
511
20
.
20.
Feldman
HI
,
Appel
LJ
,
Chertow
GM
,
Cifelli
D
,
Cizman
B
,
Daugirdas
J
,
The chronic renal insufficiency cohort (CRIC) study: design and methods
.
J Am Soc Nephrol
.
2003
;
14
(
7 Suppl 2
):
S148
53
.
21.
Levey
AS
,
Stevens
LA
,
Schmid
CH
,
Zhang
YL
,
Castro
AF
 3rd
,
Feldman
HI
,
A new equation to estimate glomerular filtration rate
.
Ann Intern Med
.
2009
;
150
(
9
):
604
12
.
22.
Anderson
AH
,
Yang
W
,
Hsu
CY
,
Joffe
MM
,
Leonard
MB
,
Xie
D
,
Estimating GFR among participants in the chronic renal insufficiency cohort (CRIC) study
.
Am J Kidney Dis
.
2012
;
60
(
2
):
250
61
.
23.
Benjamini
Y
,
Hochberg
Y
.
Controlling the false discovery rate: a practical and powerful approach to multiple testing
.
J Royal Stat Soc
.
1995
;
57
(
1
):
289
300
. .
24.
Hastie
T
,
Tibshirani
R
,
Friedman
J
.
The elements of statistical learning: data mining, inference, and prediction
. 2nd ed.
NewYork
:
Springer
;
2009
.
25.
Wishart
DS
,
Feunang
YD
,
Marcu
A
,
Guo
AC
,
Liang
K
,
Vázquez-Fresno
R
,
HMDB 4.0: the human metabolome database for 2018
.
Nucleic Acids Res
.
2018
;
46
(
D1
):
D608
d17
.
26.
Manoli
I
,
Venditti
CP
.
Disorders of branched chain amino acid metabolism
.
Transl Sci Rare Dis
.
2016
;
1
(
2
):
91
110
. .
27.
White
PJ
,
Newgard
CB
.
Branched-chain amino acids in disease
.
Science
.
2019
;
363
(
6427
):
582
3
. .
28.
Sell
DR
,
Strauch
CM
,
Shen
W
,
Monnier
VM
.
Aging, diabetes, and renal failure catalyze the oxidation of lysyl residues to 2-aminoadipic acid in human skin collagen: evidence for metal-catalyzed oxidation mediated by alpha-dicarbonyls
.
Ann N Y Acad Sci
.
2008
;
1126
:
205
9
. .
29.
Verzola
D
,
Famà
A
,
Villaggio
B
,
Di Rocco
M
,
Simonato
A
,
D’Amato
E
,
Lysine triggers apoptosis through a NADPH oxidase-dependent mechanism in human renal tubular cells
.
J Inherit Metab Dis
.
2012
;
35
(
6
):
1011
9
.
30.
Cheng
Y
,
Li
Y
,
Benkowitz
P
,
Lamina
C
,
Köttgen
A
,
Sekula
P
.
The relationship between blood metabolites of the tryptophan pathway and kidney function: a bidirectional Mendelian randomization analysis
.
Sci Rep
.
2020
;
10
(
1
):
12675
. .
31.
de Vries
LV
,
Minović
I
,
Franssen
CFM
,
van Faassen
M
,
Sanders
JS
,
Berger
SP
,
The tryptophan/kynurenine pathway, systemic inflammation, and long-term outcome after kidney transplantation
.
Am J Physiol Renal Physiol
.
2017
;
313
(
2
):
F475
f86
.
32.
Debnath
S
,
Velagapudi
C
,
Redus
L
,
Thameem
F
,
Kasinath
B
,
Hura
CE
,
Tryptophan metabolism in patients with chronic kidney disease secondary to type 2 diabetes: relationship to inflammatory markers
.
Int J Tryptophan Res
.
2017
;
10
:
1178646917694600
.
33.
Anderson
AH
,
Xie
D
,
Wang
X
,
Baudier
RL
,
Orlandi
P
,
Appel
LJ
,
Novel risk factors for progression of diabetic and nondiabetic CKD: findings from the chronic renal insufficiency cohort (CRIC) study
.
Am J Kidney Dis
.
2021
;
77
(
1
):
56
73
.
34.
van der Kloet
FM
,
Tempels
FW
,
Ismail
N
,
van der Heijden
R
,
Kasper
PT
,
Rojas-Cherto
M
,
Discovery of early-stage biomarkers for diabetic kidney disease using ms-based metabolomics (FinnDiane study)
.
Metabolomics
.
2012
;
8
(
1
):
109
19
.
35.
National Center for Biotechnology Information
.
pubchem compound summary for CID 61157688, asparaginyl-hydroxyproline
;
2021
. Available from: https://pubchem.ncbi.nlm.nih.gov/compound/Asparaginyl-Hydroxyproline.
36.
Ma
L
,
Li
N
,
Neu
J
.
Small intestinal hypoxic injury and use of arginyl-glutamine dipeptide: applications to pediatrics
. In:
Rajendram
R
,
Preedy
VR
,
Patel
VB
, editors.
Glutamine in clinical nutrition
.
New York, NY
:
Springer New York
;
2015
. p.
369
82
. .
37.
Hunter
WG
,
Kelly
JP
,
McGarrah
RW
 3rd
,
Khouri
MG
,
Craig
D
,
Haynes
C
,
Metabolomic profiling identifies novel circulating biomarkers of mitochondrial dysfunction differentially elevated in heart failure with preserved versus reduced ejection fraction: evidence for shared metabolic impairments in clinical heart failure
.
J Am Heart Assoc
.
2016
;
5
(
8
):
e003190
.
38.
Jacob
S
,
Nodzenski
M
,
Reisetter
AC
,
Bain
JR
,
Muehlbauer
MJ
,
Stevens
RD
,
Targeted metabolomics demonstrates distinct and overlapping maternal metabolites associated with BMI, glucose, and insulin sensitivity during pregnancy across four ancestry groups
.
Diabetes Care
.
2017
;
40
(
7
):
911
9
.
39.
de Paulis
T
,
Commers
P
,
Farah
A
,
Zhao
J
,
McDonald
MP
,
Galici
R
,
4-Caffeoyl-1,5-quinide in roasted coffee inhibits [3H]naloxone binding and reverses anti-nociceptive effects of morphine in mice
.
Psychopharmacology
.
2004
;
176
(
2
):
146
53
.
40.
Izzedine
H
,
Launay-Vacher
V
,
Baumelou
A
,
Deray
G
.
An appraisal of antiretroviral drugs in hemodialysis
.
Kidney Int
.
2001
;
60
(
3
):
821
30
. .
41.
Sekula
P
,
Dettmer
K
,
Vogl
FC
,
Gronwald
W
,
Ellmann
L
,
Mohney
RP
,
From discovery to translation: characterization of C-mannosyltryptophan and pseudouridine as markers of kidney function
.
Sci Rep
.
2017
;
7
(
1
):
17400
.

Additional information

Sharma and Natarajan are co-senior authors.

Open Access License / Drug Dosage / Disclaimer
This article is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC). Usage and distribution for commercial purposes requires written permission. Drug Dosage: The authors and the publisher have exerted every effort to ensure that drug selection and dosage set forth in this text are in accord with current recommendations and practice at the time of publication. However, in view of ongoing research, changes in government regulations, and the constant flow of information relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug for any changes in indications and dosage and for added warnings and precautions. This is particularly important when the recommended agent is a new and/or infrequently employed drug. Disclaimer: The statements, opinions and data contained in this publication are solely those of the individual authors and contributors and not of the publishers and the editor(s). The appearance of advertisements or/and product references in the publication is not a warranty, endorsement, or approval of the products or services advertised or of their effectiveness, quality or safety. The publisher and the editor(s) disclaim responsibility for any injury to persons or property resulting from any ideas, methods, instructions or products referred to in the content or advertisements.