Abstract
Background/Aims: Fetal macrosomia and its associated complications are the most frequent and serious morbidities for infants associated with gestational diabetes mellitus (GDM). The aim of this study was to evaluate the lncRNAs involvement in GDM, especially for the prediction of risk for fetal overgrowth. Methods: The peripheral blood obtained from four group including healthy control (NC), healthy volunteers with pregnancy (NC-P), GDM patients with and without macrosomia were screened by lncRNA microarray and validated by quantitative real-time PCR (RT-qPCR) arranged in the training and a two-stage validation sets. The positive and negative prediction ability for candidate lncRNAs were analyzed by risk score analysis. Results: A multiple venny analysis was performed revealed five candidate lncRNA including XLOC_014172, RP11-230G5.2, PCBP1-AS1, LOC149086 and RP11-160H22.5 which was consistence with the following parameter: i, increased in GDM patients with macrosomia (GDM-M) comparing with patients without macrosomia; ii, increased in GDM-M comparing with NC-P group; iii, increased in GDM-M comparing with NC. Further validation found XLOC_014172 and RP11-230G5.2 was final consistence with these parameter in 150 samples each group. Further receiver operating characteristic curve (ROC) analysis, with the combined two stably expressed lncRNAs indicated a high diagnostic ability an area under ROC curve value (AUC) of 0.955 and 0.962 in training set and validation set respectively. Conclusions: Circulating XLOC_014172 and RP11-230G5.2 may act as novel biomarkers in GDM patients as fingerprint for the risk of macrosomia outcome.
Introduction
Gestational diabetes mellitus (GDM), defined as any degree of abnormal glucose tolerance with onset or first recognition during pregnancy, is one of the most common medical complications of pregnancy [1, 2]. Babies born to mothers with GDM are typically at increased risk of excessive birth weight or macrosomia which defined as weighing more than 4 kg [3, 4]. Macrosomia is closely associated with higher rates of cesarean section and sequential use of vacuum and obstetric forceps for vaginal delivery for the mother, and clavicle fracture, intracranial hemorrhage and respiratory distress syndrome for the newborn [5, 6].
Recently, genetic studies have focused on non-coding RNAs. Over the past decade, many studies have reported that non-coding RNAs have important regulatory potentials, both transcriptionally and post transcriptionally [7-9]. LncRNAs, tentatively defined as non-coding RNAs more than 200 nucleotides, are usually divided into exonic lncRNAs, intronic lncRNAs, intergenic lncRNAs and overlapping lncRNAs in accordance with their location relative to the protein-coding transcripts [10, 11]. LncRNAs are known to play important roles during cellular development and differentiation, and a large range of biological processes, such as modulation of tumor proliferation and invasiveness, and reprogramming of induced pluripotent stemcells [12, 13]. However, to our knowledge to date, little study has been performed regarding the circulating lncRNAs for dynamic monitoring of GDM patients as well as the risk of macrosomia outcome.
In this study, we conducted our experiment based on the identified lncRNAs which was different expression in the GDM patients with or without macrosomia outcome. Five lncRNAs (XLOC_014172, RP11-230G5.2, PCBP1-AS1, LOC149086 and RP11-160H22.5) obtained as candidate fingerprints. We aimed to explore the potential lncRNA acting as fingerprint for the early and dynamic identification of macrosomia outcome in GDM patients.
Materials and Methods
Samples and Screening phase
This study was approved by an institutional review board, blood samples were collected after informed consent. Women with GDM, diagnosed by 75g oral glucose tolerance test (OGTT) during their 24th to 28th week of gestation at the Obstetrics and Gynecology Department in Wuxi Maternity and Child Health Hospital in 2011-2016 were enrolled in this study. According to the IADPSG (The International Association of Diabetes and Pregnancy Study Groups) and ADA (American Diabetes Association) recommendation. GDM was diagnosed if any one glucose level was equal to or greater than fasting 5.1 mmol/L, one hour post load 10.0 mmol/L or two hours post load 8.5 mmol/L. Pregnant women with GDM were treated with a strict diet control to maintain the fasting plasma glucose level at less than 5.3mmol/L and 2h postprandial level at less than 6.7mmol/L. Insulin treatment was administrated when dietary control could not achieve this goal. The babies of GDM group were diagnosed as macrosomia when their birth-weight was equal to or exceeded 4000 g [14, 15]. Controls were from healthy volunteers without any health problems during their health checkups at the at the Obstetrics and Gynecology Department in Wuxi Maternity and Child Health Hospital. The research protocol was approved by the Institutional Ethics Committee of the Wuxi Maternity and Child Health Hospital. Written informed consent was obtained from every participant. The peripheral blood samples of patients were collected in the early pregnancy during the antenatal examination period. Blood samples were collected in a separate vacuum cube, followed by centrifugation at 3, 000 rpm for 10 min. All samples were stored at –80°C until further analysis. The relevant clinical data of all patients were available. The detailed information of patients and healthy controls was summarized in Table 1.
The screening phase was divided into training set and validation set. 20 samples each group were enrolled while the validation set contained two cohorts. Groups were named as healthy controls (NC), pregnant volunteer without DM (NC-P), GDM patients without macrosomia (GDM) and GDM patients with macrosomia (GDM-M). The detailed workflow was presented in Fig. 1. The sample number in training set was 20 participants in each group while 130 participants in each group pf validation set.
Training set
All candidates were tested in an independent cohort of 20 plasma samples obtained from patients. The expression levels of these candidates were analyzed in all samples, and comparative 2-ΔΔCt method algorithms were used to analyze the difference between different groups.
Validation set
A case-control study was designed to validate the obvious difference of relative expression levels of the selected potential biomarkers candidates in another independent cohort including 130 samples each group.
Quantitative real-time PCR (qRT-PCR)
Total RNAs were obtained from plasmic samples using TRIzol reagent as described by the manufacturer (Invitrogen Life Technologies Co, Carlsbad, CA, USA). For mRNA detection, total RNAs (500ng) were reverse transcribed using the reverse transcription kit (Takara, Tokyo, Japan). The quality of RNA samples was assessed by a UV spectrophotometer (Bio-Rad, Hercules, CA, USA), and the 260/280nm absorbance ratio of samples were limited to 1.8-2.0. QRT-PCR was performed by using ABI Prism 7900HT (Applied Biosystems, CA, USA).
Risk score analysis
Risk score analysis was performed to evaluate the associations between the concentrations of the plasma lncRNA expression levels. The upper 95% reference interval of each lncRNA value in controls or non-metastasis group was set as the threshold to code the expression level of the corresponding lncRNA for each sample as 0 and 1 in the training set. A risk score function (RSF) to predict NSCLC group was defined according to a linear combination of the expression level for each lncRNA. For example, the RSF for sample i using information from three lncRNAs was: rsfi=∑3j-1Wj.sij. In the above equation, sij is the risk score for lncRNA j on sample i, and Wj is the weight of the risk score of lncRNA j. The risk score of three lncRNAs was calculated using the weight by the regression coefficient that was derived from the univariate logistic regression analysis of each lncRNAs. Samples were ranked according to their RSF and then divided into a high-risk group, representing the GDM-M group, and a low-risk group, representing the predicted control individuals. Frequency tables and ROC curves were then used to evaluate the diagnostic effects of the profiling and to find the appropriate cutoff point, and to validate the procedure and cutoffs in the next validation sample set.
Statistical analysis
Data were presented as mean (S.E.M.) If there is no special case. The Student’s t-test and Mann-Whitney unpaired test analysis of variance were used to evaluate statistical differences between patients and controls. Analysis of area under the ROC curve (AUC) was used to estimate the effectiveness of lncRNAs for prediction. Statistical analysis was performed using softwares of STATA 10.0 and GraphPad prism. In all cases, P< 0.05 was considered to be significant. All P values were two-sided.
Results
Subject characteristics
The clinical characteristics of the GDM patients and other groups were presented in Table 1. GDM patients presented with higher plasma glucose both during fasting and during OGTT, and higher HbA1C and triglycerides than control pregnant women (P< 0.001). HDL cholesterol was significantly lower in GDM subjects (P< 0.05). No difference were obtained in GDM-M group comparing with the GDM group (Table 1).
The circulating lncRNA expression landscape in GDM patients
Human LncRNA Array v3.0 was used to detect the lncRNA derived from plasma of 12 sample including three peripheral blood samples from healthy volunteer, three from pregnant volunteer without DM (NC-P), and three from GDM patients without macrosomia and three from GDM patients with macrosomia. Aberrant expression of lncRNA underwent hierarchical clustering using a heat map, resulting in a profile of the differential expression of lncRNA in the different groups (Fig. 2A). Among these, we identified 284 increased lncRNAs in pregnant volunteer without DM comparing with healthy control without pregnancy; 197 increased lncRNAs in GDM patients without macrosomia comparing with pregnant volunteer without DM and 147 increased lncRNAs in GDM patients with macrosomia comparing with GDM patients without macrosomia according to a fold-change cut-off of 4/0.25. To further screened the potential fingerprints for GDM as predicting the macrosomia out in gravidae, we next using the following parameter for screening: i, increased in GDM patients with macrosomia (GDM-M) comparing with patients without macrosomia; ii, increased in GDM-M comparing with NC-P group; iii, increased in GDM-M comparing with NC. Finally, we obtained five candidate lncRNAs which was consistence with all the parameters mentioned above including XLOC_014172, RP11-230G5.2, PCBP1-AS1, LOC149086 and RP11-160H22.5 (Fig. 2B).
Validation of candidate lncRNAs in large sample size
We next examined their expression levels through a larger individual samples (150 samples each group). As shown in Fig. 2C, we found that only XLOC_014172 and RP11-230G5.2 were consisted with the training set while PCBP1-AS1, LOC149086 and RP11-160H22.5 indicated no significant difference (Fig. 3).
To validate the accuracy and specificity of the two lncRNAs as a GDM with macrosomia outcome potential signature, the risk score formula was applied to assess the diagnostic value of the two lncRNAs profiling system. Firstly, the risk score of each plasma sample in the training set was calculated (20 samples random selection each group), as the basis of their risk scores and a set cut off, plasma samples were then divided into a high-risk group, representing the possible GDM-M group, and a low-risk group, representing the GDM group. At the optimal cutoff value (Value=9.889) with the value of sensitivity + specificity considered to be maximal, the positive predictive value (PPV) and negative predictive value (NPV) was 70% and 80% in the training set, respectively. Similarly, when the same cutoff value was applied to calculate the risk score of samples in the larger validation sets, the PPV and NPV were 85%, 86%, respectively (Table 2).
ROC curve analysis was used to evaluate the predicting diagnosis value of lncRNAs for GDM-M. Areas under the ROC curves (AUC) of the two lncRNAs signature in training set were 0.900 and 0.885, respectively while the combination of the two factors possessed a moderate ability for discrimination between GDM-M and GDM with an area under the ROC curve of 0.955 (Fig. 4A). Additional, in the validation set, AUC of the two lncRNAs signature in training set were 0.904 and 0.849, respectively while the combination of the two factors possessed a moderate ability for discrimination between GDM-M and GDM with an area under the ROC curve of 0.962 (Fig. 4B).
Discussion
Gestational diabetes mellitus is a complex disease induced by a combination of genetic factors and environmental exposure, with genetic components being the major determinant of disease [16]. To identify specific lncRNA signature and its biological significance in the pathobiology of GDM, we performed lncRNA profiling in plasma samples from healthy volunteer, pregnant volunteer without DM, GDM patients without macrosomia and GDM patients with macrosomia outcome. In our study, we performed a case-control study through the high throughput lncRNA microarray, and discovered two novel lncRNA, XLOC_014172 and RP11-230G5.2, acting as the biomarker for predicting the outcome of macrosomia in GDM patients. The risk score analysis including a multistage validation was employed to evaluate the association between GDM-M group and the lncRNAs expression levels.
It has been widely reported that disease-associated lncRNAs are detectable in blood, sputum, urine, and other biological fluids of patients [17, 18]. Besides, miRNA such as miR-508-3p has been identified as signature in gestational diabetes mellitus and was associated with risk of macrosomia [19]. For lncRNA, researcher has reported the deregulated MALAT1 expression in patients with gestational diabetes mellitus; however, no detailed analysis was applied for predicting the GDM or the outcome of macrosomia in GDM patients [20]. Therefore, lncRNA might be investigated as potential biomarkers for the dynamic monitoring of GDM. The function of lncRNA has been focused by other project, for example, Shi Z, et al, reported a microarray expression profile analysis of lncRNA in umbilical cord plasma samples of GDM patients, they revealed a landscape of lncRNA in patients suffering from GDM-induced macrosomia [21]; however, no mechanism study or diagnosis analysis was performed. Moreover, alteration in expression and methylation of lncRNA igf2/h19 in placenta and umbilical cord blood were also associated with macrosomia exposed to intrauterine hyperglycemia [22].
From a clinical point of view, the lncRNA biomarker selected in our study for the monitoring of GDM-induced macrosomia might be useful for indicating the clinical typing of GDM which is a great help to decide the way of the treatment. Furthermore, for as a single diagnostic method, the biomarker in our study presented a high sensitivity and specificity, nevertheless, given that the express fluctuations of plasma lncRNA analysis with a panel of multiple diagnosis methods might be a solution to the problem of GDM detection.
In conclusion, we have identified the unique lncRNA biomarker for early screening of GDM-induced macrosomia in GDM patients, which may serve as a novel non-invasive approach for diagnosis and dynamic monitoring of GDM-induced macrosomia.
Disclosure Statement
The authors declare to have no competing interests.