Abstract
Background/Aims: Autosomal dominant polycystic kidney disease (ADPKD) is one of the most common inherited cystic kidney diseases caused by mutations in two large multi-exon genes, PKD1 and PKD2. High allelic heterogeneity and duplication of PKD1 exons 1–32 as six pseudo genes on chromosome 16 complicate molecular analysis of this disease. Methods: We applied targeted next-generation sequencing (NGS) in 9 non-consanguineous unrelated Iranian families with ADPKD to identify the genes hosting disease-causing mutations. This approach was confirmed by Sanger sequencing. Results: Here, we determined three different novel frameshift mutations and four previously reported nonsense mutations in the PKD1 gene encoding polycystin1 in heterozygotes. Conclusion: This study demonstrates the effectiveness of NGS in significantly reducing the cost and time for simultaneous sequence analysis of PKD1 and PKD2, simplifying the genetic diagnostics of ADPKD. Although a probable correlation between the mutation types and phenotypic outcome is possible, however for more extensive studies in future, the consideration of renal hypouricemia (RHUC) and PKD1 coexistence may be helpful. The novel frameshift mutations reported by this study are p. Q1997X, P. D73X and p. V336X.
Introduction
Autosomal dominant polycystic kidney disease (ADPKD; OMIMs #173900; 613095) is the most common inherited kidney disease, ultimately leading to end-stage renal disease (ESRD) with an incidence of 1 in 400 to 1 in 1000 [1]. It is characterized by the development and progressive enlargement of cysts in the kidney, typically in late middle age. About 8% of patients on dialysis or undergoing kidney transplantation have ADPKD. It is a systemic disorder where cysts develop in the kidneys and other areas of the body lead to many clinical manifestations [2].
ADPKD is caused by mutations in two genes, PKD1 and PKD2. PKD1 (MIM# 601313, 16p13.3) accounts for 85% of cases and mutation in PKD2 (MIM#173910, 4q21-22) is responsible for the remainder of the cases (15%). The phenotype of PKD2 is milder than PKD1 [3]. Both PKD1 and PKD2 exhibit marked allelic heterogeneity, with a high level of gene variations [4]. PKD1 is a complex gene with 46 exons extending to 52 kb of the genomic DNA that encodes a GC-rich transcript of approximately 14 kb length, [5] 70% of its length (exons 1–33) shows six-fold duplication on chromosome 16p within six pseudo-genes and shares a 97.7% sequence identity with the un-mutated gene [4, 6-8]. Up to now, more than 2300 mutations of PKD1 have been identified [9]. PKD2 gene consists of 15 exons which encode a 3 kb open reading frame [10]. More than 300 mutations of PKD2 have been identified. The PKD1 and PKD2 genes encode respectively polycystin-1 and polycystin-2 proteins [11]. Since ADPKD is a dominant disease with high penetrance, the children of affected parents have on average a 50% chance of developing the disease [12].
ADPKD also shows high level of phenotypic variability in the onset of ESRD with respect to age [13].Three important issues could be considered in ADPKD genotype-phenotype correlation studies: first, ESRD occurs averagely about 20 years earlier in ADPKD patients with PKD1 than those with PKD2 [13]. Second, the position of the mutation in PKD1 gene is associated with ESRD onset with respect to age [14]. Third, mutation type in PKD1 gene, regardless to mutation position, correlate with renal survival [13].
There is a general consensus that ADPKD could routinely be diagnosed by renal imaging, using age-dependent cyst number criteria determined by ultrasound [15]. Computed tomography (CT) and magnetic resonance imaging (MRI) can also quantify renal cystic disease, they are not very effective at diagnosing ADPKD in young individuals and in patients with mild symptoms caused by PKD2 mutations. ADPKD is not clinically detectable throughout the first few decades of life (age, <30 years) [16]. The high allelic heterogeneity of ADPKD genes with six pseudo-genes at the 5′ end of the PKD1 gene, elongated in two-thirds of the gene, make molecular diagnostics problematic [17].
The requirement for a relatively large family and multiple diagnosed members are limiting factors for genetic diagnosis of ADPKD by linkage analysis [18]. The high cost and time required for PKD1 and PKD2 mutation analysis using Sanger sequencing technique limit its extensive utilization in the clinical setting. It requires laborious long-range amplifications [12]. Next generation sequencing (NGS) of PKD1 and PKD2 genes have been tried for ADPKD diagnosis [19-21] and using this technique routine genetic diagnostics of ADPKD and a significant reduction in the time and cost could be achieved for simultaneous sequence analysis of ADPKD genes [22]. Despite the abundance of families with ADPKD in the Asian population [23],there are very few published reports in the literature. In this communication, we report the study of PKD1 and PKD2 in nine Iranian unrelated ADPKD families using targeted NGS.
Materials and Methods
Ethical compliance
The study was approved by the local ethics committees, in adherence to thedeclaration of Helsinki, and informed consent was obtained from all the patients concerned before their participation in this study.
Subject data
Nine unrelated ADPKD probands (seven males and two females) showing classical dominant pattern of inheritance were recruited for the study from dialysis centers and nephrology clinics in Tehran, Iran. The diagnosis of ADPKD in the selected probands was established by Pei’s ultrasound diagnostic criteria [24]. The probands had one affected parent with ADPKD and all of them had at least one member with a more serious manifestation of the disease (ESRD before 50 years of age) or they had already had developed ESRD. All the probands selected had not gone through mutational screening. Asymptomatic at-risk individuals were examined by ultrasound. Pedigrees were generated using the family information from probands. Total genomic DNA from available family members was extracted from blood samples using a standard phenol–chloroform procedure. The quality and quantification of DNA samples were appraised using spectrophotometry and 1% agarose gel electrophoresis, respectively. Some clinical characteristics of the 9 ADPKD patients studied are summarized in Table 1.
Targeted Next Generation Sequencing data analysis
Targeted NGS experiments were performed by BGI. The designed custom solution-based capture array hybridization (NimbleGen, Roche) was applied for capture. Image analysis, error estimation and base calling were performed via Illumina Pipeline for raw data generation by BGI. Indexed primers were used to determine the reads from different samples. The 90 bp clean reads obtained were then aligned using Burrows Wheeler Aligner (BWA) software with human reference genome from the NCBI database (Build 37). Here, the applied NCBI RefSeq sequences were NG_008617.1 and NM_001009944.2 for PKD1 and NG_008604 and NM_000297.3 for PKD2. SNPs and insertion-deletions (indels) were identified with SOAPsnp software and the Genome Analysis Toolkit (GATK) [21].
Sanger sequencing
To validate the variations identified by NGS, the corresponding gene regions surrounding the variations were amplified by long-range polymerase chain reaction (LR-PCR) according to previously reported PKD1-specific primers [25]. PCR products were purified from agarose gel slices with the Fermentas kit (ThermoFisher Scientific, Rockford, IL, USA) and sequenced by Sanger sequencing using appropriate reported internal primers and the manufacturer’s protocol (Table 2).
Bioinformatics analysis and mutation identification and classification
For confirmation of candidate mutations against electronic database information and computational analysis, the detected sequence variations within our study were checked with the currently published list of PKD gene variants in the Human Gene Mutation Database (HGMD) [26] and Autosomal Dominant Polycystic Kidney Disease Mutation Database (PKDB) [27]. To evaluate the effects of novel variations, they were analyzed by a web-based computational pathogenicity prediction tool which included Mutation Taster [28].
Segregation analysis
To confirm the association between mutation and the pathogenicity of the disease, careful analysis of other available patients in the pedigree was carried out by direct sequencing. Meanwhile, unaffected family members and normal controls from the Iranian normal population database were also checked for the mutation to confirm the prediction.
Results
We performed mutational analysis of ADPKD genes in the nine unrelated Iranian ADPKD families. Using targeted NGS, three frameshift deletion mutations and four nonsense mutations in nine families were detected in our set of ADPKD probands (Table 3). All were heterozygotes. We did not detect any aberrant variants in PKD2 gene. Detected frameshift mutations had not been reported previously in any published database including exome variant server (publicly available; http://evs.gs.washington.edu/EVS/), ADPKD mutation (PKDB; http://pkdb.mayo.edu) and NCBI dbSNP databases. They have been determined as novel and validated using classic Sanger sequencing.
Mutation taster software was used to assess the pathogenicity of novel variations, and it recognized all of them as disease-causing mutations. Each candidate mutation was assessed for its co-segregation with disease in the relevant available family members, at least one normal (aged, >30 years individuals) and one available affected member. The mutations were identified in all affected members and were absent in asymptomatic family members and in 400 normal members (Iranian population).This provides evidence for causative pathogenic mutations.
Mutations detection
All detected variants were catagorised as frameshift and nonsense mutations, predicting an early stop codon (Table 3). The frameshift mutations p. Q1997X, P. D73X and p. V336X were detected in patients PKD06, PKD 011 and PKD013, respectively (Fig. 1).The nonsense mutation p.Q3475X in exon 34 of the PKD1 gene was found in two unrelated Iranian patients (PKD015 and PKD016) and the nonsense mutation p.R1436X in exon 15 was found in another two unrelated Iranian patients (PKD01 and PKD03) .Mutations p.Q2136X in exon 15 and p.Q533X in exon 7 were detected in patients PKD04 and PKD09, respectively.
The 3 novel frameshift and previously reported nonsense mutations in PKD1 gene were identified by DNA sequencing and the partial sequence at the related location in the affected member and wild-type gene. (a) Transition of a C>T at nucleotide 4306 in exon 15 of the PKD1 gene resulted in a premature stop codon, which might stop the transcription and lead to truncation of polycystin-1 in families PKD01 and PKD03; (b) transition of a C>T in exon 15 at nucleotide 6406 leading to a stop codon in family PKD04; (c) deletion of G at nucleotide 5991 in exon 15 causing a frameshift of the reading frame in family PKD06; (d) transition of a C>T at nucleotide 1597 in exon 7 leading to a stop codon in family PKD09; (e)2 bp deletion at nucleotide 217-218 in exon 2 causing a frameshift of the reading frame in family PKD011; (f) a 14 bp deletion from nucleotide 1007 to 1020 in exon 5 causing a frameshift of the reading frame in family PKD013; (g) transition of a C>T in exon 34 at nucleotide 10420 leading to a stop codon in families PKD015 and PKD016.
The 3 novel frameshift and previously reported nonsense mutations in PKD1 gene were identified by DNA sequencing and the partial sequence at the related location in the affected member and wild-type gene. (a) Transition of a C>T at nucleotide 4306 in exon 15 of the PKD1 gene resulted in a premature stop codon, which might stop the transcription and lead to truncation of polycystin-1 in families PKD01 and PKD03; (b) transition of a C>T in exon 15 at nucleotide 6406 leading to a stop codon in family PKD04; (c) deletion of G at nucleotide 5991 in exon 15 causing a frameshift of the reading frame in family PKD06; (d) transition of a C>T at nucleotide 1597 in exon 7 leading to a stop codon in family PKD09; (e)2 bp deletion at nucleotide 217-218 in exon 2 causing a frameshift of the reading frame in family PKD011; (f) a 14 bp deletion from nucleotide 1007 to 1020 in exon 5 causing a frameshift of the reading frame in family PKD013; (g) transition of a C>T in exon 34 at nucleotide 10420 leading to a stop codon in families PKD015 and PKD016.
Discussion
In Iran, ADPKD is the most common inherited kidney disease, affecting more than 200,000 Iranians. The study of PKD1 and PKD2 gene mutations in Iranian patients contribute to the data of genetic diversity between diverse ethnic groups and further upgrade the mutation database of the world population. A plethora of mutations in PKD1 and PKD2 genes have been reported, including frameshift, nonsense, splice-site, missense, small insertion-deletion and large deletion/duplication rearrangements. In spite of the wide range of existing disease-causing variants in ADPKD genes, the investigations of Iranian patients with ADPKD are rather limited and very few reported mutations have been found in the studies of Iranian ADPKD families. To the best of our knowledge, this is the first significant study of ADPKD gene mutations in the Iranian ADPKD families by targeted NGS as efficient method for detection of ADPKD. The NGS method used in this study has the potential to greatly improve genetic testing for ADPKD genes by allowing simultaneous detection of copy number variations and point mutations via a single test. Accordingly, this method may increase the rate of mutation detection and hence significantly decrease testing turnaround time and costs.
In this study, we studied nine unrelated Iranian families with ADPKD. We identified three novel frameshift mutations and four previously reported nonsense mutations in PKD1 gene in all selected ADPKD families (Fig. 1). The total mutation detection rate was 100%. The mutations in all patients were well distributed over the length of the replicated region. The mutations spread throughout the 5´ region of the PKD1 gene. Two nonsense mutations, p.R1436X and p.Q3474X, each were present in two unrelated Iranian patients with similar clinical course (families 01, 03 and 015, 016, respectively), which may be related to having a common ancestor. All of detected variants were novel deletion frameshift and previously reported nonsense mutations in PKD1 gene and predicted to truncate the coding sequence by introducing premature termination codons. The functional impact of the detected variants was predicted as deleterious by Mutation Taster software. The variants were co-segregated with ADPKD in the available family members. They were absent in unaffected family members and 400 normal Iranians. The identified frameshift mutations were absent in any published database including HGMD, ADPKD and NCBI dbSNP databases. They have been determined as novel. The overall evidences supported the detected mutations as a pathogenic mutation for ADPKD. The bioinformatics analysis concluded the variants as definitely pathogenic mutations.
ADPKD shows a high degree of phenotypic variability between patients due to genetic and allelic heterogeneity. Truncating PKD1 mutations are associated with ESRD onset at younger age than nontruncating mutations [29]. Consistent with these studies, the probands in our study had at least one family member with early-onset ESRD or they were already diagnosed with ESRD and were younger than 50 years of age. All mutations found within our study were truncated nonsense and frameshift mutations. This cohort confirms a probable correlation between the mutation types and phenotypic outcome is possible in some way. Therefore, the age of onset of end-stage renal failure can be influenced by the causal mutation.
In contrast to our study, a previous study reported that its the position and not the type of PKD1 mutation that correlates with the severity of renal disease in ADPKD patients. They found no significant differences among patients with missense, in-frame changes and truncating mutations in PKD1 gene [14]. Thus further studies in larger Iranian ADPKD patient groups and follow-up in other young affected family members are necessary to verify the reported association.
Conclusion
It can be concluded that, although this study added 3 novel mutations p. Q1997X, P. D73X and p. V336X to the spectrum of PKD1 gene mutations, the majority of mutation types of the PKD1 gene in the Iranian population may as yet be undiscovered. Also, coexistence of RHUC type I and ADPKD have been reported [32], there isn’t any study about RHUC in Iranian population. This study may be useful to design more extensive investigations with consideration of the coexistence of RHUC and PKD1 and help the improvement of molecular and functional analyses of ADPKD in Iranian patients by enriching our knowledge of the genotype–phenotype correlations of ADPKD. Finally, the use of NGS method in the molecular diagnosis of ADPKD will increase the number of studied families and expand the mutation database of ADPKD.
Acknowledgements
We are deeply grateful to the families who took part in this research and we profoundly appreciate their collaboration which made this study possible. This work was supported by the Royan Institute and the Urology and Nephrology Research Center (UNRC) of Shahid Beheshti University of Medical Sciences.
Disclosure Statement
The authors declared that they have no conflict of interest.