In medical genetics, clinically relevant mutations are usually thought to damage a gene or to alter its copy number. Changes in sequence, or structural genome variations, not affecting the coding sequence of a gene are generally considered not to be clinically relevant. Nevertheless, chromosomal breakpoints, copy number variations (CNVs), or single nucleotide variations (SNVs) outside the coding sequence of plausible candidate genes have been observed in some patients with dominant developmental disorders [Krebs et al., 1997; Klopocki et al., 2011; Liskova et al., 2018]. Such variations do not alter the functional properties of the encoded proteins. It is conceivable, however, that they may provoke clinical phenotypes by either changing the level of gene expression or by causing ectopic gene expression. Exactly how this may come about is poorly understood. By combining genotypic and phenotypic information on patients with detailed studies of altered development in model organisms into which such variations have been introduced, a few gaps in our understanding have recently been filled.
In patients with acrocallosal syndrome or craniosynostosis of the Philadelphia type, duplications at the IHH locus were detected [Yuksel-Apak et al., 2012; Paumard-Hernández et al., 2015]. Since the patient with craniosynostosis with the duplication encompassing the IHH regulatory region, the IHH encoding gene was thought to be involved in cranial and skeletal development [Paumard-Hernández et al., 2015]. In mice, the duplications affected conserved noncoding elements, which drove reporter gene expression according to a pattern similar to that of wild-type IHH expression [Klopocki et al., 2011]. The authors concluded that such duplications provoke misexpression and/or overexpression of the IHH gene and thus affect regulatory signaling networks during digit and cranial development [Klopocki et al., 2011]. In mice, Ihh positively regulates intramembranous ossification, and loss of Ihh reduced Bmp expression within the cranial bones [Lenton et al., 2011]. Ihh is mainly expressed in prehypertrophic and hypertrophic chondrocytes of developing endochondral bones, where it synchronizes chondrogenesis and osteogenesis during endochondral ossification by controlling chondrocyte proliferation and differentiation as well as specification of bone-forming osteoblasts [Ohba, 2016]. IHH directly promotes chondrocyte propagation and forms a negative feedback loop with parathyroid hormone-related protein (PTHrP) to maintain the growth plate length. In addition, IHH activity is required for the specification of progenitors into osteoblast precursors [Ohba, 2016]. At the epiphysis, the thyroid hormone fine tunes levels of IHH and thereby controls the transit of proliferating immature chondrocytes into mature hypertrophic chondrocytes to eventually become osteoblasts [Aghajanian et al., 2017]. While RUNX2 is the master regulatory transcription factor controlling osteogenesis, it is itself regulated by IHH through concomitant inhibition of GLI3-repressor formation and activation of downstream targets [Veistinen et al., 2017]. This suggests that IHH is a key element in a regulatory feedback circuit controlling skull osteogenesis and suture patency [Veistinen et al., 2017]. Together with other genes, IHH thus forms a functional network that regulates ossification [Poot, 2018]. Perturbation of IHH function within this gene network may result in skeletal dysmorphologies and craniosynostosis.
Recently, transgenic reporter and genome-editing experiments in mice showed that IHH expression is regulated by a cluster of conserved noncoding elements consisting of at least 9 enhancers, each with individual tissue specificity in the digit anlagen, growth plates, cranial sutures, and fingertips [Will et al., 2017]. These enhancers function additively as engineered consecutive deletions revealed distinct patterns of growth defects of the skull and the long bones. IHH duplications, in contrast, not only caused dose-dependent upregulation, but also misexpression of the IHH gene. The latter led to abnormal phalanges, fusion of sutures, and syndactyly. These findings demonstrate that precise spatiotemporal control of developmental gene expression in the skeletal and the cranial bones is achieved through complex multipartite enhancer ensembles. Alterations in the composition of such clusters by structural variation may provoke gene misexpression and eventually dysmorphic disorders.
Also a specific form of autosomal-dominant corneal endothelial dystrophy, known as posterior polymorphous corneal dystrophy (PPCD), may be due to ectopic gene expression [Liskova et al., 2018]. The clinical course of this disorder is highly variable, ranging from asymptomatic changes of the corneal endothelium, up to blurred and even complete loss of vision. In the latter case, vision can only be restored by corneal transplantation. In a large 9-generation family with 18 living and 19 deceased PPCD patients, neither Sanger sequencing of established PPCD-associated genes, including OVOL2, nor whole exome sequencing revealed any likely disease-associated SNVs. Linkage analysis of 9 patients and 7 unaffected siblings showed linkage for a locus spanning 19 Mbp within region 8q22.3q24.12, with a maximum LOD score of 4.38. Whole genome sequencing of 4 distantly related patients and filtering SNVs for this locus turned up 2 intergenic SNVs and 1 cosegregating variant within intron 1 of the GRHL2 gene. This SNV was also found in 3 other unrelated PPCD families and 2 more SNVs in the same intron of GRHL2 in 2 other unrelated PPCD families. In human embryonic stem cells, adult dermal fibroblasts, and normal human epidermal keratinocytes, these SNVs in intron 1 of GRHL2 showed histone 3 methylation and acetylation marks and DNase I hypersensitivity consistent with open and transcriptionally active chromatin. According to MatInspector and AliBaba 2.1, all PPCD-associated GRHL2 SNVs were predicted to result in a gain or loss of binding of at least 1 transcription factor expressed in the corneal endothelium [Grabe, 2002; Cartharius et al., 2005]. Each of the 3 GRHL2 PPCD-associated SNVs showed significantly increased promoter activity in promoter-less firefly luciferase reporter vectors transfected into HEK293 cells. Thus, these PPCD-associated SNVs of intron 1 of GRHL2 enhanced the transcription of their target gene.
Previously, mutations leading to loss of 1 allele of GRHL2 have been found in autosomal-dominant nonsyndromic hearing loss (DNFA28; MIM:608641) and homozygous missense mutations have been associated with autosomal recessive ectodermal dysplasia with hearing loss (ECTDS; MIM:616029) [Van Laer et al., 2008; Vona et al., 2013; Petrof et al., 2014; Kim et al., 2015]. While mutations within the coding sequence or loss of 1 allele of GRHL2 are linked to hearing loss, upregulation and ectoptic expression of GRHL2 are associated with PPCD without hearing impairment [Liskova et al., 2018]. Hence, a specific and novel mechanism for noncoding GRHL2 SNVs in causing PPCD has to be conceived.
ZEB1 maintains human corneal endothelial cells in the normal state by promoting their epithelial-to-mesenchymal transition. GRHL2 and OVOL2 are transcriptional repressors of ZEB1, but ZEB1 can also repress the expression of both of these genes. This mechanism maintains corneal endothelial cells in the healthy cellular state. In PPCD, mutations involving loss of 1 allele or causing reduced expression of ZEB1, and promoter mutations that drive ectopic expression of GRHL2 or OVOL2 in corneal endothelial cells may result in cell state change through the mesenchymal-to-epithelial transition. GRHL2 activates OVOL2 expression such that ectopic expression of GRHL2 will also result in ectopic OVOL2 expression. OVOL2 in turn regulates the transcriptional program of human corneal epithelium cells by repressing expression of mesenchymal genes such as ZEB1 [Frisch et al., 2017]. This will drive corneal endothelium cells to become epithelial-like, stratified, and irregularly shaped cells. By transactivating expression of OVOL2, a gene associated with PPCD, GRHL2 in turn activates expression of E-cadherin, claudin 4 (epidermal tight junctions), and Rab25 (apical trafficking) [Boivin and Schmidt-Ott, 2017].
Disturbances of the delicate balance between ZEB1, OVOL2, and GRHL2 levels will thus result in either of 3 forms of PPCD [Liskova et al., 2018]. OVOL2 ectopic expression in corneal endothelial cells leads to high OVOL2/low ZEB1 PPCD1. ZEB1 haploinsufficiency in corneal endothelial cells will produce low ZEB1/high OVOL2/high GRHL2 PPCD3. Finally, GRHL2 ectopic expression will result in low ZEB1/high GRHL2 PPCD4. In view of the current information on human PPCD patients, this is a plausible model to explain 3 forms of PPCD, which merits testing in model organisms as has been done with the involvement of the IHH gene in relation to disturbed development of cranial and long bones.
These publications demonstrate the importance of CNVs and SNVs in noncoding genome segments and reemphasize the limitations of targeted sequencing and whole exome sequencing. Thus far, mutations in noncoding regions have been underestimated or even neglected in medical genetics. Intronic losses or duplications in developmental genes, such as ARHGEF39, BTN2A1, CNTNAP2, CENPJ, MTMR3, TRIP10, WWOX, and ZNF701, which may affect transcription and microRNA binding sites, or intragenic enhancers, prove to be more widespread and to have a larger impact than as yet appreciated [Cinghu et al., 2017; Poot, 2017; Devanna et al., 2018]. The publications discussed above also suggest that precisely balanced gene networks, governing the transition from one cell type to another, may be paramount for proper development. Ectopic gene expression, due to noncoding mutations and interfering with such regulatory networks, may become a novel paradigm in developmental genetics.
Martin Poot