Abstract
The interphase cell nucleus is extraordinarily complex, ordered, and dynamic. In the last decade, remarkable progress has been made in deciphering the functional organisation of the cell nucleus, and intricate relationships between genome functions (transcription, DNA repair, or replication) and various nuclear compartments have been revealed. In this review, we describe the architecture of the Arabidopsis thaliana interphase cell nucleus and discuss the dynamic nature of its organisation. We underline the need for further developments in quantitative and modelling approaches to nuclear organization.
The linear dimension of eukaryotic genomes can be readily analysed using various high-throughput techniques. Thus, we are now able to decipher the evolution of genome sequences, and an increasing number of studies have reported dynamic epigenomes. This progress has given rise to new challenges, namely to approach the genome in its three-dimensional nuclear framework (3D), to examine the interplay between the main functions of the genome and the architecture of the interphase cell nucleus, and to decipher the relationships between nuclear structure and function. Thus, there is a renewed interest in nuclear compartments, some of which were described more than 1 century ago, and in 3D nuclear architecture. The complexity of the interphase cell nucleus, its ordered structure, and the dynamics of this organelle at different scales are thus being investigated in both animal and plant cells. Much has been learnt about the composition and fine structure of the nucleus and the mechanism of formation and dynamics of its various functional compartments. A better understanding of the structural and functional interplay between chromatin and the other nuclear compartments is emerging. These studies have been accompanied by the development of specific 3D approaches and tools, such as 3D imaging and modeling, and methods that capture chromosome conformation. Numerous reviews have been published on diverse aspects of nuclear organisation [Delgado et al., 2010; Rajapakse and Groudine, 2011; de Wit and de Laat, 2012; Taddei and Gasser, 2012; Dekker et al., 2013; Dion and Gasser, 2013; Towbin et al., 2013]. However, much remains to be learnt about chromatin dynamics in plants. In this review, we summarize our current knowledge of nuclear compartments of the interphase nucleus in the model plant Arabidopsis thaliana with a special emphasis on heterochromatin. Indeed, whereas little is known about euchromatin dynamics at the scale of the nucleus, heterochromatin is highly plastic, exhibits large-scale reorganizations, and participates to genome organisation. We also discuss 3D modeling and quantitative techniques for analyzing the architecture of interphase nuclei in A. thaliana.
Components of Plant Heterochromatin
In 1928, Emil Heitz classified chromatin into 2 types: heterochromatin and euchromatin. Whereas the former remains highly condensed throughout the cell cycle, the latter decondenses during interphase. This binary classification system, which was originally based on cytological observations in mosses, is still widely used to describe chromatin in all eukaryotes. However, it has evolved tremendously in the past 15 years, and central dogmas, such as the inertness and transcriptional inactivity of heterochromatin, have been challenged. The classification system has been expanded to include molecular and biochemical characteristics, such as symmetric or asymmetric DNA methylation, post-translational histone modifications, nucleosome composition and arrangement, and transcriptional status, as determined by specialized polymerases. However, chromatin states at the scale of the nucleus are difficult to determine due to limitations in resolution, and only the relatively large-scale heterochromatin compartments of interphase nuclei have been analysed using cytological approaches.
The main heterochromatic regions of A. thaliana, which are visible by microscopy after DNA counterstaining, occur at the centromeres, pericentromeric regions, telomeres, and nucleolar organizer regions (NORs) (fig. 1, 2). These regions are referred to as constitutive heterochromatin, whereas chromatin that occasionally acquires heterochromatin characteristics and is dispersed throughout the genome is known as facultative heterochromatin. The cytological appearance of plant heterochromatin varies depending on the genome size (ranging from ∼63-149,000 Mb) [Heslop-Harrison and Schwarzacher, 2011] and chromosomal organisation (ranging in dicotyledonous species from 2n = 4, such as in Haplopappus gracilis, to 2n = ∼640 in Sedum suaveolens; http://www.tropicos.org/Project/IPCN). Plant heterochromatin is either located in discrete and well-defined subnuclear regions that exhibit intense labeling with DNA stain, also called chromocenters (CCs) in some species, e.g. A. thaliana and Oryza sativa (rice), or it is distributed throughout the genome in less defined substructures as, for instance, in Zea mays (maize). The heterochromatin fraction of A. thaliana is estimated to account for 7.1% of the total chromosome length at pachytene (∼330 µm) based on a cytological approach [Fransz et al., 1998], for 10-15% of the genome based on the genome sequence [Arabidopsis Genome Initiative, 2000], and for 16% of the genome (22 Mb out of the ∼135 Mb of the genome) based on DNA accessibility analysed by DNase I chip [Shu et al., 2012]. The relative heterochromatin fraction (RHF), defined as the area and fluorescence intensity of CCs in relation to the area and fluorescence intensity of the entire nucleus in DAPI counterstaining, is estimated to be ∼15% [Soppe et al., 2002; Schonrock et al., 2006], with variations depending on cell type and developmental and environmental cues. Heterochromatin is rich in repetitive DNA sequences and transposable elements, has few genes, and exhibits little or no transcriptional activity. Furthermore, it exhibits distinct molecular and biochemical variations according to localization and function.
Centromeres are the primary constrictions along mitotic/meiotic chromosomes. The relative location of the centromere differs for each type of chromosome (fig. 1) [for a detailed review, see Ma et al., 2007]. The centromere directs the assembly of the proteinaceous kinetochore which interacts with spindle microtubules and facilitates the segregation of sister chromatids during mitosis [Delgado et al., 2010]. Centromeres of A. thaliana are composed of arrays of a 178-bp satellite repeat, ranging from 0.4 to 1.4 Mb in different chromosomes [Fransz et al., 1998; Copenhaver et al., 1999; Heslop-Harrison et al., 2003; Ma et al., 2007; Zhang et al., 2008]. Substantial variation in the copy number of the centromeric repeat was reported in different ecotypes [Hall et al., 2006]. The DNA sequences of centromeric satellite repeats differ markedly even among closely related species [Heslop-Harrison et al., 1999; Lysák, 2009; Heslop-Harrison and Schwarzacher, 2011]. Also, the centromeric chromatin region shows low levels of DNA methylation and of the H3K9me2 epigenetic mark [Zhang et al., 2008]. Independent of the DNA sequence, the location of the centromere is epigenetically specified by the presence of a histone H3 variant, CENH3 (also named HTR12 in A. thaliana). Despite an essential role in mitosis and meiosis, CENH3 is rapidly evolving and participates in the formation of centromeric nucleosomes with unique properties, thereby allowing the centromere to fulfill essential roles in kinetochore formation and genome partitioning [Lermontova et al., 2011; Tachiwana and Kurumizaka, 2011; Tachiwana et al., 2011]. In A. thaliana, the rapidly evolving N-terminal domain of CENH3 is specifically required for centromeric loading in meiotic nuclei, suggesting that different CENH3 loading mechanisms exist in mitosis and meiosis [Ravi and Chan, 2010; Lermontova et al., 2006, 2011; Ravi et al., 2011]. These data have key functional implications for genome maintenance. A recent study reported that KINETOCHORE NULL2 (KNL2) participates in loading CENH3 at centromeres [Lermontova et al., 2013]. KNL2 contains a SANT-associated (SANTA) domain which is present in chromatin remodeling proteins and is associated with centromeres during all phases of the mitotic cell cycle, except from metaphase to mid-anaphase. KNL2 inactivation affects both mitotic and meiotic division without abolishing CENH3 assembly at centromeres [Lermontova et al., 2013]. In the A. thaliana cell nucleus, centromeres can be visualized by fluorescent in situ hybridization (FISH) by using a centromeric satellite repeat probe [Fransz et al., 1998], immunocytochemistry by using an antibody against HTR12 [Talbert et al., 2002] or MSI12, which colocalizes with HTR12 [Sato et al., 2005], or by live cell imaging using fluorescently tagged HTR12 [Fang and Spector, 2005]. These approaches have facilitated studies of the dynamics of the centromeric sub-compartment (see below).
The flanking pericentromeric heterochromatin is enriched in repetitive DNA sequences and transposable elements (TEs) [Copenhaver et al., 1999]. It is also characterized by a high level of 5-cytosine DNA methylation, with some strand-specific bias [Luo and Preuss, 2003], and an increased methylation of lysine 9 of histone H3 (H3K9me) and of the histone H3.1 variant [Stroud et al., 2012]. Recently, pericentromeric heterochromatin was shown to be the least accessible chromatin to DNase I, and blocks of accessible chromatin are progressively more abundant with increasing distance from the centromere [Shu et al., 2012]. Thus, there is not a sharp boundary between pericentromeric heterochromatin and euchromatin but rather a gradual transition to chromatin with an increased protein-coding gene density and a decreased TE density [Shu et al., 2012]. The assembly of H3K9me2-marked heterochromatin requires the transcription of repetitive sequences by specific RNA polymerases and RNA-directed DNA methylation [Verdel et al., 2009; Beisel and Paro, 2011; Zhang and Zhu, 2011; Castel and Martienssen, 2013].
Telomeres are protective nucleoprotein structures at the extremities of linear chromosomes that stabilize chromosome termini and prevent chromosome fusion and degradation by exonucleases [Lamb et al., 2007; Zellinger and Riha, 2007; Watson and Riha, 2010]. They consist of relatively short tandem repeat arrays (2-5 kb in A. thaliana) of a conserved short motif (TTTAGGG in most plant species) and associated telomere proteins. Interestingly, non-functional telomere-like repeats have been identified in chromosomal regions [named interstitial telomeric repeats (ITRs)], some of which are in close proximity to centromeres [Uchida et al., 2002], whereas other short interstitial telomere motifs (named telo box) were preferentially observed in the 5′-flanking region of genes [Regad et al., 1994; Gaspin et al., 2010]. The existence of these interstitial telomeric repeats strongly challenges the structural and functional analysis of terminal telomeric repeats but also provides interesting perspectives on regulatory mechanisms. The length of telomeres, which is related to life span, is under genetic control and varies among species. Although telomeres were originally thought to consist of heterochromatin, a recent molecular analysis of epigenetic marks in A. thaliana telomeres revealed that telomeric chromatin has some unexpected and unique features that are characteristic of intermediate heterochromatin [Vrbsky et al., 2010] or even euchromatin [Vaquero-Sedas and Vega-Palas, 2013]. Indeed, Arabidopsis telomeres are enriched in H3K9me2 and H3K27me1 heterochromatic marks but still retain the euchromatic H3K4me3 mark [Vrbsky et al., 2010; Vaquero-Sedas et al., 2012]. Furthermore, the A. thaliana telomeres are also relatively enriched in the H3.3 histone variant (which is usually associated with transcriptionally active regions) in comparison to centromeres that are enriched in H3.1 in comparison to telomeres [Vaquero-Sedas and Vega-Palas, 2013]. Interestingly, subtelomeres and ITRs are heterochromatic [Vaquero-Sedas et al., 2012]. The telomeric enrichment in the histone H3.3 variant is probably related to the unique juxtapositioning of telomeres and transcriptionally active genes in A. thaliana. Furthermore, as in other eukaryotes, A. thaliana telomeres are transcribed into non-coding telomeric repeat-containing RNAs (named TERRA) [Vrbsky et al., 2010]. Telomeric repeats are methylated at asymmetric cytosine sites via an RNA-dependent DNA methylation pathway, and small RNAs are derived from TERRAs produced from telomeres and from ITRs located near centromeres [Vrbsky et al., 2010]. Telomeric methylation was shown to be dependent on the activity of DNA methyltransferase MET1 and the remodeling factor DDM1 [Ogrocka et al., 2013].
Interstitial blocks of heterochromatin, called knobs, occur on the chromosome arms of maize and related species [Poggio et al., 2005; Albert et al., 2010]. Knobs vary in size and composition but are highly enriched in DNA repeats and TEs. Some knobs are visible throughout the cell cycle and are used as cytological markers [Ghaffari et al., 2013]. Knobs are associated with low gene density and low transcriptional and recombinational activity. Some A. thaliana accessions, such as Wassilewskija and Columbia-0 (Col-0), bear a ∼0.7-Mb knob on the short arm of chromosome 4 (hk4S) which does not colocalize with the CC of chromosome 4 (CC4) and is not visible during interphase (fig. 1) [Fransz et al., 1998, 2000; Koornneef et al., 2003]. The knob hk4S is enriched in centromeric and pericentromeric repeats and has few expressed genes. It was proposed to originate from an inversion event that moved DNA sequences from the pericentromeric outer domain to a distal euchromatin region of the 4S chromosome arm [Fransz et al., 2000]. Maize knobs do not have such a pericentromeric origin.
The nucleolar organizer region [McClintock, 1934] consists of tandem arrays of 45S rRNA-encoding DNA (rDNA) and is another major functional genomic region with heterochromatic characteristics. A. thaliana contains 2 NORs of similar size (each spanning 3.5-4.0 Mb of tandem repeat arrays), located at the subtelomeric regions of the acrocentric chromosomes 2 and 4 [Copenhaver and Pikaard, 1996] (fig. 1). The 5S rDNA loci are also organized in tandem arrays (of ∼1,000 copies) which span 0.1-0.3 Mb and are located at pericentromeric regions of chromosomes 3, 4, and 5 in the Col-0 accession [Campell et al., 1992; Murata et al., 1997]. The presence, location, and size of the 5S rDNA cluster on chromosome 3 are accession specific (fig. 1) with some possible intra-accession polymorphisms such as in the Cape Verde Islands (Cvi) accession [Fransz et al., 1998; Sanchez-Moran et al., 2002; Lopez et al., 2012]. It was shown that in the Col-0 accession, only the 5S rDNA clusters, located on chromosomes 4 and 5, contribute to the 5S RNA pool [Cloix et al., 2002].
Arabidopsis Model of Chromosome Organisation Centered on Heterochromatin
In A. thaliana, CCs correspond to the coalescence of centromeric and pericentromeric regions of a chromosome and the NOR (if the chromosome bears a NOR). These heterochromatic structures function as genome organizer centers. Indeed, euchromatic chromosomal regions form loops that are 0.2-2 Mb long and are anchored to CCs [Fransz et al., 2002]. This organisation contributes to the overall structure of chromosome territories as described in the chromocenter-loop model [Fransz et al., 2002], also named the rosette-like model [van Driel and Fransz, 2004]. Furthermore, it was shown that highly repetitive elements and TEs located on euchromatic chromosomal arms colocalize with CCs and remain associated with CCs despite extensive demethylation of the genome [Soppe et al., 2002]. This suggests that TEs both anchor the euchromatin loops and organize the pericentromeric regions [Soppe et al., 2002].
Variations in the number, size, and shape of centromeric foci and CCs as well as the cell type-specific organisation of heterochromatin have been reported in a number of studies. The nuclei of most cells (e.g. parenchyma cells, epidermal guard cells, and root cells) exhibit a ‘classical CC' pattern, with 4-10 (mean ∼8) conspicuous CCs (fig. 2) [Fransz et al., 2002]. The heterochromatin index (HX), defined as the percentage of nuclei showing the classical CC pattern, was thus calculated in numerous studies to quantify heterochromatin distribution [Fransz et al., 2003]. However, nuclei with uniform DAPI fluorescent nucleoplasms have been reported in some cells such as the diploid interphase tapetal cells of premeiotic anthers [Weiss and Maluszynska, 2001; Talbert et al., 2002]. In the root tip, centromeric foci exist in a variety of shapes, from dots of 0.5 µm in diameter to discontinuous strings (1.0-2.0 µm in length) of smaller bead-like dots, suggesting that centromeres have a range of compaction ratios [Talbert et al., 2002]. Given that the root tip is actively dividing, this range in centromeric foci shape might be, at least partially, cell cycle dependent. Interestingly, nuclei of the triploid endosperm tissue also have a peculiar heterochromatin organisation, with small CCs and additional heterochromatic foci interspersed in euchromatin which is likely linked to parental dosage [Baroux et al., 2007].
In plants, endoreduplication cycles occur in differentiated cells, leading to ≥4C cell nuclei. A positive correlation between CC association and ploidy levels was reported for a number of plant species [Ceccarelli et al., 1998]. In A. thaliana, endoreduplicated sister centromere associations have also been reported using live cell imaging [Fang and Spector, 2005]. These associations are cell type-dependent, being for instance more frequent in root epidermal cells than in leaf epidermal cells [Fang and Spector, 2005]. Similar results were observed in fixed cell nuclei with an alignment of the majority of the sister centromeres up to 16C [Schubert et al., 2006]; however, surprisingly, a dispersed pattern was reported in 32C nuclei [Schubert et al., 2006].
Distribution of Heterochromatin in the Nuclear Space
In some species, chromosomes exhibit a polarized orientation with all centromeres clustered at one pole of the interphase nucleus and all telomeres at the other. This peculiar interphase nuclear organisation, originally observed in salamander cell nuclei, was named the Rabl configuration [Rabl, 1885]. It has been described in Allium cepa(onion), Hordeum vulgare (barley), Triticum aestivum (wheat), Secale cereale (rye), and Avena sativa (oats) [Stack and Clark, 1974; Schwarzacher et al., 1989; Dong and Jiang, 1998; Santos and Shaw, 2004; Roberts et al., 2009]. The Rabl configuration is possibly a remnant of the preceding mitotic event. Indeed, during mitosis, the chromosomes condense, separate, and move to opposite poles, with the centromeres being pulled by kinetochore microtubules and leading the way and the telomeres lagging behind.
The Rabl configuration is not present in A. thaliana interphase nuclei. Rather, the centromeres are located at CCs which preferentially occupy peripheral positions, and the telomeres are preferentially associated with the nucleolus [Armstrong et al., 2001; Fransz et al., 2002; Schubert et al., 2012]. Interestingly, it was observed that plants with large genomes, e.g. A. cepa, ∼149,000 Mb, tend to exhibit the Rabl pattern, whereas those with smaller genomes, e.g. A. thaliana,∼135 Mb, tend to exhibit a non-Rabl pattern. These data suggest a correlation between the Rabl configuration and genome size; however, the non-Rabl configuration was also reported in Sorghum bicolor (sorghum) and maize [Dong and Jiang, 1998], 2 species with quite large genomes. Thus, other determinants of the Rabl configuration may exist. Interestingly, the non-Rabl configuration appears to be tissue-specific in diploid rice; whereas the Rabl configuration is present in root xylem vessels, it is absent in other root tissues [Prieto et al., 2004]. Endoreduplication may occur in the large nuclei of vascular tissues and induce these changes in chromatin distribution, in agreement with the previously described correlation, or the large nucleolus of xylem cells might interfer with the redistribution of centromeres and telomeres.
The preferential locations of telomeres at the nucleolus and the dispersed peripheral distribution of centromeres were also observed during meiotic interphase in A. thaliana [Armstrong et al., 2001]. In meiotic prophase of most species (e.g. A. thaliana and maize), the ends of chromosomes cluster together on the inner surface of the nuclear envelope and form a structure called the ‘bouquet' [Franklin and Cande, 1999; Cowan et al., 2001; Tiang et al., 2012]. Thus, in maize the Rabl configuration is observed prior to the last premeiotic cell division and is lost during the following interphase [Bass et al., 1997] and a bouquet is formed in meiotic prophase. These observations demonstrate that the distribution of chromosomes in the nuclear volume is tightly regulated.
Many studies using fixed nuclei have reported that A. thaliana centromeres tend to preferentially localize to the nuclear periphery. This centromere distribution was confirmed by measuring the distances between centromeres and the nuclear envelope in 3D images of various diploid living cells from transgenic A. thaliana plants expressing HTR12-GFP [Fang and Spector, 2005]. However, this pattern might be more complex. Live-cell imaging also revealed that centromeres cluster transiently at opposite poles at the end of mitosis in root meristematic cells [Fang and Spector, 2005] and in root tip cells [Lindhout et al., 2007]. Lastly, in A. thaliana and A. lyrata interphase nuclei, CCs from NOR-bearing chromosomes 2 and 4 are more frequently located in close proximity to the nucleolus [Fransz et al., 2002; Berr et al., 2006; Schubert et al., 2012]. Using spatial statistics, a recent study showed that the 3D intra-nuclear distribution of CCs in leaf cell nuclei was not completely random and that this distribution was more regular than a completely random one [Andrey et al., 2010]. This finding was observed in both round and elongated nuclei of plant cells which differ in differentiation stage and ploidy level. This repulsive trend was evidenced based on the global analysis of the CC population. Therefore, it is not incompatible with some frequent associations of specific CCs, such as CC2s and CC4s. Observation of CCs in close proximity (CC clusters) has also been reported by de Nooijer et al. [2009]. However, in this study the frequency and the intensity of the phenomenon remain elusive as no quantification was provided. Therefore, the spatial distribution of CCs seems to obey to a global apparent repulsive tendency with some attractive trends for specific CCs. It remains to be determined whether this regular distribution of CCs can be fully explained by peripheral positioning or if additional constraints have to be invoked to explain the apparent mutual repulsion between CCs. For example, the existence of euchromatin loops anchored at CCs, as proposed by the rosette model of chromosome organisation [Fransz et al., 2002], could prevent CCs from coming into close proximity. Specific proteins may also be involved as recently demonstrated by the clustering of centromeres in CAP-D protein mutants [Schubert et al., 2013].
Dynamics of the Heterochromatin Compartment during Development
The plant life cycle is characterized by major developmental phase transitions and the reiterative production of plant phytomers but also by diverse adaptations to environmental changes. These events require transcriptional reprogramming events that modulate the expression of specific sets of genes. Recent studies showed that these transcriptional reprogramming events are accompanied by reorganization of heterochromatin compartments, illustrating that the nucleus is highly plastic [Baroux et al., 2011; Schubert and Shaw, 2011; van Zanten et al., 2012a]. Whether this reorganization participates in or is a consequence of gene regulation remains to be elucidated.
The female spore mother cell (or megaspore mother cell, MMC) differentiates from somatic cells within ovules and ultimately gives rise to female gametes. Large-scale chromatin reprogramming occurs during the specification of the MMC, and this probably contributes to the acquisition of the gametophyte fate [Baroux et al., 2011]. During this nuclear reorganization, the nucleolus and nucleus expand, the RHF and CC undergo a reduction in number, and the heterochromatin decondenses [She et al., 2013]. MMC chromatin reprogramming may be divided into 2 distinct phases: an early and rapid phase during which the composition of the nucleosome changes, followed by a late phase during which histone modifications undergo important changes [She et al., 2013].
In A. thaliana, embryonic development is completed about 10 days after pollination (DAP). After a phase of seed maturation, which involves the accumulation of sufficient reserves and desiccation (from 10-20 DAP), the seed undergoes a period of dormancy. Seed maturation is accompanied by 2 independent processes, nuclear shrinkage and chromatin compaction, which occur between 8 and 12 DAP and precede the major dehydration event of the maturing seed [Mansfield and Briarty, 1992; van Zanten et al., 2011, 2012b]. The RHF in embryonic cotyledon nuclei increases sharply during the maturation phase, while the 45S rDNA loci and the centromeric and pericentromeric repeats remain localized to the CCs. Interestingly, the nuclear volume is independent of both the moisture content and dormancy status of the seed but is developmentally controlled. ABSCISIC ACID INSENSITIVE3 (ABI3), a key transcription factor in seed maturation, participates in nuclear shrinkage which is thought to be a general adaptive response to desiccation tolerance [van Zanten et al., 2011]. During the early events of seed germination (48-72 h after imbibition), the nuclear volume increases again, and this increase requires the activity of LITTLE NUCLEI1 (LINC1) and LINC2, 2 lamin-like analogues [van Zanten et al., 2011, 2012b; Ciska et al., 2013]. Furthermore, chromatin reorganization accompanies this event. Whereas the 45S rDNA loci remain localized to CCs during germination, the centromeric and pericentromeric repeats are more dispersed at the onset of germination [van Zanten et al., 2012b]. These CCs are smaller than those present in mature seeds. The classical conspicuous CC pattern reappears later during seedling growth.
During floral transition, which corresponds to the short developmental switch from the vegetative to the reproductive phase, a transient reduction in both RHF and HX was observed in 3 accessions [Landsberg erecta(Ler), Col-0, Cvi] which was accompanied by the decompaction of pericentromeric regions and 5S rDNA chromatin, followed by their subsequent relocation to CCs 3 days after bolting [Tessadori et al., 2007a].
Dynamics of the Heterochromatin Compartment in Response to Environmental Cues
Two recent studies reported a correlation between heterochromatin organisation and ambient light intensity; specifically, the RHF and HX increase with a rise in light intensity [Tessadori et al., 2009; van Zanten et al., 2010, 2012a]. In the first study, Tessadori et al. [2009] analysed the HX in 21 A. thaliana accessions originating from different geographical habitats and identified a significant correlation between geographical latitude, which determines the photon flux density (light intensity) of the region, and the HX. Interestingly, the HX was found to plateau (at 100 µmol m2 s-1 for Col-0 and at 200 µmol m2 s-1 for Ler, a widely-used Central-European accession). The lowest HX was observed in the sub-tropical Cvi-0 accession which has smaller and fewer CCs than Ler. Furthermore, the Cvi-0 accession exhibited dispersed 5S rDNA and pericentromeric repeats, and the centromeric and 45S rDNA sequences remained in the reduced CCs. This chromatin arrangement is reminiscent of the one observed during floral transition. The second study showed that chromatin compaction progressively decreases after a reduction in light intensity from 200 to 15 µmol m2 s-1. This heterochromatic event is reversible with return to normal light conditions, and the intensity of the response varies in different accessions (with Col-0 being more sensitive than Ler) [van Zanten et al., 2010]. Therefore, chromatin plasticity seems to contribute to the plant's adaptation to environmental light conditions.
Alternatively, the heterochromatin response to low light can be viewed as an abiotic stress response. Upon exposure to another abiotic stress, namely prolonged heat stress, the transcription of centromeric and pericentromeric repeats is reactivated, and these regions exhibit a dispersed pattern in FISH [Pecinka et al., 2010]. Interestingly, throughout recovery, transcription of centromeric and pericentromeric repeats was progressively silenced, whereas decondensation persisted for up to 1 week. Therefore, this is another example showing that chromatin condensation status and gene expression can be uncoupled. Furthermore, such alterations did not occur in meristematic cells or in cells from leaves produced after a period of heat stress. It was proposed that the specific meristematic chromatin response indicates the existence of a safeguard mechanism that minimizes genome damage in the germline [Pecinka et al., 2010]. Interestingly, heterochromatin decompaction was not observed after freezing or UV-C treatments [Pecinka et al., 2010]. Therefore, decondensation of the heterochromatin compartments is either not a general stress response or each type of stress is associated with chromatin reorganization in a specific compartment or with a distinctive timing and amplitude pattern. It will be interesting to decipher the signaling mechanisms that induce large-scale chromatin reorganization in differentiated cells and prevent such reorganization in rapidly dividing cells. Reorganization of heterochromatin was also observed in response to biotic stress [Pavet et al., 2006]. A drastic reduction in RHF and CC number (with most nuclei having only 2 small CCs) and loosening of CCs were observed within 1 day of infection with the bacterial pathogen Pseudomonas syringae.
A drastic decondensation involving pericentromeric regions, 5S rDNA, centromeric repeats, and 45S rDNA was described during the isolation of A. thaliana protoplasts [Tessadori et al., 2007b]. Despite general NOR decondensation, a fraction remains partially condensed, participating in small CCs close to the nucleolus. The protoplast chromatin reorganization is accompanied by the acquisition of totipotency and major transcriptional reprogramming that affects, for example, chromatin-associated genes and genes encoding histone variants [Chupeau et al., 2013]. It remains to be determined whether the reorganization of protoplast chromatin results from a stress response due to enzymatic digestion and osmotic and light changes and/or is necessary for acquisition of totipotency and major transcriptional reprogramming [Chupeau et al., 2013].
Mechanisms Involved in the Spatial Heterochromatin Distribution
Two main patterns of heterochromatin distribution emerge from the previous examples: the first pattern involves the partial decondensation of CCs at the 5S and pericentromeric regions, and the second affects all heterochromatic compartments of the CCs. A detailed study of the progressive and sequential reformation of CCs during protoplast culture provided complementary information about the highly ordered structure of CCs [Tessadori et al., 2007b]. During sequential CC recompaction, the NOR regions (3.5-4 Mb) reorganize first, followed by the centromeric (0.4-1.4 Mb), 5S rDNA (0.1-0.3 Mb), and dispersed pericentromeric repeats, including transposons, suggesting that the recompaction timing and the size of the repeat arrays are correlated [Tessadori et al., 2007b]. Thus, the 5S and pericentromeric sequences might participate in one core domain of a CC, which is first mobilized in chromatin decondensation events, and the centromeric repeats and 45S rDNA in another CC core domain, with a more central location and/or different properties. Establishing whether this latter core domain decondenses independently of the other core domain would provide insights into the structure of CCs. The number of anchoring sites might also be proportional to the size of the arrays and may thus contribute to the kinetics and formation of sub-compartments of the CCs.
Finally, the underlying biochemical properties of heterochromatin, such as DNA methylation, epigenetic marks, or histone composition, are also expected to contribute to this sort of ‘CC breathing'. Heterochromatin dynamics have been considered as being either dependent or independent of epigenetic changes, suggesting that several mechanisms with possible self-reinforcing feedbacks exist. For instance, by using molecular approaches, the 5S rDNA arrays were shown to be hypomethylated when they loop out of CCs during seed germination [Mathieu et al., 2003], and demethylation of the centromeric and pericentromeric repeats was shown to accompany biotic-induced chromatin decondensation [Pavet et al., 2006]. However, no change in DNA methylation was observed at centromeric repeats during floral transition [Tessadori et al., 2007a], in protoplasts [Tessadori et al., 2007b], and in response to heat stress [Mittelsten Scheid et al., 2002; Pecinka et al., 2010]. Despite the large-scale reorganization, there is no change in H3K9me2 and H3K4me3 contents in protoplasts as determined by immunoblot analysis of total histones. In heat-stressed cell nuclei, a reduction in nucleosome occupancy with a small reduction in H3K9me2 was observed [Mittelsten Scheid et al., 2002; Pecinka et al., 2010]. From these data, it is tempting to speculate that the epigenetic-dependent pathway might contribute to the formation of a putative 5S-pericentromeric core domain and the independent pathway to that of the other putative core domain. However, it is important to note that most studies used methods with low sensitivity at the global nuclear scale to detect epigenetic changes and did not consider all of the chromatin marks and their combinations [Baubec et al., 2010]. Furthermore, all sub-compartments were not simultaneously analysed. Therefore, specific epigenetic variations may not have been identified yet. Alternatively, ‘CC breathing' may be seen as a continuous process with various amplitude and timing patterns. Finally, another key missing element is a better understanding of the higher-order structures of chromatin. The existence of 30-nm chromatin fibers is still a matter of debate, and an alternative chromatin model that involves interdigitation of nucleosomal arrays, which is more compatible with rapid conformational changes providing access to DNA, is currently proposed [Fussner et al., 2011; Luger et al., 2012] and might also impact on ‘CC breathing'.
A few mutations that have a marked impact on the formation and/or spatial distribution of conspicuous heterochromatin sub-compartments have been described (table 1). Three main classes of genetic determinants involved in heterochromatin dynamics can tentatively be distinguished based on their functions (table 1). The first class (class I) corresponds to genes involved in the formation of heterochromatin and the maintenance of silencing in A. thaliana(i.e. MET1, CMT3, NRPD2, and NRPE1)[Mittelsten Scheid et al., 2002; Soppe et al., 2002; Onodera et al., 2005; Vaillant et al., 2008; Douet et al., 2009]. It is important to note that mutations that affect silencing do not necessarily alter nuclear heterochromatin organisation. For instance, the nuclear shape and CC structure of the morpheus' molecule1 mutant (mom1), which is affected in an epigenetic regulator, are normal [Probst et al., 2003]. The second class (class II) includes genes encoding chromatin-associated proteins, such as the ATP-dependent SWI2/SNF2-like chromatin remodeling DDM1 factor [Soppe et al., 2002; Probst et al., 2003] which was shown to have specific functions in heterochromatin remodeling [Zemach et al., 2013], the histone-modifying enzyme HDA6 [Probst et al., 2004], and the chromatin assembly subunits FAS1 and FAS2 [Schonrock et al., 2006]. The first 2 classes of genetic determinants may participate in the epigenetic-dependent pathway which is involved in heterochromatin dynamics. The third emerging class (class III) contains genes that encode diverse nuclear structural proteins, such as the 2 lamin-like proteins LINC1 and LINC2 [Dittmer et al., 2007] that might be involved in an epigenetic-independent pathway. Based on the identification of nuclear lamina-associated chromatin domains in Drosophila and humans [Pickersgill et al., 2006; Guelen et al., 2008], we expect that mutations that affect the plant nuclear envelope [Graumann et al., 2010; Boruc et al., 2012] might have an impact on plant chromatin organisation. Furthermore, some subunits of the cohesion and condensin complexes, such as the SYN4 and CAP-D3 proteins, also play important roles in sister chromatid organisation and centromere distribution and thus affect the interphase chromatin architecture [Schubert et al., 2009, 2013].
Different signaling pathways involved in heterochromatin reorganization are expected to exist that might converge on key regulators of heterochromatin reorganization, such as the factors described in table 1. Indeed, development and environmental processes affect the spatial organisation of heterochromatin. These regulatory components might be specific to or shared by different signaling pathways. As described above, light intensity is an important environmental signal that controls chromatin compaction. The light-signaling pathway is mediated by well-described photoreceptors, some of which are localized to the nucleus. Among them, CRYPTOCHROME 2 (CRY2) and PHYTOCHROME B (PHYB) are involved in chromatin reorganization, whereas others, such as phototropins and CRYPTOCHROME1, are not [Tessadori et al., 2007a; van Zanten et al., 2010]. These data suggest that CRY2 and PHYB also participate in the spatial organisation of heterochromatin. Thus, many elements that affect the timing and molecular events of heterochromatin dynamics remain to be identified, and the impact of heterochromatic reorganization on the 3D organisation of the genome remains to be established.
Chromosome Territories
In plant and animal cell nuclei, each chromosome occupies a discrete portion of the nuclear space, named the chromosome territory (CT) [Boveri, 1909; Cremer and Cremer, 2010; Tiang et al., 2012]. Deciphering the rules and mechanisms that govern the spatial nuclear distribution and internal organisation of chromosomes is a highly dynamic and challenging research field, with the aim to better understand genome functions [Bickmore and van Steensel, 2013; de Graaf and van Steensel, 2013]. The chromosome conformation capture method (3C) and derived methods, such as high-throughput chromosome conformation capture (HiC), have been developed in animal systems to provide maps of interaction frequencies between genomic regions at the locus, chromosome, or whole genome scales [de Wit and de Laat, 2012]. Three-dimensional conformations of the genomes have thus been computationally reconstructed from these data [Lieberman-Aiden et al., 2009; Dekker et al., 2013]. Based on HiC results, the fractal globule model of CT organisation was proposed. According to this model, the chromatin fiber is packed into 1-Mb domains in meta-stable conformational states [Mirny, 2011; Dekker et al., 2013]. In plants, the 3C method has only recently been used [Hovel et al., 2012], and no equivalent or alternative model is available for plant CT organisation yet.
Plant CTs were observed in species both with [Abranches et al., 1998] and without [Lysak et al., 2001, 2003; Pecinka et al., 2004] Rabl configuration by FISH studies. Using a wheat line containing an extra pair of rye chromosomes, it was shown that the rye CTs were roughly parallel and elongated nuclear domains extending between the centromeric and telomeric nuclear poles in a Rabl configuration [Abranches et al., 1998]. More recently, this organization was shown to be present in meristematic but not in differentiated nuclei [Schubert et al., 2011]. In small genome species with non-Rabl configuration, such as Arabidopsis, CTs have a different spatial distribution. The association frequencies between A. thaliana chromosome pairs were computed based on simultaneous painting of all chromosome pairs [Pecinka et al., 2004]. Based on a computer model of CT formation determined by polymer decondensation, it was shown that these frequencies were not significantly different from those expected under randomness, except for NOR-bearing chromosomes which are frequently associated with each other [Pecinka et al., 2004]. Likewise, no differences were found between random expectations and observed associations of homologous genomic regions of ∼100 kb [Pecinka et al., 2004]. Similar results were found in related species [Berr et al., 2006] and in differentiated cells [Berr and Schubert, 2007]. Hence, the current view of CT organisation in A. thaliana and related species is globally random and contrasts with results from animal studies [Schubert et al., 2012]. It has, for example, been suggested that CTs in animals obey a size-dependent radial distribution, with smaller chromosomes being located toward the nuclear center. Other studies reported a gene density-dependent spatial distribution, with gene-rich chromosomes being located towards the center [van Driel and Fransz, 2004]. The absence of such patterns in A. thaliana may be due to its relatively uniform chromosome size and gene density and its low number of chromosomes (fig. 1).
Similarly, few arguments favor any cell type- and tissue-specific organisation of CTs in Arabidopsis and other plants. Chromosome painting in A. thaliana revealed that CTs were larger in endosperm nuclei than in nuclei of other cell types, corroborating the notion that less chromatin compaction occurs in this tissue [Baroux et al., 2007]. The tissue-specific spatial distribution of CTs has been reported in mouse, and CT distribution patterns were found to be maintained throughout mitosis [Parada and Misteli, 2002]. By contrast, the spatial distribution of CTs is not conserved between mother and daughter cells in Arabidopsis, and only a transient mirror-image distribution has been reported between daughter cells [Berr and Schubert, 2007]. The absence of maintenance of CT organisation during mitosis and within tissues is probably also related to the small genome and chromosome sizes of Arabidopsis.
No specific role of CTs in genome expression has been reported in plants until now [Tiang et al., 2012]. In animals, transcription sites are largely localized to the CT boundaries, and genes have been reported to relocalize to sites outside the CT upon activation [Geyer et al., 2011; Davidson et al., 2013]. By contrast, the relative localization of the flowering gene FWA to its CT did not seem to differ with its transcriptional status [Pecinka et al., 2004]. In wheat, transcription sites were not preferentially localized to the periphery of CTs [Abranches et al., 1998]. Only a few studies have investigated gene-to-CT positioning in relation to transcriptional regulation, and these studies were performed using coarse spatial descriptions. Further large-scale and systematic quantitative analyses are required to better document the role, if any, of chromosome organisation in the regulation of gene expression in plant nuclei.
Chromatin Dynamics at the Locus Scale
Numerous studies in yeast and animals have measured the chromatin motion of genomic regions to investigate the interplay between gene positioning and transcriptional status. In plants, the first evidence of chromatin dynamics and remodeling at the scale of the locus was provided by a 3D FISH experiment that revealed 2 chromatin conformations, one open and one closed, in the non-hair and hair cell nuclei of the A. thaliana root epidermis, respectively, in the genomic region of the GLABRA2 (GL2) transcription factor locus [Costa and Shaw, 2006]. GL2 is expressed in non-hair cells; however, mutant analysis revealed that neither GL2 expression nor the belonging to a specific cell lineage is required for the open chromatin conformation in the GL2 genomic region. The chromatin conformation in the GL2 region depends on positional information in root cells and is reset at mitosis.
Currently, our knowledge is limited due to the complexity of the nuclear system and technical problems in visualizing, recording, and analyzing small portions of the genome in wild-type environments. The chromatin beacon or chromatin-tagging method is a powerful technique for analyzing the dynamics of tagged genomic regions in living organisms and for quantifying chromatin structural parameters at these genomic locations [Amini et al., 2011]. In this technique, exogenous concatameric arrays (∼2-10 kb) of a DNA sequence, which is specifically recognized by a selected DNA-binding domain, are inserted into the genome, and the location of the arrays (termed beacon arrays) is molecularly characterized. The expression in trans of a chimeric protein resulting from the fusion between the selected DNA-binding domain and a fluorescent reporter protein allows visual tracking of chromatin movement at the tagged loci in the nuclei of live cells. A 2-component labeling system based on the recognition of tandem lac operator binding sites (lacO) by the lac repressor (LacI) was successfully used in yeast, Drosophila, mammals, and plants [Robinett et al., 1996; Marshall et al., 1997; Kato and Lam, 2001; Vazquez et al., 2001]. A bacterial tetracycline operator/repressor (tetO/TetR-YFP) system [Matzke et al., 2003] and a combination of the lacO/LacI-dsRed and tetO/TetR-YFP systems [Matzke et al., 2005] have been developed to analyze the 3D distribution of chromosomal sites in plants. By introducing a gene into the cassette bearing the lacO repeat array, it was possible to study chromatin movement in relation to transcriptional activity of the reporter gene at the location of the insertion [Rosin et al., 2008; Rosa et al., 2013].
The approaches described above were used to study the relationship between the physical and functional properties of chromatin. In animals, it was shown that chromatin undergoes random diffusive motion in interphase nuclei, and that this Brownian motion is constrained in a way that a given genomic segment moves within a limited nuclear sub-region. Complex motion patterns have been recorded for genomic regions which exhibit short- and long-range motion with different velocities. The long-range diffusive motion fluctuates over a 36-fold distance range during the cell cycle [Vazquez et al., 2001]. These findings support a highly organized nuclear space and the existence of mechanisms that control chromatin motility according to cell type and throughout the cell cycle. Furthermore, these findings suggest that chromatin motion participates in nuclear processes that require chromosome mobility and chromosome reorganization in the nuclear space. For instance, it was recently shown that the induction of double-strand breaks in the yeast Saccharomyces cerevisiae dramatically increases the mobility of the broken loci but also of the homologous unbroken loci without changing the speed at which chromosomal regions move. The increased mobility allows a wider exploration of the nuclear volume which facilitates the pairing of homologous regions [Mine-Hattab and Rothstein, 2012, 2013].
Using the chromatin beacon technique, a few studies have demonstrated that chromatin in the A. thaliana nucleus also exhibits constrained diffusive motion and that the amplitude and kinetics (diffusion coefficient) of the motion depend on the ploidy level and cell type [Kato and Lam, 2003; Rosin et al., 2008]. The constrained area increases with the ploidy level: it was 6.6 times larger, on average, in elongated polyploid epidermal pavement cell nuclei than in smaller disk-shaped diploid guard cell nuclei, whereas the diffusion coefficient was lower (<2.3 times) in polyploid cells than in diploid ones. The confinement radius in the A. thaliana nucleus ranges from 0.1-0.4 µm, depending on the ploidy level and activity of the locus [Kato and Lam, 2003; Rosin et al., 2008]. In A. thaliana, the chromatin compaction ratio, as determined by measuring the in vivo distance between 2 fluorescence-tagged transgene inserts on the same chromosome [Matzke et al., 2005], is approximately 670-fold. For instance, 2 transgenes, located 2.4 Mb apart (which corresponds to a DNA sequence of 0.8 mm) on chromosome 2, had an observed physical separation of 2.1 µm (380-fold), whereas another pair of transgenes, situated 25 Mb apart on chromosome 1, exhibited a physical separation of 8.7 µm (960-fold). Furthermore, the frequencies of homologous pairing and association with heterochromatin of transgenic repeats differed with the construct, the chromosomal insertion position, the cell type, and the number and repetitiveness of inserts [Pecinka et al., 2005].
The first evidence that gene position and transcriptional activity are linked in the plant cell nucleus emerged with the study of the nuclear organisation of transgenic FLC alleles tagged with the lacO repeat array (FLC-lacO) in response to vernalization [Rosa et al., 2013]. During cold treatment and after ethanol induction of the LacI-YFP-NLS fusion protein, a cluster of FLC-lacO alleles was observed with 1-2 foci in endoreduplicated root cells and only 1 in meristematic diploid root cells, whereas in non-vernalized plants, 6 or more foci were observed. The FLC-lacO clustering was quantitatively dependent on the cold treatment and dependent on Polycomb proteins of the PHD-PRC2 complex which are required for FLC silencing before and during the cold treatment. Therefore, physical repositioning of loci associated with transcriptional changes occurs in plants and can be mediated by Polycomb-mediated epigenetic mechanisms in both plants and animals, suggesting that the key mechanisms are conserved.
There are several limitations of the chromatin beacon technique: (i) the beacon array is not targeted precisely, because site-directed mutagenesis can not be performed in most plants; (ii) an artificial transgenic environment that differs from the original one may be created due to the interruption and/or rearrangement of the genomic region where the beacon is inserted, and the 3D organisation of the chromatin fiber resulting from the insertion may be modified; and (iii) the high number of tandem repeats can induce repeat-induced silencing and generate modifications in the epigenetic chromatin of the beacon array and/or in the adjacent genomic chromatin. Indeed, a high copy number of repeats alters the local chromatin arrangement, resulting in more frequent somatic pairing of lacO-harboring regions and more frequent association with heterochromatic CCs [Pecinka et al., 2005]. These changes depend on the size and number of repeat units per locus [Jovtchev et al., 2008]. To minimize recombination events, replication instability, and the repeat-induced silencing usually associated with tandem repeats, Rosa et al. [2013] used a low number of concatameric lacO binding sites of the lacO DNA sequence and inserted ∼10-bp random sequences between lacO DNA sequences [Jovtchev et al., 2008; Rosa et al., 2013]. There is an urgent need to develop alternative techniques to label specific genomic regions so that the relationship between the physical and functional properties of chromatin can be analysed more thoroughly. Zinc finger and transcription activator-like effector proteins have been customized to introduce targeted genome modifications (i.e. mutations, insertions, and substitutions) [Chen and Gao, 2013; Gaj et al., 2013]. These proteins have also been adapted to visualize heterochromatic repeat sequences, both in plants [Lindhout et al., 2007] and animals [Miyanari et al., 2013]. Novel optimization and adaptation techniques for single-copy gene detection would be tremendously useful. Furthermore, the development of programmable RNA-guided DNA labeling systems derived from RNA-guided nuclease technology [Belhaj et al., 2013; Gaj et al., 2013; Puchta and Fauser, 2013] might give rise to alternative useful techniques. A recent study using rolling-circle amplification of gene-specific padlock probes coupled with FISH reported an example of gene repositioning being associated with transcriptional activation [Feng et al., 2014].
Dynamics of Nuclear Protein Compartments
Chromatin occupies only a fraction of the interphase nuclear space (i.e. ∼5% for DNA and ∼20% for the 10-nm chromatin fiber) [Fussner et al., 2011]. Thus, interchromatin and perichromatin nuclear domains correspond to large functional spaces, harboring numerous and diverse nuclear bodies (NBs). NBs are membraneless subdomains that were originally described as being morphologically distinct from their surrounding neighborhood when observed by transmission electron microscopy (TEM) [Shaw and Brown, 2004; Dundr, 2012]. The definition was extended to subnuclear compartments observed by immunocytochemistry or fluorescence microscopy but without prior morphological evidence of their ultrastructure (i.e. Polycomb bodies or transcription factories).
NBs are nuclear microenvironments that function in specific biochemical processes, such as splicing (nuclear speckles and Cajal bodies), transcription (transcription factories), or gene silencing (Polycomb bodies). Most are highly conserved in animals and plants, but some, such as photobodies, are plant specific. Photobodies are discrete accumulations of photoreceptors with key regulatory functions in plant growth and development in response to light signals. They contain red and far-red sensing phytochromes (A to E) or cryptochrome blue light receptors (CRY2 and possibly CRY1) and other associated proteins. Light induces rapid conformational changes of the photoreceptors and directly controls the assembly of photobodies. Although they have been proposed to have functions in storage, transcriptional regulation, or protein degradation, these possibilities remain to be confirmed [Chen, 2008; Chen and Chory, 2011; Van Buskirk et al., 2012]. As previously described, light signaling can induce heterochromatin reorganization. It remains unclear whether photobodies interact with chromatin and participate in such reorganization. Based on fluorescence imaging studies, other NBs with specific functions in abscisic acid or auxin signaling pathways [Ng et al., 2004; Tao et al., 2005] or in the control of the circadian clock [Strayer et al., 2000] have been reported. Plant nuclear dicing bodies, which are located close to nucleoli, are sites of plant-specific miRNA processes [Fang and Spector, 2007; Liu et al., 2012].
Most NBs are highly dynamic in terms of composition, number, shape, and size, and these dynamics are modulated by the cell differentiation state, metabolic state, and transcriptional activity. Whereas animal NBs are fairly well characterized, the components, structures, and assembly mechanisms of plant NBs remain elusive, with the exception of the largest NB, the nucleolus. It was proposed that NBs generally result from a local dynamic equilibrium between assembly and disassembly activities of protein complexes. Two distinct assembly models for NBs have been proposed. The first model is based on an orderly manner of assembly that involves central scaffolding factors, whereas the second model depends on random aggregation [Matera et al., 2009]. Due to the spatial and temporal dynamics of NBs in the cell nucleus and their intricate functions in genomic activities (i.e. transcription, replication, and DNA repair), it is difficult to decipher the relationship between NB functions and genome organisation. How the spatial dynamics of NBs relate to chromatin dynamics is largely unknown. Indeed, the spatial distribution of NBs relative to CTs, chromatin domains, and other nuclear compartments is poorly documented.
The largest NB, the nucleolus, was first described in 1835 by Rudolf Wagner and is a well-studied multifunctional compartment in both animals and plants. Mainly known for its functions in rDNA transcription, rRNA processing, and pre-ribosome assembly, the nucleolus is also involved in the processing and assembly of ribonucleoproteins, transcriptional gene silencing, and mRNA surveillance and export [Brown and Shaw, 2008; Dundr, 2012; Morimoto and Boerkoel, 2013]. The nucleolus is a complex NB that contains different subdomains with their own dynamics (e.g. the fibrillar center, dense fibrillar and granular components, and peripheral heterochromatic shell) [Nemeth and Langst, 2011]. Interestingly, the NOR plays a crucial role in its assembly [McClintock, 1934]. In humans, a model was recently proposed for a role of distal (telomeric side) NOR-flanking regions as anchoring sites for active NORs in the perinucleolar heterochromatic region, whereas inactive NORs are relocated from the nucleolar periphery [Floutsakou et al., 2013]. The nucleolus has emerged as a genome organisation center that is involved in nuclear architecture [Nemeth et al., 2010; van Koningsbruggen et al., 2010]. Indeed, genomic regions associated with the nucleolus (nucleolar-associated domains, NADs) were identified and found to account for ∼4% of the genome. NADs are characterized by a high density of AT-rich sequence elements and satellite repeats, a low gene density, and a significant enrichment in transcriptionally repressed genes belonging to different gene families. It will be interesting to identify the cis and trans determinants of NAD formation and the mechanism underlying the dynamic maintenance of NADs in different physiological conditions. The plant nucleolus was also intensively studied [Brown and Shaw, 2008], and a large set of A. thaliana nucleolar proteins (217) was identified within the first extensive proteomic analysis of a plant NB [Pendle et al., 2005]. In A. thaliana, the nucleolar periphery seems to be an important functional interface with other NBs (D-bodies and Cajal bodies) and also with heterochromatic compartments (NORs, telomeres). It is thus very likely that NADs exist in plants.
In Drosophila and mammals, local enrichments in Polycomb group (PcG) proteins have been described as Polycomb bodies (PcG bodies) [Pirrotta and Li, 2012]. PcG proteins are key chromatin factors that maintain gene repression via several mechanisms, such as histone post-translational modifications, chromatin compaction, or formation of higher-order chromatin structures [Schuettengruber and Cavalli, 2009; Bantignies and Cavalli, 2011; Simon and Kingston, 2013]. PcG proteins participate in the formation of multiprotein complexes, such as Polycomb Repressive Complex1 (PRC1) and PRC2 which are the best-studied PRC complexes in eukaryotes [Schuettengruber and Cavalli, 2009]. Animal PcG bodies can be visualized in the cell nucleus by labeling subunits of PRC1, such as Polycomb and Polyhomeotic [Pirrotta and Li, 2012]. PcG bodies participate in gene silencing by creating a specific microenvironment in which remote PcG-silenced targets colocalize as a result of long-distance interactions between silent targets and chromatin-fiber folding [Bantignies et al., 2011; Cheutin and Cavalli, 2012; Delest et al., 2012]. Some RNAi components, such as Dicer-2, PIWI, and Argonaute1, which also form specific nuclear bodies, can colocalize with PcG bodies and participate in PcG repression [Grimaud et al., 2006]. Furthermore, PcG bodies are dynamic, with proteins being rapidly exchanged between the nucleoplasm and PcG bodies, and vary both in number and size with cell type; their number increases as cells differentiate. Time-lapse experiments also revealed a fast regime of PcG motion within volumes smaller than CTs as well as a slow regime of long-range motion that depended on coordinated large-scale chromatin movements which might be associated with displacements of whole or large parts of chromosomes and led to occasional association and dissociation events of PcG bodies [Cheutin and Cavalli, 2012]. However, whether PcG bodies are true nuclear entities or 3D chromosomal domains enriched in PcG proteins is still a matter of debate [Smigova et al., 2011; Pirrotta and Li, 2012]. Thus, the mechanism by which PcG bodies form, which may involve PRC1 components with polymerization properties [Isono et al., 2013], as well as the role of PcG bodies in transcriptional regulation remain to be clarified. In A. thaliana, LHP1, the plant functional homolog of Polycomb, represses gene expression [Turck et al., 2007; Zhang et al., 2007]. LHP1 is located in discrete nuclear foci, and its distribution changes during cell differentiation [Gaudin et al., 2001; Libault et al., 2005]. In young proliferating cells, a uniform LHP1 distribution pattern was reported. As root hair cells mature progressively from the root apex to the crown area, the number of foci in the root hair nuclei increases. The distribution of LHP1 in the nucleus was also shown to be dynamic throughout the cell cycle [Libault et al., 2005]. Therefore, LHP1 foci are reminiscent of Polycomb bodies in animals. LHP1 was shown to repress FLC expression. However, LHP1 does not seem to be required for the vernalization-induced clustering of FLC-lacO alleles, whereas PRC2 subunits participate in such processes [Rosa et al., 2013]. EMBRYONIC FLOWER1 (EMF1), a putative PRC1 subunit, has been shown to participate in chromatin compaction [Beh et al., 2012]. Whether EMF1 or others PcG proteins participate in the formation of plant PcG bodies and function in the spatial organisation of the genome as in animals requires further investigation.
First described in 1903 by Ramón y Cajal, Cajal bodies (CBs) are subnuclear structures that appear to contain many coiled threads (previously named coiled bodies) in TEM images. They are present in most animal species and in several plant species [Shaw and Brown, 2004; Dundr, 2012; Machyna et al., 2013]. CBs are mainly involved in the maturation of RNA species and the assembly and trafficking of transcription-related ribonucleoprotein particles involved in splicing, ribosome biogenesis, or telomere maintenance. In A. thaliana, CBs have specific functions in siRNA and miRNA biogenesis [Pontes and Pikaard, 2008]. CBs are composed of a wide range of proteins (some of which also occur in the nucleolus and histone locus bodies) and various types of small RNAs that are related to their specific and various functions [Fong et al., 2013; Machyna et al., 2013]. The coilin protein, which is considered as a signature of CBs, has functions in structural scaffolding and is a key regulator of CB formation and activity [Collier et al., 2006; Machyna et al., 2013]. The CBs are either free in the nucleoplasm and/or physically associated with specific regions of chromatin, such as the nucleolus, in line with the original description of CBs as nucleolar accessory bodies [Shaw and Brown, 2004]. The A. thaliana nucleus usually contains a single spherical CB of ∼1 µm in diameter which is often closely associated with the nucleolus of different cell types. A. thaliana CBs can be visualized by TEM or by using marker proteins such as AtCoilin [Collier et al., 2006] or the spliceosomal protein U2B′′ [Beven et al., 1995; Boudonck et al., 1999]. Mutants with multiple CBs have been described [Collier et al., 2006], suggesting the existence of plant factors involved in the maintenance of CB cohesion. As in animals [Platani et al., 2000], plant CBs are highly motile and travel within the nucleoplasm and nucleolus [Boudonck et al., 1999]. Their numbers fluctuate with the cell cycle, with coalescence occurring mainly at the nucleolar periphery [Boudonck et al., 1999].
Nuclear speckles (NSs), which are enriched in splicing regulators such as Ser/Arg-rich RNA-binding proteins and 3′-end processing factors, correspond to another type of NB that is also related to RNA splicing. These NBs are considered to act as storage and assembly environments for splicing regulators which are translocated to transcription sites for RNA maturation [Reddy et al., 2012; Spector and Lamond, 2011]. The NS components shuttle rapidly from the nucleoplasm and nucleolus to the NS which are motile within a constrained region and vary in size, shape, and number, as do other NBs.
Transcription factories (TFAs) are discrete local subnuclear regions enriched in RNA polymerase and ribonucleoproteins and are in close contact with chromatin [Davidson et al., 2013]. Nascent transcripts are synthesized at the surface of most TFAs, but some are not active. Genomic loci were found to relocate to TFAs in animals in a manner that was dependent on the transcriptional status of the loci, and it was proposed that TFAs co-regulate gene sets, with several transcription units being associated with a single TFA and some being remotely located. As for most NBs, TFAs differ in number, size, and distribution, depending on the cell type, the metabolic and transcriptional activity of the cell, and the cell cycle phase. Recent imaging of RNA production at the single cell level demonstrated that transcription is a discontinuous process, with bursts of transcriptional activity occurring [Chubb and Liverpool, 2010]. Furthermore, there is recent evidence that Polymerase II clusters form transiently, and due to the resulting molecular crowding occurring at these sites, the clusters might participate in the assembly of the transcriptional machinery during the rate-limiting transcription steps [Cisse et al., 2013]. In plants, parameters describing the dynamics of TFAs in the nucleus remain to be determined. A recent study using super-resolution microscopy highlighted distinct reticulate structures within euchromatin in A. thaliana and rye interphase nuclei that correspond to active and inactive RNA polymerases, rather than discrete factories as in animals [Schubert, this issue].
Modeling Approaches to Uncover Plant Nuclear Architecture
An increasing number of studies use multidisciplinary approaches based on quantitative image analysis and biophysical modeling to identify the organizational principles of the nucleus and underlying mechanisms. However, these new research tools need to be refined before they can adequately address these objectives.
Since nuclear patterns are dynamic and do not fully obey deterministic rules, statistical analysis over large samples of nuclei is required to better describe and analyze these patterns. However, the high variability in nuclear size and shape makes integration and comparison of data from different nuclei difficult. In some species, such as S. cerevisiae, nuclear landmarks can be used to superimpose different nuclei and to generate statistical representations of intra-nuclear distributions [Berger et al., 2008]. Since nuclear landmarks are absent in plant nuclei or have not yet been identified, alternative quantitative strategies for integrating data are required. The need for quantitative approaches also results from the remarkable properties of the spatial distributions in 3D volumes. For example, the outer shell of a half radius width represents 90% of a sphere volume, and this proportion remains as high as 50% when reducing the shell width to one-fifth of the sphere radius (fig. 3A). Since nuclei frequently have a spherical topology, peripheral positioning of nuclear compartments is thus likely to occur under random distribution. Therefore, demonstrating a specific peripheral positioning requires a quantitative analysis.
Before nuclear images can be subjected to quantitative analysis, objects of interest (e.g. the nucleus, CTs, genes) must be localized in 3D. This process, called image segmentation, is generally achieved by setting an intensity threshold that separates object voxels from the others. For example, nuclear contours are frequently identified using DAPI counterstaining. The difficulty in achieving this task automatically is generally underestimated. For example, the popular Otsu's thresholding method [Otsu, 1979] is sensitive to intensity distribution features [Xu et al., 2011; Xue and Zhang, 2012]. Setting an accurate threshold is all the more critical because errors are magnified in 3 dimensions. This point can be illustrated by segmenting a 3D image at 2 slightly different thresholds: the 2D section areas displayed in figure 3B appear to be almost identical, differing by <2%; by contrast, in 3D, the difference between nuclear volumes is 8%. When there is uncertainty about the intensity threshold, it is recommended that the potential impact on quantitative measurements be evaluated using, for example, sensitivity analysis. Also, threshold-insensitive measures could be used or measurements over a range of threshold values could be integrated [Eils et al., 1996]. Alternative methods for intensity thresholding can also use a priori information about the number or size of the expected objects. It may be important to include additional image processing steps before or after the segmentation steps, such as deconvolution, noise reduction, or shape regularization, using mathematical morphological operators [Ronneber-ger et al., 2008]. Some of these steps may be simplified when better images become available due to technological advances in microscopy. However, the need for accurate segmentations will remain.
Two broad approaches are used to analyze nuclear positioning data which typically consist of sets of points and sometimes also of the associated object sizes. In the first approach, object positions are analysed individually, thus focusing on the absolute positioning in the nucleus. The prominent paradigm is radial distance analysis in which object location is defined based on its distance from the nuclear envelope or nuclear centroid or on its relative position along the radius passing through it. The radial distance measurement is generally binned into a finite number of classes corresponding to concentric shells. To test for any preferential location towards/away from the envelope, the resulting histogram is compared to the expected distribution under randomness. Continuous variants have been proposed [Ballester et al., 2008] that avoid the loss of statistical power inherent in class binning. In plants, distance analysis was used to demonstrate the preferential localization of A. thaliana centromeres at the nuclear periphery of diploid cells [Fang and Spector, 2005]. The popularity of radial distance analysis in nuclear organisation studies [Shiels et al., 2007] is probably due to the simple spherical shape of cultured animal cell nuclei. The diversity of plant nuclear shapes (fig. 2) [Chytilova et al., 1999], however, challenges the general relevance of this approach in plant studies. Further, radial distance analysis entails a projection of 3D data onto a single dimension, thus resulting in a significant loss of spatial information (fig. 3C).
The second approach considers all object positions from a single nucleus together, thus evaluating their mutual positioning. In its simplest form, such an analysis relies on object association frequencies, with an association being defined by the inter-object distance below some fixed thresholds. Comparing measured frequencies with expected frequencies under random object distributions reveals associations or exclusions. This approach revealed a random association between homologous genomic regions of ∼100 kb in A. thaliana, thus demonstrating the absence of somatic homologous pairing in this species [Pecinka et al., 2004]. Similar results were later obtained in other species [Berr et al., 2006; Schubert et al., 2007]. Spatial statistics offer more elaborate tools for analyzing point patterns based on summary statistics such as cumulative distance functions [Diggle, 2003]. Standard spatial descriptors are the distance from any object to its nearest neighbor or the distance between any position and the nearest object. As above, measured distribution functions are compared to expected functions under the hypotheses of reference models, which are generally based on complete spatial randomness. Spatial statistics have historically been developed for ecology, forestry, and epidemiology studies, where data are positions recorded within a single sampling window. This contrasts with nuclear data which result from exhaustive position recordings over finite domains with replication over samples of nuclei. Initial attempts to apply spatial statistical tools to nuclear studies ignored these difficulties by considering nuclear data as classical spatial data sampled from virtually infinite point processes [Noordmans et al., 1998; Beil et al., 2005; Buser et al., 2007]. Adapting methods rather than data was recently proposed as an alternative with the introduction of spatial statistical tools specifically designed to analyze nuclear organisation data [Andrey et al., 2010].
Because of the lack of statistical power in per-individual analyses, uncovering spatial effects of small amplitude requires integrating data extracted from large populations of nuclei. Early studies incorporated no size or shape normalization and relied, for example, on average cumulative distance functions [Noordmans et al., 1998; Beil et al., 2005]. In other studies, inter-nucleus size normalization was achieved by dividing measured distances by nuclear size [Fang and Spector, 2005] or maximal observed distance [McManus et al., 2006]. Shape normalization was recently proposed for testing spatial models of object distributions, based on the comparison of nuclear patterns to model outcomes conditioned by nuclear morphology. Comparisons were either performed on an individual basis [Russell et al., 2009] or at the group level [Andrey et al., 2010]. The latter was made possible by introducing a normalized measure of model/data discrepancy for each nucleus. Results reported on simulated data later demonstrated the superior statistical power of this method as compared with other strategies for data integration over samples of nuclei [Weston et al., 2012]. This methodology is particularly relevant for nuclear organisation studies in plants, as plant nuclei have diverse shapes that deviate significantly from simple spherical ones [Chytilova et al., 1999]. Since this methodology is generic, it is also likely that it will be generalized in the future to evaluate more elaborate spatial models and to support inter-group comparisons.
Several studies in both animals and plants have used polymer modeling to analyze the mechanisms underlying the spatial organisation of the genome. The chromatin fiber is generally modeled as a linear chain of monomers, and an energy function is defined to incorporate physical forces and constraints, such as attraction, repulsion, and volume exclusion, between adjacent or distant monomers. Starting from initial configurations, computer simulations iteratively update monomer positions to progressively reach lower energy configurations. In the Spherical Chromatin Domain (SCD) model, which has been used both in animal and plant studies, chromosomes are modeled as strings of 1-Mb spherical domains connected by elastic springs and obeying short-range exclusion/long-range attraction potentials. Chromosome territories are obtained by relaxing initially condensed linear configurations representing metaphase chromosomes [Kreth et al., 2004]. By comparing observed association frequencies between homologous chromosomes with those predicted by the SCD model, it was shown that homologue associations in A. thaliana and the closely related species A. lyrata are globally not significantly different from random, except for chromosome pairs 2 and 4, which bear NORs [Pecinka et al., 2004; Berr et al., 2006]. Using a coarse-grain molecular dynamics model closely related to the SCD model, the contribution of non-specific physical interactions in the positioning of CCs and nucleoli in Arabidopsis leaf nuclei was examined [de Nooijer et al., 2009]. In that study, chromosomes were represented by heteropolymers that could form loops, and large monomers were used to model CCs. Based on a comparison of the different models of polymer structure, the authors concluded that non-specific interactions between polymers with CC-anchored loops were sufficient to explain the experimentally observed positioning of CCs and nucleoli, as well as the segregation of chromosomes into territories, thus supporting the previously proposed rosette model [Fransz et al., 2002]. One difficulty in interpreting the results of this model is that equilibrium configurations were obtained through chromosome condensation and reduction of the nuclear volume. It is also unclear how a completely non-specific determinism of heterochromatin positioning in A. thaliana nuclei can be reconciled with the heterochromatin reorganizations described previously. An important future challenge will be to shift from simulations in theoretical, simple nuclear shapes (spheres or ellipsoids) to actual shapes determined from 3D images. In parallel, the ability to associate heterogeneous properties with monomers along chromosomes will be essential for further expanding the potential of these models in investigations of the principles underlying nuclear organisation and of genetic determinants.
Conclusion
In the past decade, tremendous progress has been made in deciphering chromatin properties at the genomic scale in eukaryotes, and the molecular and biochemical diversity of chromatin has been highlighted. Despite its repetitiveness, which for a long time largely excluded it from molecular studies, heterochromatin has started to reveal its secrets and is no longer considered as a monotonous and silent compartment. Furthermore, besides key functions in chromosome segregation and genome stability and maintenance, a new function in genome organisation has emerged for heterochromatin, highlighting the interdependence of euchromatin and heterochromatin. The development of 3C and related methods, which have been used to unravel the 3D conformation of the genome and, in turn, have raised questions about the relationship between 3D genome organisation and its main functions in the interphase nucleus (i.e. transcription, replication, and DNA repair), has revolutionized chromatin biology. Combining these molecular approaches and classical cytological studies with modeling approaches promises to answer these questions while taking nucleus-to-nucleus variability and the stochasticity of nuclear processes into account.
In A. thaliana, most cytological studies have been performed on 2D nuclei. However, the number of 3D approaches is rapidly increasing, and this increase calls for a change in perspective and tools. The principles that govern plant heterochromatin organisation are slowly emerging. It will be interesting to determine whether these principles, which were established in a diploid model plant, extend to plants with complex and large genomes containing large heterochromatin fractions. The repertoire of known plant nuclear compartments has expanded tremendously. However, the mechanisms underlying their assembly and functions are still poorly understood, as is their functional interplay, particularly with the 3D conformation of the plant genome.
The complex and intricate interplay between nuclear components, nuclear architecture, and genome functions remains to be further investigated. We predict that much will be learnt about the interphase plant nucleus and its dynamics in the coming years and that this knowledge will depend on intensive interdisciplinary dialogues on subjects ranging from biophysics and modeling to imaging and cytology.
Acknowledgements
We apologize to those whose work could not be cited in this review, owing to space limitations. We thank Kathleen Farquharson for critical reading of the manuscript and the reviewers for interesting suggestions. S.D.P. and J.A. are supported by PhD fellowships provided by the European Commission Seventh Framework-People-2012-ITN project EpiTRAITS (Epigenetic regulation of economically important plant traits, No. 316965), and K.S. by the LabEx Saclay Plant Sciences.