Abstract
Mammalian adult neurogenesis has remained enigmatic. Two lines of research have emerged. One focuses on a potential repair mechanism in the human brain. The other aims at elucidating its functional role in the hippocampal formation, chiefly in cognitive processes; however, thus far it has been unsuccessful. Here, we try to recognize the sources of errors and conceptual confusion in comparative studies and neurobehavioral approaches with a focus on mice. Evolutionarily, mammalian adult neurogenesis appears as protracted juvenile neurogenesis originating from precursor cells in the secondary proliferation zones, from where newly formed cells migrate to target regions in the forebrain. This late developmental process is downregulated differentially in various brain structures depending on species and age. Adult neurogenesis declines substantially during early adulthood and persists at low levels into senescence. Short-lasting episodes in proliferation or reduction of adult neurogenesis may reflect a multitude of factors, and have been studied chiefly in mice and rats. Comparative studies face both species-specific variations in staining and technical abilities of laboratories, lacking quantification of important reference measures (e.g. granule cell number) and evaluation of maturational markers whose persistence might be functionally more relevant than proliferation rates. Likewise, the confusion about the functional role of variations in adult hippocampal neurogenesis has many causes. Prominent is an inferential statistical approach, usually with low statistical power. Interpretation is complicated by multiple theories about hippocampal function, often unrealistically extrapolating from humans to rodents. We believe that the field of mammalian adult neurogenesis needs more critical thinking, more sophisticated hypotheses, better statistical, technical and behavioral approaches, and a broader conceptual perspective incorporating comparative aspects rather than neglecting them.
Introduction
After decades of intensive research, mammalian adult neurogenesis has remained enigmatic. The initial protagonists and early followers are still unsure about the functional meaning of adult neurogenesis in general [Bonfanti, 2016] and adult hippocampal neurogenesis (AHN) in particular, although they rarely question its functional significance. A count of publications related to adult neurogenesis reveals a rapid increase from the year 2000 to 2015, the last 4 years revealing between 600 and 700 papers per year (fig. 1). The obvious benefit of the research is better insight into molecular mechanisms underlying development and differentiation of neurons and stem cells in the neurogenic zones of the mammalian brain. On the other hand, a research field in which some 7,500 publications have appeared without elucidating the functional meaning of adult neurogenesis requires consideration about what might have gone wrong.
Clearly, two main lines of research have emerged across the years. One focuses on adult neurogenesis as a potential repair mechanism in the human brain, investigating molecular mechanisms controlling it or at least mediating proliferation, cell fate, and survival of newly generated neurons, with emphasis on potential applications for brain repair [Martino et al., 2011; Lindvall and Kokaia, 2015; Alunni and Bally-Cuif, 2016]. This approach, which also includes comparative studies, is more fact-related, and emerging contradictions and conclusions can be reconciled. Nonetheless, one must still await better understanding and more studies to judge whether adult neurogenesis provides a substantial potential for brain repair or whether it is just leading to a (protracted) developmental end point imposed by structural rules of the mammalian brain, including evolutionary constraints and nerve tissue reactions [Weil et al., 2008; Bonfanti, 2011; Peretto and Bonfanti, 2014; Bonfanti, 2016].
The other research line emerged separately and tries to elucidate the functional role of adult neurogenesis, chiefly in cognitive processes related to hippocampal function, often referred to as AHN. Contradictory data and lack of replication are the main problems of this line or subfield, as evidenced by two text passages from papers conveying the same message in the title. One is a classic paper by Gould et al. [1999] entitled ‘Learning enhances adult neurogenesis in the hippocampal formation'. Some 14 years later, Epp et al. [2013] published a paper ‘Hippocampus-dependent learning influences hippocampal neurogenesis' and the abstract explains: ‘One particular area of research has examined the effects of hippocampus dependent learning on proliferation, survival, integration and activation of immature neurons in response to memory retrieval. Within this subfield there remains a range of data showing that hippocampus dependent learning may increase, decrease or alternatively may not alter these components of neurogenesis in the hippocampus'. Such statements do not facilitate interpretation of the findings in this field, even though the latter paper provides a useful listing of factors that might account for different results, notably in one subfield of AHN only.
Evidently, the problem with studies in adult neurogenesis is not the lack of results, but that there are too many findings with inconclusive and seemingly irreproducible results, specifically when behavioral changes are involved. Yet reproducibility is a cornerstone of good science [Morrison, 2014]. Since a detailed review of many thousand studies with conflicting or doubtful data would go beyond the space limits of this article, we aim at readers with limited experience in the field of adult neurogenesis and try to identify the sources of variation precluding a clear picture in the field. First, we will give a framework based on comparative evaluation of adult neurogenesis in different mammalian species, highlighting that ecological adaptations ultimately determine patterning and functionality of adult neurogenesis [Peretto and Bonfanti, 2014]. The second goal is to provide the nonexperts in the field with a set of criteria by which studies about the functionality of AHN can be weighted according to whether they ignored or respected known sources of methodological confounds or comparative aspects. To this end, we shall discuss selected high-impact papers and draw attention to their hidden methodological problems. For neurobehavioral studies with laboratory rodents aimed at identifying relations between AHN and behavior, we identify factors known to produce confounds in cell counts, but also confounds in behavioral testing that may blur relations between AHN and behavior. Finally, we will use published examples to show why the standard presentation of data in this field is masking important information. We emphasize that the term ‘field' does not only denote the leading laboratories, but encompasses the many follow-up studies employing (often uncritically) methods and terminology of the main players.
Findings from Comparative Research
Adult Neurogenesis Is Highly Heterogeneous in Vertebrates, Even among Mammals
Adult mammalian neurogenesis follows essentially the same rules as embryonic brain development: in early development, neuronal and glial precursor cells are generated in the periventricular germinal layers from where they migrate towards the external surface of the brain along radial guidelines formed by special glial cells [Bonfanti and Peretto, 2007; Kriegstein and Alvarez-Buylla, 2009]. Two regions remote from the ventricles contain secondary proliferation zones in which neural precursor cells continue to divide: the dentate gyrus and, temporarily, the superficial layers of the cerebellum. In the adult mammalian brain, however, regular proliferation of new cells became restricted evolutionarily to two zones harboring stem cells and coined ‘canonical' by Feliciano et al. [2015], namely the forebrain subventricular zone (SVZ; giving rise to a stream of inhibitory neurons for the olfactory bulb), and the subgranular zone (SGZ) of the granule cell layer in the dentate gyrus. In addition, there are a number of brain sites in which adult neurogenesis can be sporadically observed, depending on species, age, and physiological and pathological status, the ensemble coined ‘parenchymal' (non-canonical) and ‘lesion-induced' neurogenic sites, respectively [Feliciano et al., 2015]. Unsurprisingly, their irregular nature is fueling controversies in the field of neuroregenerative approaches since virtually all these noncanonical neurogenic processes do not produce neuronal functional integration, thus remaining ‘incomplete' [Bonfanti and Peretto, 2011].
From a comparative point of view, mammals do not show true adult neurogenesis as found in bird species with seasonal variation in behavior-controlling brain structures [Nottebohm, 2004]. Rather, adult neurogenesis appears as protracted juvenile neurogenesis that is downregulated differentially in various brain structures depending on species and age levels. Adult neurogenesis in the SVZ-olfactory bulb system and AHN can be found in most mammalian species, but its expression is highly variable [Amrein et al., 2011; Bonfanti and Peretto, 2011; Amrein, 2015]. Species with the lowest level of expression are humans [Sanai et al., 2011], nonhuman primates [Kornack and Rakic, 1999; Jabes et al., 2010], many bat species including both Microchiroptera and Megachiroptera [Amrein et al., 2007], and cetaceans [Parolisi et al., 2015; Patzke et al., 2015], while shrews can switch off AHN completely after the first hibernation [Bartkowska et al., 2008; Amrein, unpublished data]. Likewise, the proliferation levels are low in long-living subterranean mole species [van Dijk et al., 2016a]. Levels of AHN appear high in some rodents showing a daily turnover rate of up to 1.5% of adult dentate granule cells, counterbalanced by increased apoptosis [Amrein et al., 2004], as contrasted to humans with an estimated level of 0.004% at 40 years [Spalding et al., 2013].
Variable Expression of Markers for Juvenile Neurons
In most species, adult neurogenesis is followed by developmental stages of the newly born neurons characterized by the expression of so-called ‘juvenile proteins' thought to underlie functional plasticity. These include, among others, doublecortin (DCX), polysialylated neural cell adhesion molecule (PSA-NCAM; a low adhesive cell adhesion molecule involved in structural plasticity; see references in Bonfanti and Nacher [2012]), the transcription factor NeuroD, and others [Knoth et al., 2010]. DCX labeling is often used as a proxy for identification of newly generated neuroblasts since it is expressed transiently during the neurogenesis and migration process [Brown et al., 2003]. However, it cannot be used as the sole marker for newborn neurons. There are several studies showing that DCX staining can be observed in non-newly generated cells [Gomez-Climent et al., 2008; Luzzati et al., 2009; Klempin et al., 2011], such as the so-called ‘immature' neurons of the cortical layer II (fig. 2b) [reviewed in Bonfanti and Nacher, 2012]. Thus, markers for juvenile neurons are, per se, not markers of neurogenesis but might be associated more broadly with structural plasticity [Bonfanti and Nacher, 2012]. Hence, observing high numbers of DCX-stained neurons in a dentate gyrus indicates both some degree of proliferation as well as functional plasticity. Likewise, NeuroD is interesting as it is activated during neuronal differentiation, specifically regulating mossy fiber synapses [Wilke et al., 2012], but also responds to neuronal activation similarly as other activity markers such as c-fos [Konishi et al., 2001]. To compare species meaningfully, the minimal requirements for obtaining a comprehensive view is counting the number of both Ki67+ and DCX+ cells, possibly also NeuroD+ or PSA-NCAM+ cells, and finally counting the total number of granule cells to have a reference frame.
Generally, in AHN the numbers of cells carrying proliferation markers and those staining for DCX correlate well. For example, in red foxes Amrein and Slomianka [2010] report a correlation coefficient of r = 0.80 (n = 7), while the correlation coefficient in mice as shown by Ben Abdallah et al. [2010] is r = 0.98 (correlated means of 6 age classes).
Focus on Cell Numbers or Plasticity Features?
On the other hand, the ratio between DCX+ cells and cells with proliferation markers can be extremely different across species. In the mice above, there were on average about 4 times more DCX+ cells than Ki67+ cells, but about 70 times more DCX+ cells than Ki-67+ cells were observed in the red foxes, which, in turn, showed 15 times more DCX+ cells than elderly beagle dogs [Siwak-Tapp et al., 2007]. In old naked mole rats, residual PSA-NCAM immunoreactivity was observed many years after the disappearance of any proliferation activity. Similar observations were made in the human dentate gyrus in which DCX+ cells were found long after detectable proliferation [Knoth et al., 2010]. Observing such temporal dissociation between neurogenesis and persistence of developmental markers raises fundamental questions.
A comparative view suggests a dual strategy to achieve neuronal plasticity: short living species such as mice or rats use a strategy of producing new cells providing the required plasticity levels, while long-living species develop mechanisms to preserve juvenile properties of developmentally generated granule cells or may reactivate molecular switches, generating cellular plasticity in adult neurons. This might be necessary to cope with the increasing age-dependent restrictions on proliferation and migration of newly formed neurons (see below). Indeed, the study by Penz et al. [2015] showed that juvenile features such as excitability of granule cells from newborn naked mole rats were conserved throughout life and without accompanying neurogenesis; the findings were interpreted as an example of neoteny (the conservation of early developmental features). Similar neotenic mechanisms might operate in bats, although de novo generation of juvenile proteins facilitating synaptic plasticity might reflect another strategy to maintain an optimal status for hippocampal activity for 20 years and more.
Substantial Decline in Adult Neurogenesis Occurs Depending on Age, Mostly in Large-Brained Mammals
A general finding among mammals is that adult neurogenesis in both the SVZ and SGZ declines substantially during early adulthood and persists at low levels. In the SVZ of large-brained animals, it partially disappears at very young ages [Paredes et al., 2016]. For example, the rostral migratory stream of neuroblasts generated in the SVZ and directed to the olfactory bulb in most mammals is virtually absent in humans reaching 2 years of age [Sanai et al., 2011], and no periventricular germinal layer is detectable in neonatal dolphins [Parolisi et al., 2015] (fig. 3a). Such reduction is likely due to a weak sense of olfaction in humans, especially relative to a rodent, and its absence in aquatic mammals like dolphins [Parolisi and Bonfanti, in preparation].
Thus far, a common pattern in the rodent SGZ is an exponential decline from juvenile levels early in life to low levels in adulthood, tapering off with progressing age [Amrein et al., 2011; Lazic, 2012b; Couillard-Despres, 2013], with the duration of this phase showing a moderate correlation with lifespan [Lazic, 2012b]. Such a time-course may reflect the need to suppress rapidly uncontrollable juvenile neuronal excitability of dentate granule cells, while later episodes of short-lasting variations in proliferation during the tapering-off phase reflect a multitude of intervening factors of different origins and debated significance. Alternatively, as suggested by Couillard-Despres et al. [2011] newly born neurons may have different effects during various lifetime stages.
Detailed quantitative comparative evidence for the change in turnover rates is scarce since it requires (as in other comparative studies) both the counting of granule cell number as a reference, and counting cells with proliferation and maturation markers to obtain an age-dependent estimate of turnover rates in AHN. Thus far, this has been realized systematically in mice up to 9 months of age [Ben Abdallah et al., 2010], and to a lesser extent in macaque monkeys because of the low number of animals available [Jabes et al., 2010]. Jabes et al. reported in their four adult monkeys an estimated daily turnover rate of 0.02% in relation to the actual adult granule cells. A daily turnover rate in adult monkeys has been estimated (on the basis of BrdU injections and granule cell counts from the literature) as 0.004% [Kornack and Rakic, 1999]. Given the long lifespan of monkeys, this might allow for a lifetime replacement of neurons between 40 and 100%.
Quantifying AHN in Humans versus Mice: Problems and Real Errors
Conceptually, the question whether AHN declines exponentially or is an age-related process is important. If the decline of AHN reflected biological ageing comparable to the loss of elasticity in arteries, ongoing neurogenesis would represent a process preventing age-related loss of function in the dentate gyrus. However, rapid downregulation of proliferation means that the level of persisting AHN must be kept low for physiological reasons, or may simply reflect the end of a developmental growth program that is not totally repressive as observed in many other parts of the body (e.g. in wisdom teeth or skull bones). The scientific problem is then to recognize whether variations of low levels of AHN during ageing are functionally relevant or not, and to understand the reasons for an exponential decline from childhood to adulthood.
In humans, systematic morphological postmortem studies about the lifetime course of newly added dentate granule cells are lacking. To obtain comparatively meaningful data, one should know the actual number of mature granule cells at a given time point. However, human granule cell numbers must be estimated from a few studies including different age classes, which revealed a huge individual variability between 10 and 30 million granule cells [West, 1993; Simic et al., 1997].
Knoth et al. [2010] studied the occurrence of numerous proliferation and maturation markers during human lifespan up to 100 years, interpreting their data carefully without attempts to determine turnover rates. By using DCX as a proxy for proliferation, they confirmed the exponential decrease model, reporting observation of Ki67+ cells up to the age of 38 years, and MCM2+ cells (a proliferation marker with a larger temporal window) up to 64 years, while DCX-positive cells were found up to old ages. NeuroD+ neurons disappeared after 10 years of age. Another study in humans reported few Ki67+ cells in aged hippocampi, observing substantial individual variation [Reif et al., 2006].
On the other hand, an often-cited study [Spalding et al., 2013] has claimed that the early decay of AHN in humans is less severe, and that human AHN at 40 years is comparable with the C57BL/6 laboratory mouse (one of the most investigated strains) at 9-12 months, thus justifying translational studies in mice to explain functional changes in the adult and aging human hippocampus. These conclusions were derived from applying a well-established technique to study turnover in slowly reproducing tissue such as in the dentate gyrus, namely the incorporation of radioactive 14C left from nuclear explosions in the 1950s.
Surprisingly, one of the chief (and often cited) conclusions of the CELL paper is wrong: the authors infer that daily turnover of new granule cells in humans at 40 years roughly corresponds to the turnover rate of 1-year-old mice as reported by Ben Abdallah et al. [2010]. The error became evident by checking a graph using the same data [Bergmann et al., 2015], depicting the proportions in mice and humans. Figure 3b shows the true relation (humans: 700 new cells out of 20 million granule cells = 0.0035%; mice at 9 months: 416 out of 0.5 million granule cells = 0.083%). Thus, men and mice appear to differ in turnover rates by a factor of 15-20 at roughly corresponding ages. We do not deny that the dynamics of granule cell proliferation in humans might be different from mice, but the conclusion that both species have comparable turnover rates at corresponding age levels is untenable by the available data.
Other Lifetime Changes in the Dentate Gyrus
Likewise, attention should be directed to local nonproliferative growth processes accompanying adult neurogenesis in the dentate gyrus as they may equally affect hippocampal processing. Indeed, in the SGZ and the dentate gyrus, other slow growth processes occur and have their own time course. For example, a lifetime study in 69 guinea pigs, spanning a period from 5 to 1,100 days, showed that the density of recurrent mossy fiber collaterals being associated with presumably inhibitory basket cells increased in two spurts: an early one before puberty, and a later one in midlife at 600 days; no additional proliferation was observed thereafter [Wolfer and Lipp, 1995]. Dentate neurogenesis had not been measured, but it is safe to assume that it was probably reduced in the old guinea pigs [Akers et al., 2014]. Similarly, a lifetime study in 47 hybrid mice (F1 C57 × DBA/2) using the Gallyas myelin stain revealed ongoing myelination in the SGZ from day 40 to day 640; in old mice it was more prominent in males than in females [Vich-Plesko, 1999]. Given the repressive effect of myelin components on axonal growth [Caroni and Schwab, 1993; Fawcett et al., 2012], one can thus expect in ageing mice an increasing blocking of neurite growth towards CA3, impairing the normal functional integration of mossy fibers and with unknown behavioral consequences. Unfortunately, any suspected parallelism of age-related changes in dendritic trees, synaptic arrangements, and other features of hippocampal morphology require lifetime studies that have become rare in a field driven by the need for rapid publications, with the laudable exception being the study on naked mole rats [Penz et al., 2015].
AHN and Behavior
Biological and Behavioral Sources of Confounds
Age
Perhaps the most neglected confound is the age of the tested animals. For example, most mice are tested at the approximate age of 2-3 months, even though it is now known that this period is characterized by an exponential decline of cell proliferation [Ben Abdallah et al., 2010; Gil-Mohapel et al., 2013]. The rapid decline of AHN has also been observed in rats, both in laboratory strains and wild-caught Norway rats [Epp et al., 2009]. The dynamics of the decline may depend on many factors, but if the age of controls and experimental animals is not determined with utmost care and fully comparable, unrecognized age differences may introduce substantial variability of findings. Specifically, manipulations thought to induce increased or decreased proliferation might simply have altered the time course of the decline, but without knowledge of the dynamics of AHN in the own animal sample it is not possible to define whether a treatment has increased or decreased proliferation rates or survival of newborn cells. To illustrate this point, the study by Ben Abdallah et al. [2010] was not planned. During an irradiation experiment aimed at reducing AHN, we noticed that the controls at the age of 3 months had significantly less newborn cells than those at 2 months - an unpleasant finding as it indicated unstable control values, and made it necessary to document the time course of the decline. In a positive vein, comparing adult mice or rats at the age of 2 and 3 months can easily test whether different levels of AHN predict hypothesized cognitive or emotional differences - at least a first approach does not require experimental manipulation [Gil-Mohapel et al., 2013]. Thus, we strongly recommend checking in any given paper the age of the animals used. Studies in which mice or rats had been tested at the age of at least 4 months or beyond are more likely to provide reliable results than comparisons at earlier ages. There are recent examples considering this age confound [Nakashiba et al., 2012; van Dijk et al., 2016b]. We do realize that the exorbitant costs of animal husbandry are not in favor of keeping mice or rats for longer periods, but this should be taken into consideration when planning new studies.
On the other hand, the Frankland Group [Akers et al., 2014] has successfully exploited the dynamics of the early decline by asking what changes in memory can be observed in children with supposedly high levels of young granule cells. Children have natural deficits in long-term memory, predicting that high proliferation of granule cells might cause forgetting of acquired responses in adult mice. They carefully selected the time window for manipulating proliferation of granule cells, verified the age-dependent decline in their mouse samples, avoided BrdU injections, and could show in genetically modified mice that memory stability (retention of conditioned fear responses) was impaired by increasing AHN through voluntary running, and improved by reducing chemically AHN. Likewise, blocking AHN at juvenile stages mitigated retention. Moreover, precociously born species such as degus or guinea pigs showed less infantile amnesia, but revealed memory impairment when neurogenesis was increased by means of wheel-running. Taken together, the data strongly suggest a role of adult neurogenesis in erasing established memories/behaviors in different species.
Genetic Variation
With a few exceptions, the genetic background of the animals studied appears to be of little relevance in many studies of behavioral correlates of AHN. The most frequently used strain, C57BL/6, has become a de facto, yet undeserved, gold standard in many fields including neurobehavioral studies. Its main advantage is relatively robust learning under different conditions and a high proliferation rate in the SGZ [van Dijk et al., 2016b]. On the other hand, C57BL/6 mice suffer, as every inbred mouse strain, from various subtle genetic mutations in homozygous states. Behavior genetic studies have shown that F1 hybrids show usually hybrid vigor (mutual compensation of deficits) and are generally superior in a variety of behavioral tasks [Owen et al., 1997; Lipp and Wolfer, 2003]. In fact, the Banbury Conference in 1997 recommended the use of F1 hybrids between common inbred strains for neurobehavioral assessment because their genetic background is likely to buffer strain-specific peculiarities [Silva et al., 1997]. However, the literature is abundant with studying AHN in C57BL/6 only, while rare studies using F1 hybrids have shown substantial differences between inbred and hybrid (or wild) mice [Clark et al., 2009; Klaus et al., 2012]. A study using 52 recombinant mouse strains showed extreme genetic variability and identified 190 candidate genes playing a role in the control of adult neurogenesis [Kempermann et al., 2006]. On the other hand, many studies using transgenic mouse models to manipulate AHN use mice from poorly controlled mixed genetic backgrounds containing strains with defects of the nervous system and behavioral impairments (129/SvEv) [Sahay et al., 2011].
Sensitivity to Drugs, Hormones, and Stress
Due to the particular location of the stem cells in the SGZ [Paredes et al., 2016], up- or downregulation of adult neurogenesis is sensitive to chemical and hormonal signals carried in the blood and in the cerebrospinal fluid from nearby circumventricular organs. This makes it difficult to recognize whether such regulation depends on intrahippocampal signals (putatively originating from cognitive processing) or physiological signals, such as generated by wheel-running, or even by interaction between different sources. Various forms of experimentally induced stress in rodents (e.g. sleep deprivation or social defeat) have been shown to suppress cell proliferation in the SGZ, the changes in AHN being investigated to study associated changes in learning behavior or assessing effects of drugs known for ameliorating depression in humans. Yet using stress as an experimental tool may add a serious source of confound, particularly in mice since it is known to trigger epigenetic changes in the response of glucocorticoid and mineralocorticoid receptors in the hippocampus [Weaver et al., 2004]. Moreover, in other brain regions it can concomitantly induce long-lasting up- or downregulation of many promoter loci controlling expression of CNS-specific genes known as epigenetic tags. For example, social defeat stress in male mice showed 1,285 upregulated and 799 downregulated genes in the nucleus accumbens alone, but the changes were observed only in about 50% of stress-sensitive mice [Wilkinson et al., 2009].
The effects of potentially bimodal stress responses in an inbred strain on survival and functionality of newly generated neurons are difficult to anticipate, particularly since epigenetic changes in the dentate gyrus of mice are also observed after seemingly harmless perturbations such as exposure to novelty [Chandramohan et al., 2007; for a review see Stankiewicz et al., 2013]. Subtle stressors for rodents are also behavioral apparatus such as illuminated open fields and test chambers smelling of cleaning alcohol, specifically experimenters [Sorge et al., 2014], which most likely account in part for irreproducible behavioral results across laboratories [Crabbe et al., 1999; Chesler et al., 2002]. At least in mice, prolonged handling before experimentation does not remedy the problem [Longordo et al., 2011]. Thus, experimental manipulations aimed at analyzing the interaction between stress, AHN, and behavioral readouts mix differential stressors with differential epigenetic consequences as long as mice are handled by experimenters and face unfamiliar test situations.
Toxicity and Side Effects of Substances Assessing and Manipulating AHN
BrdU as a mitotic marker can have (strain-specific) toxicity on neural stem cells [Taupin, 2007], and is more prone to be incorporated by replication errors [Duque and Rakic, 2011]. Likewise, other organotypic stem cells in the body might by afflicted by BrdU, causing feedback to the hippocampus, which harbors a large amount of receptors for signals from the immune system [Lathe, 2001]. These signals may interfere not only with stem cell proliferation, but also with learning and memory, e.g. as shown for the Toll-like receptor [Okun et al., 2011]. Thus, BrdU might be used for tracking the fate of newly born neurons but should be avoided in studies aimed at showing behavioral readouts. Nonetheless, a huge majority of studies continues to use it, even though better and nontoxic proliferation markers are available. More subtle effects on neural stem cells may arise from drugs such as tamoxifen used to activate genes in conditional mutant mice [Chen et al., 2013] or from transgenic insertion of a tetracycline transactivator permitting activation of CNS-specific genes by means of ingesting tetracycline or doxycycline [Han et al., 2012].
Domestication
There is clear evidence that wheel-running in wild mice species do not have the stimulatory effect as commonly observed in mouse studies. Wild wood mice (Apodemus flavicollis) patrol very large territories but show no boost in AHN after wheel-running [Hauser et al., 2009]. Similar observations were made in laboratory-bred wild Mus musculus [Klaus and Amrein, 2012; Klaus et al., 2012; Schaefers, 2013], and wild Norway rats, which supposedly roam more, fall into the range of laboratory rat strains [Epp et al., 2009]. Likewise, studies of Siberian foxes indicated higher AHN in tame individuals [Huang et al., 2015]. Hence, domestication in mice and rats may have expanded considerably the spectrum of physiological factors regulating adult neurogenesis; however, this point is rarely considered [Kempermann, 2015]. Finally, the observation that proliferation levels of AHN in semidomesticated adult minks (1.5 years old) appear to correlate positively with stereotypic behavior [Malmkvist et al., 2012] presents another conceptual challenge for linking putative human hippocampal functions to AHN.
Translational Problems
From Man to Mouse and Vice Versa
A core assumption in translational research holds that observations in standard laboratory animals (mice and rats) can be extrapolated to humans and vice versa. While hippocampal neurophysiology and cellular features are indeed largely comparable, this is only partially true for connectivity, and rarely the case when behavior is included. Hippocampal functions have been tuned by evolution to provide optimal adaptation to the ecological niche of a species, and these niches are radically different for rodents and humans. Hippocampal malfunction in humans affects primarily memory, while rodents show multiple deficits in spatial and reversal learning as well as impairments in species-typical behaviors such as nest-building and the ability to clean out tubes filled with pebbles or food morsels [Deacon et al., 2002]. Hence, using mice to explain human hippocampal functions requires an understanding of what rodents actually use their hippocampus for during daily life in their habitual environment [Lipp and Wolfer, 2013].
Are Actual Behavioral Tests Ecologically Relevant for Rodents?
To illustrate this point, we will focus on pattern separation. The term denotes the ability to store and handle subtly different complex memories. The idea became a focus in AHN research because computational modeling showed that changing the excitability of relatively few dispersed granule cells could establish new input channels into the hippocampal subfields [Yassa and Stark, 2011]. The model provided an explanation for the vexing problem of how few newly generated cells in the dentate gyrus could have functional relevance, while partial removal of the hippocampus or a 25% reduction of the number of pyramidal cells by lesions did not affect water maze learning [Mohajeri et al., 2003].
The principle is illustrated by a frequently cited study by Sahay et al. [2011]. The authors used mice with transgenic control of survival of newly generated granule cells and tested various batches of manipulated and control mice in a battery of tasks. Mice with more juvenile neurons were indistinguishable in a battery of classic tests thought to measure hippocampal functions. Another mouse sample was tested in so-called ‘contextual fear discrimination learning'. They were first shocked in one chamber with 0.75 mA (causing vigorous jumping and squealing) and were then placed the following day either into the same chamber in which they had received shocks (A) or in a chamber of the same type but marked by a few different cues (B). The behavioral readout of such tests is the ratio of immobility responses (freezing) as compared to phases when the animal is moving for escape. In chamber A, mice continued to receive shocks at the end of the observation period, but not in chamber B. Initially both groups froze equally in either box (generalization), but gradually mice with more juvenile neurons (n = 11) froze moderately less in box B than controls (n = 9); significant differences emerged at days 5 and 6.
Yet, survival of a mouse in its natural environment depends on very fast decisions. It is difficult to see how such slowly developing fear-related discrimination learning could provide an evolutionary advantage for this species. Likewise, the ecological advantage of having proliferative processes in the dentate gyrus supporting such learning remains obscure. Nonetheless, the discrete behavioral result was taken as strong evidence for a selective effect of juvenile neurons on pattern discrimination. Confounding effects of repeated stress were not discussed, even though the same laboratory had reported earlier that heavy stress in mice entailed epigenetic changes in some 2,000 CNS genes outside the hippocampus.
Are Ethology-Based Approaches Better?
There are some recent reports indicating a recognizable correlation between individual levels of neurogenesis and spontaneous behavior, thus being more ethological and avoiding the problem of forcing the animals into short-lasting behavioral tests confounded by human handling and anxiety in test situations. For example, Freund et al. [2013, 2015] presented data showing that the emergence of individuality in exploratory roaming of mice kept in large enclosures correlated positively with proliferation levels, and negatively in mice prone to exploratory behavior independent of interaction with conspecifics.
This corresponds to the findings of van Dijk et al. [2016b] who conducted a well-designed study with inbred female mice of the strain DBA/2. A total of 106 mice obtained at exactly the same age served to assess spontaneous behavior and complex learning in automated home cages with four learning corners each (IntelliCages®). They compared two age classes (2-month-old vs. 5-month-old animals) for learning simple spatial and progressively more complex tasks including spatial rule learning and discount-delay behavior (suppressing a spontaneous behavior for a quick water reward in favor of a better sweet reward after some waiting), collecting a total of more than 1,500 behavioral readouts during a test period of 30 days. Using a Bayesian approach, they reduced false-positive correlations with proliferation rates, with surprising results. For one, there was no difference in newly generated neurons in mice having spent the same time in IntelliCages but without any learning - indicating that mental load in form of complex learning did not increase hippocampal neurogenesis. The other intriguing observation was that the number of newly generated cells did not correlate with any learning scores, but only negatively with ‘exploratory' behavior during the first day in the IntelliCages. This was observed only in the older group (being in the tapering off phase of proliferation), indicating that individuals with the highest proliferation rate were visiting the learning corners least. This might imply increased caution of adult mice with high proliferation levels in novel environments.
However, other explanations are possible if the older mice would have paid more attention to novel cage mates at the expense of climbing into learning corners. Future studies must address this issue, but at present it fits well to theories assuming that variations in levels of AHN predispose individuals of a given species to different levels of exploring and adapting to new environments within the natural lifespan. The strong correlations between propensity for sucrose preference in the home cage and AHN in rats [Hu et al., 2016] points in the same direction, as they were observed in both treated and control rats. Obviously, ethological observational approaches are not superior per se, but they avoid the notorious confounds in behavioral testing of mice, thus increasing the chance for observing reproducible correlations.
A last argument for paying more attention to ethological factors comes from an extensive study in conventional knockout mice lacking adult neurogenesis in both SVZ and SGZ [Jaholkowski et al., 2009]. These mice compensated the lack of AHN remarkably well when tested in a large battery of learning and memory tasks aimed at revealing hippocampal malfunction. On the other hand, they were impaired in olfactory functions, and showed significant deficits in species-typical behaviors such as food burrowing and nest building [Jednyak et al., 2012], corresponding to hippocampal lesion effects in mice [Deacon et al., 2002].
Interpreting Data
Inferential Conclusions
The most common form of data presentation in the field of behavioral AHN shows that a given treatment or genetic differences result in group differences of both cell counts (endophenotype) and behavioral readouts. The implicit assumption is that parallel changes in the endophenotype and the behavioral readout indicate a causal effect of the experimentally varied cell counts on the behavioral readout. This (widely occurring) logical error in neurobehavioral studies has been criticized repeatedly [Lazic, 2010, 2012a; Lazic et al., 2014], yet without much resonance. The latter study checked more than 900 papers claiming an effect of AHN on behavioral measures, but found only 11 papers reporting a correlation between natural or experimentally induced individual levels of AHN and the behavioral scores. A meta-analysis revealed that the contribution of individual variability to the behavior scores was statistically negligible and that major effects on behavior had to be attributed to other factors than cell counts.
The argument is shown in more detail in figure 4. The first part deals with the risk of accepting spurious brain-behavior associations based on group differences in unrelated samples of animals, clearly the weakest method for dissecting brain-behavior relationships. This approach is most frequently used by testing batches of animals known for increased or decreased proliferation, yet without actually analyzing cell numbers in the dentate gyrus of the tested mice [Sahay et al., 2011]. The covariance between individual cell numbers and behavioral scores is not known but simply inferred.
To demonstrate the fallacy of such assumptions, we used our own data showing that the mouse strains C3H and DBA/2 differ significantly in avoidance learning (fig. 4b). At the time of the study, we intended to show that this difference was related to the volume of synaptic fields in the CA3/CA4 region [Lipp et al., 1989]. Indeed, the poorly performing strain C3H showed both significantly larger molecular layers (fig. 4a) and more mossy fiber terminals along the basal dendrites of pyramidal cells in CA3/CA4, the infrapyramidal projection (fig. 4c). However, the structural and behavioral strain differences might be based on numerous factors unrelated to each other, such as body weight or hormonal status, and claiming a functional relation between hippocampal synaptic fields and behavior would represent a massive inferential leap not justified by the data. We then randomized noninvasively genetic factors affecting the size of the synaptic fields in CA3/CA4 simply by crossing the two strains and creating a F2 generation showing phenotypic variability of synaptic fields and avoidance learning. Figure 4d shows that variation in the size of the molecular layer was now largely uncorrelated with avoidance learning scores, indicating that the initially observed link was spurious. On the contrary, there was a strong and significant negative correlation between the volume of the IIP-MF and avoidance learning. Later studies showed that the extent of the IIP-MF correlated with a variety of hippocampus-dependent behaviors as well as with behaviors not suspected of being hippocampus dependent [Lipp et al., 1989; Lipp and Wolfer, 1995].
Hidden Information in Within-Group Variation of Data Points
In studies measuring neuronal numbers and behavior in the same animal, there is no need to create artificial variation because substantial within-group variation in both variables can provide valuable information if the relations are presented at least as scatter plots. In doing so, one can easily identify whether there are correlations between cell counts and behavior, either significant or at least as trend. But a total lack of within-group correlations is suspicious for spurious relations (fig. 4f) and the results must be considered as questionable. The presence of significant within-group correlations or at least trends (fig. 4g) indicates the possibility of a more causal relationship (without proving it), and papers showing them deserve more credit for their conclusions. Such inspection does not require large sample sizes. In a recent study, the number of BrdU-positive neurons was reduced by retinoic acid, followed by assessment of water maze learning and sucrose preference [Hu et al., 2016]. The data revealed that the proliferation rate appeared to be strongly correlated with individual sucrose preference within both control and treated rats, even though it included 6 rats per condition only. The lowest correlation coefficient was r = 0.79 and the best was r = 0.98, the distribution of data points not being skewed. One may also note that such correlations were observed during spontaneous behavior in the home cage but not in the water maze. Another paper presented an analysis of sex differences in rats subjected to variants of radial maze learning thought to reveal behaviorally neurophysiological pattern separation in the hippocampus, without attempts to change potential levels of AHN experimentally [Yagi et al., 2016]. In contrast to earlier reports investigating AHN and learning behavior [e.g. Sahay et al., 2011], they analyzed the cases of errors carefully, attributing them to differential strategies of problem solving. The small sample size and distribution of data points does not allow for definite conclusions of whether the survival of newly born granule cells is really linked to the behavior scores, but any reader can quickly make his own conclusions. For further examples of correlative studies, see Freund et al. [2013] and van Dijk et al. [2016b].
Thus, as a rule of thumb, the presence or absence of scatter plots in data presentations reporting changes in both adult neurogenesis and supposedly related behavior allows classifying publications as interesting and trustworthy, or, respectively, as inconclusive, thus greatly reducing the amount of papers deserving in-depth reading.
Conceptual Problems
Research Findings in AHN and Their Perception by the Public
An important mechanism driving research directions and interpretation of data in AHN is the role of the media. Widespread publicity attracts more research groups, and increases the pool of funding. In this respect, the seminal paper by van Praag et al. [1999] probably had the largest and longest-lasting impact. Specifically, the hypothesis that physical activity was improving cognition through stimulating AHN fit the public and clinical opinion well, even though the original report was cautious in this respect and did not provide a correlational analysis. On the other hand, follow-up papers reporting data not matching the prevailing public perception did not get much attention, even when their approaches were technically sound. For example, Japanese groups investigated the effects of differential masticatory activity on AHN in feeding soft or hardened pellets to C57BL/6 mice. Much in parallel to the first wheel-running study, they reported increased survival of BrdU-labeled neurons in the mice forced to chew more [Akazawa et al., 2013]. Moreover, the groups differed in the latest phases of water maze learning, and the hard chewers showed (shaky) superior probe trial performance exactly as in the paper by van Praag et al. [1999]. Unfortunately, neither paper showed scatter plots that would have provided clues about the behavioral relevance of the variation in BrdU-labeled neurons and the behavior scores. Nonetheless, it would be interesting to investigate whether strong chewing in elderly humans might convey the same mental benefits as other forms of more demanding physical exercise [Chen et al., 2015].
More Is Better or Less Is More?
The dominant belief in the field and in public opinion is that AHN is a process supporting cognition, memory, and emotional stability because it occurs in the hippocampus - a structure known to mediate cognition, memory, and emotionality. The circularity of the argument is rarely questioned. As proliferation seems to undergo an ageing process associated with progressive decline of memory capacities, stimulating AHN ought to improve cognition in the elderly, protect mental functions in the demented, and help in counteracting depressive states. This somewhat simplistic view has generated a plethora of behavioral AHN papers and corresponding subfields. Even though their results have been inconsistent, the acceptance of the ‘more-is-better' concept has prevailed.
Pending confirmation, the experimentally backed recent emergence of a ‘less-is-more' concept is clearly undermining the foundation for clinical prospects along the conventional route. For example, risking amnesia by stimulating AHN no longer seems attractive for the elderly. On the other hand, it has led to new approaches in the pathogenesis of schizophrenia by considering protracted proliferation of granule cells in the human dentate gyrus as a pathological trait [Walton et al., 2012]. The concept is also compatible with views relating the time course of temporal decline in AHN to the emergence or change of habits during the life history of a given species [Amrein and Lipp, 2009], and it integrates data from different species. Finally, the finding that increasing AHN experimentally may induce forgetting might partly explain so-called ‘paradoxical findings' [Ehninger and Kempermann, 2006; Saxe et al., 2007; Schoenfeld and Gould, 2013]. Hopefully, the ‘less-is-more' concept will lead to studies controlling age-related dynamics of AHN in the own experimental animals as an integral part of hypothesis and research plan.
Summary and Prospectus
(i) Adult neurogenesis and its role in brain repair and neural regeneration are far from being clear. Although the brains of different animal groups (mammalian and nonmammalian) harbor active neuronal progenitors that are able to produce new neurons throughout life and which can be stimulated to contribute to neuronal repair, differences between species are apparent with regard to the extent and efficiency of repair in different locations and injury contexts. Interspecies comparisons are needed to recognize divergent pathways affecting adult neurogenesis positively or negatively.
(ii) A comparative view of adult neurogenesis in the SGZ indicates that its occurrence is highly variable, species specific, without recognizable common function up to now, and sometimes even lacking. Thus, different species may have different needs for protracted adult neurogenesis, if any. Similar markers of developmental stages of juvenile neurons appear to occur in all mammals, but their temporal correlations and expression relative to proliferating cells can vary enormously. Some of them, especially DCX, can be expressed in adult, non-newly generated cell populations, marking sites of neuronal plasticity.
(iii) A common feature of hippocampal neurogenesis is downregulation in juvenile and young-adult ages (in rats and mice between 1.5 and 4 months). Reportedly, similar turnover rates for adult humans and mice are based on a calculation error. The turnover rate in older mice is 10-20 times higher than in humans.
(iv) To understand the effects of adult neurogenesis, the lifetime course of late maturational processes in the mammalian brain such as gliogenesis, dendritogenesis, myelination, axonal sprouting and rearrangement, and other forms of neuronal plasticity should be known better. Yet lifetime studies in experimental animals are mostly lacking.
(v) The analysis of the behavioral functions of AHN suffers from several biological confounds or neglects. Prominent is the rapid downregulation of AHN, which can bias the effects of manipulations when they fall into time windows during which rapid change in cell numbers occurs. Most studies use only one inbred strain (C57BL6), but should be done in F1 hybrids between well-characterized strains for buffering strain-specific genetic variation. Critical results should be checked in nondomesticated mice.
(vi) Stress-related hormones can exert a massive impact on AHN. However, sensitivity to stressors is individually different even within inbred strains, due to long-lasting epigenetic up- or downregulation of promoter loci outside the hippocampus, even by minor stressors. The safest way to reduce variability in behavioral studies is by keeping experimenters away from the animals, and by using noninvasive proliferation markers.
(vii) Novel behavioral tests such as pattern separation aiming at demonstrating a specific role of newly generated neurons appear irrelevant for the ecological situation of mice and rats and are thus likely to generate more unequivocal data.
(viii) Promising approaches for identifying the behavioral role of AHN in rodents should focus on ethology-based behavioral assessment without interference by experimenter and potentially anxiogenic test set-ups. Discovery of relations is now possible by automated behavioral observation and cognitive testing.
(ix) The statistical interpretation of data in behavioral research in AHN is biased by unjustified inferential leaps. In the worst case, causation is inferred by observing parallel changes in functional readouts after a treatment increasing or decreasing cell proliferation in different samples. Most studies in which cell counts and behavioral readouts were done in the same animals do not report covariance between the scores although it could be done easily.
(x) Further biases in interpretation arise from public perception of animal studies suggesting positive perspectives for improving mental health in ageing or depressed people, mostly by a ‘more-is-better' approach. The emergence of a ‘less-is-more' approach can explain paradoxical results and fits well with developmental and comparative data.
In conclusion, we believe that the field of adult neurogenesis has a solid basis for understanding the molecular and developmental mechanisms regulating neurogenesis. The functional consequences of adult neurogenesis for both regeneration and behavioral/cognitive effects will remain elusive and contradictory unless investigators are willing to make more efforts to analyze their material for processes accompanying induced or natural changes in cell proliferation, and try to integrate reports opposing their findings instead of ignoring them. More stringent reviews and editorial requests for proper statistics justifying conclusions would be helpful. Ultimately, the field may deflate by itself when scientists realize that the main promises of AHN cannot be realized with quick studies any longer. The curve of publications in the coming years will show whether the field of AHN is just leveling off, or whether we will see a progressive decline. Given the huge financial investments by governments and industry, one may hope for a continuation at least avoiding the pitfalls outlined here, ideally with novel approaches and ideas.
Acknowledgments
We appreciate the permission of Irmgard Amrein and Ottavia Palazzo in providing unpublished graphic material and the administrative help of Irina Lipp.