Aim: It is difficult to reliably detect the earliest signs of Alzheimer's disease (AD)-associated cognitive impairment. Our aim was to compare 3 psychometric methods of identifying amnestic mild cognitive impairment (aMCI) in a middle-aged longitudinal cohort enriched for AD risk. Methods: Wisconsin Registry for Alzheimer's Prevention (WRAP) participants with 3 waves of cognitive assessment over approximately 6 years were coded as meeting each of 3 psychometric aMCI definitions: (a) ‘aMCI standard-baseline' used published norms to establish cutoffs for baseline performance; (b) ‘aMCI robust-baseline' applied WRAP-specific robust norms to baseline, and (c) ‘aMCI robust-multiwave' applied these robust norms across 3 waves of assessment. Each group was compared to a cognitively healthy subset. Results: Half the aMCI standard-baseline and one third of the aMCI robust-baseline group reverted to normal ranges at follow-up. Only the aMCI robust-multiwave method had an aMCI × age interaction showing significantly worse age-related memory declines in the aMCI group compared to the cognitively healthy group over 6 years of follow-up. Conclusion: Both cross-sectional methods showed instability over time, with many reverting to normal performance after baseline. The multiwave approach identified a group who showed progressive memory declines over 3 visits. Being able to detect progressive decline in late middle age is a critical step in improving prevention efforts.
Dementia due to Alzheimer's disease (AD) is preceded by milder cognitive changes affecting episodic memory, executive function, or other cognitive skills [1,2]. The diagnostic category of mild cognitive impairment (MCI) was developed to define these changes and thus identify persons at risk for developing AD or other dementias . Over the past decade, diagnostic criteria for MCI have undergone several revisions (e.g. Winblad et al. ), the most recent being the recommendations of a joint National Institute on Aging and Alzheimer's Association consensus panel . The consensus panel focused on procedures to identify the symptomatic preclinical phase of AD, referred to as MCI due to AD. Clinical and cognitive criteria for MCI in general were specified as (a) concern about cognitive decline by the person or a significant other; (b) preservation of independence in functional abilities, and (c) impairment in 1 or more cognitive domains. For MCI due to AD, impairment in episodic memory was specified as the most likely early deficit, and the importance of obtaining evidence for progressive longitudinal cognitive decline was emphasized.
The leading model of AD pathogenesis based on the amyloid cascade  portrays cognitive change as a late-occurring event preceded by several years of clinically silent neurobiological changes. However, evidence from at-risk populations suggests that mild cognitive changes may be occurring in midlife and earlier in the cascade than previously believed. The Framingham offspring study  found reduced verbal and nonverbal memory performance among AD offspring at a mean age of 63 years; the differences in memory were significant among APOE ε4 carriers only and were strongest in participants with a maternal family history of AD. Parental dementia was associated with worsening executive function over time, regardless of APOE genotype. In the Wisconsin Registry for Alzheimer's Prevention (WRAP), we found alterations in verbal list learning commonly associated with early-stage AD in middle-aged persons with a parental family history of AD, independent of APOE genotype [8,9]. Both studies have also shown atrophy in AD-sensitive brain regions in their middle-aged samples, most notably in persons with a parental family history of AD [7,10]. Although neither the Framingham offspring study nor WRAP has linked these early cognitive and brain atrophy changes to increased risk of AD as yet, previous investigations not selected for family history have shown associations between mild deficits in memory in middle age and increased risk of AD over intervals of 10 or more years [11,12].
Because midlife may be a crucial phase of life for intervening to alter the trajectory of cognitive decline and the development of dementia, determining how best to identify middle-aged persons at risk for developing AD is a high priority. One important step in this process is to determine whether current definitions of MCI effectively capture mild cognitive deficits occurring in middle age. While several studies have compared methods of operationalizing MCI criteria in older adults (e.g. Jak et al. ; Ganguli et al. ), only a handful of studies have applied MCI criteria to middle-aged samples, and these have shown mixed results regarding stability and correlates of MCI diagnoses. In 3 studies that included samples with mean ages in the 60s, the longitudinal stability of psychometrically identified MCI varied from 29% [15,16] to 70% , and none of the relatively young participants developed clinical dementia over follow-up intervals of 4-5 years. In the largest of these investigations (the Path Through Life Study) , a more broadly defined category of mild cognitive disorder, comprising any of several more specific diagnoses, showed a higher rate of stability across a 4-year interval than MCI per se. However, in the latest follow-up from this cohort , now young-old in age (68-72 years), more than 45% of mild cognitive disorder diagnoses were unstable across a period of 8 years. These initial investigations suggest that potentially significant cognitive impairments can be detected in midlife, but that single-point estimates of MCI may not be ideal for identifying the highest-risk individuals in this age span. This is important because biomarker profiles in preclinical AD may vary by the type of MCI definition employed in middle age.
The main aims of the present study were to compare the prevalences, baseline characteristics, and cognitive trajectories of 3 psychometric methods for identifying mild cognitive deficit in WRAP, a longitudinal study of over 1,500 late middle-aged persons at risk for developing dementia because of a parental family history of AD . Given the centrality of declines in episodic memory in early-stage AD [1,5], we focused on amnestic MCI (aMCI), with or without other cognitive impairments. All 3 approaches entailed a psychometric classification of aMCI, supported by informant reports of minimal functional impairment and by medical history review. The first approach (aMCI standard-baseline) used published age-based norms  for a well-known episodic memory test to identify persons whose baseline scores were below expected levels. The second approach (aMCI robust-baseline) used robust norms specific to this sample to identify persons with episodic memory deficits at baseline in a manner conceptually similar to DeSanti et al. . The third approach (aMCI robust-multiwave) implemented the recent recommendation of the joint consensus panel for longitudinal evidence of decline  and required episodic memory deficits (relative to robust norms) on multiple assessments spanning an average of 6 years.
Our analyses examined several hypotheses. First, we hypothesized that the aMCI standard-baseline approach would identify the fewest people with aMCI, while the robust-baseline approach would identify the most people. Second, to aid in interpreting whether memory deficits identified by each method reflect cognitive decline as opposed to stable lower ability, we hypothesized that each of the 3 aMCI groups would be similar to the cognitively healthy (CH) group in terms of estimates of premorbid cognitive abilities. Last, we hypothesized that the aMCI robust-multiwave approach would prove most effective in identifying persons with progressive memory change during the follow-up assessment window.
WRAP is a longitudinal study of a sample of over 1,500 middle-aged adults predominantly between the ages of 40 and 65 years at baseline. In order to increase our power to detect decline (and associated predictors) in middle age, the WRAP sample is enriched for a family history of AD, with over 70% of WRAP participants having a parent with either autopsy-confirmed or probable AD as defined by National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer's Disease and Related Disorders Association research criteria . Follow-up assessments are underway, with second-wave assessments occurring approximately 4 years after baseline and all subsequent waves occurring at approximately 2-year intervals. As shown in figure 1, WRAP has a very low attrition rate, with less than 5% of the baseline sample being unavailable for cognitive follow-up (e.g. deceased or dropped out). A large subset of the baseline sample had not yet returned for their third-wave visit (n = 831, 52.9%) and were therefore not included in analyses of our study hypotheses. To be included in testing the first hypothesis, participants must have completed 3 waves of testing, be free of dementia at or before the third wave of assessment, and be free of neurological conditions (including stroke, meningitis, epilepsy, multiple sclerosis, and Parkinson's disease); 532 met these inclusion/exclusion criteria (see sample flow chart in fig. 1). To be included in analyses of hypotheses 2 and 3, participants also had to meet aMCI criteria for 1 or more of the 3 psychometric aMCI approaches or meet criteria for ‘CH' across all 3 assessment waves.
General Study Procedures
Each wave of assessment includes a battery of commonly used clinical neuropsychological tests (see Sager et al.  for a description of the baseline cognitive battery), completion of questionnaires about health history and lifestyle, laboratory tests, and APOE genotyping. The neuropsychological battery included the measures used as cognitive outcomes for this study (see below), as well as the Mini-Mental State Examination (MMSE) , and estimated full-scale intelligence quotient (FSIQ) . Questionnaires included measures of education, the Instrumental Activities of Daily Living scale (IADL) , and the Center for Epidemiologic Studies Depression scale (CES-D) . All study procedures have been approved by the Health Sciences Institutional Review Board of the University of Wisconsin-Madison.
Factor analysis using promax rotation and maximum likelihood estimation  was used to reduce the set of cognitive measures to a smaller number of factors and obtain weights used to combine the measures within each factor. The resulting 6 weighted factor scores were then standardized (∼N (0, 1)) into z-scores, using means and standard deviations (SDs) obtained from the whole baseline sample. Factors include 2 general ability indicators, Verbal Ability and Visuospatial Ability, comprised of the Wechsler Abbreviated Scale of Intelligence [24 ]subtests and related measures. There are 2 factors representing new learning and recall (Immediate Memory; Verbal Learning and Memory), both derived from the Rey Auditory Verbal Learning Test (AVLT) . There are also 2 factors reflecting components of executive function (Working Memory, derived from the Digit Span Forward, Digit Span Backward, and Letter-Number Sequence subtests of the Wechsler Adult Intelligence Scale-III ; Speed and Flexibility, derived from Trails A, Trails B, and Stroop Color-Word). Additional details on the factor analysis methods and results can be found in Dowling et al. , Jonaitis et al. , and online supplementary Appendix A (for all online suppl. material, see www.karger.com/doi/10.1159/000355682) in this paper. The first 2 factors are obtained only at baseline and wave 2, while the others, which are more likely to be sensitive to early cognitive decline, are obtained at all waves. Deficits in Immediate Memory or Verbal Learning and Memory are associated with aMCI and are the primary focus of this paper, while deficits in Working Memory or Speed and Flexibility are associated with nonamnestic MCI.
Estimates of Premorbid Functioning
Identifying performance that is indicative of subtle decline is challenging and requires appropriate population-based norms or individual baseline performance or both. Because baseline testing from earlier in life is generally lacking, demographically based prediction equations have been developed to estimate premorbid intellectual levels (e.g. Barona et al. ; Crawford et al. ; Griffin et al. ). Duff [35 ]and Duff et al.  have demonstrated that this approach can be extended to estimate premorbid memory abilities based on demographics and a measure of premorbid intellect; they further demonstrated that people with aMCI show significantly greater discrepancies between actual performance and estimated premorbid performance than CH peers. Adopting a similar approach, we used linear regression to develop prediction equations to estimate premorbid functioning for each of the aMCI-related factors using a combination of demographic variables [age, gender, and race (non-Hispanic Caucasian vs. other)] and premorbid intellect. Reading scores, used by Duff [35 ]and Duff et al. , have previously been shown to be reliable estimators of premorbid intellect [37,38]. Baseline Wide Range Achievement Test-III (WRAT-III)  raw reading scores were standardized using means and SDs from the WRAP baseline sample. Deciles from the standardized reading scores were then used in estimating premorbid functioning for the memory and executive function factors and in developing robust norms (as described in a subsequent section). Since the WRAP sample includes participants with siblings, the prediction equations for premorbid functioning used a subset of the baseline sample (n = 1,194) comprised of 1 randomly selected sibling per family to eliminate the influence of intra-family correlation on the regression coefficients; regression coefficients were then applied to the whole sample to obtain estimates of premorbid functioning corresponding to baseline age.
Development of Robust Norms
There are few published norms suitable for the relatively young age and high average education level of the WRAP cohort, especially on the AVLT, and those that are available lack information to assess sensitivity for identifying low performers in our sample. This problem has been addressed in other studies by developing robust internal norms (e.g. DeSanti et al. ; Sliwinski et al. ). Using the participants without a family history of AD as our reference group, we developed robust norms as follows. First, for each of the memory and executive function factors, we used baseline scores to develop lower prediction limits corresponding to a cutoff of -1.5 SDs below predicted based on age, gender, and WRAT reading decile; these are also sometimes referred to as ‘conventional norms' (e.g. DeSanti et al. ). Those who had at least 1 factor score at or below the corresponding lower prediction limit at both baseline and wave 2 (n = 21) met exclusion criteria for preclinical decline and were removed, and the process was repeated on the reduced set to obtain new lower prediction limits; the new limits are referred to as ‘robust norms'.
Categorizing CH versus aMCI Subjects
The factor scores from the memory domain were compared to the published and robust norms to categorize participants into either the CH group or an aMCI group for each of 3 aMCI variables (aMCI standard-baseline, aMCI robust-baseline, and aMCI robust-multiwave). For all 3 aMCI variables, the CH group consists of participants who had no scores below published norms or robust norms at any of the first 3 assessment waves (n = 335). The ‘aMCI standard-baseline' method comes closest to the typical psychometric procedure for identifying possible aMCI, clinically, and historically, in research [3,4] and includes people whose AVLT total or delayed scores at baseline fell at least 1.5 SDs below published norms (n = 24) . Given that limitations in norms can lead to underrecognition of mild cognitive deficits, especially in persons with high education or premorbid ability [40,41], the ‘aMCI robust-baseline' method includes people with 1 or more AVLT-based factor scores (Immediate Memory or Verbal Learning and Memory) falling at least 1.5 SDs below robust norms at baseline (n = 73). Finally, to reduce the odds of unstable classification frequently reported for nonclinical samples [15,42,43], the ‘aMCI robust-multiwave' group includes participants whose Immediate Memory and/or Verbal Learning and Memory factor scores fell below the robust norms on at least 2 of 3 waves of assessment (n = 61). For all categories, participants had normal-range performance on the MMSE (all had ≥25 points, 90% had ≥28 points) and informant reports of IADL function at wave 3 were also typically in the normal range (98% had ≥14 points).
Prevalence of aMCI in this sample was estimated for each aMCI method using a 95% confidence interval (CI) and total sample size of n = 532 (hypothesis 1). Sample characteristics were compared between groups (i.e. study sample vs. those yet to return for wave 3 and CH vs. each of the aMCI groups) using t tests for normally distributed data, two-sample Wilcoxon tests for continuous non-Gaussian data, and χ2 tests for categorical data. Estimates of premorbid functioning were also compared between the CH group and each of the aMCI groups using t tests (hypothesis 2). Longitudinal performance on each of the memory and executive function factors was compared across CH and aMCI groups using linear mixed models adjusting for baseline age, gender, and WRAT reading; possible random effects included family, individual, and individual slope. This approach to modeling facilitates comparisons of between-group changes, while also adjusting for intra-individual correlations . For each factor and aMCI method, the model including the aMCI method × age interaction was examined first to test whether rates of cognitive decline differed between the CH and aMCI groups (hypothesis 3). If the aMCI × age interaction was significant, indicating differing rates of decline between CH and aMCI, separate simple age intercepts and slopes were calculated for the CH and aMCI groups using the intercept, aMCI, and age parameters from the regression model . If the aMCI × age interaction was not significant (p ≥ 0.05), the interaction term was removed and the aMCI main effect was examined.
In secondary analyses, since APOE ɛ4 carrier status and a positive family history of AD increase risk of AD, we also examined the interaction of each of these with age in parallel models (i.e. same covariates and random effects). We also used χ2 tests to test whether each of these was independent of each of the 3 aMCI variables. All analyses were performed using SAS v.9.3; all statistical tests used an α of 0.05.
A total of 532 participants (98.7% non-Hispanic Caucasians) met inclusion criteria for the first hypothesis examining prevalence of aMCI across the 3 psychometric classification methods. The study sample was similar in gender and APOE ɛ4 carrier status to those who were eligible except for not having returned yet for their third visit (n = 831 in fig. 1). The study sample was more non-Hispanic Caucasian, younger, had more people with a college degree and family history of AD, had fewer people with elevated depression scores, and performed better on baseline assessment of IQ and literacy than those who have yet to return for a third visit. Details of the comparisons of these baseline characteristics are presented in online supplementary Appendix B.
In the study sample (n = 532), the percentage meeting aMCI criteria (and corresponding 95% CI) was 4.5% (2.7-6.3) for the aMCI standard-baseline approach, 13.7% (10.8-16.6) for the aMCI robust-baseline method, and 11.5% (8.8-14.2) for the aMCI robust-multiwave method. A total of 96 (18%) people in the sample met aMCI criteria for 1 or more methods; overlap in the 3 methods is shown in figure 2.
Stability of aMCI Baseline Classifications. Of the 24 participants who met aMCI standard-baseline criteria, more than one half (n = 14, 58.3%) reverted to the normal range at subsequent visits, slightly less than one fifth (n = 4, 16.7%) fell at least 1.5 SDs below published norms (i.e. met aMCI standard criteria) at all 3 visits, and the remainder showed inconsistent patterns across the 3 testings [1 participant (4.2%) met aMCI standard criteria at wave 2 but not wave 3, and 5 participants (20.8%) reverted to normal at wave 2 but met aMCI standard criteria again at wave 3]. Of the 73 participants who met aMCI robust-baseline criteria, 29 (39.7%) reverted to the normal range at subsequent visits, nearly one third met criteria at all 3 visits (n = 24, 32.9%), and the remainder varied across testings [8 participants (11.0%) met aMCI robust criteria at waves 1 and 2 only, and 12 participants (16.4%) met aMCI robust criteria at waves 1 and 3 only].
The remainder of our analyses compared the 335 participants who met psychometric criteria for CH with the aMCI group from each of the 3 classification methods. Subjects whose performance did not meet criteria for CH or 1 or more aMCI variables were omitted from these analyses. Table 1 summarizes sample characteristics for each group. The aMCI standard-baseline group differs significantly from the CH group in the following ways: it has more men, lower baseline IQ and WRAT reading scores, and lower estimates of premorbid functioning for all 4 factors representing memory and executive function. The aMCI robust-baseline group differs significantly from the CH group only in terms of baseline IQ. The aMCI robust-multiwave group is older, has higher baseline reading scores, and higher baseline Working Memory scores than the CH group.
Cognitive Decline in CH versus aMCI Groups
For each aMCI method and cognitive outcome, the first model examined included random effects, covariates, aMCI method, and the aMCI method × age interaction. When the interaction was not significant, it was removed and the model was rerun. Parameter estimates of the resulting 12 mixed effects models are summarized in table 2. For the aMCI standard-baseline analyses, the only significant aMCI × age interaction was for the Verbal Learning and Memory factor. Simple age slopes, calculated as described in table 2 and shown in figure 3a, indicate an interaction opposite to what one would expect of criteria that reliably identify decliners. Specifically, the analysis showed an age-related slope of -0.014 SDs/year in the CH group compared with an age-related slope of 0.027 SDs/year in the aMCI standard group. Thus, published norms identified a group of people whose verbal learning started lower but improved, while those in the CH group started higher and showed modest age-related decline. aMCI standard-baseline main effects were significant for Immediate Memory and Speed and Flexibility, with the CH group showing better performance than the aMCI group in both factors.
For the aMCI robust-baseline analyses, there were no significant aMCI × age interactions, and all aMCI main effects were significant, with the CH group scoring higher than the aMCI group across all 4 factors. Figure 3b depicts the nonsignificant aMCI × age interaction for Verbal Learning and Memory; the age-related slopes of the CH and aMCI robust groups were -0.014 and -0.002, respectively. The interaction was removed and rerun to obtain the other parameter estimates shown in table 2.
For the aMCI robust-multiwave analyses, there were significant aMCI × age interactions for both memory factors. Follow-up analyses indicated that those in the aMCI robust-multiwave group were lower on average and declined significantly faster than their CH peers on Immediate Memory (simple age slopes of -0.044 SDs/year for the aMCI robust-multiwave group vs. -0.020 SDs/year for the CH group). Patterns were similar in the Verbal Learning and Memory factor, with age slopes of -0.015 SDs/year for the CH group and -0.037 SDs/year for the aMCI robust-multiwave group, as shown in figure 3c. The 95% CIs for the age-related changes per year in Verbal Learning and Memory z-scores were -0.003 to 0.058 for aMCI standard-baseline, -0.019 to -0.004 for aMCI robust-baseline, and -0.056 to -0.018 for aMCI robust-multiwave.
Secondary χ2 analyses identified no significant associations between family history and aMCI status or APOE and aMCI status across any of the 3 aMCI methods. Longitudinal analyses did not reveal any significant family history or APOE main effects or interactions with age in any of the 4 cognitive factors among participants who had completed 3 waves of assessment.
One of the highest priorities in AD research is to find indicators of risk that are reliable and valid early in the lifespan when interventions to alter the disease course are likely to be most effective. Our study builds on previous work on MCI conducted primarily with older samples, extending categorization of mild cognitive deficit into a normally functioning middle-aged group. We focused on cognitive outcomes, particularly in episodic memory, in the belief that subtle variations in performance and patterns of decline may prove significant and may contribute meaningfully to the search for preclinical markers of AD susceptibility.
The standard approach to MCI identification, in which baseline memory performance was compared to published age norms, identified 4.5% of WRAP participants as having aMCI. This is similar to MCI prevalence estimates in 2 prior studies with samples in their early 60s [15,43], both of which found a 4% rate of aMCI based on similar methods. However, in our study, only 50% of persons with aMCI at baseline relative to published norms met criteria for multiwave aMCI. More importantly, the simple age slopes showed that instead of declining over time on the crucial Verbal Learning and Memory factor, the aMCI group identified by standard criteria showed improvement in learning and memory performance over the 6-year follow-up period. Two prior studies with middle-aged samples have also shown relatively low rates of stability for standard MCI diagnoses over follow-ups of a few years [15,43], whereas a third study, which combined middle-aged and elderly subjects in stability analyses, showed greater longitudinal consistency in MCI classification . These findings for middle-aged samples are similar to what has been observed in population-based (i.e. nonclinic) studies of elderly groups [14,42,46]. For older adults in the community diagnosed with MCI based on a single assessment, the most likely outcomes over a few years are either stable low performance or reversion to normal-range scores. Although only a relatively small percentage progress to AD, the rate of progression is nonetheless greater than that for persons with normal cognitive performance at study onset .
Using robust norms to identify low baseline memory performance resulted in more than twice as many persons (13.7% of the sample) being identified as having aMCI, and 60% of these individuals meeting aMCI robust criteria also met criteria for multiwave aMCI. However, for the aMCI robust-baseline group as a whole, there was no significant longitudinal change across the 6-year follow-up on any of the cognitive factors. While the use of robust norms can help to minimize oversight of significant cognitive deficit, which can be particularly likely in persons with high education or high ability [40,41], our data suggest that the application of robust norms at a single point in time could lead to identification of persons with mild memory and executive function difficulties that do not progress in a consistent manner in the midlife phase over the time span that subjects were followed here. Overall, it appears that neither of the approaches that used baseline data alone are optimal for identifying persons with subtle but reliable declines in episodic memory that may be the earliest cognitive indication of AD.
Using criteria that capitalized on longitudinal cognitive data, we found that nearly 12% of this late middle-aged sample have memory deficits consistent with aMCI. By requiring performance to be below expected levels on at least 2 of 3 testing occasions, the goal was to enhance the likelihood of identifying persons with relatively persistent memory problems who, despite normal activities of daily living function, may be at increased risk for cognitive decline and possible progression to AD. The longitudinal factor score data suggest that may be the case. The multiwave aMCI group declined more over the 6 years of follow-up than the CH group on both factors related to episodic memory; the age-related decline in the aMCI multiwave group was also more than either of the other 2 aMCI groups, as shown by the largely nonoverlapping CIs for the aMCI simple slopes. Executive function, while lower overall compared to the CH group, did not significantly decline over time in the multiwave aMCI group.
The aMCI robust-multiwave group most closely approximates cognitive criteria for MCI due to AD as recommended by the National Institutes of Health (NIH) workgroup, and it improves the sensitivity of detection of subtle decline by its use of internal norms that adjust for the high literacy and education of the sample. This multiwave group exhibited relatively persistent episodic memory deficit for age and literacy level, and they showed an accelerated rate of episodic memory decline across a period of 6 years. Despite these memory difficulties, they were typically independent in everyday function based on reports of close informants, and global cognitive performance as measured by the MMSE remained intact. In contrast to the workgroup criteria for MCI due to AD, however, WRAP participants with multiwave aMCI were not specifically selected for concern about memory decline.
A somewhat similar approach to MCI classification was taken by Collie et al. , who required impairment on word list learning on 3 consecutive testings, at 6-month intervals, for MCI classification. Compared to matched controls with normal word list learning on all testings, those with stable MCI performed worse on additional memory tests administered after the third test wave. Compared to WRAP, the Collie et al.  study was based on a smaller sample, included a greater proportion of elderly participants, and used much shorter test-retest intervals. However, their conclusion was similar to ours, i.e. that serial assessments are crucial for differentiating individuals with MCI from CH persons and from those with transient cognitive impairments. An accurate classification of MCI is needed to identify those at risk for AD and is particularly important in studies of preclinical AD, like WRAP, which are attempting to define the earliest neurobiology of AD in preparation for prevention clinical trials.
A concern for all methods of MCI identification is that criteria will select persons with characteristically weak memory performance unrelated to AD risk (so called ‘accidental MCI') [48,49]. In our data, the likelihood that this may have occurred appears strongest for the aMCI standard group, where baseline performance was lower on all cognitive factors compared to the CH group; premorbid performance, estimated from demographics and reading scores, was also significantly lower in the aMCI standard group for all cognitive factors. By contrast, for the multiwave aMCI group, estimated premorbid ability on secondary memory factors was comparable to that of the CH group, and the accelerated rate of decline in memory performance after baseline suggests a progressively deteriorating memory pattern. Using quantitative estimation of premorbid memory ability as a tool for identifying aMCI has also been supported by other recent research .
Parental family history of AD and APOE genotype were not associated with aMCI classification in these analyses. Biomarker evidence for preclinical AD has been reported among asymptomatic offspring of persons with AD by several research groups (e.g. Donix et al. ; Xiong et al. ), including our own (e.g. Johnson et al. ; Okonkwo et al. ), and we have also found evidence for subtle differences in memory strategies for persons with and without a family history of AD [8,9]. The absence of a significant family history effect may be due in part to the relatively small number of subjects meeting aMCI criteria and due to the fact that relatively few family history participants had completed 3 waves of testing at the time these analyses were conducted. There was also no evidence for elevated rates of aMCI in persons with at least 1 APOE ε4 allele. In addition to the small aMCI group limiting power to detect APOE effects, age and zygosity may have been factors in this negative finding. A recent meta-analysis of studies of APOE effects on cognition in nonclinical samples  concluded that having an ε4 allele is associated with a modest negative influence on episodic memory, but that the effect size is smaller in younger samples and for ε4 heterozygotes as opposed to homozygotes. The WRAP sample is relatively young and only 11.7% of our APOE-positive subgroup had 2 ε4 alleles.
Study limitations include the following: although word list learning and recall tests are among the most commonly recommended measures for detecting memory loss related to MCI and AD, aMCI categorization in our sample may have differed if longitudinal data from different memory tasks, or multiple memory measures, had been available. We were not able to verify the psychometric diagnoses with clinical exams since clinical diagnostic examinations were not a part of WRAP procedures in these early test waves. The relatively small number of subjects with aMCI limits statistical power to test associations with family history, APOE, or other predictors. Finally, although beyond the scope of this paper, it is important to recognize that we do not yet have biomarker evidence to indicate that aMCI is likely due to AD, nor do we know as yet whether participants with aMCI will progress to clinical AD.
Important future steps for WRAP will be to repeat these analyses when a larger proportion of the baseline sample has completed 3 waves of assessment. Subsequent analyses will also compare the multiwave approach with other methods of evaluating cognitive change, such as a reliable change index approach (e.g. Stein et al. ), and analysis of whether practice effects in early waves predict subsequent aMCI status in ways similar to other studies (e.g. Duff et al. ). It will also be critical to compare biomarker profiles for the aMCI robust-multiwave versus CH groups and to continue longitudinal follow-up to further assess cognitive trajectories and to establish dementia endpoints. The validity of the current multiwave aMCI classification will be supported if the aMCI group shows neuroimaging and/or CSF profiles consistent with preclinical AD, and ultimately, higher rates of AD. Equally important will be the characterization of factors (genetic, familial, and health- and lifestyle-related) that are associated with the earliest signs of aMCI, or alternatively, with remaining dementia free.
The authors gratefully acknowledge the assistance of the WRAP scheduling, assessment, and project management staff. We especially thank the WRAP participants. This research was supported by NIA grant R01AG27161 (Wisconsin Registry for Alzheimer Prevention: Biomarkers of Preclinical AD), NIH grant M01RR03186 (University of Wisconsin Clinical and Translational Research Core), the Helen Bader Foundation, Northwestern Mutual Foundation, and Extendicare Foundation. The project described was also supported by the Clinical and Translational Science Award (CTSA) program, through the NIH National Center for Advancing Translational Sciences (NCATS), grant UL1TR000427. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.