Abstract
Introduction: In this systematic review, we aimed to identify suitable assessment and measurement tools for screening individuals for cognitive impairment and Alzheimer’s disease (AD). We conducted a comprehensive evaluation of the reliability and validity of cognitive function assessment instruments. Based on our findings, we offer insightful suggestions for further research on cognitive function scale development and clinical researchers in AD. Methods: We searched the PubMed and CNKI databases for community-based studies aimed at developing or evaluating the validity or reliability of cognitive function assessment scales. Only studies written in English and Chinese that reported the development of cognitive function assessment scales and/or the validation of cognitive impairment severity in patients with cognitive impairment and AD were eligible for inclusion. The methodological quality of the studies, based on reliability (i.e., internal consistency and test-retest reliability) and validity (i.e., construct validity), was assessed using the consensus-based standards for the selection of health measurement instruments (COSMIN) according to the “worst score counts” principle. Subsequently, the measurement properties were rated qualitatively. Results were summarized and rated using the modified Grading of Recommendations, Assessment, Development, and Evaluation. Based on the results, recommendations were categorized into four levels: A, B, C, and D. Results: We retrieved a total of 804 studies. Following screening, a total of 62 articles were included, which reported 49 cognitive impairment assessment scales. The methodological quality of studies ranged from inadequate to very good, and the measurement properties varied from sufficient (+) to indeterminate (?). We found that the AD Assessment Scale-cognitive subscale (ADAS-Cog), Montreal Cognitive Assessment (MoCA), Baylor Profound Mental Status Examination (BPMSE), Clinical Dementia Rating (CDR), and the other 28 scales had sufficient validity and reliability. Conclusion: Our evaluation according to the COSMIN guidelines suggested that the ADAS-Cog, MoCA, BPMSE, CDR scale, and Mini-mental State Examination could be used to assess the degree of cognitive impairment in patients with AD. When developing cognitive function assessment scales, factors such as time and linguistic and cultural differences could be carefully considered.
Introduction
An aging global population has resulted in a rise in the incidence of neurological diseases. Of these diseases, Alzheimer’s disease (AD) is one of the most common dementias and has placed a considerable burden on patients and their families [1]. AD is a progressive neurodegenerative disease characterized by memory loss and cognitive impairment that severely affects patients’ ability to function in daily life. To date, the etiology and pathogenesis of AD have not been fully elucidated [2]. The National Institute on Aging-Alzheimer’s Association workgroup diagnostic guidelines state that patients with AD exhibit poor performance in one or more cognitive domains, and these impairments are expected to markedly deteriorate over time. Such alterations can manifest across a range of cognitive domains, such as memory, executive function, attention, language, and visuospatial skills. Impairment in episodic memory (the capacity to learn and retain new information) is most prevalent among patients with mild cognitive impairment (MCI) [3]. The initial symptoms of AD are often subtle and difficult to detect by caregivers. Therefore, patients are often in the moderate to severe stage of AD by the time caregivers become aware of abnormalities and have thus missed the opportunity for early and effective treatment. This can negatively impact patients’ health-related outcomes and quality of life. As the disease progresses, cognitive impairment gradually worsens, and patients encounter increasingly more difficulties in daily life.
Outcome measurement instruments (OMIs) constitute a standardized mechanism designed to quantify specific health-related constructs within defined clinical domains. The taxonomy of OMIs encompasses diverse modalities, including but not limited to: questionnaires (including patient-reported outcome measurement or clinician-reported outcomes), clinician-administered rating scales, biomarker analyses, and so on [4]. OMIs are used to monitor the health status of patients (including psychological and cognitive health) and evaluate treatment methods in research and clinical practice [5, 6]. OMIs are widely used in AD research and clinical practice to obtain subjective cognitive function data via systematic scales and scoring criteria. By assessing different aspects of cognitive function, these instruments can help determine the extent, type, and progression of a patient’s cognitive deficits. Their standardized implementation facilitates comparative outcome analyses while informing evidence-based clinical decision-making processes.
Cognitive measurement instruments represent economically viable, time-efficient screening solutions, demonstrating particular utility in preclinical screening paradigms for detecting individuals at elevated risk for developing symptomatic AD manifestations. Although extensive investigations and summaries have been conducted on the measurement characteristics such as reliability, validity, sensitivity, and responsiveness of cognitive assessment tools for AD, their measurement has not yet been used the internationally recognized methodology for structured quality rating of the scale. The comprehensive evaluation results of measurement attributes have not been integrated to form a clinical recommendation level [7, 8]. Determining whether existing scales for evaluating cognitive function require updating is vital. Therefore, this systematic review aimed to identify the most commonly used cognitive assessment instruments in AD screening, diagnosis, and evaluation. We assessed and compared the quality of measurement properties (i.e., reliability and validity) and the characteristics of cognitive function assessment scales to examine the quality of studies on MCI and AD according to the guidelines of Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) and the consensus-based standards for the selection of health measurement instruments (COSMIN). PRISMA-COSMIN (2024) is an expert consensus developed by the Enhancing the Quality and Transparency Of health Research (EQUATOR) guidelines and included a literature search, expert consultations, a Delphi study, a hybrid workgroup meeting, pilot testing, and an end-of-project meeting, with integrated patient/public involvement. The PRISMA-COSMIN methodology for OMIs, offering a detailed, standardized, and transparent framework, facilitates the selection and use of high-quality OMIs in both research and clinical practice [9‒12].
Methods
The systematic review was conducted and reported in accordance with the recommendations of the PRISMA statement. The methodological quality of the included studies was evaluated using the COSMIN checklist. We examined the psychometric properties of each instrument. The PRISMA checklist is provided in the Supplementary Material.
Inclusion and Exclusion Criteria
Instruments were included in the review if they satisfied the following criteria: (1) the study described the development (i.e., inauguration paper) and/or the evaluation of the psychometric properties and their relationships with other relevant cognitive constructs (i.e., validation paper) in AD patients; (2) the study population were adults with AD; (3) the instrument was designed to evaluate the degree of cognitive impairment for AD specifically; (4) the study only documented OMIs (the format is limited to cognitive measurement instruments such as questionnaires, Q&A tools, scales, and tests, which can obtain scores in a short period of time without the assistance of medical imaging equipment); and (5) studies were written in English or Chinese. Instruments were excluded if they met any of the following criteria: (1) nonscale formats (the types include biomarker analyses, laboratory tests, scores obtained through a physical examination or observations of an image, and so on); (2) randomized controlled studies using OMIs only as a measurement tool (i.e., clinical trials without any validation of the measurement tool); (3) dissertations, editorials, conference proceedings, and reports; (4) republished papers; and (5) papers with abstracts only or unavailable full texts.
Search Strategy
Articles were identified by conducting searches of two electronic databases: PubMed and China National Knowledge Infrastructure (CNKI). Searches were performed from inception to February 1, 2024. All identified articles were screened. We used 5 combined sets of search terms, which are provided in the Supplementary Material.
Data Extraction
The retrieved articles were organized using NoteExpress3.9.0 (Beijing Aegean Lezhi Technology Co., Ltd., Beijing, China). For studies that met the inclusion criteria, two reviewers independently extracted the following data: name of the instrument, author, country, number of samples, measurement domains, number of items, and evidence of reliability and validity. Data were summarized using Excel2019 (Product ID: 00405-32257-26357-AAOEM).
Quality Assessment
The methodological quality of the studies was assessed using the COSMIN checklist [9, 13, 14], which is a standardized tool for assessing the measurement properties of instruments across nine domains: internal consistency, reliability, measurement error, content validity, construct validity (subdivided into structural validity, hypothesis testing, and cross-cultural validity), convergent validity, and responsiveness. Each domain is rated using 5–18 items. The methodological quality of each item was rated as “very good,” “adequate,” “doubtful,” or “inadequate.” A methodological quality score for each measurement property was obtained by taking the lowest rating of any item (i.e., the “worst score counts” principle). The measurement property was then rated as sufficient (+), insufficient (−), inconsistent (±), or indeterminate (?). The quality of the instruments was graded as high, moderate, low, or very low using a modified grading of recommendations assessment, development, and evaluation (GRADE) approach. Finally, 4 recommendation levels (A, B, C, and D) were used to represent high, moderate, low, and very low research quality, respectively. Two independent reviewers assessed the methodological quality of the included studies using the checklist. Any disagreements in scoring were resolved through discussion with a third reviewer.
In addition, according to the GRADE approach for evaluating measurement properties in systematic reviews of OMIs, we considered the following 4 factors: (1) risk of bias (i.e., the methodological quality of the studies), (2) inconsistency (i.e., unexplained inconsistency of results across studies), (3) imprecision (i.e., total sample size of the available studies), and (4) indirectness (i.e., evidence from populations that differ from the current population of interest).
Results
Search Results
We obtained 804 articles from the database search. Of these, 32 were obtained from CNKI, and 772 were retrieved from PubMed. We excluded 376 articles because they described the development (inauguration paper) and/or assessment (validation paper) of psychometric properties that were not relevant to cognitive function in patients with AD or were not related to other constructs. We excluded 81 articles because the full text was unavailable. A further 26 review articles and 132 articles reporting unqualified measurement studies (e.g., studies using instruments only as clinical trial outcomes, studies without any validation, and studies missing important measurement properties) were excluded. We also excluded 58 studies that only used OMIs as a measurement tool or validation studies that used OMIs for other instruments. Fifteen studies that were not OMIs or specific OMIs were excluded. After excluding 28 non-Chinese or English articles, 62 valid articles were finally included. The article selection process is shown in Figure 1.
The studies collectively used a total of 49 scales. Our data analysis revealed that the AD Assessment Scale-cognitive subscale (ADAS-Cog) was the most frequently used scale, where nine articles evaluated the measurement properties of the ADAS-Cog. Four articles evaluating the Montreal Cognitive Assessment (MoCA) were included in English and Chinese. Three articles assessed the Baylor Profound Mental Status Examination (BPMSE), and two studies evaluated the Clinical Dementia Rating (CDR) scale. We included only one article on the measurement properties of the Mini-Mental State Examination (MMSE). Based on the grading criteria outlined in the COSMIN guidelines, we categorized the recommendation levels into 4 grades (A, B, C, and D) and compiled the recommendation levels of the 49 assessment tools into the Table 1.
Recommended level table for AD OMIs
OMIs . | Recommendation . |
---|---|
AD Assessment Scale-cognitive subscale (ADAS-Cog) | A |
Montreal Cognitive Assessment (MoCA) | A |
Baylor Profound Mental Status Examination (BPMSE) | A |
Clinical Dementia Rating (CDR) | A |
Mini-Mental State Examination (MMSE) | A |
Test for the Early Detection of Dementia from Depression (TE4D) | A |
Telephone cognitive screen (T-CogS) into Turkish version (T-CogS-TR) | A |
Self-Test (ST) | A |
Six spatial navigation-related tasks | A |
The Severe Cognitive Impairment Rating Scale (SCIRS) | A |
Subjective Cognitive Decline-Questionnaire 9 (SCD-Q9) | A |
Patient-Reported Outcomes in Cognitive Impairment (PROCOG) | A |
Ordinal Scales of Psychological Development was modified (M-OSPD) | A |
Mini-Mental State Examination (MMSE) | A |
The Mattis Dementia Rating Scale (MDRS-2) | A |
The London Psychogeriatric Rating Scale (LPRS) | A |
LeSCoD for Lewy body Screening scale in Cognitive Disorders | A |
The Korean version of the Fear of Alzheimer’s Disease Scale (K-FADS) | A |
Persian version of the FAST (the Functional Assessment Staging of dementia of the Alzheimer’s type) (I-FAST) | A |
HoNOS 65+ | A |
The Global Neuropsychological Assessment (GNA) | A |
Dementia Knowledge Assessment Scale of Chinese vision (DKAS-C) | A |
Columbia University Scale for Psychopathology in Alzheimer's Disease (CUSPAD) | A |
Cornell Scale for Depression in Dementia (CSDD-CV) | A |
The Chinese Version of the Relevant Outcome Scale for Alzheimer's Disease (CROSA) | A |
Cognitive-12 (Cog-12) | A |
The Bedford Alzheimer Nursing Severity Scale | A |
AQ-D | A |
The Abbreviated Mental Test Score (AMTS) | A |
Apathy Evaluation Scale (AES-I) | A |
Activities of Daily Living Questionnaire (ADLQ-CV) | A |
The Chinese version of Addenbrooke’s Cognitive Examination-Revised (ACE-R) | A |
The 7-Minute Screen (7 MS) | A |
The Korean version of the SIB (SIB-K) – the Severe Impairment Battery (SIB) | B |
Subjective Cognitive Decline-Questionnaire (SCD-Q) | B |
Italian revised memory and behavior problems checklist (It-RMBPC) | B |
The French Prospective and Retrospective Memory Questionnaire (PRMQ) | B |
The “Protocole d’Examen Cognitif de la Personne Agée – Lausanne” (PECPA-L) | B |
The Neurobehavioral Rating Scale (NRS) | B |
Neuropsychiatric Symptom Assessment (NPSA) scale | B |
The Neuropsychological Impairment Scale – Senior (NIS-S) | B |
Multifactorial Memory Questionnaire (MMQ) | B |
The Memory Alteration Test (M@T) | B |
Dementia Screening Questionnaire for Individuals with Intellectual Disabilities (DSDS) | B |
The Direct Assessment of Functional Status-Revised (DAFS-R) | B |
The Attention Questionnaire Scale (AQS) | B |
Taiwanese version of the Apathy Evaluation Scale, clinician version (AES-C) | B |
AD-IE | B |
The Alzheimer Disease 8 (AD8) | B |
OMIs . | Recommendation . |
---|---|
AD Assessment Scale-cognitive subscale (ADAS-Cog) | A |
Montreal Cognitive Assessment (MoCA) | A |
Baylor Profound Mental Status Examination (BPMSE) | A |
Clinical Dementia Rating (CDR) | A |
Mini-Mental State Examination (MMSE) | A |
Test for the Early Detection of Dementia from Depression (TE4D) | A |
Telephone cognitive screen (T-CogS) into Turkish version (T-CogS-TR) | A |
Self-Test (ST) | A |
Six spatial navigation-related tasks | A |
The Severe Cognitive Impairment Rating Scale (SCIRS) | A |
Subjective Cognitive Decline-Questionnaire 9 (SCD-Q9) | A |
Patient-Reported Outcomes in Cognitive Impairment (PROCOG) | A |
Ordinal Scales of Psychological Development was modified (M-OSPD) | A |
Mini-Mental State Examination (MMSE) | A |
The Mattis Dementia Rating Scale (MDRS-2) | A |
The London Psychogeriatric Rating Scale (LPRS) | A |
LeSCoD for Lewy body Screening scale in Cognitive Disorders | A |
The Korean version of the Fear of Alzheimer’s Disease Scale (K-FADS) | A |
Persian version of the FAST (the Functional Assessment Staging of dementia of the Alzheimer’s type) (I-FAST) | A |
HoNOS 65+ | A |
The Global Neuropsychological Assessment (GNA) | A |
Dementia Knowledge Assessment Scale of Chinese vision (DKAS-C) | A |
Columbia University Scale for Psychopathology in Alzheimer's Disease (CUSPAD) | A |
Cornell Scale for Depression in Dementia (CSDD-CV) | A |
The Chinese Version of the Relevant Outcome Scale for Alzheimer's Disease (CROSA) | A |
Cognitive-12 (Cog-12) | A |
The Bedford Alzheimer Nursing Severity Scale | A |
AQ-D | A |
The Abbreviated Mental Test Score (AMTS) | A |
Apathy Evaluation Scale (AES-I) | A |
Activities of Daily Living Questionnaire (ADLQ-CV) | A |
The Chinese version of Addenbrooke’s Cognitive Examination-Revised (ACE-R) | A |
The 7-Minute Screen (7 MS) | A |
The Korean version of the SIB (SIB-K) – the Severe Impairment Battery (SIB) | B |
Subjective Cognitive Decline-Questionnaire (SCD-Q) | B |
Italian revised memory and behavior problems checklist (It-RMBPC) | B |
The French Prospective and Retrospective Memory Questionnaire (PRMQ) | B |
The “Protocole d’Examen Cognitif de la Personne Agée – Lausanne” (PECPA-L) | B |
The Neurobehavioral Rating Scale (NRS) | B |
Neuropsychiatric Symptom Assessment (NPSA) scale | B |
The Neuropsychological Impairment Scale – Senior (NIS-S) | B |
Multifactorial Memory Questionnaire (MMQ) | B |
The Memory Alteration Test (M@T) | B |
Dementia Screening Questionnaire for Individuals with Intellectual Disabilities (DSDS) | B |
The Direct Assessment of Functional Status-Revised (DAFS-R) | B |
The Attention Questionnaire Scale (AQS) | B |
Taiwanese version of the Apathy Evaluation Scale, clinician version (AES-C) | B |
AD-IE | B |
The Alzheimer Disease 8 (AD8) | B |
Study Properties
Table 2 shows the 6 properties of the cognitive assessment instruments for evaluating AD patients. Sample sizes ranged from 15 to 5,704 patients. Studies were conducted in various countries, including China (n = 19), the USA (n = 10), and South Korea (n = 5). Instruments typically contained 3 to 10 dimensions (e.g., memory, number processing, orientation, language ability, and behavioral ability), with the number of items ranging from 6 to 103. More than half of the articles (n > 30) involved tests of memory. Some instruments (10 or so) emphasized changes in language abilities, and other instruments were focused on attentional and perceptual dysfunctions. Two articles did not describe specific measurement dimensions of the scale used [15, 16].
Properties of the included studies
Evaluation instrument . | Author(s) . | Country . | Sample Total N . | Number of items/domains . | Measurement domains . |
---|---|---|---|---|---|
The 7-Minute Screen (7 MS) | Ijuin et al. [17] (2008) | Japan | 154 | Four subtests | 1. Benton Temporal Orientation Test. 2. Enhanced cued recall. 3. Clock drawing. 4. Verbal fluency |
The Chinese version of Addenbrooke’s Cognitive Examination-Revised (ACE-R) | Fang et al. [18] (2014) | China | 151 | 5 domains/16 cognitive items | 1. Orientation, registration, attention, and concentration. 2. Memory (recall, anterograde memory, retrograde memory, recognition). 3. Verbal fluency. 4. Language (comprehension, writing, repetition, naming, comprehension, reading). 5. Visuospatial abilities and perceptual abilities |
Alzheimer Disease 8 (AD8) | Ryu et al. [19] (2009) | Korea | 155 | 8 items | 1. Problems with judgment. 2. Reduced interest in hobbies/activities. 3. Repeats questions, stories, or statements. 4. Trouble learning how to use a tool, appliance, or gadget. 5. Forgets correct month or year. 6. Difficulty handling complicated financial affairs. 7. Difficulty remembering appointments. 8. Consistent problems with thinking and/or memory |
The Italian version of Alzheimer’s Disease Assessment Scale (ADAS) | Inzitaria et al. [20] (1999) | Italy | 217 | 21 items | The cognitive portion of the scale consists of 11 items, assessing memory, language, and praxis function, while the 10 items of the noncognitive portion aim to evaluate mood, agitation, vegetative functions, delusions, and hallucinations |
Alzheimer’s Disease Assessment Scale-Cognitive Subscale (ADAS-Cog) | Ben et al. [21] (2017) | Tunisia | 182 | 11 items | 1. Word recall. 2. Naming (objects and fingers). 3. Following commands. 4. Constructions (drawing). 5. Ideational praxis. 6. Orientation. 7. Word recognition. 8. Recall of test instructions. 9. Spoken language ability. 10. Word-finding difficulty and comprehension of spoken language. 11. Assessing memory, praxis, and language |
Wei et al. [22] (2022) | China, Taiwan | 170 | |||
Jiang et al. [23] (2020) | China | 1276 | |||
Liu et al. [24] (2001) | China | 329 | |||
Cano et al. [25] (2010) | UK | 1421 | |||
Mavioglu et al. [26] (2006) | Turkey | 56 | |||
Ying-hong et al. [27] (2014) | China, Guangzhou | 76 | |||
Xia et al. [28] (2009) | China | 213 | |||
Alzheimer’s Disease Impression evaluation (AD-IE) | Yan’e et al. [29] (2021) | China | 5,704 | 8 domains/20 items | 1. Directional force (1–4) not myself. 2. Dialogue ability (articles 5–8) (9), (10–12). 3. Daily life, work, and social activities and roles (13). 4. Free activities (article 14). 5. Memory (15–18). 6. Mental symptoms (articles 19–20) and so on 8 parts |
Activities of Daily Living Questionnaire (ADLQ-CV) | Chu et al. [30] (2008) | China | 125 | 6 domains/28 items | 1. Self-care. 2. Household care. 3. Employment and recreation. 4. Shopping and money. 5. Travel. 6. Communication |
Taiwanese version of the Apathy Evaluation Scale, clinician version (AES-C) | Hsieh et al. [31] (2012) | China, Taiwan | 144 | 4 domains/18 items | Behavior, cognition, and emotion subscales |
Abbreviated Mental Test Score (AMTS) | Dionysios Tafiadis et al. [15] (2022) | Greece | 132 | 10 items | N/A |
AQ-D | Sato et al. [32] (2007) | Japan | 143 | 2 domains/30 items | Intellectual functioning and behavior |
Attention Questionnaire Scale (AQS) | Kim et al. [33] (2011) | Korea | 268 | 15 items | 8 negative items and 7 positive items |
Bedford Alzheimer Nursing Severity Scale | Bellelli et al. [34] (1997) | Italy | 99 | 3 domains/7 items | 1. Evaluating functional (ambulating, eating, dressing). 2. Cognitive (speech, eye contact) abilities. 3. Pathological symptoms (muscle rigidity, sleep-wake cycle disturbances) |
Baylor Profound Mental State Examination (BPMSE-cog) | Doody et al. [35] (1999) | USA | 208 | 25 question cognition subscale/4 subareas | 1. Orientation. 2. Language. 3. Attention. 4. Motor functioning |
Kørner et al. [36] (2012) | Denmark | 55 | |||
Salmerón et al. [37] (2016) | Spain | 100 | |||
Clinical Dementia Rating (CDR) | Nguyen et al. [38] (2020) | Vietnam | 153 | 6 domains | 1. Memory. 2. Orientation. 3. Judgment and problem solving. 4. Community affairs. 5. Home and hobbies. 6. Personal care |
Coley et al. [39] (2011) | France | 667 | |||
Cognitive‐12 Scale (COG-12) | Gang et al. [40] (2014) | China, Nanjing | 148 | 2 domains/12 items | 1. Judgment problems. 2. Decline of interest, change of hobbies, and decrease of activities. 3. Repeat the same thing. 4. Have difficulty in learning to use a simple everyday tool. 5. Lost count of the current month and year. 6. Have difficulty in dealing with complex personal economic affairs. 7. Unable to remember the agreement with others. 8. Problems in daily memory and thinking ability. 9. Volatile mood. 10. Changes in living habits or improper behavior. 11. Personality change. 12. Language problems. The first latitude included items C1, C2, C4, C5, C6, C7, C8, and C12, whereas the second latitude included items C3, C9, C10, and C11 |
The Chinese Version of the Relevant Outcome Scale for Alzheimer’s Disease (CROSA) | Zhang et al. [41] (2021) | China | 336 | 2 factors/3 domains/16 items | Factor 1 comprised three domains – cognition, communication, and function – and two items measuring PWAD QoL (item 15) and caregiver burden (item 16). The domain of cognition measured the remote (item 1) and recent (item 2) memory and complex ADLs (item 3) |
The domain of communication measured language expression (item 4) and communication (item 5) abilities and social competence (item 6). The domain of function measured basic ADLs (item 12), attention (item 13), and independence (item 14). Factor 2 (domain of behavior) measured levels of common BPSD, including irritability/aggression (item 7), restless (item 8), delusion (item 9), anxiety (item 10), and agitation (item 11) | |||||
Cornell Scale for Depression in Dementia (CSDD) | Ru-Jing et al. [42] (2008) | China | 60 | 5 domains/19 items | 1. Mood-related signs 2. Behavioral disturbances. 3. Physical signs. 4. Cyclic functions. 5. Ideational disturbances |
The Columbia University Scale for Psychopathology in Alzheimer's disease (the CUSPAD) | Devanand et al. [43] (1992) | USA | 91 | 5 domains/27 items | 1. Delusions. 2. General question (paranoid delusions, delusions of abandonment, somatic delusions, misidentification, other delusions). 3. Hallucinations (illusions). 4. Behavioral disturbances. 5. Depression |
The Direct Assessment of Functional Status-Revised (DAFS-R) | Fernanda et al. [44] (2010) | Brazil | 89 | 7 different domains | 1. Time orientation. 2. Communication skills. 3. Dealing with finances. 4. Shopping skills. 5. Grooming skills. 6. Eating skills. 7. Driving skills |
Dementia Knowledge Assessment Scale of Chinese vision (DKAS-C) | Zhao et al. [45] (2022) | China | 290 | 4 domains/25 items | 1. Causes and characteristics (7 items). 2. Communication and behavior (6 items). 3. Care considerations (6 items). 4. Risk and health promotion (6 items) |
Dementia Screening Questionnaire for Individuals with Intellectual Disabilities (DSDS) | Deb et al. [46] (2007) | UK | 193 | 9 domains/54 items | 1. Memory. 2. Confusion. 3. Loss of skills. 4. Social withdrawal. 5. Behavioral changes. 6. Psychological symptoms. 7. Physical symptoms. 8. Sleep disturbance. 9. Speech abnormalities |
The Global Neuropsychological Assessment (GNA) | Olson et al. [47] (2022) | USA | 105 | 7 subtests | 1. Story memory. 2. Digit span. 3. Perceptual comparison. 4. Verbal fluency [VF]. 5. Category switching. 6. Spatial span. 7. The 4-item Patient Health Questionnaire or PHQ-4 |
HoNOS 65+ | Spear et al. [48] (2002) | Australia | 42 | 12 scales | 1. Behavior. 2. Suicide. 3. Problem drinking. 4. Cognition. 5. Physical illness. 6. Hallucinations. 7. Depressed mood. 8. Other symptoms. 9. Relationship. 10. Activities of daily living. 11. Living conditions. 12. Activities |
The Persian version of the FAST (the Functional Assessment Staging of dementia of the Alzheimer’s type) (I-FAST) | Noroozian et al. [49] (2022) | Iran | 219 | 16 stage | Stages 1, 2, 3, 4, 5, 6 (a), 6 (b), 6 (c), 6 (d), 6 (e), 7 (a), 7 (b), 7 (c), 7 (d), 7 (e), and 7 (f) |
The Korean version of the Fear of Alzheimer’s Disease Scale (K-FADS) | Moon et al. [50] (2014) | Korea | 108 | 6 domains/30 items | 1. Basic questions. 2. Loss of memory. 3. Physical symptoms of fear/anxiety. 4. Vegetative symptoms of fear/anxiety. 5. Catastrophic attitude toward AD. 6. Family‐related concerns/loss of autonomy |
LeSCoD for Lewy body Screening Scale in Cognitive Disorders | Olivieri et al. [51] (2021) | France | 128 | 9 domains/36 items | 1. Attention. 2. Verbal fluency tests. 3. Iteral. 4. Visuospatial ability. 5. Visual perception. 6. Cognition. 7. Vigilance. 8. Insidious hallucinations. 9. Minor psychotic symptoms |
The London Psychogeriatric Rating Scale (LPRS) | Reid et al. [52] (1991) | Canada | 264 | 4 subscales/36 items | 1. Mental disorganization/confusion (13 items). 2. Physical disability (9 items). 3. Socially irritating behavior (8 items). 4. Disengagement (6 items) |
The Memory Alteration Test (M@T) | Rami et al. [53] (2007) | Spain | 400 | 4-5 domains/40–50 items | 1. Encoding. 2. Temporal orientation. 3. Semantic memory. 4. Free recall. 5. Cued recall |
The Mattis Dementia Rating Scale (MDRS-2) | Boycheva et al. [54] (2018) | Spain | 125 | 5 subscales/36 items | 1. Attention (AT). 2. Initiation/perseveration (I/P). 3. Construction (CT). 4. Conceptualization (CP). 5. Memory (ME) |
Multifactorial Memory Questionnaire (MMQ) | Yunyun et al. [55] (2018) | China, Chongqing | 326 | 3 domains/57 items | 1. Memory satisfaction (18 items). 2. Memory ability (20 items). 3. Memory strategy (19 items) |
Mini-Mental State Examination (MMSE) | Marshal et al. [56] (1975) | USA | 206 | 2 sections/11 items | Two sections, the first of which requires vocal responses only and covers orientation, memory, and attention. The second part tests ability to name, follow verbal and written commands, write a sentence spontaneously, and copy a complex polygon similar to a Bender-Gestalt figure |
The Montreal Cognitive Assessment (MoCA) | Pinto et al. [57] (2019) | Brazil | 229 | 7 cognitive domains | 1. Visuospatial/executive functions. 2. Naming. 3. Verbal memory registration and learning. 4. Attention. 5. Abstraction. 6. 5-min delayed verbal memory. 7. Orientation |
Hu et al. [58] (2013) | China, Hangzhou | 72 | |||
Kaya et al. [59] (2014) | Turkey | 474 | |||
Janelidze [60] (2017) | Georgia | 86 | |||
The Ordinal Scales of Psychological Development was modified (M-OSPD) | Auer et al. [61] (1994) | USA | 70 | 5 subscales | 1. Visual pursuit and object permanence.2. Means-ends. 3. Causality. 4. Spatial relations. 5. Schemes |
The Neuropsychological Impairment Scale – Senior (NIS-S) | O’Donnell et al. [62] (2001) | USA | 200 | 30 items | 18 items inquire about cognitive deficits (“Are you forgetful?”), which are summed to obtain a Global Measure of Impairment (GMI); 5 items gauge defensiveness (DEF; “Do you always tell the truth?”); 7 items measure affective disturbance (AFF; “Do you feel sad or blue?”) |
Neuropsychiatric Symptom Assessment (NPSA) scale | Yan’e et al. [63] (2022) | China | 4,677 | 15 items | 1. Reduced or rigid expression. 2. Decreased attention to daily activities and self. 3. Aggressive or agitated behavior. 4. Emotional, high voice. 5. Delusion. 6. Hallucinations. 7. Feeling depressed, pessimistic, and helpless. 8. Repeated questions about upcoming events, afraid to be alone. 9. Sleep disorder. 10. Poor self-control and judgment. 11. Unable to recognize people, places, or even oneself. 12. Wandering or repetitive behavior. 13. Changes in eating behavior. 14. Change of interests and hobbies. 15. Personality changes |
The Neurobehavioral Rating Scale (NRS) | Sultzer et al. [64] (1995) | USA | 15 | 28 items | 1. Inattention/reduced alertness. 2. Somatic concern disorientation. 3. Anxiety expressive deficit. 4. Emotional withdrawal conceptual. 5. Disorganization disinhibition. 6. Guilt feelings, memory deficit. 7. Agitation inaccurate insight. 8. Depressed mood, hostility/uncooperativeness. 9. Decreased initiative/motivation. 10. Suspiciousness. 11. Fatigability. 12. Hallucinations, motor retardation. 13. Unusual thought content blunted. 14. Affect excitement poor planning. 15. Mood lability/tension. 16. Comprehension-deficit speech. 17. Articulation defect/fluent aphasia |
The “Protocole d’Examen Cognitif de la Personne Agée – Lausanne” (PECPA-L) | Gunten et al. [65] (2006) | Switzerland | 379 | 10 domains/103 items | 1. Orientation in time. 2. Orientation in space. 3. Attention and calculation. 4. Immediate recall. 5. Language. 6. Remote memory. 7. Judgment and abstraction. 8. Gnosis. 9. Praxis. 10. Delayed recall |
The French Prospective and Retrospective Memory Questionnaire (PRMQ) | Guerdoux-Ninot et al. [66] (2019) | France | 488 | 16 items/2 domains | 8 items addressing prospective memory errors in daily life and 8 items concerning retrospective memory daily-life difficulties |
Patient-Reported Outcomes in Cognitive Impairment (PROCOG) | Frank et al. [16] (2006) | USA | 186 | 55 items | N/A |
Italian revised memory and behavior problems checklist (It-RMBPC) | Ottoboni et al. [67] (2019) | Italy | 355 | 3 domains/24 questionnaire items | 1. Memory problems (items 1–7). 2. Disrupted behaviors (items 8, 9, 10, 11, 13, 15, 16, and 24). 3. Depression (items 12, 14, and 17–23) |
Subjective Cognitive Decline-Questionnaire (SCD-Q) | Qing et al. [68] (2021) | China, Wuhan | 410 | 2 subscales/3 domains/24 items | The SCD-Q included two questionnaires: the MyCog and the known-person assessment of the elderly (TheirCog). MyCog consists of three cognitive domains: 1. Memory function. 2. Language function. 3. Executive function, with a total of 24 entries. TheirCog, similar to MyCog, is primarily an informed assessment of subjective cognitive decline in older adults |
English-Chinese version Subjective cognitive decline questionnaire 9 (SCD-Q9) | Lixiao et al. [69] (2019) | China, Shunyi | 200 | 2 domains、9 items | Overall memory function and time comparison (4 items), daily activity ability (5 items) |
The Severe Cognitive Impairment Rating Scale (SCIRS) | Choe et al. [70] (2008) | Korea | 267 | 11 items/5 cognitive areas | 1. Memory. 2. Orientation. 3. Language. 4. Frontal lobe function. 5. Visuospatial function |
The Korean version of the SIB (SIB-K) – the Severe Impairment Battery (SIB) | Suh et al. [71] (2006) | Korea | 65 | 40 items/9 subscales | 1. Social interaction. 2. Memory. 3. Orientation. 4. Language. 5. Attention. 6. Praxis. 7. Visuospatial ability. 8. Construction. 9. Orienting to name |
Six spatial navigation-related tasks | Allison et al. [72] (2019) | USA | 91 | 6 tasks | 1. Modified cognitive mapping task. 2. Experimental task. 3. Relational binding tasks. 4. Shape-color binding task. 5. Shape-location binding task. 6. Visual perspective taking task |
Self-Test (ST) | Stanonik et al. [73] (2005) | USA | 83 | 4 dimensions, 4 items | 1. A clock-drawing test. 2. Short-term memory. 3. Verbal fluency. 4. Orientation |
Telephone cognitive screen (T-CogS) into Turkish version (T-CogS-TR) | Naharci et al. [74] (2020) | Turkey | 104 | 5 domains/16 items | 1. Orientation. 2. Registration. 3. Recall. 4. Attention. 5. Language |
The TE4D (Test for the Early Detection of Dementia from Depression) | Mahoney [75] (2005) | UK | 203 | 7 subscales/8 items | 1. Immediate recall. 2. Semantic memory. 3. Clock drawing. 4. Category fluency. 5. Orientation. 6. Following commands (ideomotor praxis) |
Evaluation instrument . | Author(s) . | Country . | Sample Total N . | Number of items/domains . | Measurement domains . |
---|---|---|---|---|---|
The 7-Minute Screen (7 MS) | Ijuin et al. [17] (2008) | Japan | 154 | Four subtests | 1. Benton Temporal Orientation Test. 2. Enhanced cued recall. 3. Clock drawing. 4. Verbal fluency |
The Chinese version of Addenbrooke’s Cognitive Examination-Revised (ACE-R) | Fang et al. [18] (2014) | China | 151 | 5 domains/16 cognitive items | 1. Orientation, registration, attention, and concentration. 2. Memory (recall, anterograde memory, retrograde memory, recognition). 3. Verbal fluency. 4. Language (comprehension, writing, repetition, naming, comprehension, reading). 5. Visuospatial abilities and perceptual abilities |
Alzheimer Disease 8 (AD8) | Ryu et al. [19] (2009) | Korea | 155 | 8 items | 1. Problems with judgment. 2. Reduced interest in hobbies/activities. 3. Repeats questions, stories, or statements. 4. Trouble learning how to use a tool, appliance, or gadget. 5. Forgets correct month or year. 6. Difficulty handling complicated financial affairs. 7. Difficulty remembering appointments. 8. Consistent problems with thinking and/or memory |
The Italian version of Alzheimer’s Disease Assessment Scale (ADAS) | Inzitaria et al. [20] (1999) | Italy | 217 | 21 items | The cognitive portion of the scale consists of 11 items, assessing memory, language, and praxis function, while the 10 items of the noncognitive portion aim to evaluate mood, agitation, vegetative functions, delusions, and hallucinations |
Alzheimer’s Disease Assessment Scale-Cognitive Subscale (ADAS-Cog) | Ben et al. [21] (2017) | Tunisia | 182 | 11 items | 1. Word recall. 2. Naming (objects and fingers). 3. Following commands. 4. Constructions (drawing). 5. Ideational praxis. 6. Orientation. 7. Word recognition. 8. Recall of test instructions. 9. Spoken language ability. 10. Word-finding difficulty and comprehension of spoken language. 11. Assessing memory, praxis, and language |
Wei et al. [22] (2022) | China, Taiwan | 170 | |||
Jiang et al. [23] (2020) | China | 1276 | |||
Liu et al. [24] (2001) | China | 329 | |||
Cano et al. [25] (2010) | UK | 1421 | |||
Mavioglu et al. [26] (2006) | Turkey | 56 | |||
Ying-hong et al. [27] (2014) | China, Guangzhou | 76 | |||
Xia et al. [28] (2009) | China | 213 | |||
Alzheimer’s Disease Impression evaluation (AD-IE) | Yan’e et al. [29] (2021) | China | 5,704 | 8 domains/20 items | 1. Directional force (1–4) not myself. 2. Dialogue ability (articles 5–8) (9), (10–12). 3. Daily life, work, and social activities and roles (13). 4. Free activities (article 14). 5. Memory (15–18). 6. Mental symptoms (articles 19–20) and so on 8 parts |
Activities of Daily Living Questionnaire (ADLQ-CV) | Chu et al. [30] (2008) | China | 125 | 6 domains/28 items | 1. Self-care. 2. Household care. 3. Employment and recreation. 4. Shopping and money. 5. Travel. 6. Communication |
Taiwanese version of the Apathy Evaluation Scale, clinician version (AES-C) | Hsieh et al. [31] (2012) | China, Taiwan | 144 | 4 domains/18 items | Behavior, cognition, and emotion subscales |
Abbreviated Mental Test Score (AMTS) | Dionysios Tafiadis et al. [15] (2022) | Greece | 132 | 10 items | N/A |
AQ-D | Sato et al. [32] (2007) | Japan | 143 | 2 domains/30 items | Intellectual functioning and behavior |
Attention Questionnaire Scale (AQS) | Kim et al. [33] (2011) | Korea | 268 | 15 items | 8 negative items and 7 positive items |
Bedford Alzheimer Nursing Severity Scale | Bellelli et al. [34] (1997) | Italy | 99 | 3 domains/7 items | 1. Evaluating functional (ambulating, eating, dressing). 2. Cognitive (speech, eye contact) abilities. 3. Pathological symptoms (muscle rigidity, sleep-wake cycle disturbances) |
Baylor Profound Mental State Examination (BPMSE-cog) | Doody et al. [35] (1999) | USA | 208 | 25 question cognition subscale/4 subareas | 1. Orientation. 2. Language. 3. Attention. 4. Motor functioning |
Kørner et al. [36] (2012) | Denmark | 55 | |||
Salmerón et al. [37] (2016) | Spain | 100 | |||
Clinical Dementia Rating (CDR) | Nguyen et al. [38] (2020) | Vietnam | 153 | 6 domains | 1. Memory. 2. Orientation. 3. Judgment and problem solving. 4. Community affairs. 5. Home and hobbies. 6. Personal care |
Coley et al. [39] (2011) | France | 667 | |||
Cognitive‐12 Scale (COG-12) | Gang et al. [40] (2014) | China, Nanjing | 148 | 2 domains/12 items | 1. Judgment problems. 2. Decline of interest, change of hobbies, and decrease of activities. 3. Repeat the same thing. 4. Have difficulty in learning to use a simple everyday tool. 5. Lost count of the current month and year. 6. Have difficulty in dealing with complex personal economic affairs. 7. Unable to remember the agreement with others. 8. Problems in daily memory and thinking ability. 9. Volatile mood. 10. Changes in living habits or improper behavior. 11. Personality change. 12. Language problems. The first latitude included items C1, C2, C4, C5, C6, C7, C8, and C12, whereas the second latitude included items C3, C9, C10, and C11 |
The Chinese Version of the Relevant Outcome Scale for Alzheimer’s Disease (CROSA) | Zhang et al. [41] (2021) | China | 336 | 2 factors/3 domains/16 items | Factor 1 comprised three domains – cognition, communication, and function – and two items measuring PWAD QoL (item 15) and caregiver burden (item 16). The domain of cognition measured the remote (item 1) and recent (item 2) memory and complex ADLs (item 3) |
The domain of communication measured language expression (item 4) and communication (item 5) abilities and social competence (item 6). The domain of function measured basic ADLs (item 12), attention (item 13), and independence (item 14). Factor 2 (domain of behavior) measured levels of common BPSD, including irritability/aggression (item 7), restless (item 8), delusion (item 9), anxiety (item 10), and agitation (item 11) | |||||
Cornell Scale for Depression in Dementia (CSDD) | Ru-Jing et al. [42] (2008) | China | 60 | 5 domains/19 items | 1. Mood-related signs 2. Behavioral disturbances. 3. Physical signs. 4. Cyclic functions. 5. Ideational disturbances |
The Columbia University Scale for Psychopathology in Alzheimer's disease (the CUSPAD) | Devanand et al. [43] (1992) | USA | 91 | 5 domains/27 items | 1. Delusions. 2. General question (paranoid delusions, delusions of abandonment, somatic delusions, misidentification, other delusions). 3. Hallucinations (illusions). 4. Behavioral disturbances. 5. Depression |
The Direct Assessment of Functional Status-Revised (DAFS-R) | Fernanda et al. [44] (2010) | Brazil | 89 | 7 different domains | 1. Time orientation. 2. Communication skills. 3. Dealing with finances. 4. Shopping skills. 5. Grooming skills. 6. Eating skills. 7. Driving skills |
Dementia Knowledge Assessment Scale of Chinese vision (DKAS-C) | Zhao et al. [45] (2022) | China | 290 | 4 domains/25 items | 1. Causes and characteristics (7 items). 2. Communication and behavior (6 items). 3. Care considerations (6 items). 4. Risk and health promotion (6 items) |
Dementia Screening Questionnaire for Individuals with Intellectual Disabilities (DSDS) | Deb et al. [46] (2007) | UK | 193 | 9 domains/54 items | 1. Memory. 2. Confusion. 3. Loss of skills. 4. Social withdrawal. 5. Behavioral changes. 6. Psychological symptoms. 7. Physical symptoms. 8. Sleep disturbance. 9. Speech abnormalities |
The Global Neuropsychological Assessment (GNA) | Olson et al. [47] (2022) | USA | 105 | 7 subtests | 1. Story memory. 2. Digit span. 3. Perceptual comparison. 4. Verbal fluency [VF]. 5. Category switching. 6. Spatial span. 7. The 4-item Patient Health Questionnaire or PHQ-4 |
HoNOS 65+ | Spear et al. [48] (2002) | Australia | 42 | 12 scales | 1. Behavior. 2. Suicide. 3. Problem drinking. 4. Cognition. 5. Physical illness. 6. Hallucinations. 7. Depressed mood. 8. Other symptoms. 9. Relationship. 10. Activities of daily living. 11. Living conditions. 12. Activities |
The Persian version of the FAST (the Functional Assessment Staging of dementia of the Alzheimer’s type) (I-FAST) | Noroozian et al. [49] (2022) | Iran | 219 | 16 stage | Stages 1, 2, 3, 4, 5, 6 (a), 6 (b), 6 (c), 6 (d), 6 (e), 7 (a), 7 (b), 7 (c), 7 (d), 7 (e), and 7 (f) |
The Korean version of the Fear of Alzheimer’s Disease Scale (K-FADS) | Moon et al. [50] (2014) | Korea | 108 | 6 domains/30 items | 1. Basic questions. 2. Loss of memory. 3. Physical symptoms of fear/anxiety. 4. Vegetative symptoms of fear/anxiety. 5. Catastrophic attitude toward AD. 6. Family‐related concerns/loss of autonomy |
LeSCoD for Lewy body Screening Scale in Cognitive Disorders | Olivieri et al. [51] (2021) | France | 128 | 9 domains/36 items | 1. Attention. 2. Verbal fluency tests. 3. Iteral. 4. Visuospatial ability. 5. Visual perception. 6. Cognition. 7. Vigilance. 8. Insidious hallucinations. 9. Minor psychotic symptoms |
The London Psychogeriatric Rating Scale (LPRS) | Reid et al. [52] (1991) | Canada | 264 | 4 subscales/36 items | 1. Mental disorganization/confusion (13 items). 2. Physical disability (9 items). 3. Socially irritating behavior (8 items). 4. Disengagement (6 items) |
The Memory Alteration Test (M@T) | Rami et al. [53] (2007) | Spain | 400 | 4-5 domains/40–50 items | 1. Encoding. 2. Temporal orientation. 3. Semantic memory. 4. Free recall. 5. Cued recall |
The Mattis Dementia Rating Scale (MDRS-2) | Boycheva et al. [54] (2018) | Spain | 125 | 5 subscales/36 items | 1. Attention (AT). 2. Initiation/perseveration (I/P). 3. Construction (CT). 4. Conceptualization (CP). 5. Memory (ME) |
Multifactorial Memory Questionnaire (MMQ) | Yunyun et al. [55] (2018) | China, Chongqing | 326 | 3 domains/57 items | 1. Memory satisfaction (18 items). 2. Memory ability (20 items). 3. Memory strategy (19 items) |
Mini-Mental State Examination (MMSE) | Marshal et al. [56] (1975) | USA | 206 | 2 sections/11 items | Two sections, the first of which requires vocal responses only and covers orientation, memory, and attention. The second part tests ability to name, follow verbal and written commands, write a sentence spontaneously, and copy a complex polygon similar to a Bender-Gestalt figure |
The Montreal Cognitive Assessment (MoCA) | Pinto et al. [57] (2019) | Brazil | 229 | 7 cognitive domains | 1. Visuospatial/executive functions. 2. Naming. 3. Verbal memory registration and learning. 4. Attention. 5. Abstraction. 6. 5-min delayed verbal memory. 7. Orientation |
Hu et al. [58] (2013) | China, Hangzhou | 72 | |||
Kaya et al. [59] (2014) | Turkey | 474 | |||
Janelidze [60] (2017) | Georgia | 86 | |||
The Ordinal Scales of Psychological Development was modified (M-OSPD) | Auer et al. [61] (1994) | USA | 70 | 5 subscales | 1. Visual pursuit and object permanence.2. Means-ends. 3. Causality. 4. Spatial relations. 5. Schemes |
The Neuropsychological Impairment Scale – Senior (NIS-S) | O’Donnell et al. [62] (2001) | USA | 200 | 30 items | 18 items inquire about cognitive deficits (“Are you forgetful?”), which are summed to obtain a Global Measure of Impairment (GMI); 5 items gauge defensiveness (DEF; “Do you always tell the truth?”); 7 items measure affective disturbance (AFF; “Do you feel sad or blue?”) |
Neuropsychiatric Symptom Assessment (NPSA) scale | Yan’e et al. [63] (2022) | China | 4,677 | 15 items | 1. Reduced or rigid expression. 2. Decreased attention to daily activities and self. 3. Aggressive or agitated behavior. 4. Emotional, high voice. 5. Delusion. 6. Hallucinations. 7. Feeling depressed, pessimistic, and helpless. 8. Repeated questions about upcoming events, afraid to be alone. 9. Sleep disorder. 10. Poor self-control and judgment. 11. Unable to recognize people, places, or even oneself. 12. Wandering or repetitive behavior. 13. Changes in eating behavior. 14. Change of interests and hobbies. 15. Personality changes |
The Neurobehavioral Rating Scale (NRS) | Sultzer et al. [64] (1995) | USA | 15 | 28 items | 1. Inattention/reduced alertness. 2. Somatic concern disorientation. 3. Anxiety expressive deficit. 4. Emotional withdrawal conceptual. 5. Disorganization disinhibition. 6. Guilt feelings, memory deficit. 7. Agitation inaccurate insight. 8. Depressed mood, hostility/uncooperativeness. 9. Decreased initiative/motivation. 10. Suspiciousness. 11. Fatigability. 12. Hallucinations, motor retardation. 13. Unusual thought content blunted. 14. Affect excitement poor planning. 15. Mood lability/tension. 16. Comprehension-deficit speech. 17. Articulation defect/fluent aphasia |
The “Protocole d’Examen Cognitif de la Personne Agée – Lausanne” (PECPA-L) | Gunten et al. [65] (2006) | Switzerland | 379 | 10 domains/103 items | 1. Orientation in time. 2. Orientation in space. 3. Attention and calculation. 4. Immediate recall. 5. Language. 6. Remote memory. 7. Judgment and abstraction. 8. Gnosis. 9. Praxis. 10. Delayed recall |
The French Prospective and Retrospective Memory Questionnaire (PRMQ) | Guerdoux-Ninot et al. [66] (2019) | France | 488 | 16 items/2 domains | 8 items addressing prospective memory errors in daily life and 8 items concerning retrospective memory daily-life difficulties |
Patient-Reported Outcomes in Cognitive Impairment (PROCOG) | Frank et al. [16] (2006) | USA | 186 | 55 items | N/A |
Italian revised memory and behavior problems checklist (It-RMBPC) | Ottoboni et al. [67] (2019) | Italy | 355 | 3 domains/24 questionnaire items | 1. Memory problems (items 1–7). 2. Disrupted behaviors (items 8, 9, 10, 11, 13, 15, 16, and 24). 3. Depression (items 12, 14, and 17–23) |
Subjective Cognitive Decline-Questionnaire (SCD-Q) | Qing et al. [68] (2021) | China, Wuhan | 410 | 2 subscales/3 domains/24 items | The SCD-Q included two questionnaires: the MyCog and the known-person assessment of the elderly (TheirCog). MyCog consists of three cognitive domains: 1. Memory function. 2. Language function. 3. Executive function, with a total of 24 entries. TheirCog, similar to MyCog, is primarily an informed assessment of subjective cognitive decline in older adults |
English-Chinese version Subjective cognitive decline questionnaire 9 (SCD-Q9) | Lixiao et al. [69] (2019) | China, Shunyi | 200 | 2 domains、9 items | Overall memory function and time comparison (4 items), daily activity ability (5 items) |
The Severe Cognitive Impairment Rating Scale (SCIRS) | Choe et al. [70] (2008) | Korea | 267 | 11 items/5 cognitive areas | 1. Memory. 2. Orientation. 3. Language. 4. Frontal lobe function. 5. Visuospatial function |
The Korean version of the SIB (SIB-K) – the Severe Impairment Battery (SIB) | Suh et al. [71] (2006) | Korea | 65 | 40 items/9 subscales | 1. Social interaction. 2. Memory. 3. Orientation. 4. Language. 5. Attention. 6. Praxis. 7. Visuospatial ability. 8. Construction. 9. Orienting to name |
Six spatial navigation-related tasks | Allison et al. [72] (2019) | USA | 91 | 6 tasks | 1. Modified cognitive mapping task. 2. Experimental task. 3. Relational binding tasks. 4. Shape-color binding task. 5. Shape-location binding task. 6. Visual perspective taking task |
Self-Test (ST) | Stanonik et al. [73] (2005) | USA | 83 | 4 dimensions, 4 items | 1. A clock-drawing test. 2. Short-term memory. 3. Verbal fluency. 4. Orientation |
Telephone cognitive screen (T-CogS) into Turkish version (T-CogS-TR) | Naharci et al. [74] (2020) | Turkey | 104 | 5 domains/16 items | 1. Orientation. 2. Registration. 3. Recall. 4. Attention. 5. Language |
The TE4D (Test for the Early Detection of Dementia from Depression) | Mahoney [75] (2005) | UK | 203 | 7 subscales/8 items | 1. Immediate recall. 2. Semantic memory. 3. Clock drawing. 4. Category fluency. 5. Orientation. 6. Following commands (ideomotor praxis) |
Instrument Analysis
We have listed the articles and instruments used in online supplementary Table 2 (for all online suppl. material, see https://doi.org/10.1159/000545907), including instruments that were only used in one study. For this review, we specifically considered the clinical application of the instrument and thus focused on the measurement tools that appeared in more than one study. We placed the review results (Measurement Characteristic Evaluation Form and Bias Risk Evaluation Form) in the supplementary table.
ADAS-Cog
The nine studies that evaluated the measurement properties of the ADAS-Cog are listed in Table 3. The ADAS-Cog was developed in 1984 and is a widely used neuropsychological screening tool for cognitive impairment in patients with AD [76]. According to Rosen et al., the scale was not designed as a diagnostic instrument but rather as a valuable tool for evaluating functional impairments and facilitating research on AD and other forms of dementia. Its applications encompass psychiatric medication studies, assessments of nursing environments, and longitudinal investigations, underscoring its multifaceted potential in the field [76]. The ADAS comprises 44 items. The maximum score of the ADAS-Cog is 70, and a higher score indicates greater cognitive impairment. The study by Weng et al. [27] did not examine the convergent validity of the ADAS-Cog. The other eight studies measured and evaluated the reliability and validity of the ADAS-Cog and considered the MMSE as the gold standard to determine the convergent validity of the ADAS-Cog. The Pearson and Spearman correlation coefficient showed a significant negative correlation between ADAS-Cog scores and MMSE scores (|r| > 0.6), which indicated a “very good” methodological quality rating. The structural validity of the ADAS-Cog was evaluated in only two studies [21, 27], which yielded a sufficient (+) rating for measurement properties and an inadequate rating for methodological quality. Internal consistency (as measured by Cronbach’s alpha) of the other seven articles was rated as sufficient, and methodological quality was rated as “very good.” The studies by Ben et al. [21] and Mavioglu et al. [15] achieved a sufficient (+) rating for test-retest validity and a “very good” rating for methodological quality. The methodological quality of the study by Cano et al. [25] was rated as “doubtful” because the timing of the retest was not stated. Although Li et al. [28] used the Spier correlation coefficient to measure retest effectiveness, no relevant data were found. Therefore, the methodological quality of the study was rated as “inadequate.” Four articles measured ADAS-Cog reliability using the intragroup correlation coefficient (ICC). The results were rated as sufficient (+), and methodological quality was rated as “very good.” The final recommended level for the ADAS-Cog was A.
Evidence of the reliability and validity of the Alzheimer's Disease Assessment Scale (ADAS-Cog)
Author(s) . | Country (language) . | n . | Validity . | Construct validity . | Internal consistency . | n . | Test-retest reliability . | Reliability . | Recommendation . | |||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | |||||
Xia et al. [28] (2009) | China | 213 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.77; (+) AUC = 0.949(+) | N/A | N/A | N/A | N/A | 213 | Inadequate | Spearman p < 0.05(?) | N/A | N/A | B |
Ben et al. [21] (2017) | Tunisia | 182 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.86; p < 0.001(+) | Inadequate | PCA three factors explain 72% of the variance(+) | Very good | Cronbach’s alpha = 0.82(+) | 182 | Very good | r = 0.97(+) | N/A | N/A | A |
Wei et al. [22] (2022) | China, Taiwan | 170 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.673; p < 0.001(−) | N/A | N/A | Very good | Cronbach’s alpha = 0.727(+) | N/A | N/A | N/A | N/A | N/A | A |
Jiang et al. [23] (2020) | China | 1276 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.631; p < 0.001(−) | N/A | N/A | Very good | Cronbach’s alpha = 0.766(+) | N/A | N/A | N/A | N/A | N/A | A |
Cano et al. [25] (2010) | UK | 1421 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.74; p < 0.001(+) | N/A | N/A | Very good | Cronbach’s alpha = 0.84(+) | 1421 | Doubtful | Cronbach’s alpha = 0.94(+) | Very good | ICC = 0.75 to 0.83(+) | A |
Liu et al. [24] (2001) | China | 329 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.85; p < 0.001(+) | N/A | N/A | Very good | Cronbach’s alpha = 0.87(+) | N/A | N/A | N/A | Very good | ICC = 0.96 to 0.99(+) | A |
Mavioglu et al. [26] (2006) | Turkey | 56 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.739(+) | N/A | N/A | Very good | Cronbach’s alpha = 0.80(+) | 56 | Very good | ICC = 0.91(+) | Very good | ICC = 0.98 to 0.99(+) | A |
Ying-hong et al. [27] (2014) | China, Guangzhou | N/A | N/A | N/A | Inadequate | Cronbach’s alpha >0.70(+) | Very good | Cronbach’s alpha = 0.966(+) | N/A | N/A | N/A | N/A | N/A | B |
D. Inzitaria [20] (1999) | Italy | 217 | Very good | Spearman correlation coefficient (MMSE) r = 0.73 p < 0.05(+) | N/A | N/A | N/A | N/A | N/A | N/A | N/A | Very good | ICC = 0.91 to 0.99(+) | A |
Author(s) . | Country (language) . | n . | Validity . | Construct validity . | Internal consistency . | n . | Test-retest reliability . | Reliability . | Recommendation . | |||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | |||||
Xia et al. [28] (2009) | China | 213 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.77; (+) AUC = 0.949(+) | N/A | N/A | N/A | N/A | 213 | Inadequate | Spearman p < 0.05(?) | N/A | N/A | B |
Ben et al. [21] (2017) | Tunisia | 182 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.86; p < 0.001(+) | Inadequate | PCA three factors explain 72% of the variance(+) | Very good | Cronbach’s alpha = 0.82(+) | 182 | Very good | r = 0.97(+) | N/A | N/A | A |
Wei et al. [22] (2022) | China, Taiwan | 170 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.673; p < 0.001(−) | N/A | N/A | Very good | Cronbach’s alpha = 0.727(+) | N/A | N/A | N/A | N/A | N/A | A |
Jiang et al. [23] (2020) | China | 1276 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.631; p < 0.001(−) | N/A | N/A | Very good | Cronbach’s alpha = 0.766(+) | N/A | N/A | N/A | N/A | N/A | A |
Cano et al. [25] (2010) | UK | 1421 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.74; p < 0.001(+) | N/A | N/A | Very good | Cronbach’s alpha = 0.84(+) | 1421 | Doubtful | Cronbach’s alpha = 0.94(+) | Very good | ICC = 0.75 to 0.83(+) | A |
Liu et al. [24] (2001) | China | 329 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.85; p < 0.001(+) | N/A | N/A | Very good | Cronbach’s alpha = 0.87(+) | N/A | N/A | N/A | Very good | ICC = 0.96 to 0.99(+) | A |
Mavioglu et al. [26] (2006) | Turkey | 56 | Very good | Pearson’s correlation coefficient (MMSE) r = −0.739(+) | N/A | N/A | Very good | Cronbach’s alpha = 0.80(+) | 56 | Very good | ICC = 0.91(+) | Very good | ICC = 0.98 to 0.99(+) | A |
Ying-hong et al. [27] (2014) | China, Guangzhou | N/A | N/A | N/A | Inadequate | Cronbach’s alpha >0.70(+) | Very good | Cronbach’s alpha = 0.966(+) | N/A | N/A | N/A | N/A | N/A | B |
D. Inzitaria [20] (1999) | Italy | 217 | Very good | Spearman correlation coefficient (MMSE) r = 0.73 p < 0.05(+) | N/A | N/A | N/A | N/A | N/A | N/A | N/A | Very good | ICC = 0.91 to 0.99(+) | A |
Montreal Cognitive Assessment
The four articles that measured the characteristics of the MoCA are described in Table 4. The MoCA, developed by Nasreddine et al. [77] in 2005, is a short and useful screening tool with high sensitivity and specificity for detecting cognitive dysfunction in people who score normally on the MMSE. Because the MoCA can effectively detect MCI, it is suited to clinical use when time is limited (<10 min) [77]. The MoCA is a single-page assessment that evaluates seven cognitive domains: visuospatial/executive functions, naming, verbal memory registration and learning, attention, abstraction, 5-min delayed verbal memory, and orientation. The scores of the MoCA range from 0 to 30, and a higher score indicates higher cognitive function. In all four papers, the convergent validity of the MoCA was measured by the Pearson correlation coefficient or area under the curve AUC, and Cronbach’s alpha coefficient was used to measure the internal consistency of the MoCA. The measurement properties were all sufficient (+), and the method quality rating was “very good.” Pinto et al. [57] and Janelidze et al. [60] measured retest reliability using ICC and Cronbach’s alpha, respectively, and both achieved a sufficient (+) rating for measurement property, with an explicit reference to retest time, and a “very good” rating for methodological quality. Hu and colleagues achieved a methodological quality rating of “doubtful” because the timing of the retest was not explicitly stated. The final recommended level for the MoCA was A.
Evidence of the reliability and validity of the Montreal Cognitive Assessment (MoCA)
Author(s) . | n . | Validity . | Internal consistency . | n . | Test-retest reliability . | Reliability . | Recommendation . | ||||
---|---|---|---|---|---|---|---|---|---|---|---|
meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | ||||
Pinto et al. [57] (2019) | 229 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.75, p < 0.001 AUC = 0.94–0.95(+) | Very good | Cronbach’s alpha = 0.77 (+) | 229 | Very good | ICC = 0.91 (95% CI 0.74–0.96; p < 0.001)(+) | Very good | ICC = 0.96 (95% CI 0.94–0.98; p < 0.001)(+) | A |
Hu et al. [58] (2013) | 72 | Very good | AUC = 0.928 (95% CI: 0.921; 0.936)-0.962 (95% CI: 0.955; 0.970)(+) | Very good | Cronbach’s alpha = 0.967 (+) | 72 | Doubtful | ICC = 0.862 (95%CI: 0.789; 0.918) (p < 0.001)(+) | N/A | N/A | B |
Kaya et al. [59] (2014) | 474 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.598–0.807, p < 0.001 AUC: 0.641–0.943, 95% CI: 0.641–0.885, p < 0.001(+) | Very good | Cronbach’s alpha = 0.81 (+) | N/A | N/A | N/A | N/A | N/A | A |
Janelidze et al. [60] (2017) | 86 | Very good | AUC = 0.88 and 0.95(+) | Very good | Cronbach’s alpha = 0.86–0.92 (+) | 20 | Very good | Cronbach’s alpha = 0.86–0.92 (+) | N/A | N/A | A |
Author(s) . | n . | Validity . | Internal consistency . | n . | Test-retest reliability . | Reliability . | Recommendation . | ||||
---|---|---|---|---|---|---|---|---|---|---|---|
meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | ||||
Pinto et al. [57] (2019) | 229 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.75, p < 0.001 AUC = 0.94–0.95(+) | Very good | Cronbach’s alpha = 0.77 (+) | 229 | Very good | ICC = 0.91 (95% CI 0.74–0.96; p < 0.001)(+) | Very good | ICC = 0.96 (95% CI 0.94–0.98; p < 0.001)(+) | A |
Hu et al. [58] (2013) | 72 | Very good | AUC = 0.928 (95% CI: 0.921; 0.936)-0.962 (95% CI: 0.955; 0.970)(+) | Very good | Cronbach’s alpha = 0.967 (+) | 72 | Doubtful | ICC = 0.862 (95%CI: 0.789; 0.918) (p < 0.001)(+) | N/A | N/A | B |
Kaya et al. [59] (2014) | 474 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.598–0.807, p < 0.001 AUC: 0.641–0.943, 95% CI: 0.641–0.885, p < 0.001(+) | Very good | Cronbach’s alpha = 0.81 (+) | N/A | N/A | N/A | N/A | N/A | A |
Janelidze et al. [60] (2017) | 86 | Very good | AUC = 0.88 and 0.95(+) | Very good | Cronbach’s alpha = 0.86–0.92 (+) | 20 | Very good | Cronbach’s alpha = 0.86–0.92 (+) | N/A | N/A | A |
CI, confidence interval.
Baylor Profound Mental Status Examination
The three articles that assessed the measurement characteristics of the BPMSE are described in Table 5. Our group (R.S.D. and E.P.F.) developed the BPMSE in 1988, which consists of a 25-question cognition subscale (BPMSE-cog), a 10-item dichotomous behavior subscale (BPMSE-behav), and 2 descriptive questions concerning communication and social interactions [35]. The cognition subscale is divided into 4 subareas: orientation, language, attention, and motor functioning. The test can be administered in 5 min and uses common items (e.g., watches, pencils, and keys) for the assessment, which allows administration of the test in almost any setting without the need for designated testing rooms or specialized auditor training. All three papers examined the test-retest reliability (using ICC or correlation coefficients) and convergent validity (using Pearson correlation coefficients) of the BPMSE. For both test-retest reliability and convergent validity, the measurement feature rating was rated as sufficient (+), and methodological quality was rated as “very good.” It is worth noting that although Kørner used Wilcoxon’s test to measure the reliability of the BPMSE, the results were not reported and thus the measurement properties were rated indeterminate (?), and the methodological quality rating was “doubtful.” In addition, Salmeron et al. used factor analysis to measure the structural validity of the BPMSE, and although the measurement properties were rated as sufficient (+), the methodological quality was rated as “inadequate.” The BPMSE received an A level Table 6.
Evidence of the reliability and validity of Baylor Profound Mental Status Examination (BPMSE)
Author(s) . | n . | Validity . | Construct validity . | Internal consistency . | n . | Test-retest reliability . | Reliability . | Recommendation . | |||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | ||||
Doody et al. [35] (1999) | 208 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.81 p < 0.001(+) | N/A | N/A | Very good | Cronbach’s alpha = 0.94(+) | 208 | Very good | r = 0.95 (p < 0.001)(+) | N/A | N/A | A |
Alex Kørner et al. [36] (2012) | 55 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.74 p < 0.001(+) | N/A | N/A | N/A | N/A | 49 | Very good | r = 0.78 to 0.93(+) | Doubtful | Wilcoxon's test (p = 0.33)(?) | B |
Salmerón et al. [37] (2016) | 100 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.92 p < 0.001(+) | Inadequate | 6 factors explain 77.2% of the variance (p < 0.001)(+) | Very good | Cronbach’s alpha = 0.84, p < 0.001(+) | 100 | Very good | ICC = 0.97–0.99 (p < 0.001)(+) | N/A | N/A | A |
Author(s) . | n . | Validity . | Construct validity . | Internal consistency . | n . | Test-retest reliability . | Reliability . | Recommendation . | |||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | ||||
Doody et al. [35] (1999) | 208 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.81 p < 0.001(+) | N/A | N/A | Very good | Cronbach’s alpha = 0.94(+) | 208 | Very good | r = 0.95 (p < 0.001)(+) | N/A | N/A | A |
Alex Kørner et al. [36] (2012) | 55 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.74 p < 0.001(+) | N/A | N/A | N/A | N/A | 49 | Very good | r = 0.78 to 0.93(+) | Doubtful | Wilcoxon's test (p = 0.33)(?) | B |
Salmerón et al. [37] (2016) | 100 | Very good | Pearson’s correlation coefficient (MMSE) r = 0.92 p < 0.001(+) | Inadequate | 6 factors explain 77.2% of the variance (p < 0.001)(+) | Very good | Cronbach’s alpha = 0.84, p < 0.001(+) | 100 | Very good | ICC = 0.97–0.99 (p < 0.001)(+) | N/A | N/A | A |
Evidence of the reliability and validity of Clinical Dementia Rating (CDR)
Author(s) . | n . | Validity . | Construct validity . | Internal consistency . | Recommendation . | |||
---|---|---|---|---|---|---|---|---|
meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | |||
Nguyen et al. [38] (2020) | 153 | Very good | AUC = 0.968 [95% CI 0.924–1,000](+) | N/A | N/A | Very good | Cronbach’s alpha = 0.93, p < 0.001(+) | A |
Coley et al. [39] (2011) | 667 | Very good | AUC = 0.77(+) | Inadequate | 6 factors explained 47% of the variance (p < 0.001)(+) | Very good | Cronbach’s alpha = 0.88, p < 0.001(+) | B |
Author(s) . | n . | Validity . | Construct validity . | Internal consistency . | Recommendation . | |||
---|---|---|---|---|---|---|---|---|
meth qual . | result (rating) . | meth qual . | result (rating) . | meth qual . | result (rating) . | |||
Nguyen et al. [38] (2020) | 153 | Very good | AUC = 0.968 [95% CI 0.924–1,000](+) | N/A | N/A | Very good | Cronbach’s alpha = 0.93, p < 0.001(+) | A |
Coley et al. [39] (2011) | 667 | Very good | AUC = 0.77(+) | Inadequate | 6 factors explained 47% of the variance (p < 0.001)(+) | Very good | Cronbach’s alpha = 0.88, p < 0.001(+) | B |
CDR Scale
The CDR scale was evaluated in two studies. The CDR scale was initially developed as a staging instrument for AD severity and encompasses six categories: memory, orientation, judgment and problem solving, community affairs, home and hobbies, and personal care. The global rating for the CDR scale, which is calculated using a complex algorithm, ranges from 0 (no dementia) to 3 (severe dementia) [78]. The CDR scale focuses solely on the evaluation of cognitive function and meticulously excludes the confounding effects of noncognitive factors. Therefore, the CDR scale is invaluable to a diverse array of multidisciplinary specialists for conducting comprehensive and accurate global assessments of cognitive function in older adults across a wide spectrum of mild to severe dementia [79]. Convergent validity was measured by AUC, and internal consistency was measured by Cronbach’s alpha. For both of these measures, the measurement properties were rated as sufficient (+), and methodological quality was rated as “very good.” Coley et al. [39] used factor analysis to measure the structural validity of the CDR scale. Although the measurement property was rated as sufficient (+), the amount of evidence was insufficient and the methodological quality was “inadequate.” The final recommended level for the CDR scale was A.
Discussion
This review aimed to examine the quality of OMIs used to assess cognitive impairment in patients with AD. The ADAS-Cog, MoCA, BPMSE, CDR, and other 28 scales were rigorously evaluated according to COSMIN guidelines, and all instruments demonstrated satisfactory levels of validity (i.e., structural validity) and reliability (i.e., internal consistency and test-retest reliability). They were all assigned the highest recommended grade of A, which highlighted their credibility in assessing the severity of cognitive impairment. The COSMIN guidelines were applied to ensure the rationality of the scales rather than to assess measurement bias. However, the scales do use various formats to assess the same cognitive dimension. The methodological quality of the four scales was under control, and the recommended levels were comparable across all four scales. Therefore, the specific setting and contents of the scale could be considered when conducting AD-related clinical research to ensure the most appropriate assessment scales are used.
When evaluating the same cognitive dimension, the use of different items from different scales can significantly impact performance in terms of sensitivity and coverage. For example, for immediate memory testing, the MMSE [56] and MoCA [57] involve the oral presentation of 3 or 5 words, whereas the ADAS-Cog [21] evaluates delayed memory by using word recognition. Therefore, the MMSE and MoCA are more suited to illiterate patients. Furthermore, the cultural background of subjects has a significant impact on ADAS-Cog performance. Future research could prioritize the development of culturally adapted OMIs and validate these instruments across diverse populations, given the documented impact of cultural factors on measurement validity and applicability.
In terms of usage scenarios and conditions, the ADAS-Cog boasts versatility; it can be applied to a range of scenarios, such as AD and other dementia studies, psychotropic medication trials, care setting evaluations, and longitudinal research [76]. The MoCA, renowned for its brevity, ease of use, and heightened sensitivity toward MCI patients, has widespread clinical application [77]. In contrast, the BPMSE is distinct in that it requires no test kit, specialized reviewer training, or informant dependency, which simplifies the administration process. Thus, the BPMSE is an effective tool for evaluating cognitive function in patients with severe AD in both research and clinical settings [35, 37, 36].
In terms of research subject, the efficacy of the ADAS-Cog for detecting MCI is relatively low. It is more suitable for evaluating patients with moderate to severe AD [25, 80]. The MoCA is more appropriate for patients with MCI during the early stage of AD [57‒60, 81]. The CDR scale encompasses a broad spectrum of dementia severity and integrates multiple patient characteristics, which makes it a practical tool for neurologists, psychiatrists, and other specialists alike. However, to accurately evaluate the cognitive function of patients, the CDR scale requires that users have a certain knowledge base (e.g., neurologists and psychiatrists) [38, 39, 79]. The content described above is also in line with current diagnostic and treatment guidelines and previous literature [82‒84]. It is evident that each existing scale has its relative limitations and shortcomings. However, with technological advancements, an emerging approach – digital cognitive assessment platforms and adaptive testing methodologies – has demonstrated considerable potential. This technology can systematically identify and categorize issues within existing scales while dynamically integrating assessment content based on individual patient characteristics (e.g., cognitive ability, cultural background, and clinical needs). As a result, it generates highly personalized, tailored scales. The field is rapidly evolving through expanded validation initiatives incorporating larger-scale cohorts and longitudinal investigation designs [85‒87]. We suggest that our future research could focus on rigorous longitudinal validation of these technologies to establish their psychometric robustness in tracking preclinical AD progression trajectories and predicting clinical conversion patterns.
Additionally, Hu et al. [58] examined the measurement characteristics of the MoCA in eastern Chinese communities and discovered that cultural span affects the response rate of MoCA testing and proposed suggestions for improvement. Kaya et al. [59] and Janelidze et al. [60] studied the measurement properties of the Turkish and Georgian versions of the MoCA, respectively, and reported that both versions had good reliability and validity. In addition, owing to the change of times and differences of region, the use and elimination of paper letters vary among regions with diverse historical backgrounds. In the current version of the ADAS-Cog, there is an operational issue regarding how patients can send paper letters to post office. Therefore, the universal applicability and compatibility must be carefully considered when translating or applying these evaluation tools. We suggest that clinical researchers studying AD could consider time, linguistic, and cultural variations when selecting instruments because scale translation, adaptability, and cultural sensitivity can influence the accuracy and effectiveness of the scales.
To test the convergent validity of instruments, 27 articles calculated Pearson correlation coefficients between the study instrument score and the MMSE score. Although we only screened one study examining the relevant measurement properties of the MMSE [88], it is well established that the MMSE has high clinical significance in the assessment of cognitive function in AD patients. The MMSE [56] has become the most commonly used screening tool in clinical and community settings that provides an overall measure of cognitive impairment. The maximum score for the MMSE is 30, with a lower score indicating greater cognitive impairment [56]. However, there has been limited systematic evaluation of the diagnostic accuracy and measurement properties of the MMSE [89]. Harrell et al. argued that the MMSE score range is too limited to distinguish between moderate and severe AD and subsequently developed a new instrument based on the MMSE: the Severe Mini-mental State Examination (SMMSE). The SMMSE is simple, easy to administer, and requires no specialized equipment. Moreover, it has a high tolerance for patients with severe cognitive impairment.
Despite retrieving a large number of studies on measurement attributes, the methodological quality and structural validity of almost all studies were rated “inadequate” and “doubtful,” respectively. Although few studies measured structural validity according to the COSMIN guidelines, most provided evidence that convergent validity could have achieved a sufficient (+) rating. However, the association criteria for each study could not be controlled, and the results of the instrument measurements may have been influenced by the subjective decisions of the researchers. The included articles provided little evidence of content validity, although there was also no compelling evidence to suggest inadequate content validity. Therefore, we did not include content validity as a measurement property in our review. The COSMIN rating method ensures the feasibility of validity rating, even when information on the content effectiveness is insufficient. Therefore, conclusions regarding content validity are based largely on limited evidence, and thus, our results, as guided by the COSMIN, should be interpreted with caution [85‒87].
Conclusion
We conducted a systematic evaluation of assessment tools for evaluating the cognitive function of AD patients by rating the measurement characteristics of the instruments and analyzing the methodological quality of each measurement method. We found four commonly used scales with high therapeutic efficacy: the ADAS Cog, MoCA, BPMSE, CDR, and MMSE scale. However, for clinical research on AD, the research objectives, usage scenarios, scale dimensions, and subtle differences in scale items must be carefully considered. Furthermore, the impact of time, region, and culture on the reliability and validity of the scales could be examined.
Acknowledgments
The author extended heartfelt gratitude to Professor Lyv, the corresponding author, for her invaluable guidance and profound insights shared by all mentors. Additionally, the author would like to thank the literature review and screening team for their significant contributions. We thank Sarina Iwabuchi, PhD, from Liwen Bianji (Edanz) (https://www.liwenbianji.cn) for editing the English language of a draft of this manuscript.
Statement of Ethics
The research design of this review did not involve human participants. Thus, no ethical review was required.
Conflict of Interest Statement
The authors have no conflicts of interest to declare.
Funding Sources
This work was supported by the 15th batch of free exploration projects of the Institute of Clinical Basic Medicine of Chinese Academy of Traditional Chinese Medicine: performance study of outcome measurement tool for patient-reported outcome measures of Alzheimer’s disease based on clinical measurement.
Author Contributions
Zhe-Zuo Zhang: literature analysis, and writing-original draft; Xiao-Ying Lyu: conceptualizaton, writing-review and editing, supervision, and project administration; Xiang-Wei Dai, Jian-Ni Cong, and Fu-Xia Yang: providing revision suggestions and editing from the perspetives of clinical medicine and scales methodilogy.
Data Availability Statement
All data generated or analyzed during this study are included in this article and its supplementary material files. Further inquiries can be directed to the corresponding author.