Introduction: Alzheimer’s disease (AD) is the most prevalent type of dementia and can cause abnormal cognitive function and progressive loss of essential life skills. Early screening is thus necessary for the prevention and intervention of AD. Speech dysfunction is an early onset symptom of AD patients. Recent studies have demonstrated the promise of automated acoustic assessment using acoustic or linguistic features extracted from speech. However, most previous studies have relied on manual transcription of text to extract linguistic features, which weakens the efficiency of automated assessment. The present study thus investigates the effectiveness of automatic speech recognition (ASR) in building an end-to-end automated speech analysis model for AD detection. Methods: We implemented three publicly available ASR engines and compared the classification performance using the ADReSS-IS2020 dataset. Besides, the SHapley Additive exPlanations algorithm was then used to identify critical features that contributed most to model performance. Results: Three automatic transcription tools obtained mean word error rate texts of 32%, 43%, and 40%, respectively. These automated texts achieved similar or even better results than manual texts in model performance for detecting dementia, achieving classification accuracies of 89.58%, 83.33%, and 81.25%, respectively. Conclusion: Our best model, using ensemble learning, is comparable to the state-of-the-art manual transcription-based methods, suggesting the possibility of an end-to-end medical assistance system for AD detection with ASR engines. Moreover, the critical linguistic features might provide insight into further studies on the mechanism of AD.

1.
Pulido
MLB
,
Hernández
JBA
,
Ballester
MÁF
,
González
CMT
,
Mekyska
J
,
Smékal
Z
.
Alzheimer’s disease and automatic speech analysis: a review
.
Expert Syst Appl
.
2020 Jul
150
113213
9
.
2.
Penney
J
,
Ralvenius
WT
,
Tsai
LH
.
Modeling Alzheimer’s disease with iPSC-derived brain cells
.
Mol Psychiatry
.
2020 Jan
25
1
148
67
.
3.
Bertini
F
,
Allevi
D
,
Lutero
G
,
Calza
L
,
Montesi
D
.
An automatic Alzheimer’s disease classifier based on spontaneous spoken English
.
Comput Speech Lang
.
2022 Mar
72
101298
.
4.
Li
RJ
,
Wang
XY
,
Lawler
K
,
Garg
S
,
Bai
Q
,
Alty
J
.
Applications of artificial intelligence to aid early detection of dementia: a scoping review on current capabilities and future directions
.
J Biomed Inform
.
2022 Mar
127
104030
.
5.
Meilan
JJG
,
Martinez-Sanchez
F
,
Carro
J
,
Lopez
DE
,
Millian-Morell
L
,
Arana
JM
.
Speech in Alzheimer’s disease: can temporal and acoustic parameters discriminate dementia
.
Dement Geriatr Cogn Disord
.
2014
37
5–6
327
34
.
6.
de la Fuente Garcia
S
,
Ritchie
CW
,
Luz
S
.
Artificial intelligence, speech, and language processing approaches to monitoring Alzheimer’s disease: a systematic review
.
J Alzheimers Dis
.
2020
;
78
(
4
):
1547
74
.
7.
Ramanarayanan
V
,
Lammert
A
,
Rowe
H
,
Quatieri
T
,
Green
J
.
Speech as a biomarker: opportunities, interpretability, and challenges
.
Perspect ASHA Spec Interest Groups
.
2022 01/11
7
276
83
.
8.
Ivanova
O
,
Garcia Meilan
JJ
Speech analysis in preclinical identification of Alzheimer’s disease
Puerto de la Cruz, Spain
9th international work-conference on the interplay between natural and artificial computation (IWINAC)
2022
. p.
363
8
.
9.
Boschi
V
,
Catricala
E
,
Consonni
M
,
Chesi
C
,
Moro
A
,
Cappa
SF
.
Connected speech in neurodegenerative language Disorders: a review
.
Front Psychol
.
2017 Mar 6
8
269
21
.
10.
Pompili
A
,
Abad
A
,
de Matos
DM
,
Martins
IP
.
Pragmatic aspects of discourse production for the automatic identification of Alzheimer’s disease
.
IEEE J Sel Top Signal Process
.
2020 Feb
14
2
261
71
.
11.
Zhu
Y
,
Obyat
A
,
Liang
X
,
Batsis
JA
,
Roth
RM
.
WavBERT: exploiting semantic and non-semantic speech using Wav2vec and BERT for dementia detection
.
Interspeech
.
2021
;
2021
:
3790
4
.
12.
Zhou
L
,
Fraser
KC
,
Rudzicz
F
Int Speech Commun A
.
Speech recognition in Alzheimer’s disease and in its assessment
.
Interspeech
.
2016
CA2016. San Francisco. p.
1948
52
.
13.
Abulimiti
A
,
Weiner
J
,
Schultz
T
.
Automatic speech recognition for ILSE-interviews: longitudinal conversational speech recordings covering aging and cognitive decline
.
Interspeech
.
2020
3795
9
.
14.
Wang
T
,
Deng
J
,
Geng
M
,
Ye
Z
,
Hu
S
,
Wang
Y
.
Conformer based elderly speech recognition system for Alzheimer’s disease detection
.
Interspeech
.
2022
4825
9
.
15.
Luz
S
,
Haider
F
,
de la Fuente
S
,
Fromm
D
,
MacWhinney
B
.
Alzheimer’s dementia recognition through spontaneous speech: the ADReSS Challenge. 21st
.
Annual conference of the international speech communication association
Shanghai, China
International Speech Communication Association
2020
. p.
2172
6
.
16.
Luz
S
,
Haider
F
,
De La Fuente
S
,
Fromm
D
,
MacWhinney
B
Detecting cognitive decline using speech only: the ADReSSo Challenge. 22nd annual conference of the international speech communication association
Brno, Czech republic
International Speech Communication Association
2021
. p.
4211
5
.
17.
Yuan
J
,
Bian
Y
,
Cai
X
,
Huang
J
,
Ye
Z
,
Church
K
.
Disfluencies and fine-tuning pre-trained language models for detection of alzheimer’s disease. 21st
.
Annual conference of the international speech communication association
Shanghai, China
International Speech Communication Association
2020
. p.
2162
6
.
18.
Wang
Y
,
Wang
T
,
Ye
Z
,
Meng
L
,
Hu
S
,
Wu
X
.
Exploring linguistic feature and model combination for speech recognition based automatic AD detection
.
Interspeech
.
2022
3328
32
.
19.
Lundberg
SM
,
Lee
SI
. A unified approach to interpreting model predictions. 31st annual conference on neural information processing systems (NIPS), CA2017.
Long Beach
. p.
4768
77
.
20.
Becker
JT
,
Boller
F
,
Lopez
OL
,
Saxton
J
,
McGonigle
KLJA
.
The natural history of Alzheimer’s disease: description of study cohort and accuracy of diagnosis
.
Arch Neurol
.
1994 Jun
51
6
585
94
.
21.
Chen
J
,
Zhu
J
,
Ye
J
.
An attention-based hybrid network for automatic detection of Alzheimer’s disease from narrative speech
.
Interspeech
.
2019
4085
9
.
22.
Haider
F
,
de la Fuente
S
,
Luz
S
.
An assessment of paralinguistic acoustic features for detection of Alzheimer’s dementia in spontaneous speech
.
IEEE J Sel Top Signal Process
.
2020
;
14
(
2
):
272
81
.
23.
Zhang
S
.
Development status, problems and solutions of speech recognition technology
.
J Phy Conf Ser
.
2020
;
1693
:
012137
IOP Publishing.
24.
Fraser
KC
,
Meltzer
JA
,
Rudzicz
F
.
Linguistic features identify Alzheimer’s disease in narrative speech
.
J Alzheimers Dis
.
2016
;
49
(
2
):
407
22
.
25.
Calza
L
,
Gagliardi
G
,
Favretti
RR
,
Tamburini
F
.
Linguistic features and automatic classifiers for identifying mild cognitive impairment and dementia
.
Comput Speech Lang
.
2021 Jan
65
1
18
.
26.
Lindsay
H
,
Troger
J
,
Konig
A
.
Language impairment in Alzheimer's disease-robust and explainable evidence for AD-related deterioration of spontaneous speech through multilingual machine learning
.
Front Aging Neurosci
.
2021
;
13
:
642033
.
27.
Croisile
B
.
Agraphia in Alzheimer’s disease
.
Dement Geriatr Cogn Disord
.
1999 May–Jun
10
3
226
30
.
28.
Duong
A
,
Whitehead
V
,
Hanratty
K
,
Chertkow
H
.
The nature of lexico-semantic processing deficits in mild cognitive impairment
.
Neuropsychologia
.
2006
;
44
(
10
):
1928
35
.
29.
Sand
F
,
Kuhlmann
M
,
Jelic
V
,
Östberg
P
.
Is cognitive impairment associated with reduced syntactic complexity in writing? Evidence from automated text analysis
.
Aphasiology
.
2020
;
35
:
1
14
.
30.
Chang
CC
,
Yu
SC
,
McQuoid
DR
,
Messer
DF
,
Taylor
WD
,
Singh
K
.
Reduction of dorsolateral prefrontal cortex gray matter in late-life depression
.
Psychiatry Res
.
2011
;
193
:
1
6
.
31.
Taud
H
,
Mas
JF
.
Multilayer perceptron (MLP)
. In:
Camacho Olmedo
MT
,
Paegelow
M
,
Mas
J-F
,
Escobar
F
, editors.
Geomatic approaches for modeling land change scenarios
Cham
Springer International Publishing
2018
. p.
451
5
.
32.
Freund
Y
,
Schapire
RE
.
A decision-theoretic generalization of on-line learning and an application to boosting
.
J Comput Syst Sci
.
1997 Aug
55
1
119
39
.
33.
Soroski
T
,
da Cunha Vasco
T
,
Newton-Mason
S
,
Granby
S
,
Lewis
C
,
Harisinghani
A
.
Evaluating web-based automatic transcription for alzheimer speech data: transcript comparison and machine learning analysis
.
JMIR Aging
.
2022 Sep 21
5
3
e33460
. (Electronic).
34.
De Looze
C
,
Dehsarvi
A
,
Crosby
L
,
Vourdanou
A
,
Coen
RF
,
Lawlor
BA
.
Cognitive and structural correlates of conversational speech timing in mild cognitive impairment and mild-to-moderate Alzheimer’s disease: relevance for early detection approaches
.
Front Aging Neurosci
.
2021
;
13
:
637404
.
35.
Ilias
L
,
Askounis
D
.
Explainable identification of dementia from transcripts using transformer networks
.
IEEE J Biomed Health
.
2022 Aug
26
8
4153
64
.
36.
Cho
S
,
Quilico Cousins
KA
,
Shellikeri
S
,
Ash
S
,
Irwin
DJ
,
Liberman
MY
.
Lexical and acoustic speech features relating to Alzheimer disease pathology
.
Neurology
.
2022 Jul 26
99
4
E313
22
.
37.
Shebani
Z
,
Nestor
PJ
,
Pulvermueller
F
.
What’s “up”? Impaired spatial preposition processing in posterior cortical atrophy
.
Front Hum Neurosci
.
2021 Dec
15
718
.
38.
Bosse
SJPLSA
.
Spontaneous spatial information provided by dementia patients and elderly controls in narratives
.
Proc Ling Soc Amer
.
2019
;
4
:
4
9
.
39.
Agbavor
F
,
Liang
HJPDH
.
Predicting dementia from spontaneous speech using large language models
.
PLoS Digit Health
.
2022
;
1
(
12
):
e0000168
.
40.
Agbavor
F
,
Liang
HJBS
.
Artificial intelligence-enabled end-to-end detection and assessment of alzheimer’s disease using voice
.
Brain Sci
.
2022
;
13
(
1
):
28
.
41.
Forbes-McKay
KE
,
Venneri
A
.
Detecting subtle spontaneous language decline in early Alzheimer’s disease with a picture description task
.
Neurol Sci
.
2005
;
26
(
4
):
243
54
.
42.
Gayraud
F
,
Lee
HR
,
Barkat-Defradas
M
.
Syntactic and lexical context of pauses and hesitations in the discourse of Alzheimer patients and healthy elderly subjects
.
Clin Linguist Phon
.
2011
;
25
(
3
):
198
209
.
43.
Deal
JA
,
Gross
AL
,
Sharrett
AR
,
Abraham
AG
,
Coresh
J
,
Carlson
M
.
Hearing impairment and missing cognitive test scores in a population-based study of older adults: the Atherosclerosis Risk in Communities neurocognitive study
.
Alzheimers Dement
.
2021
;
17
(
10
):
1725
34
.
You do not currently have access to this content.