Introduction: Current cognitive assessments suffer from floor/ceiling and practice effects, poor psychometric performance in mild cases, and repeated assessment effects. This study explores the use of digital speech analysis as an alternative tool for determining cognitive impairment. The study specifically focuses on identifying the digital speech biomarkers associated with cognitive impairment and its severity. Methods: We recruited older adults with varying cognitive health. Their speech data, recorded via a wearable microphone during the reading aloud of a standard passage, were processed to derive digital biomarkers such as timing, pitch, and loudness. Cohen’s d effect size highlighted group differences, and correlations were drawn to the Montreal Cognitive Assessment (MoCA). A stepwise approach using a Random Forest model was implemented to distinguish cognitive states using speech data and predict MoCA scores based on highly correlated features. Results: The study comprised 59 participants, with 36 demonstrating cognitive impairment and 23 serving as cognitively intact controls. Among all assessed parameters, similarity, as determined by Dynamic Time Warping (DTW), exhibited the most substantial positive correlation (rho = 0.529, p < 0.001), while timing parameters, specifically the ratio of extra words, revealed the strongest negative correlation (rho = −0.441, p < 0.001) with MoCA scores. Optimal discriminative performance was achieved with a combination of four speech parameters: total pause time, speech-to-pause ratio, similarity via DTW, and intelligibility via DTW. Precision and balanced accuracy scores were found to be 88.1 ± 1.2% and 76.3 ± 1.3%, respectively. Discussion: Our research proposes that reading-derived speech data facilitates the differentiation between cognitively impaired individuals and cognitively intact, age-matched older adults. Specifically, parameters based on timing and similarity within speech data provide an effective gauge of cognitive impairment severity. These results suggest speech analysis as a viable digital biomarker for early detection and monitoring of cognitive impairment, offering novel approaches in dementia care.

1.
Kelaiditi
E
,
Cesari
M
,
Canevelli
M
,
van Kan
GA
,
Ousset
PJ
,
Gillette-Guyonnet
S
, et al
.
Cognitive frailty: rational and definition from an (I.A.N.A./I.A.G.G.) international consensus group
.
J Nutr Health Aging
.
2013
;
17
(
9
):
726
34
. .
2.
Mental health of older adults
.
World Health Organization
;
2017
.
3.
[Health Equity].
4.
Cummings
J
,
Lee
G
,
Nahed
P
,
Kambar
M
,
Zhong
K
,
Fonseca
J
, et al
.
Alzheimer's disease drug development pipeline: 2022
.
Alzheimers Dement
.
2022
;
8
(
1
):
e12295
. .
5.
Spooner
DM
,
Pachana
NA
.
Ecological validity in neuropsychological assessment: a case for greater consideration in research with neurologically intact populations
.
Arch Clin Neuropsychol
.
2006
;
21
(
4
):
327
37
. .
6.
Kueper
JK
,
Speechley
M
,
Montero-Odasso
M
.
The Alzheimer's Disease Assessment Scale-Cognitive subscale (ADAS-Cog): modifications and responsiveness in pre-dementia populations. A narrative Review
.
J Alzheimers Dis
.
2018
;
63
(
2
):
423
44
. .
7.
Rektorova
I
,
Mekyska
J
,
Janousova
E
,
Kostalova
M
,
Eliasova
I
,
Mrackova
M
, et al
.
Speech prosody impairment predicts cognitive decline in Parkinson's disease
.
Parkinsonism Relat Disord
.
2016
;
29
:
90
5
. .
8.
Beltrami
D
,
Gagliardi
G
,
Rossini Favretti
R
,
Ghidoni
E
,
Tamburini
F
,
Calza
L
.
Speech analysis by natural language processing techniques: a possible tool for very early detection of cognitive decline
.
Front Aging Neurosci
.
2018
;
10
:
369
. .
9.
Ambrosini
E
,
Caielli
M
,
Milis
M
,
Loizou
C
,
Azzolino
D
,
Damanti
S
, et al
.
Automatic speech analysis to early detect functional cognitive decline in elderly population
.
Annu Int Conf IEEE Eng Med Biol Soc
.
2019
;
2019
:
212
6
. .
10.
Konig
A
,
Mallick
E
,
Troger
J
,
Linz
N
,
Zeghari
R
,
Manera
V
, et al
.
Measuring neuropsychiatric symptoms in patients with early cognitive decline using speech analysis
.
Eur Psychiatry
.
2021
;
64
(
1
):
e64
. .
11.
Pan
Y
,
Nallanthighal
VS
,
Blackburn
D
,
Christensen
H
,
Härmä
A
.
Multi-task estimation of age and cognitive decline from speech
.
ICASSP 2021 - 2021 IEEE international conference on acoustics, speech and signal processing (ICASSP)
.
2021
; p.
7258
62
.
12.
Wang
HL
,
Tang
R
,
Ren
RJ
,
Dammer
EB
,
Guo
QH
,
Peng
GP
, et al
.
Speech silence character as a diagnostic biomarker of early cognitive decline and its functional mechanism: a multicenter cross-sectional cohort study
.
BMC Med
.
2022
;
20
(
1
):
380
. .
13.
Snowdon
DA
.
Aging and Alzheimer’s disease: lessons from the nun study
.
Gerontologist
.
1997
;
37
(
2
):
150
6
. .
14.
Garrard
P
,
Maloney
LM
,
Hodges
JR
,
Patterson
K
.
The effects of very early Alzheimer’s disease on the characteristics of writing by a renowned author
.
Brain
.
2005
;
128
(
Pt 2
):
250
60
. .
15.
van Velzen
M
,
Garrard
P
.
From hindsight to insight: retrospective analysis of language written by a renowned Alzheimer's patient
.
Interdiscipl Sci Rev
.
2008
;
33
(
4
):
278
86
. .
16.
Le
X
,
Lancashire
I
,
Hirst
G
,
Jokel
R
.
Longitudinal detection of dementia through lexical and syntactic changes in writing: a case study of three British novelists
.
Lit Ling Comput
.
2011
;
26
(
4
):
435
61
. .
17.
Vigo
I
,
Coelho
L
,
Reis
S
.
Speech- and language-based classification of Alzheimer’s disease: a systematic Review
.
Bioengineering
.
2022
;
9
(
1
):
27
. .
18.
Yancheva
M
,
Fraser
KC
,
Rudzicz
F
.
Using linguistic features longitudinally to predict clinical scores for Alzheimer’s disease and related dementias
.
Proceedings of SLPAT 2015: 6th workshop on speech and language processing for assistive technologies
.
2015
. p.
134
39
.
19.
Simpson
W
,
Kaufman
LD
,
Detke
M
,
Lynch
C
,
Butler
A
,
Dominy
S
.
P4-542: utility of speech-based digital biomarkers for evaluating disease progression in clinical trials of Alzheimer’s disease
.
Alzheimer's Dementia
.
2019
;
15
(
7S_Part_29
):
P1524
. .
20.
Fairbanks
G
.
Voice and articulation drillbook
.
Addison-Wesley Educational Publishers
;
1960
.
21.
Nasreddine
ZS
,
Phillips
NA
,
Bedirian
V
,
Charbonneau
S
,
Whitehead
V
,
Collin
I
, et al
.
The Montreal Cognitive Assessment, MoCA: a brief screening tool for mild cognitive impairment
.
J Am Geriatr Soc
.
2005
;
53
(
4
):
695
9
. .
22.
Yardley
L
,
Beyer
N
,
Hauer
K
,
Kempen
G
,
Piot-Ziegler
C
,
Todd
C
.
Development and initial validation of the Falls Efficacy Scale-International (FES-I)
.
Age Ageing
.
2005
;
34
(
6
):
614
9
. .
23.
Radloff
LS
.
The CES-D scale: a self-report depression scale for research in the general population
.
Appl Psychol Meas
.
1977
;
1
(
3
):
385
401
. .
24.
Beck
AT
,
Epstein
N
,
Brown
G
,
Steer
RA
.
An inventory for measuring clinical anxiety: psychometric properties
.
J Consult Clin Psychol
.
1988
;
56
(
6
):
893
7
. .
25.
[Audacity].
26.
Ciesla
R
.
Voice acting: hardware and techniques. Sound and music for games: the basics of digital audio for video games
.
Springer
;
2022
; p.
127
46
.
27.
Orozco-Arroyave
JR
,
Vásquez-Correa
JC
,
Vargas-Bonilla
JF
,
Arora
R
,
Dehak
N
,
Nidadavolu
PS
, et al
.
NeuroSpeech: an open-source software for Parkinson's speech analysis
.
Digital Signal Processing
.
2018
;
77
:
207
21
. .
28.
Kang
K
,
Nunes
AS
,
Sharma
M
,
Hall
A
,
Mishra
RK
,
Casado
J
, et al
.
Utilizing speech analysis to differentiate progressive supranuclear palsy from Parkinson's disease
.
Parkinsonism Relat Disord
.
2023
;
115
:
105835
. .
29.
Benjamini
Y
.
Discovering the false discovery rate
.
J R Stat Soc Ser B Stat Methodol
.
2010
;
72
(
4
):
405
16
. .
30.
Bagheri
AB
,
Rouzi
MD
,
Koohbanani
NA
,
Mahoor
MH
,
Finco
M
,
Lee
M
, et al
.
Potential applications of artificial intelligence and machine learning on diagnosis, treatment, and outcome prediction to address health care disparities of chronic limb-threatening ischemia
.
Semin Vasc Surg
.
2023
;
36
(
3
):
454
9
. .
31.
Deshpande
G
,
Schuller
B
.
An overview on audio, signal, speech, & language processing for COVID-19
. arXiv preprint. arXiv:200508579.
2020
.
32.
Pal
M
.
Random forest classifier for remote sensing classification
.
Int J Remote Sensing
.
2005
;
26
(
1
):
217
22
. .
33.
Rogers
J
,
Gunn
S
.
Identifying feature relevance using a random forest
.
Subspace, Latent Structure and Feature Selection: Statistical and Optimization Perspectives Workshop, SLSFS 2005, Bohinj, Slovenia, February 23-25, 2005, Revised Selected Papers
.
Bohinj, Slovenia
:
Springer
;
2006
. p.
173
84
.
34.
Park
C
,
Rouzi
MD
,
Atique
MMU
,
Finco
M
,
Mishra
RK
,
Barba-Villalobos
G
, et al
.
Machine learning-based aggression detection in children with ADHD using sensor-based physical activity monitoring
.
Sensors
.
2023
;
23
(
10
):
4949
. .
35.
Lee
H
,
Joseph
B
,
Enriquez
A
,
Najafi
B
.
Toward using a smartwatch to monitor frailty in a hospital setting: using a single wrist-wearable sensor to assess frailty in bedbound inpatients
.
Gerontology
.
2018
;
64
(
4
):
389
400
. .
36.
Brodersen
KH
,
Ong
CS
,
Stephan
KE
,
Buhmann
JM
.
The balanced accuracy and its posterior distribution. 2010 20th international conference on pattern recognition
.
IEEE
;
2010
; p.
3121
4
.
37.
Velez
DR
,
White
BC
,
Motsinger
AA
,
Bush
WS
,
Ritchie
MD
,
Williams
SM
, et al
.
A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction
.
Genet Epidemiol
.
2007
;
31
(
4
):
306
15
. .
38.
Sage
A
.
Random forest robustness, variable importance, and tree aggregation
;
2018
.
39.
Wang
Z
,
Jiang
C
,
Ding
Y
,
Lyu
X
,
Liu
Y
.
A novel behavioral scoring model for estimating probability of default over time in peer-to-peer lending
.
Electron Commer Res Appl
.
2018
;
27
:
74
82
. .
40.
Tibshirani
RJ
,
Efron
B
.
An introduction to the bootstrap
.
Monogr Stat Appl Probab
.
1993
;
57
(
1
).
41.
Zhu
W
.
Making bootstrap statistical inferences: a tutorial
.
Res Q Exerc Sport
.
1997
;
68
(
1
):
44
55
. .
42.
Adelabu
S
,
Mutanga
O
,
Adam
E
.
Testing the reliability and stability of the internal accuracy assessment of random forest for classifying tree defoliation levels using different validation methods
.
Geocarto Int
.
2015
;
30
(
7
):
810
21
. .
43.
Tóth
L
,
Hoffmann
I
,
Gosztolya
G
,
Vincze
V
,
Szatlóczki
G
,
Bánréti
Z
, et al
.
A speech recognition-based solution for the automatic detection of mild cognitive impairment from spontaneous speech
.
Curr Alzheimer Res
.
2018
;
15
(
2
):
130
8
. .
44.
Sreejith
S
,
Rahul
S
,
Jisha
R
.
A real time patient monitoring system for heart disease prediction using random forest algorithm
.
Advances in Signal Processing and intelligent recognition systems: proceedings of second international symposium on signal processing and intelligent recognition systems (SIRS-2015) December 16-19, 2015
.
Trivandrum, India
:
Springer
;
2016
. p.
485
500
.
45.
Kaur
P
,
Kumar
R
,
Kumar
M
.
A healthcare monitoring system using random forest and internet of things (IoT)
.
Multimed Tools Appl
.
2019
;
78
(
14
):
19905
16
. .
46.
Zheng
L
,
Li
Q
,
Ban
H
,
Liu
S
.
Speech emotion recognition based on convolution neural network combined with random forest. 2018 Chinese control and decision conference (CCDC)
.
IEEE
;
2018
; p.
4143
7
.
47.
Chen
L
,
Wu
M
,
Pedrycz
W
,
Hirota
K
,
Chen
L
,
Wu
M
.
Two-layer fuzzy multiple random forest for speech emotion recognition
.
Emotion Recognition and Understanding for Emotional Human-Robot Interaction Systems
.
2021
:
77
89
. .
48.
Steinwart
I
,
Christmann
A
.
Support vector machines
.
Springer Science & Business Media
;
2008
.
49.
Grinsztajn
L
,
Oyallon
E
,
Varoquaux
G
.
Why do tree-based models still outperform deep learning on typical tabular data
.
Adv Neural Inf Process Syst
.
2022
;
35
:
507
20
.
You do not currently have access to this content.