Introduction: Studying what older adults say can provide important insights into cognitive, affective, and social aspects of aging. Available language analysis tools generally require audio-recorded speech to be transcribed into verbatim text, a task that has historically been performed by humans. However, recent advances in AI-based language processing open up the possibility of replacing this time- and resource-intensive task with fully automatic speech to text. Methods: This study evaluates the accuracy of two common automatic speech-to-text tools – OpenAI’s Whisper and otter.ai – relative to human-corrected transcripts. Based on two speech tasks completed by 238 older adults, we used the Linguistic Inquiry and Word Count (LIWC) to compare language features of text generated by each transcription method. The study further assessed the degree to which manual tagging of filler words (e.g., “like,” “well”) common in spoken language impacts the validity of the analysis. Results: The AI-based LIWC features evidenced very high convergence with the LIWC features derived from the human-corrected transcripts (average r = 0.98). Further, the manual tagging of filler words did not impact the validity for all LIWC features except the categories filler words and netspeak. Conclusion: These findings support that Whisper and otter.ai are valuable tools for language analysis in aging research and provide further evidence that automatic speech to text with state-of-the art AI tools is ready for psychological language research.

1.
Rohr
MK
,
Wieck
C
,
Kunzmann
U
.
Age differences in positive feelings and their expression
.
Psychol Aging
.
2017
;
32
(
7
):
608
20
.
2.
Luo
M
,
Schneider
G
,
Martin
M
,
Demiray
B
.
Cognitive aging effects on language use in real-life contexts: a naturalistic observation study
.
The 41st annual meeting of the cognitive science society; 2019 July 24–27; Montreal, QC
.
CogSci
;
2019
.
3.
Boyd
RL
,
Ashokkumar
A
,
Seraj
S
,
Pennebaker
JW
.
The development and psychometric properties of LIWC-22
.
The University of Texas at Austin
;
2022
.
4.
Pentland
SJ
,
Fuller
CM
,
Spitzley
LA
,
Twitchell
DP
.
Does accuracy matter? Methodological considerations when using automated speech-to-text for social science research
.
Int J Soc Res Methodol
.
2022
;
26
(
6
):
661
77
.
5.
Pfeifer
VA
,
Chilton
TD
,
Grilli
MD
,
Mehl
MR
.
How ready is speech-to-text for psychological language research? Evaluating the validity of AI-generated English transcripts for analyzing free-spoken responses in younger and older adults
.
Behav Res Methods
.
2024
;
56
(
7
):
7621
31
.
6.
Radford
A
,
Kim
JW
,
Hallacy
C
,
Ramesh
A
,
Goh
G
,
Agarwal
S
, et al
.
Learning transferable visual models from natural language supervision
.
Int Conf Machine Learn
.
2021
:
8748
63
.
7.
Loakes
D
.
Automatic speech recognition and the transcription of indistinct forensic audio: how do the new generation of systems fare
.
Front Commun
.
2024
;
9
.
8.
Zhao
R
,
Choi
ASG
,
Koenecke
A
,
Rameau
A
.
Quantification of automatic speech recognition system performance on d/deaf and hard of hearing speech
.
Laryngoscope
.
2025
;
135
(
1
):
191
7
.
9.
Molina
JTG
,
Gaspar
PA
,
Figueroa-Barra
A
.
Automatic speech recognition in psychiatric interviews: a rocket to diagnostic support in psychosis
.
Revista Colombiana de Psiquiatría
.
2024
.
10.
Laserna
C
,
Seih
Y
,
Pennebaker
J
.
Um… who like says you know: filler word use as a function of age, gender, and personality
.
J Lang Soc Psychol
.
2014
;
33
(
3
):
328
38
.
11.
Koo
TK
,
Li
MY
.
A guideline of selecting and reporting intraclass correlation coefficients for reliability research
.
J Chiropr Med
.
2016
;
15
(
2
):
155
63
.
12.
Romana
A
,
Niu
M
,
Perez
M
,
Provost
EM
.
FluencyBank timestamped: an updated data set for disfluency detection and automatic intended speech recognition
.
J Speech Lang Hear Res
.
2024
;
67
(
11
):
4203
15
.
13.
Romana
A
,
Koishida
K
,
Provost
EM
.
Automatic disfluency detection from untranscribed speech
.
IEEE/ACM Trans Audio Speech Lang Process
.
2024
;
32
:
4727
40
.
14.
Wagner
D
,
Bayerl
SP
,
Baumann
I
,
Noeth
E
,
Riedhammer
K
,
Bocklet
T
.
Large language models for dysfluency detection in stuttered speech
. (No. arXiv:2406.11025).
2024
.
15.
Pennebaker
JW
,
Stone
LD
.
Words of wisdom: language use over the life span
.
J Pers Soc Psychol
.
2003
;
85
(
2
):
291
301
.
16.
Ryan
L
,
Hay
M
,
Huentelman
MJ
,
Duarte
A
,
Rundek
T
,
Levin
B
, et al
.
Precision aging: applying precision medicine to the field of cognitive aging
.
Front Aging Neurosci
.
2019
;
11
(
1
):
128
.
You do not currently have access to this content.