Introduction: Dataset creation is one of the first tasks required for training AI algorithms but is underestimated in pathology. High-quality data are essential for training algorithms and data should be labelled accurately and include sufficient morphological diversity. The dynamics and challenges of labelling a urine cytology dataset using The Paris System (TPS) criteria are presented. Methods: 2,454 images were labelled by pathologist consensus via video conferencing over a 14-day period. During the labelling sessions, the dynamics of the labelling process were recorded. Quality assurance images were randomly selected from images labelled in previous sessions within this study and randomly distributed throughout new labelling sessions. To assess the effect of time on the labelling process, the labelled set of images was split into 2 groups according to the median relative label time and the time taken to label images and intersession agreement were assessed. Results: Labelling sessions ranged from 24 m 11 s to 41 m 06 s in length, with a median of 33 m 47 s. The majority of the 2,454 images were labelled as benign urothelial cells, with atypical and malignant urothelial cells more sparsely represented. The time taken to label individual images ranged from 1 s to 42 s with a median of 2.9 s. Labelling times differed significantly among categories, with the median label time for the atypical urothelial category being 7.2 s, followed by the malignant urothelial category at 3.8 s and the benign urothelial category at 2.9 s. The overall intersession agreement for quality assurance images was substantial. The level of agreement differed among classes of urothelial cells – benign and malignant urothelial cell classes showed almost perfect agreement and the atypical urothelial cell class showed moderate agreement. Image labelling times seemed to speed up, and there was no evidence of worsening of intersession agreement with session time. Discussion/Conclusion: Important aspects of pathology dataset creation are presented, illustrating the significant resources required for labelling a large dataset. We present evidence that the time taken to categorise urine cytology images varies by diagnosis/class. The known challenges relating to the reproducibility of the AUC (atypical) category in TPS when compared to the NHGUC (benign) or HGUC (malignant) categories is also confirmed.

1.
McAlpine
ED
,
Michelow
P
.
The cytopathologist’s role in developing and evaluating artificial intelligence in cytopathology practice
.
Cytopathology
.
2020
;
31
(
5
):
385
92
.
2.
Peikari
M
,
Salama
S
,
Nofech-Mozes
S
,
Martel
AL
.
A cluster-then-label semi-supervised learning approach for pathology image classification
.
Sci Rep
.
2018
;
8
(
1
):
7193
13
.
3.
Tizhoosh
HR
,
Pantanowitz
L
.
Artificial intelligence and digital pathology: challenges and opportunities
.
J Pathol Inform
.
2018
;
9
(
1
):
38
.
4.
Marée
R
.
The need for careful data collection for pattern recognition in digital pathology
.
J Pathol Inform
.
2017
[cited 2018 Dec 2];
8
:
19
. Available from: http://www.jpathinformatics.org/text.asp?2017/8/1/19/204200.
5.
Abels
E
,
Pantanowitz
L
,
Aeffner
F
,
Zarella
MD
,
vd Laak
J
,
Bui
MM
,
.
Computational pathology definitions, best practices, and recommendations for regulatory guidance: a white paper from the Digital Pathology Association
.
J Pathol
.
2019
;
249
(
3
):
286
94
.
6.
Géron
A
.
Hands-on machine learning with scikit-learn, keras & tensorFlow
. 2nd ed.
O’Reilly Media Inc.
;
2019
. p.
23
32
.
7.
Guo
Z
,
Chen
R
,
Zhang
K
,
Pan
Y
,
Wu
J
.
The impairing effect of mental fatigue on visual sustained attention under monotonous multi-object visual attention task in long durations: an event-related potential based study
.
PLoS One
.
2016
;
11
(
9
):
1
13
.
8.
Barkan
GA
,
Wojcik
EM
,
Nayar
R
,
Savic-Prince
S
,
Quek
ML
,
Kurtycz
DF
,
.
The Paris System for reporting urinary cytology: the quest to develop a standardized terminology
.
Adv Anat Pathol
.
2016 [cited 2018 Dec
223
]
;(
3
):
193
201
.
9.
Reid
MD
,
Osunkoya
AO
,
Siddiqui
MT
,
Looney
SW
.
Accuracy of grading of urothelial carcinoma on urine cytology: an analysis of interobserver and intraobserver agreement
.
Int J Clin Exp Pathol
.
2012
;
5
(
9
):
882
91
.
10.
Long
T
,
Layfield
LJ
,
Esebua
M
,
Frazier
SR
,
Giorgadze
DT
,
Schmidt
RL
.
Interobserver reproducibility of the Paris system for reporting urinary cytology
.
Cytojournal
.
2017
;
14
(
1
):
17
.
11.
Kurtycz
DFI
,
Barkan
GA
,
Pavelec
DM
,
Rosenthal
DL
,
Wojcik
EM
,
VandenBussche
CJ
,
.
Paris Interobserver Reproducibility Study (PIRST)
.
J Am Soc Cytopathol
.
2018
;
7
(
4
):
174
84
.
12.
Wang
YH
,
Hang
JF
,
Wen
CH
,
Liao
KC
,
Lee
WY
,
Lai
CR
.
Diagnostic agreement for high-grade urothelial cell carcinoma in atypical urine cytology: a nationwide survey reveals a tendency for overestimation in specimens with an N/C ratio approaching 0.5
.
Cancers
.
2020
;
12
(
2
):
272
.
13.
Stanzione
N
,
Ahmed
T
,
Fung
PC
,
Cai
D
,
Lu
DY
,
Sumida
LC
,
.
The continual impact of the Paris System on urine cytology, a 3-year experience
.
Cytopathology
.
2020
;
31
(
1
):
35
40
.
14.
Wei
J
,
Suriawinata
A
,
Vaickus
L
,
Ren
B
,
Liu
X
,
Wei
J
,
.
Generative image translation for data augmentation in colorectal histopathology images
.
Proc Mach Learn Res
.
2019
;
116
:
10
24
.
15.
Cuff
J
,
Higgins
JP
.
Statistical analysis of surgical pathology data using the R program
.
Adv Anat Pathol
.
2012
[cited 2020 Feb 25
]
;
19
:
131
9
.
16.
R Core Team
.
R: a language and environment for statistical computing [Internet]
.
Vienna, Austria
:
R Foundation for Statistical Computing
;
2017
. Available from: https://www.r-project.org/.
17.
Hou
L
,
Agarwal
A
,
Samaras Di
,
Kurc
TM
,
Gupta
RR
,
Saltz
JH
.
Robust histopathology image analysis: to label or to synthesize
.
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit
.
2019 Jun
;
2019
:
8525
34
.
18.
Zhang
ML
,
Guo
AX
,
VandenBussche
CJ
.
Morphologists overestimate the nuclear-to-cytoplasmic ratio
.
Cancer Cytopathol
.
2016
;
124
(
9
):
669
77
.
19.
Chollet
F
.
Deep learning with python
. 1st ed.
Greenwich, CT, USA
:
Manning Publications Co.
;
2017
.
You do not currently have access to this content.