Abstract
Introduction: Programmed death-ligand 1 (PD-L1) expression is used for treatment prediction in non-small cell lung cancer (NSCLC). While cytology may be the only available material in the routine clinical setting, testing in clinical trials has mainly been based on biopsies. Methods: We included 2 retrospective cohorts of paired, concurrently sampled, cytological specimens and biopsies. Also, the literature on PD-L1 in paired cytological/histological samples was reviewed. Focus was on the cutoff levels ≥1 and ≥50% positive tumor cells. Results: Using a 3-tier scale, PD-L1 was concordant in 40/47 (85%) and 66/97 (68%) of the paired NSCLC cases in the 2 cohorts, with kappa 0.77 and 0.49, respectively. In the former cohort, all discordant cases had lower score in cytology. In both cohorts, concordance was lower in samples from different sites (e.g., biopsy from primary tumor and cytology from pleural effusion). Based on 25 published studies including about 1,700 paired cytology/histology cases, the median (range) concordance was 81–85% (62–100%) at cutoff 1% for a positive PD-L1 staining and 89% (67–100%) at cutoff 50%. Conclusions: The overall concordance of PD-L1 between cytology and biopsies is rather good but with significant variation between laboratories, which calls for local quality assurance.
Introduction
In non-small cell lung cancer (NSCLC), immunohistochemical (IHC) staining for programmed death-ligand 1 (PD-L1) is used to predict response to immune checkpoint inhibitors [1-5]. Different PD-L1 assays have been used for the various programmed death-1 (PD-1)/PD-L1 inhibitors, and a lot of studies have presented data on comparison of assays [6-8]. Essentially, the PD-L1 clones 22C3, 28-8, and SP263 (especially the former two) exhibit a similar staining pattern, while SP142 differs from the others. Laboratory-developed tests may give the same results as approved assays, but a highly varying number of laboratory-developed tests have shown inadequate quality [9-11].
Other technical factors may also potentially affect PD-L1 staining results. Short fixation time, some decalcification procedures, older paraffin blocks, and long storage time of slides before staining have been linked to reduced staining of tumor cells [12-16]. In contrast, delayed or prolonged formalin fixation has not shown any significant effect on PD-L1 staining [13, 17].
Several biological factors that may affect PD-L1 staining results have also been investigated. PD-L1 expression may differ between primary tumor and metastasis with either gain or loss of expression in the metastasis [18-22]. A longer time interval between sampling has been reported to correlate with greater intertumor discrepancy (indicating a change in PD-L1 expression over time) [19], while the metastatic site has been reported not to influence PD-L1 expression [23]. Impact of intratumor heterogeneity has also been explored, and, for example, concordance of PD-L1 expression between paired biopsies and resected tumors has been reported to be 63–92% for <1 versus ≥1% positive tumor cells [24-28].
As immune checkpoint inhibitors are used in advanced and recurrent lung cancer, biopsies have been the main specimen type for PD-L1 evaluation in treatment studies (although resected tumor tissue was also allowed in most) [1, 2, 4, 5, 29-31] and is regarded as the gold standard. However, cytology is not seldom the only available material in the clinical setting, and some (e.g., technical) factors may differ between cytological and histological specimens. The aim of the present study was to explore if PD-L1 testing in NSCLC is comparable for cytological specimens and biopsies based on the standard procedures in southern Sweden and to review the current literature.
Materials and Methods
In the present study, paired cytology/biopsy specimens from NSCLC cases stained with PD-L1 were included from 2 sites in southern Sweden: The Department of Genetics and Pathology in Lund and the Department of Pathology and Cytology in Halm-stad.
Study Material
The 2 cohorts included all available NSCLC cases from 2017 to 2019 in Lund and 2003 to 2019 in Halmstad (the Halmstad cases were all stained in 2016–2020), respectively, with a paired cell block and biopsy stained for PD-L1. For the Lund cohort, all PD-L1 stains were from the clinical setting, while for the Halmstad cohort, stains were either from the clinical setting or identically stained as part of the present study.
For both cohorts, only cases where the paired specimens were part of the same diagnostic workup were included (<4 weeks between the samples). Only 1 cell block and 1 biopsy were stained with PD-L1 for each individual, but sometimes the cell block contained material from >1 cytological sample, for example, both bronchial brush and endobronchial ultrasound (EBUS)-guided lymph node aspiration combined into a single cell block. Cases with <100 evaluable tumor cells in either sample were excluded. Also, 2 cases were excluded where the biopsy and cytology showed different histological types, namely, squamous cell carcinoma and adenocarcinoma (presumably adenosquamous carcinoma) and large cell neuroendocrine carcinoma and adenocarcinoma (either combined large cell neuroendocrine carcinoma or 2 synchronous tumors), respectively. Comparisons of preparation of cytology and PD-L1 staining (including assays) for the 2 cohorts are summarized in Table 1 and are presented in detail in online suppl. material (for all online suppl. material, see www.karger.com/doi/10.1159/000517078) together with images of control tissue stained for PD-L1 (online suppl. Fig. 1).
PD-L1 Evaluation
All evaluations were performed using a conventional light microscope. All involved pathologists assess PD-L1 in daily practice. Hematoxylin-eosin-stained sections (and diagnostic IHC staining when performed) were available at time of evaluation of PD-L1. PD-L1 was scored <1%, 1–49%, and ≥50%, based on any intensity of partial or complete linear membranous staining of tumor cells in line with assessment manuals (Agilent/pharmDx, Santa Clara, CA, USA) [32, 33].
For the Lund cohort, all PD-L1 slides were assessed by at least 2 pathologists. Review was performed as part of the study by pathologists D.V. or H.B. and in most cases also by the pathology resident R.M. Cases were reviewed by all 3 and/or an additional pathologist (K.E.L.) for confirmation when needed.
For the Halmstad cohort, all PD-L1 slides were assessed as part of the study by the certified cytotechnologist M.S.I.M., and one or, when needed, both the pathologists T.S. and H.B. Cases with discordant PD-L1 between biopsy and cytology were also assessed by the pathologist A.D.
As all cases were primarily assessed by 2 investigators, interobserver agreement could be evaluated, although not representing the concordance between the same 2 investigators for all 144 cases. PD-L1 was scored the same by the 2 investigators for 138 biopsies and 131 cytological specimens (no significant difference with Fisher’s exact test; kappa 0.93 and 0.86, respectively).
Statistical Analysis
Kappa and Wilcoxon signed-rank test were used for diagnostic concordance and comparison of PD-L1 expression, respectively, in the paired cases. Fisher’s exact test was used for comparison of concordance between cohorts. The analyses were performed with MedCalc 14.12.0 (MedCalc Software bvba, Ostend, Belgium).
Literature Search
PubMed was searched using “PD-L1 cytology lung cancer” as the search term. Original articles in English until December 2020 that included NSCLC cases with PD-L1 analyzed on cytological material and paired histological specimens were selected. Further articles were identified through the “similar articles” utility of PubMed and individual journals as well as cited references in the retrieved articles. Studies not presenting data for pure cytological versus pure histological specimens were excluded [23, 28, 34-37].
Results
PD-L1 Concordance
Concordance of PD-L1 expression in paired biopsies and cytological specimens from NSCLC cases in Lund 2017–2019 and Halmstad 2003–2019 (stained 2016–2020), respectively, is found in Table 2. As evident, 40 (85%) of 47 cases (from 47 different individuals) were concordant in the Lund cohort. All discordant cases had lower PD-L1 score in cytology, and the difference in score was significant between biopsies and cytology (Wilcoxon test p = 0.02). Unweighted kappa was 0.77 (95% CI: 0.62–0.93). The concordance was 94% (44/47) for the 1% cutoff (i.e., <1 or ≥1% positive tumor cells) and 89% (42/47) for the 50% cutoff.
Concordance of PD-L1 expression in 47 and 97 paired biopsies and cytological specimens from NSCLC, from Lund and Halmstad, respectively

In the Halmstad cohort, 66 (68%) of 97 cases (from 97 different individuals) were concordant, and unweighted kappa was 0.49 (95% CI: 0.35–0.63). The concordance was 81% (79/97) for the 1% cutoff (i.e., <1 or ≥1% positive tumor cells) and 85% (82/97) for the 50% cutoff. As evident from Table 2, there were more cases with lower PD-L1 score in cytology in the Halmstad cohort, with a nonsignificant trend (Wilcoxon test p = 0.055).
The number of concordant cases was significantly lower in the Halmstad cohort compared to the Lund cohort (Fisher’s exact test p = 0.043). See Figures 1 and 2 for examples of a concordant and a discordant case (using the 3-tier scale <1%/1–49%/≥50%), respectively, and online suppl. Figures 2 and 3 for additional cases.
An adenocarcinoma with concordant PD-L1 expression (both ≥50%) between paired EBUS cytology (a, b) and bronchial biopsy (c, d). Hematoxylin-eosin (a, c) and PD-L1 clone 22C3 (b, d). Scale bar, 20 μm. PD-L1, programmed death-ligand 1; EBUS, endobronchial ultrasound.
An adenocarcinoma with concordant PD-L1 expression (both ≥50%) between paired EBUS cytology (a, b) and bronchial biopsy (c, d). Hematoxylin-eosin (a, c) and PD-L1 clone 22C3 (b, d). Scale bar, 20 μm. PD-L1, programmed death-ligand 1; EBUS, endobronchial ultrasound.
A squamous cell carcinoma with discordant PD-L1 expression (<1% vs. ≥50%) between paired EBUS cytology (a, b) and bronchial biopsy (c, d). Hematoxylin-eosin (a, c) and PD-L1 clone 22C3 (b, d). Scale bar, 20 μm. PD-L1, programmed death-ligand 1; EBUS, endobronchial ultrasound.
A squamous cell carcinoma with discordant PD-L1 expression (<1% vs. ≥50%) between paired EBUS cytology (a, b) and bronchial biopsy (c, d). Hematoxylin-eosin (a, c) and PD-L1 clone 22C3 (b, d). Scale bar, 20 μm. PD-L1, programmed death-ligand 1; EBUS, endobronchial ultrasound.
Characteristics of the Specimens
The diagnosis of the 47 cases in the Lund cohort was adenocarcinoma in 32 (whereof 29 concordant PD-L1), squamous cell carcinoma in 13 (9 concordant), and NSCLC not otherwise specified in 2 cases (both concordant). The biopsies were bronchial biopsies in 42 cases, transthoracic core biopsies in 3 cases, liver biopsy in 1 case, and biopsy from a cervical lymph node in 1 case. The cytological specimens were bronchial brush in 13 cases, other bronchial cytology including suction catheter, bronchoalveolar lavage (BAL), or mix of any of the two with bronchial brush in 5 cases, EBUS-guided lymph node aspirations in 15 cases, mix of EBUS from lymph nodes and bronchial cytology in 12 cases, and pleural effusions in 2 cases.
The samples were from the same site in 18 cases (whereof 16 concordant PD-L1), partly from the same site in 12 cases (11 concordant), or from different sites in 17 cases (13 concordant), respectively. The samples from the same site were all from the primary tumor. The samples partly from the same site were biopsies from the primary tumor and mixed cytological specimens (all mixed bronchial cytology and EBUS from lymph nodes).
The diagnosis of the 97 cases in the Halmstad cohort was adenocarcinoma in 67 (whereof 43 concordant PD-L1) and squamous cell carcinoma in 30 cases (23 concordant). The biopsies were bronchial biopsies in 62 cases and transthoracic core biopsies in 35 cases. The cytological specimens were bronchial brush in 13 cases, BAL in 53 cases, both bronchial brush and BAL in 7 cases, pleural effusion in 17 cases, EBUS-guided lymph node aspirations in 2 cases, fine-needle aspiration (FNA) of the lymph node in 2 cases, and both BAL and either pleural effusion, EBUS, or FNA of the lymph node in 1 case each.
The samples were from the same site in 73 cases (whereof 54 concordant PD-L1), partly from the same site in 3 cases (all concordant), or from different sites in 21 cases (9 concordant), respectively. The samples partly from the same site were biopsies from the primary tumor and mixed cytological specimens (mixed BAL and either pleura, EBUS, or FNA of the lymph node). The paraffin blocks had been stored for at least 3 years before PD-L1 staining in 36 of the cases (whereof 25 concordant PD-L1) and shorter than 3 years in the remaining 61 cases (41 concordant).
Literature Data
Based on 25 published studies, the mean, median, and range for PD-L1 concordance in studies on paired cytology/histology cases were 83–85%, 81–85%, and 62–100%, respectively, at cutoff 1% for a positive PD-L1 staining and 87–89%, 89%, and 67–100%, respectively, at cutoff 50% [38-62]. The reason for the intervals (here and below) is that some studies presented separate data for >1 cytological preparation, PD-L1 assay, or histological specimen type. The numbers remained the same if the data from the present study were included. For further details on individual studies, see online suppl. Table 1.
Summation of all paired cytology/histology cases in the studies (including the cases from the present study) resulted in a concordance of 82–83% (1,465/1,785–1,488/1,792 cases) at cutoff 1% for a positive PD-L1 staining and 88–89% (1,398/1,587–1,408/1,580 cases) at cutoff 50% [38-62]. If excluding studies with resections, 6 studies (including the present) remained, reporting a concordance in paired cytology/biopsy cases of 81–86% (228/283–248/290 cases) at cutoff 1% for a positive PD-L1 staining and 86–90% (249/290–254/283 cases) at cutoff 50% [41, 43, 46, 49, 59].
In studies with cytology fixed in formalin, that is, excluding studies with nonformalin or mixed formalin/nonformalin fixation, the concordance with paired histological specimens was 81–82% (333–336/410 cases) at cutoff 1% and 88–89% (316–317/358 cases) at cutoff 50% based on 7 studies [38, 39, 48, 50, 58, 60, 62]. If instead, only including studies with cytology fixed in nonformalin fixatives, excluding mixed formalin/nonformalin fixation, the concordance with paired histological specimens was 79–81% (500–512/633 cases) at cutoff 1% and 88–89% (554–565/633 cases) at cutoff 50% based on 9 studies including the Lund cohort from the present study [41, 45, 46, 49, 52, 55, 56, 62].
The concordance in studies using only cytological cell blocks compared to histology was 82–84% (1,181/1,436–1,217/1,443 cases) at cutoff 1% and 88–89% (1,085/1,238–1,091/1,231 cases) at cutoff 50% based on 23 studies including the present [38-44, 47-51, 53-62]. Correspondingly, the concordance for cytological smears versus histology was 79–80% (327–329/413 cases) at cutoff 1% and 93% (383–384/413) cases at cutoff 50% based on 5 studies [43, 45, 46, 49, 52].
Discussion
The paired cytology/biopsy NSCLC cases of our study support that PD-L1 expression is lower in cytological specimens, with significance in one and trend in the other of our 2 cohorts. Indeed, the PD-L1 score was lower in the cytological specimen for all discordant cases in the Lund cohort. It is noteworthy that the concordance was quite limited in the Halmstad cohort, with a kappa value of 0.49. Although discordant expression was more common in paired samples from different sites (e.g., biopsy from the primary tumor and cytology from pleura or lymph nodes) with 58% concordance (22/38 cases) in our 2 cohorts combined, discordant cases were also observed in paired samples from the same site (concordance 77%; 70/91). Concordance was similar for adenocarcinomas and squamous cell carcinomas and for new and archival blocks.
An important difference between, but also within, our 2 cohorts was the time in alcohol-based fixative (typically 1–3 days vs. <24 h, but also some with only formalin fixation) and the use of CellientTM versus manual cell blocks. While fixation and processing of biopsies is essentially the same at all pathology departments, there are substantial differences in handling of cytological specimens for IHC. Apart from formalin, cells may be fixed in various ethanol- or methanol-containing liquids (e.g., CytoLyt®, CytoRichTM Red, Novaprep®, and PreservCyt®), while different techniques such as direct smears, CytospinTM preparations, or cell blocks (e.g., plasma-thrombin, agar, CellientTM, and ShandonTM) may be used [63], also evident from the published studies compiled in online suppl. Table 1.
Alcohol fixation of tissue has been suggested to affect PD-L1 staining negatively [14], and some studies, like our Lund cohort, report a lower PD-L1 score for alcohol-fixed cytology in all discordant paired cytology/histology cases [45, 64]. Also, Koomen et al. [62] clearly demonstrated that alcohol-fixed CellientTM cell blocks result in lower concordance with histology compared to formalin-fixed agar-based cell blocks. However, the compiled literature data did not show any obvious difference between formalin and nonformalin fixation. Also, Lou et al. [60] presented a very good concordance for formalin-fixed cell blocks with or without prefixation with CytoLyt® [60], while a perfect concordance was seen in the study by Gosney et al. [65] with EBUS cytology fixed in formalin versus alcohol. Fixation in, for example, CytoLyt®, is very rapid, and it has been suggested that <1-h fixation has no effect on IHC staining (personal communication). The area thus merits future investigation.
Furthermore, specimen types differed between our cohorts, with more bronchial biopsies, brushes, and EBUS-guided lymph node aspirations in the Lund cohort, while the Halmstad cohort contained more transthoracic core biopsies, BAL, and pleural effusions. Both, but especially the Lund cohort, contained cell blocks mixed from different cytological samples (for increased tumor cell content). This complicates analyses of sample types but reflects the real-world diagnostic situation in our setting.
Grosu et al. [50] and Zou et al. [57] showed a higher concordance for PD-L1 in pleural effusions compared to matched histological samples (87–97% at 1% cutoff) than did Yoshimura et al. [37] and Jug et al. [58] for EBUS-guided samples (70–84%). Such a trend is not seen when comparing our 2 cohorts but may be concealed by other differences with greater impact. There were too few pleural effusions in the Lund cohort and too few EBUS-guided aspirations in the Halmstad cohort for comparisons of these 2 sample types within each cohort.
Interestingly, in the Halmstad cohort, biopsy/cytology PD-L1 concordance was lower for bronchial brushes and pleural effusions than for BAL specimens, while there was no obvious difference between bronchial and transthoracic core biopsies. However, our cohorts do not contain enough cases for adequate multivariable analyses, and ideally future studies may include multiple samples from the same patient for further analysis of potential effect of sample type on PD-L1 expression.
Still, the overall concordance of PD-L1 expression between cytology and histology was rather good based on our results and previously published studies. In the literature, discordant cases typically show either higher score in cytology or in histology [39, 42, 43, 55, 66], which is expected due to heterogeneity of PD-L1 expression [24-28, 67]. However, the range of concordance was quite broad, 62–100% at cutoff 1% and 67–100% at cutoff 50% based on 25 publications. Also, several studies present a positive PD-L1 expression (≥1%) in <40% of NSCLC cases when using cytology [41, 42, 44, 52, 54, 61, 68, 69], which is notably lower than what has been reported in both early treatment studies and large studies with real-life data, with PD-L1 ≥1% in 53–86% [70] and 56–63% [71, 72] of NSCLC, respectively (61% for cytology [71]). Furthermore, for example, the large study by Kuempers et al. [52] showed a high concordance (93%) at the 50% cutoff for a positive staining, but still no case with >50% in both cytology (mainly imprints from resections) and the paired resected tumors, which makes the applicability of the results questionable. We believe it would be reasonable for any department that use cytology for PD-L1 to (a) investigate the proportion of PD-L1-positive NSCLC in 50–100 nonselective cytological specimens and (b) evaluate the concordance between biopsies and cytological specimens in 20–30 paired cases. A lab with <55% PD-L1-positive cases and/or concordance <85% should consider investigating possible causes and potential improvements.
Biopsies are currently the standard for PD-L1 testing. However, based on histology, PD-L1 is not considered an optimal predictor of immunotherapy response [1-5, 29-31]. Also, there is a variation of positive PD-L1 expression between studies for biopsies as well (e.g., see online suppl. Table 1, but also seen in treatment studies [70]). Thus, it may be argued that the predictive value of PD-L1 in cytology (and preferably also the concordance with biopsies) should be evaluated in a population of patients treated with immunotherapy. Such studies are missing today.
PD-L1 expression in immune cells, relevant for a different PD-L1 assay (SP142), was not a focus of the present study. The representation of tumor-infiltrating immune cells in small samples has been discussed, and the interobserver concordance for PD-L1 has been shown to be low for immune cells [6, 7]. We noted in our Halmstad cohort that >1% positive lymphocytes were seen in 7 (7%) of 94 and >1% positive macrophages in 60 (65%) of 93 cytological specimens.
Some strengths and weaknesses of the present study must be discussed. Importantly, our paired samples were from the same diagnostic workup, which eliminates any potential effect of systemic therapy and time on PD-L1 expression, and only biopsies (not resections) and cases with >100 evaluable tumor cells were included. We did not limit the material to cases with sampling from the same site to eliminate another potentially confounding factor, but data for samples from the same and different sites, respectively, are presented. All cases in each cohort were stained with the same marker, 28-8 or 22C3, respectively. As very similar staining patterns have been reported for these 2 PD-L1 clones [6-8], the staining is probably comparable. The Halmstad cohort contained some old archival cases (which is linked to lower PD-L1 expression and not recommended [7, 15]), but this was about equal for the cell blocks and biopsies, which probably limits the impact on the results. In the Lund cohort, there was a slight selection bias for PD-L1-positive cases as a cytological cell block was not ordered as often in the clinical setting when a biopsy was PD-L1 negative. However, this probably does not affect the results significantly. The inclusion of cell blocks with mixed material from different sites limits the possibility to fully evaluate any possible impact of the sampling site on PD-L1. Also, it may be of note that interobserver discrepancy was 17% (3/18) for these cell blocks compared to 8% (10/126) for the remaining cases.
In conclusion, there is a rather good concordance for PD-L1 expression in NSCLC between cytology and histology, but with quite substantial variation between studies. As fixatives and preparation techniques for cytology may probably affect PD-L1 expression, it is of importance that each laboratory’s quality of PD-L1 staining in cytology is kept high, preferably by ascertaining a good correlation with biopsies.
Acknowledgment
The authors wish to thank the cytology units at the Department of Genetics and Pathology, Lund, and at the Department of Pathology and Cytology, Halmstad. The study was supported by the Regional Agreement on Medical Training and Clinical Research (ALF), Regional Research and Development Funding (FoU), and the Swedish Cancer Society.
Statement of Ethics
The study was conducted in adherence to the Declaration of Helsinki and was approved by the Regional Ethical Review Board in Lund (Dnr 2019-04782 and Dnr 2006-399 with addition 2017-708, respectively).
Conflict of Interest Statement
The authors have no conflicts of interest to disclose.
Funding Sources
The study was supported by the Regional Agreement on Medical Training and Clinical Research (ALF), Regional Research and Development Funding (FoU), and the Swedish Cancer Society. The funding sources had no role in the design or conduct of the study.
Author Contributions
M.S.I.M., T.S., D.V., K.D., A.D., M.P., and H.B. planned the study. L.T. and K.H. performed staining. M.S.I.M., K.E.L., T.S., U.M., R.M., B.H., D.V., A.D., and H.B. collected data. M.S.I.M. and H.B. analyzed the data and wrote the manuscript, and all other authors revised the manuscript for its intellectual content.