Multicenter evaluation of the PASS score as a negative predictive tool and the impact of inter-observer variability in pheochromocytoma and paraganglioma risk stratification

Article information

J Pathol Transl Med. 2026;60(2):202-213
Publication date (electronic) : 2026 February 23
doi : https://doi.org/10.4132/jptm.2025.11.05
1Department of Pathology, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Korea
2Department of Pathology, Gangnam Severance Hospital, Yonsei University College of Medicine, Seoul, Korea
3Department of Pathology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Korea
4Department of Pathology, Yongin Severance Hospital, Yonsei University College of Medicine, Yongin, Korea
5Department of Pathology, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
6Department of Pathology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea
Corresponding Author: Jae-Kyung Won, MD, PhD, Department of Pathology, Seoul National University Hospital, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea Tel: +82-2-2072-4895, Fax: +82-2-743-5530, E-mail: jkwon@snuh.org
Received 2025 August 21; Revised 2025 October 13; Accepted 2025 November 5.

Abstract

Background

The Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) is widely used for risk stratification in pheochromocytoma and paraganglioma (PPGL), but its clinical utility is limited by inter-observer variability of its parameters and inconsistent predictive performance.

Methods

We conducted a multicenter retrospective study of 1,518 patients with PPGL from five tertiary referral centers in Korea. Prognostic utility of PASS system was assessed using logistic regression, Kaplan-Meier analysis, and receiver operating characteristic (ROC) curve analysis. Inter-observer variability was inferred by comparing area under the ROC curve (AUCs) across institutions. Simplified PASS systems were developed based on multivariable analysis of key histopathological parameters.

Results

The PASS system was a significant predictor of adverse events and recurrence-free survival. Although the PASS system demonstrated only modest discriminative ability (AUC, 0.673), it showed a high negative predictive value (NPV, 0.885), supporting its usefulness as a screening tool for benign behavior. However, there was significant inter-institutional variability in PASS performance (AUC; range, 0.513 to 0.727; p < .05). The 3-factor Simple PASS, which incorporates necrosis, spindling, and mitotic figures, exhibited less inter-observer variation. The 4-factor Simple PASS, which adds vascular invasion to the 3-factor model, also showed reduced inter-observer variability and improved AUC and NPV compared to the original PASS system.

Conclusions

In this multicenter cohort, the PASS system demonstrated high NPV and screening potential, but significant inter-observer variability remains a challenge. Simplification of the PASS system and enhanced pathologist training may improve reproducibility and clinical utility in PPGL risk stratification.

Graphical abstract

INTRODUCTION

Pheochromocytomas, which are now categorized as adrenal paragangliomas (PPGL) and PPGL are rare neuroendocrine tumors arising from the adrenal medulla and the paraganglia of the autonomic nervous system, respectively, with diverse clinical behaviors ranging from benign to highly aggressive forms [1-3]. Accurate prognostication remains a challenge due to the heterogeneous nature of these tumors and the lack of universally accepted histopathological grading systems [1-3]. The Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) has been proposed to stratify the risk of malignancy and adverse clinical outcomes [4,5]. However, limitations such as inter-observer variability and inconsistent predictive performance have been reported, raising concerns about its clinical utility [6-9].

The PASS, originally developed by Thompson, incorporates multiple histopathological parameters but lacks quantitative standards for each, leading to variability in interpretation and application across institutions [4,6-11]. Previous research has demonstrated that the PASS, despite its modest overall performance, maintains a high negative predictive value (NPV), suggesting a potential role as a screening tool [5,12-14].

Given these challenges, there is a need to refine prognostic models for PPGL to improve reproducibility and predictive accuracy. Recent efforts to develop a simplified scoring system, focusing on key histopathological features with the strongest associations to adverse events, have shown promise in reducing inter-observer variability and enhancing prognostic performance [5,11,15-18]. This multicenter study aims to evaluate the prognostic significance of the PASS across five tertiary referral centers in Korea, assess inter-institutional variability, and validate the utility of simplified PASS scoring systems for risk stratification in PPGL patients.

MATERIALS AND METHODS

Study cohort and data collection

Five tertiary referral centers in Korea participated in this multi center study investigating the prognostic factors for paragangliomas. The participating institutions were Seoul National University Hospital (SNUH), Seoul National University Bundang Hospital (SNUBH), Samsung Medical Center (SMC), Seoul St. Mary’s Hospital, and Severance Hospital. All data were fully anonymized and merged prior to analysis, so that investigators were blinded to the identity of the contributing institutions.

A total of 1,523 patients diagnosed with PPGL from 2007 to 2020 in 5 hospitals were initially identified. Among these, three cases of gangliocytic paraganglioma and two cases of cauda equina paraganglioma were excluded from the cohort, as these entities are currently classified as composite gangliocytoma/neuroma and neuroendocrine tumor, and cauda equina neuroendocrine tumor, respectively, according to the latest World Health Organization (WHO) classification of endocrine and neuroendocrine tumors [1]. Therefore, the final study cohort comprised 1,518 patients.

Of the total cohort, PASS information was available in 1,079 cases.

Definition of clinical outcomes

Adverse events were defined as the presence of recurrence, metastasis, or multifocality. Recurrence was defined as tumor relapsing in the vicinity of the operative bed. Metastasis was defined as pathologically or radiologically confirmed tumor spread to sites outside the operative bed, specifically bone and lymph nodes [17,19-21]. Multifocality was defined as the presence of additional tumors, excluding bone or lymph node, detected at the time of pathological diagnosis. Recurrence-free survival (RFS) was defined as the interval from the date of surgery to the most recent follow-up or the time when an adverse event occurred.

Statistical analysis

All statistical analyses were performed using R Software (R Foundation for Statistical Computing, Vienna, Austria). Continuous variables were compared using analysis of variance (ANOVA), and categorical variables were analyzed with Fisher’s exact test. Survival outcomes were evaluated with Kaplan-Meier plots and the log-rank test in the subset of 363 cases with available RFS data. Univariable and multivariable Cox proportional hazards models were used to assess prognostic factors for RFS in these 363 cases. For adverse event outcomes, univariable and multivariable logistic regression analyses were conducted separately for patients with available PASS data (n = 1,079). Pair-wise Delong test was performed for comparison of area under the receiver operating characteristic curve (AUC) among five institutes. A two-sided p-value < .05 was considered statistically significant.

RESULTS

Baseline characteristics

The baseline characteristics of the study cohort are summarized in Table 1. Mean age, sex distribution, tumor size, tumor localization, PASS, and disease status are presented for each hospital. Significant differences in mean age were observed among the institutes (ANOVA, p < .05); specifically, hospital D had a higher mean age compared to hospitals A and E. PASS also differed significantly among groups, with hospital A exhibiting higher scores than the other hospitals. For categorical variables such as sex, tumor localization, and disease status, the proportions are reported in Table 1. For tumor localization, cases of paraganglioma in which the localization could not be identified in the electronic medical records were classified as unknown. Statistically significant differences among hospitals were identified for localization and disease state (Fisher’s exact test; p-values are shown in Table 1).

Baseline characteristics of this cohort

PASS as prognostic indicator of adverse events

Univariable and multivariable logistic regression analyses were performed to evaluate the odds ratios of each variable in the total cohort of 1,518 cases (Table 2). In univariable analysis, all variables except sex were statistically significant. However, in multivariable analysis, only age (per 10 years) (odds ratio [OR], 0.68; 95% confidence interval [CI], 0.60 to 0.76; p < .001) and PASS category (intermediate risk [4–5]: OR, 2.05; 95% CI, 1.22 to 3.35; high risk (≥6): OR, 4.83; 95% CI, 3.12 to 7.50; p < .001) remained significant.

Logistic regression analysis for adverse events

Kaplan-Meier survival analyses were conducted for the 363 cases with available survival data. Survival curves were stratified by age (≤45 years vs. >45 years), sex, tumor localization (adrenal vs. extra-adrenal), tumor size (≤5.0 cm vs. >5.0 cm), and PASS category (low risk [≤3], intermediate risk [4–5], high risk [≥6]), as shown in Fig. 1. According to the log-rank test, significant differences in survival were observed for age and PASS score categories (all p < .001). In contrast, sex, tumor localization, and tumor size did not show statistically significant differences in survival.

Fig. 1.

Kaplan-Meier survival curves for 363 cases with available survival data. Significant differences in survival were observed for age (B) and Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) score (E) categories (p < .001, log-rank test). No statistically significant differences are found for sex (A), tumor localization (C), or tumor size (D).

Cox proportional hazards regression analysis for disease-free survival indicated that all variables except sex were statistically significant in univariable analysis (Table 3). In multivariable analysis, age (per 10 years) (hazard ratio [HR], 0.71; 95% CI, 0.60 to 0.83; p < .001), localization (extra-adrenal: HR, 2.96; 95% CI, 1.71 to 5.14; p < .001), and PASS category (intermediate risk [4–5]: HR, 6.12; 95% CI, 2.3 to 16.3; high risk [≥6]: HR, 12.5; 95% CI, 5.49 to 28.5; p < .001) remained significant. Both multivariable logistic regression and Cox proportional hazards analyses demonstrated the prognostic significance of histopathological grading systems.

Cox proportional HR analysis for DFS

The screening role of PASS: high NPV offsets modest predictive performance

The predictive performance of the PASS score for adverse events was further evaluated using logistic regression-based receiver operating characteristic (ROC) curve analysis, as summarized in Fig. 2A. The PASS demonstrated an AUC of 0.673, sensitivity of 0.511, specificity of 0.836, NPV of 0.885, and positive predictive value (PPV) of 0.409. The grid plot demonstrated a low incidence of adverse events in the low-risk group (53 of 720 patients [7.4%]) and a comparatively higher incidence in both the intermediate-risk group (32 of 169 patients [18.9%]) and the high-risk group (59 of 190 patients [31.1%]) (Fig. 2B). Previous studies, including a recent meta-analysis, have also suggested that the high NPV of the PASS supports its utility as a screening tool for risk stratification in pheochromocytoma and PPGL [14]. Our findings are consistent with these prior reports, indicating that while the predictive accuracy of the PASS system alone may be suboptimal, its high NPV renders it a valuable prognostic grading system.

Fig. 2.

Predictive performance of Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) for adverse events and its inter-observer variability. (A) Receiver operating characteristic (ROC) curve for the PASS score and (B) grid plot showing the distribution of adverse events by PASS risk group. (C) Comparison of the area under the ROC curve (AUC) for the PASS score across five institutions. Notable variability in AUC values illustrates inter-observer differences in predictive performance among centers. NPV, negative predictive value; PPV, positive predictive value.

Inter-observer variability of the PASS

The PASS is composed of multiple histopathological parameters, many of which lack quantitative or objective criteria. As a result, concerns have been raised regarding inter-observer variability in its application [8,9]. To further assess this issue, the predictive performance of the PASS for adverse events was evaluated across five different hospitals. The AUC for each institution was as follows: hospital A, 0.698; hospital B, 0.513; hospital C, 0.727; hospital D, 0.665; and hospital E, 0.687 (Fig. 2C). Pairwise DeLong tests revealed statistically significant differences in AUC between hospital A and hospital B, as well as between hospital B and hospital C (p < .05) (Table 4). These findings indirectly demonstrate inter-observer variability in the application of the PASS score, or at the very least, highlight discrepancies in its predictive performance among different institutions.

AUC of PASS score by institutes

The distribution of positive cases for each PASS parameter in the adverse event group revealed that nuclear pleomorphism, capsular invasion, and nuclear hyperchromasia were the most frequently observed features (Fig. 3A). Notably, when the proportion of each PASS parameter was further analyzed across the four participating institutes, substantial variation was observed in several parameters—including capsular invasion and nuclear hyperchromasia (Fig. 3B). These findings not only identify which parameters are most commonly seen in the adverse event group but also suggest inter-observer variability in PASS parameter assessment among institutions, underscoring the limitation of applying PASS.

Fig. 3.

Proportion of Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) parameters in adverse event group. (A) Proportion of positive cases for each PASS parameter in the adverse event group. Parameters are displayed in order of frequency across the entire cohort. (B) Proportion of positive cases for each PASS parameter in the adverse event group, stratified by institute. The variable distribution across centers highlights inter-observer variability.

Multivariable analysis of PASS parameters and development of the Simple PASS

Multivariable logistic regression analysis was conducted on 560 samples with available PASS parameter scoring to evaluate the association between individual PASS parameters and adverse events (Table 5). Among the PASS parameters, only confluent necrosis, tumor cell spindling, and mitotic figures (>3 per 10 high-power fields [HPF] were statistically significant. Specifically, confluent necrosis was associated with an OR of 3.38 (95% CI, 1.76 to 6.52; p < .001), tumor cell spindling with an OR of 1.89 (95% CI, 1.13 to 3.12; p =.016), and mitotic figures (>3/10 HPF) with an OR of 2.00 (95% CI, 1.10 to 3.58; p = .023). These three parameters were included in the 3-factor Simple PASS, while vascular invasion—despite its marginal significance (OR, 1.64; 95% CI, 0.94 to 2.83; p = .080)—was added to the 4-factor Simple PASS. This decision took into account that other scoring systems, such as GAPP (grading of adrenal pheochromocytoma and paraganglioma) and COPPS (COmposite Pheochromocytoma/paraganglioma Prognostic Score), also incorporate vascular invasion as a parameter [5,18].

ORs of PASS parameter for adverse event

Based on these odds ratios, the 3-factor Simple PASS was developed by assigning 3 points to confluent necrosis, 2 points to spindling, and 2 points to increased mitotic figures. ROC curve analysis identified an optimal cut-off of 1 point according to the Youden index, indicating that the presence of any single parameter was sufficient to classify a case as high-risk. When high risk was defined as the presence of one or more parameters, subsequent ROC analysis yielded an AUC of 0.646, sensitivity of 0.519, specificity of 0.740, NPV of 0.837, and PPV of 0.374 (Fig. 4A). Furthermore, the 4-factor Simple PASS was created by adding vascular invasion, assigned one point, to the original 3-factor model. Using the same optimal cut-off of 1 point, as determined by the Youden index, ROC analysis demonstrated an AUC of 0.701, sensitivity of 0.497, specificity of 0.891, NPV of 0.938, and PPV of 0.374 (Fig. 4B). Notably, the 4-factor Simple PASS showed improved AUC, specificity, and NPV compared to the original PASS.

Fig. 4.

Simplified Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) models. (A) Receiver operating characteristic (ROC) curve analysis of the 3-factor Simple PASS, constructed by only confluent necrosis, spindling, and high mitotic count (3>10 high-power fields). (B) ROC curve analysis of the 4-factor Simple PASS, adding vascular invasion to the 3-factor model. The 4-factor Simple PASS demonstrates improved area under the ROC curve (AUC), specificity, and negative predictive value compared to the original PASS system. (C, D) Comparison of the AUC for both Simple PASS systems between hospitals A and B. Unlike the original PASS system, the differences of AUC between the two institutions in both simplified models are not statistically significant, suggesting reduced inter-observer variability with the simplified scoring system. NPV, negative predictive value; PPV, positive predictive value.

When comparing the performance of the simplified PASS systems between hospitals A and B, the difference in AUC was less pronounced than that observed with the original PASS in both simplified models (Fig. 4C, D). Importantly, the difference in AUC between the two hospitals for the simplified PASS was not statistically significant (DeLong test p = .658 for the 3-factor model and .465 for the 4-factor model), in contrast to the significant difference observed with the original PASS score (p = .005). These findings suggest that applying the simplified PASS reduces inter-observer variability.

DISCUSSION

The clinical utility of the PASS system for risk stratification in PPGL has been demonstrated in numerous previous studies, supporting their roles as prognostic indicators for adverse clinical outcomes [10-14,16,22]. In the present multicenter study, we validated the prognostic impact of PASS using what is, to the best of our knowledge, the largest cohort to date. Our results align with prior research, confirming that PASS is associated with adverse events and RFS. Specifically, multivariable logistic regression analysis in our cohort of 1,518 cases revealed that age and PASS were independent predictors of adverse events, with age per 10 years having an OR of 0.68 (95% CI, 0.60 to 0.76; p < .001) and high-risk PASS scores (≥6) conferring an OR of 4.83 (95% CI, 3.12 to 7.50; p < .001). ROC curve analysis showed that the PASS score had an AUC of 0.673, sensitivity of 0.511, specificity of 0.836, and a notably high NPV of 0.885. These findings are consistent with previous meta-analyses and large-scale studies, which have emphasized the high NPV of the PASS as a screening tool, despite its modest overall discriminative performance [5,12-14]. The association between younger age and adverse events may be explained by the fact that adverse events often involve multiplicity, and younger patients are more likely to carry germline mutations and exhibit multiplicity [23-25]. Further studies are warranted to precisely define adverse events using genetic profiles, to distinguish true adverse events from mere multiplicity.

A major advantage of the PASS system is its simplicity and cost-effectiveness, as it can be assessed solely on histopathological grounds without the need for additional molecular or clinical data [5,12]. However, the PASS system incorporates numerous parameters, many of which lack quantitative definitions, leading to substantial inter-observer variability [6-9]. This issue has been widely recognized in the literature, and our multicenter analysis provides further robust evidence that both the diagnostic validity and interpretative application of the PASS score are subject to considerable variability across institutions. Specifically, diagnostic performance as measured by AUC ranged from 0.513 to 0.727 among the five participating centers, with statistically significant pairwise differences observed in certain comparisons. Furthermore, our detailed exploration of individual PASS parameters demonstrated substantial heterogeneity in their assessment: for instance, capsular invasion and nuclear hyperchromasia showed pronounced discrepancies in the proportion of positive cases among centers. Such inter-institutional differences are likely attributable to variability in pathologists’ interpretation and implementation of the PASS criteria. These results underscore the persistent challenge of inter-observer variability.

In response to the limitations of the PASS, several alternative grading systems have been introduced, including the GAPP and other composite models [15-18]. However, each of these systems has its own constraints, such as limited validation, reliance on additional clinical or biochemical data, or continued subjectivity in parameter assessment. Despite these limitations, the PASS remains a valuable screening tool due to its high NPV, and ease of use based solely on histopathology [14].

To address inter-observer variability and improve the prognostic performance of PASS, we conducted multivariable analysis to identify the parameters most strongly associated with adverse events. Based on these results and previous studies, confluent necrosis, tumor cell spindling, and increased mitotic figures (>3/10 HPF) were incorporated into a 3-factor simplified PASS scoring system, while vascular invasion was additionally included to develop a 4-factor simplified PASS. Notably, the 4-factor model—where single presence of vascular invasion, necrosis, spindling and increased mitosis can be assigned as high-risk group—demonstrated improved AUC (0.701), specificity (0.891), and NPV (0.938) compared to the original PASS score, despite the reduced number of parameters. Furthermore, when comparing the performance of both simplified PASS systems between hospitals A and B, the difference in AUC was not statistically significant, suggesting reduced inter-observer variability relative to the original PASS system.

This study has several limitations. First, we did not review pathological slides or revise the initially diagnosed PASS to avoid retrospective bias, as the main goal of this study was to evaluate how well the PASS system performs in real-world practice with respect to diagnostic accuracy and inter-observer variability. During data collection, we realized that the absence of a refined consensus on the definitions of each parameter might have contributed to inter-observer variability, underscoring the high demand for precise and standardized definitions of each factor. As a multicenter retrospective analysis, the completeness and consistency of data collection may be suboptimal. While the relationship between PASS score and adverse events was the primary focus, survival data and detailed PASS parameter information was limited, resulting in smaller sample sizes (n = 560) for some analyses. Furthermore, we did not directly assess inter-observer agreement by having multiple pathologists independently review the same slides; rather, we inferred variability indirectly by comparing AUCs for adverse event prediction across institutions. Future studies should incorporate direct inter-observer concordance assessments to more precisely quantify variability. Another limitation concerns the definition of adverse events. Given that PPGL can arise anywhere along the paraganglionic system, distinguishing between multifocality, metastasis, and recurrence can be challenging, and these categories may overlap biologically [26,27]. We included multifocal tumors as adverse events but recognize that their biological behavior may differ from truly metastatic or recurrent tumors, potentially diminishing the predictive power of the PASS [28,29]. Additionally, previous studies suggest patients with germline mutations are more prone to multifocal PPGL, and immunohistochemistry (IHC) for S-100 and SOX-10 can be used to distinguish metastasis from multifocal tumors [18,23-25]. Taken together, future research should incorporate genetic data and additional IHC study to distinguish multifocal cases from metastasis and refine the definition of adverse events accordingly.

In conclusion, this study validates the prognostic impact of the PASS in the largest cohort of PPGL patients to date, confirming the high NPV and screening potential of the PASS. However, significant inter-observer variability remains a major challenge. Efforts to reduce this variability—such as enhanced training for pathologists and simplification of the PASS system—are warranted. Our findings highlight the importance of focusing on key histopathological parameters and support ongoing initiatives to streamline and standardize risk stratification in paraganglioma.

Notes

Ethics Statement

The study protocol was approved by the Institutional Review Board of Seoul National University Hospital (IRB no. H-2108-062-1243) and conducted in accordance with the principles of the Declaration of Helsinki. Formal written informed consent was not required with a waiver by the appropriate IRB and/or national research ethics committee.

Availability of Data and Material

Data supporting the findings of this study are available from the corresponding author on reasonable request.

Code Availability

Not applicable.

Author Contributions

Conceptualization: SJ, JKW. Formal analysis: SJ. Investigation: all authors. Methodology: SJ, JKW. Project Administration: CKJ. Supervision: JKW, KCJ. Visualization: SJ. Writing—original draft: SJ. Writing—review and editing: JKW, KCJ. Approval of final manuscript: all authors.

Conflicts of Interest

C.K.J. and S.Y.P., the editors-in-chief and S.W.H., contributing editor of the Journal of Pathology and Translational Medicine, were not involved in the editorial evaluation or decision to publish this article. All remaining authors have declared no conflicts of interest.

Funding Statement

This research was supported by The Korean Society of Pathologists Grant No. KSPG2021-01.

References

1. WHO Classification of Tumours Editorial Board. Endocrine and neuroendocrine tumours 5th edth ed. Lyon: IARC; 2022.
2. Fishbein L. Pheochromocytoma and paraganglioma: genetics, diagnosis, and treatment. Hematol Oncol Clin North Am 2016;30:135–50. 10.1016/j.hoc.2015.09.006. 26614373.
3. Lenders JW, Duh QY, Eisenhofer G, et al. Pheochromocytoma and paraganglioma: an endocrine society clinical practice guideline. J Clin Endocrinol Metab 2014;99:1915–42. 10.1210/jc.2014-1498. 24893135.
4. Thompson LD. Pheochromocytoma of the adrenal gland scaled score (PASS) to separate benign from malignant neoplasms: a clinicopathologic and immunophenotypic study of 100 cases. Am J Surg Pathol 2002;26:551–66. 10.1097/00000478-200205000-00002. 11979086.
5. Kimura N, Takayanagi R, Takizawa N, et al. Pathological grading for predicting metastasis in phaeochromocytoma and paraganglioma. Endocr Relat Cancer 2014;21:405–14. 10.1530/erc-13-0494. 24521857.
6. Bima C, Bioletto F, Lopez C, et al. Clinical and pathological tools for predicting recurrence and/or metastasis in patients with pheochromocytoma and paraganglioma. Biomedicines 2022;10:1813. 10.3390/biomedicines10081813.
7. Stenman A, Zedenius J, Juhlin CC. Over-diagnosis of potential malignant behavior in MEN 2A-associated pheochromocytomas using the PASS and GAPP algorithms. Langenbecks Arch Surg 2018;403:785–90. 10.1007/s00423-018-1679-9. 29779047.
8. Wachtel H, Hutchens T, Baraban E, et al. Predicting metastatic potential in pheochromocytoma and paraganglioma: a comparison of PASS and GAPP scoring systems. J Clin Endocrinol Metab 2020;105:e4661–70. 10.1210/clinem/dgaa608. 32877928.
9. Wu D, Tischler AS, Lloyd RV, et al. Observer variation in the application of the Pheochromocytoma of the adrenal gland scaled score. Am J Surg Pathol 2009;33:599–608. 10.1097/pas.0b013e318190d12e. 19145205.
10. Mondal A, Sengupta M, Dey S, et al. Clinicopathological study of adrenal pheochromocytoma and extra-adrenal paragangliomas with reference to GAPP and PASS scoring systems. Indian J Pathol Microbiol 2024;67:852–8. 10.4103/ijpm.ijpm_859_23. 38847206.
11. Yamazaki Y, Gao X, Pecori A, et al. Recent advances in histopathological and molecular diagnosis in pheochromocytoma and paraganglioma: challenges for predicting metastasis in individual patients. Front Endocrinol (Lausanne) 2020;11:587769. 10.3389/fendo.2020.587769. 33193100.
12. Koh JM, Ahn SH, Kim H, et al. Validation of pathological grading systems for predicting metastatic potential in pheochromocytoma and paraganglioma. PLoS One 2017;12e0187398. 10.1371/journal.pone.0187398. 29117221.
13. Kimura N, Takekoshi K, Naruse M. Risk stratification on pheochromocytoma and paraganglioma from laboratory and clinical medicine. J Clin Med 2018;7:242. 10.3390/jcm7090242. 30150569.
14. Stenman A, Zedenius J, Juhlin CC. The value of histological algorithms to predict the malignancy potential of pheochromocytomas and abdominal paragangliomas-a meta-analysis and systematic review of the literature. Cancers (Basel) 2019;11:225. 10.3390/cancers11020225. 30769931.
15. Cho YY, Kwak MK, Lee SE, et al. A clinical prediction model to estimate the metastatic potential of pheochromocytoma/paraganglioma: ASES score. Surgery 2018;164:511–7. 10.1016/j.surg.2018.05.001. 29929757.
16. Li Q, Lan Z, Jiang Y, Wang R, Li Z, Jiang X. Validation and evaluation of 5 scoring systems for predicting metastatic risk in pheochromocytoma and paraganglioma. Am J Surg Pathol 2024;48:855–65. 10.1097/pas.0000000000002238. 38712603.
17. Park SS, Ahn CH, Lee S, et al. Preoperative prediction of metastatic pheochromocytoma and paraganglioma using clinical, genetic, and biochemical markers: a cohort study. J Intern Med 2024;296:68–79. 10.1111/joim.13791. 38659304.
18. Pierre C, Agopiantz M, Brunaud L, et al. COPPS, a composite score integrating pathological features, PS100 and SDHB losses, predicts the risk of metastasis and progression-free survival in pheochromocytomas/paragangliomas. Virchows Arch 2019;474:721–34. 10.1007/s00428-019-02553-5. 30868297.
19. Ku EJ, Kim KJ, Kim JH, et al. Diagnosis for pheochromocytoma and paraganglioma: a joint position statement of the Korean pheochromocytoma and paraganglioma task force. Endocrinol Metab (Seoul) 2021;36:322–38. 10.3803/enm.2020.908. 33820394.
20. Yokomoto-Umakoshi M, Umakoshi H, Tsuiki M, et al. Paraganglioma as a risk factor for bone metastasis. Endocr J 2018;65:253–60. 10.1507/endocrj.ej17-0368. 29187703.
21. Zhu X, Huang Z, Dong L, Zhao H, Lu H. A case report of primary colonic paraganglioma with lymph node metastasis. Front Surg 2022;9:961514. 10.3389/fsurg.2022.961514. 36017518.
22. Agarwal A, Mehrotra PK, Jain M, et al. Size of the tumor and pheochromocytoma of the adrenal gland scaled score (PASS): can they predict malignancy? World J Surg 2010;34:3022–8. 10.1007/s00268-010-0744-5. 20703467.
23. Barontini M, Levin G, Sanso G. Characteristics of pheochromocytoma in a 4- to 20-year-old population. Ann N Y Acad Sci 2006;1073:30–7. 10.1196/annals.1353.003. 17102069.
24. Else T, Greenberg S, Fishbein L. Hereditary paraganglioma-pheochromocytoma syndromes. In : Adam MP, Bick S, Mirzaa GM, Pagon RA, Wallace SE, Amemiya A, eds. GeneReviews Seattle: University of Washington, Seattle; 2018.
25. Pamporaki C, Hamplova B, Peitzsch M, et al. Characteristics of pediatric vs adult pheochromocytomas and paragangliomas. J Clin Endocrinol Metab 2017;102:1122–32. 10.1210/jc.2016-3829. 28324046.
26. Asa SL, Ezzat S, Mete O. The diagnosis and clinical significance of paragangliomas in unusual locations. J Clin Med 2018;7:280. 10.3390/jcm7090280. 30217041.
27. Juhlin CC, Mete O. Advances in adrenal and extra-adrenal paraganglioma: practical synopsis for pathologists. Adv Anat Pathol 2023;30:47–57. 10.1097/pap.0000000000000365. 36136370.
28. Brewczynski A, Kolasinska-Cwikla A, Jablonska B, Wyrwicz L. Pheochromocytomas and paragangliomas-current management. Cancers (Basel) 2025;17:1029. 10.3390/cancers17061029. 40149362.
29. Mete O, Juhlin CC. Recent progress in the pathologic classification of pheochromocytomas and paragangliomas. Best Pract Res Clin Endocrinol Metab 2024;38:101958. 10.1016/j.beem.2024.101958. 39609157.

Article information Continued

Fig. 1.

Kaplan-Meier survival curves for 363 cases with available survival data. Significant differences in survival were observed for age (B) and Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) score (E) categories (p < .001, log-rank test). No statistically significant differences are found for sex (A), tumor localization (C), or tumor size (D).

Fig. 2.

Predictive performance of Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) for adverse events and its inter-observer variability. (A) Receiver operating characteristic (ROC) curve for the PASS score and (B) grid plot showing the distribution of adverse events by PASS risk group. (C) Comparison of the area under the ROC curve (AUC) for the PASS score across five institutions. Notable variability in AUC values illustrates inter-observer differences in predictive performance among centers. NPV, negative predictive value; PPV, positive predictive value.

Fig. 3.

Proportion of Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) parameters in adverse event group. (A) Proportion of positive cases for each PASS parameter in the adverse event group. Parameters are displayed in order of frequency across the entire cohort. (B) Proportion of positive cases for each PASS parameter in the adverse event group, stratified by institute. The variable distribution across centers highlights inter-observer variability.

Fig. 4.

Simplified Pheochromocytoma of the Adrenal Gland Scaled Score (PASS) models. (A) Receiver operating characteristic (ROC) curve analysis of the 3-factor Simple PASS, constructed by only confluent necrosis, spindling, and high mitotic count (3>10 high-power fields). (B) ROC curve analysis of the 4-factor Simple PASS, adding vascular invasion to the 3-factor model. The 4-factor Simple PASS demonstrates improved area under the ROC curve (AUC), specificity, and negative predictive value compared to the original PASS system. (C, D) Comparison of the AUC for both Simple PASS systems between hospitals A and B. Unlike the original PASS system, the differences of AUC between the two institutions in both simplified models are not statistically significant, suggesting reduced inter-observer variability with the simplified scoring system. NPV, negative predictive value; PPV, positive predictive value.

Table 1.

Baseline characteristics of this cohort

Characteristic Hospital A (n = 366) Hospital B (n = 188) Hospital C (n = 532) Hospital D (n = 117) Hospital E (n = 315) p-value
Sex .849a
 Female 184 (50.3) 97 (51.6) 277 (52.1) 66 (56.4) 162 (51.4)
 Male 182 (49.7) 91 (48.4) 255 (47.9) 51 (43.6) 153 (48.6)
Age (yr) 49.59 ± 15.90 51.71 ± 14.38 51.36 ± 15.27 54.80 ± 14.26 49.49 ± 15.27 .005b
Tumor size (cm) 4.67 ± 2.90 4.55 ± 2.59 4.44 ± 2.79 4.42 ± 2.49 4.09 ± 2.91 .255b
Localization <.001c
 Adrenal gland 218 (59.6) 110 (58.5) 326 (61.3) 71 (60.7) 178 (56.5)
 Head and neck 32 (8.7) 19 (10.1) 47 (8.8) 9 (7.8) 55 (17.5)
 Thorax 8 (2.2) 0 4 (0.8) 0 2 (0.6)
 Abdomen/pelvis 99 (27.0) 50 (26.6) 117 (22.0) 33 (28.2) 71 (22.5)
 Spinal cord 9 (2.5) 5 (2.7) 7 (1.3) 2 (1.7) 6 (1.9)
 Unknown 0 4 (2.1) 31 (5.8) 2 (1.7) 3 (1.0)
PASS score 3.79 ± 3.27 3.03 ± 3.04 2.52 ± 2.73 2.26 ± 2.37 2.44 ± 2.70 <.001b
Status of disease <.001c
 NED 286 (78.1) 164 (87.2) 502 (94.4) 106 (90.6) 301 (95.6)
 Recurrence 21 (5.7) 1 (0.5) 4 (0.8) 6 (5.1) 3 (1.0)
 Metastasis 38 (10.4) 9 (4.8) 18 (3.4) 2 (1.7) 5 (1.6)
 Multiplicity 21 (5.7) 14 (7.4) 8 (1.5) 3 (2.6) 6 (1.9)

Values are presented as number (%) or mean±SD.

PASS, Pheochromocytoma of the Adrenal Gland Scaled Score; NED, no evidence of disease; SD, standard deviation.

a

Pearson's chi-squared test;

b

One-way analysis of means,

c

Fisher’s exact test.

Table 2.

Logistic regression analysis for adverse events

Variable Univariable analysis Multivariable analysis
OR (95% CI) p-value OR (95% CI) p-value
Sex - .731 .562
 Female - -
 Male 0.95 (0.70–1.29) 0.90 (0.62–1.29)
Age (per 10 years) 0.75 (0.68–0.83) <.001 0.68 (0.60–0.76) <.001
Tumor size (per 1 cm) 1.13 (1.07–1.18) <.001 1.05 (0.98–1.11) .167
Localization <.001 .055
 Adrenal - -
 Extra-adrenal 0.37 (0.26–0.54) 0.67 (0.44–1.01)
PASS score <.001 <.001
 Low risk (0–3) - -
 Intermediate risk (4–5) 2.52 (1.59–3.93) 2.05 (1.22–3.35)
 High risk (≥6) 5.41 (3.69–7.94) 4.83 (3.12–7.50)

OR, odds ratio; CI, confidence interval; PASS, Pheochromocytoma of the Adrenal Gland Scaled Score.

Table 3.

Cox proportional HR analysis for DFS

Variable Univariable analysis Multivariable analysis
10-Year DFS (%) HR (95% CI) p-value HR (95% CI) p-value
Sex .400
 Female 79 -
 Male 78 0.81 (0.50–1.33)
Age (per 10 years) 0.76 (0.65–0.89) <.001 0.71 (0.60–0.83) <.001
Tumor size (per 1 cm) 1.13 (1.05–1.22) .003 1.05 (0.95–1.15) .300
Localization .002 <.001
 Adrenal 84 - -
 Extra-adrenal 72 2.14 (1.31–3.50) 2.96 (1.71–5.14)
PASS score <.001 <.001
 Low risk (0–3) - -
 Intermediate risk (4–5) 4.38 (1.99–9.67) 6.12 (2.30–16.3)
 High risk (≥6) 7.75 (3.94–15.2) 12.5 (5.49–28.5)

HR, hazard ratio; DFS, disease-free survival; CI, confidence interval; PASS, Pheochromocytoma of the Adrenal Gland Scaled Score.

Table 4.

AUC of PASS score by institutes

Institute AUC p-value; pair-wise Delong test
vs. A vs. B vs. C vs. D vs. E
A 0.698 - .005 .654 .719 .943
B 0.513 .008 .128 .265
C 0.727 .535 .796
D 0.665 .896
E 0.687

AUC, area under the receiver operating characteristic curve; PASS, Pheochromocytoma of the Adrenal Gland Scaled Score.

Table 5.

ORs of PASS parameter for adverse event

Variable Univariable analysis Multivariable analysis
OR (95% CI) p-value OR (95% CI) p-value
Capsular invasion 1.58 (1.06–2.35) .024 1.14 (0.67–1.90) .622
Vascular invasion 2.26 (1.38–3.64) .001 1.64 (0.94–2.83) .080
Extension into adipose tissue 1.70 (1.05–3.64) .031 1.00 (0.54–1.86) .994
Large cell nest or diffuse growth >10% 1.97 (1.29–2.98) <.001 1.35 (0.83–2.19) .222
Confluent necrosis 4.26 (2.33–7.86) <.001 3.38 (1.76–6.52) <.001
High cellularity 1.61 (1.01–2.53) .045 1.01 (0.59–1.70) .971
Tumor cell spindling 2.05 (1.27–3.28) .004 1.89 (1.13–3.12) .016
Cellular monotony 1.97 (0.77–4.71) .150 1.11 (0.39–2.94) .832
Mitotic figure (>3/10 HPF) 2.69 (1.58–4.51) <.001 2.00 (1.10–3.58) .023
Atypical mitosis 1.41 (0.44–3.88) .537 0.49 (0.14–1.55) .232
Pleomorphic nuclei 1.85 (1.24–2.76) .002 1.26 (0.78–2.03) .345
Hyperchromatic nuclei 1.70 (1.13–2.54) .010 1.23 (0.75–2.00) .412

OR, odds ratio; PASS, Pheochromocytoma of the Adrenal Gland Scaled Score; CI, confidence interval; HPF, high-power field.