Cite this asYu J, Falagario U, Winks SG, Angell K, Fulcher AS, et al. (2022) Comparison of Gleason score of Prostate Cancer at Ultrasound/MRI Fusion Target Biopsy vs. Systematic Transrectal Ultrasound Guided Biopsy with Final Gleason score at Radical Prostatectomy. Arch Urol Res 6(2): 017-021. DOI: 10.17352/aur.000039
Copyright© 2022 Yu J, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Purpose: To compare accuracy in predicting final Gleason Grade Group (GGG) of Prostate Cancer (PCa) of US/MRI fusion guided target (fBx) vs. systematic Transrectal Ultrasound-Guided (TRUS) biopsy, using histopathologic analysis at prostatectomy as the gold standard.
Materials and methods: After obtaining IRB approval, we retrospectively reviewed records of patients who underwent Radical Prostatectomy (RP) from January 2014 through May 2019 with prior US/MRI fusion guided target or TRUS biopsy. The rates of upgrading (RP GGG > BX GGG), downgrading (RP GGG < BX GGG), and concordance (RP GGG = BX GGG) were compared between the fBx and TRUS groups. Age, PSA, PSA density, and prostate volume were also noted for all patients. Statistical analysis was utilized to assess the data.
Results: A total of 348 men with complete clinical data were included in this study. The rate of downgrading and upgrading in the fBx group was less than in the TRUS biopsy group (14% vs. 19.6%, and 13.2% vs. 19.6%, respectively). The concordance rate was higher in the US-MR fBx group (72.9% vs. 60.7%, p < 0.05)) across all GGG. Notably, lower rates of concordance were found for GGG 1 (24.1%) and GGG 4 (3.6%) in the TRUS Bx group. Patients who underwent US-MR fBx had higher average PSA (9.4 vs. 6.5 ng/ml), higher PSA density (0.3 vs. 0.2 ng/ml2), and lower prostate volume (31 vs. 42 cc). Additionally, biopsy results showed a lower rate of GGG 1 (3.1% vs. 13.2%) and a higher rate of GGG 5 (14.7% vs. 5.5%) in the US-MR fBx group.
Conclusions: Target biopsy has a higher GGG concordance compared to TRUS biopsy (72.9% vs. 60.7%, p < 0.05). In addition, there was less downgrading or upgrading of final PCa GGG in the fBx groups compared to TRUS Bx (14% vs. 19.6%, 13.2% vs. 19.6%, respectively). This finding may have important implications for treatment decisions.
An elevated PSA value often causes concern for prostate malignancy and typically requires a biopsy for further evaluation [1,2]. After histopathologic analysis, these biopsy samples are then assigned a Gleason Grade Group (GGG), which will rate the aggressiveness of the disease and help to dictate the patient’s course of treatment [3,4].
Traditionally, there have been several ways to perform prostate biopsies in individuals with elevated PSA values and concern for malignancy [5,6]. Traditionally, most patients underwent systematic Transrectal Ultrasound biopsy (TRUS Bx), with representative samples taken throughout the prostate in a systematic manner [2,5]. Another method, however, that has gained recent popularity and is quickly becoming the method of choice in many institutions is Transrectal US-MR fusion-guided targeted biopsy (US-MR fBx) [7,8]. A multiparametric MRI (mpMRI) of the prostate is obtained prior to the biopsy, allowing for the identification of specific “targets,” areas suspicious for Prostate Cancer (PCa). These targets are marked on the images prior to the biopsy. At the time of biopsy, these images are then fused with those from real-time transrectal ultrasound, providing a guide for sampling the areas suspicious of malignancy [9-11]. Standard systematic samples can also be obtained in the same setting in order to obtain what is believed to be a very comprehensive evaluation of the prostate [7,8].
While the accuracy of detecting clinically significant PCa using US-MR fBx has shown to be superior to non-fusion methods [5,12-14], some questions remain as to the accuracy and clinical utilization in diagnosing the severity of the disease [5,10,15]. Prior studies show that US-MR fBx identifies a higher GGG PCa (higher risk) than systematic TRUS Bx alone [6,15]. Our concern is that the US-MR fBx may overestimate the GGG for the entire tumor and provide a poor representation of the most prevalent grade of underlying cancer. This higher score could lead to more aggressive treatment than needed, with potential increased treatment side effects or complications, such as impotence or urinary incontinence, among others [16-18].
With the increased utilization of fBx [7,8], our study aims to answer the question of whether US-MR fusion-guided target biopsies provide a representative versus unrepresentative high GGG of prostate cancer compared to systematic TRUS biopsy alone.
The University Institutional Review Board (IRB) approved the retrospective study. From January 2014 through May 2019, the records of 348 patients who had undergone radical prostatectomy were reviewed. Prior to the prostatectomy, these patients had received either systematic TRUS biopsies (n = 219) or US-MR fBx (n = 129).
Every patient who underwent fBx at our institution had a prebiopsy multi-parametric MRI (mpMRI) of the prostate. Multiparametric MRI included T2-weighted, dynamic contrast-enhanced, and diffusion-weighted imaging at 3T or 1.5T with an endorectal coil. Lesions were graded based on the Prostate Imaging Reporting and Data System (PI-RADS) v2 by two experienced genitourinary radiologists. All patients who had a lesion with PI-RADS 3 or higher were offered US/MRI fusion guided biopsy (UroNav device, Invivo, Philip). All patients who received fBx had a 12-core standard template, in addition to multiple, typically 2-4 samples from each target lesion.
In the TRUS Bx group, 12 core samples were biopsied for each patient in a systematic fashion, with samples from the bilateral apex, mid gland, and base, medially and laterally.
In the biopsy specimen, the Gleason score was assigned for each systematic sample as well as each target, and the highest Gleason score determined the grade group. In some cases, multiple targets of the prostate were detected and biopsied, in which cases, the highest GS of the target was used to determine the grade group. Biopsy and surgical specimens were evaluated by experienced GU pathologists, with expertise in the pathology of the prostate.
After prostatectomy, the surgical margins of the prostate specimen were examined for histological evidence of cancer during the frozen section. After separating the seminal vesicles, the entire gland was serially sectioned into approximately 3mm slices. Alternate slices were submitted for histopathologic examination. Tissue sections of 4 μm - 6 μm thickness were made from each paraffin block, stained with hematoxylin and eosin, and reviewed. The slides were evaluated microscopically for tumor, and the Gleason score was determined with the sum of the two most predominant patterns (primary and secondary) in terms of surface area. The grade group of the tumor was then determined. For a radical prostatectomy specimen, the highest Gleason grade observed was considered the final grade.
The primary objective of the study was to evaluate the concordance between bx and prostatectomy GGG in a population of patients who received either US-MR fBx or TRUS Bx. Concordance was defined as biopsy GGG the same as the final GGG from the surgical specimen. Upgrading or downgrading was defined as any increase or decrease in the Grade group or any change in the order of primary grade and second grade when the biopsy GGG was compared to the final GGG based on surgical histopathological analysis. First descriptive statistics were performed. Continuous variables are reported as medians and Interquartile ranges and were tested by the Kruskal-Wallis test. Categorical variables are shown as frequency and proportion and were tested by the chi-square test. The rates of upgrading (RP GGG > BX GGG), downgrading (RP GGG < BX GGG), and the concordance rate (RP GGG = BX GGG) were compared between the two groups.
Statistical analyses were performed using Stata 14 (StataCorp LP, College Station, TX, USA). All tests were 2-sided with a significance level set at p < 0.05.
A total of 348 men with complete clinical data were included in this study. Descriptive characteristics of the population of the two groups are shown in Table 1. Patients who underwent US-MR fBx had a higher PSA (9.4 vs. 6.5 ng/ml, p < 0.0001), higher PSA density (0.3 vs. 0.2 ng/ml2, p < 0.0001), and a lower prostate volume (31 vs. 42 cc, p < 0.0001).
The concordance rate with final GGG was higher in the US-MR fBx group as compared to the TRUS Bx group (94 out of 129, 72.9% vs. 133 out of 219, 60.71%, p < 0.05). Upgrading from Bx to RP was less common in patients who underwent fBx (17 out of 129, 14% vs. 43 out of 219, 19.6%). Downgrading from Bx to RP was also less common in patients who underwent fBx (18 out of 129, 13.2% vs. 43 out of 219, 19.6%). Between both groups, these differences in upgrading and downgrading did not reach statistical significance.
Table 2 shows concordance rates between Bx GGG and RP GGG in the two groups. US-MRI fBx patients had a higher concordance rate across all GGG. The biopsy results showed a similar lower rate of concordance of GGG 1 (25% vs. 24.1%) and GGG 4 (10% vs. 3.6%) in the US-MR fBx group vs the TRUS Bx group, as compared to the GGG 2, 3 and 5.
The adoption of US-MR fBx in recent years has led to improved detection of clinically significant PCa [5,12-14]. However, there is a concern that, in the US-MR fBx, the target biopsy specimen obtained is from the most suspicious area within a lesion seen at MRI, which likely represents an area of the lesion with the highest Gleason score. Thus, there is concern that this method of biopsy may result in a downgrading of the PCa at the time of RP and potentially lead to treatment that is more aggressive than necessary [16-18].
In our study comparing the GGG of a standard non-fusion TRUS bx to that of the US-MR fBx, we looked specifically at the rates of GGG upgrading (RP GGG > BX GGG), downgrading (RP GGG < BX GGG), and concordance (RP GGG = BX GGG). We had anticipated seeing the US-MR fBx downgrading rate higher than that of the TRUS Bx, based on previous assumptions and data . We were surprised to see, however, that the GGG of the US-MR fBx in fact had a lower downgrading rate than that of the TRUS Bx, 14% vs. 19.6% (Table 1).
The reasoning for this may have to do with the way core samples are collected and evaluated in each biopsy technique. In the TRUS Bx, generally, 12 cores are taken from different regions of the prostate and each is analyzed individually [19,20]. In our US-MR fBx method, multiple cores are taken from a single target (average of 4.0) in different areas of the tumor itself. These are then provided together as one sample for histopathologic analysis. In this way, if there are both high-grade cores and low-grade cores from the same target, they may average out to a score more representative of the final overall post-prostatectomy GGG. Conversely, in the TRUS Bx, there may only be one core passing through the PCa, and if it passes through an area with the highest malignancy, then it will present a high-grade score for the entire tumor with no other scoring from that area to average against.
We also found it interesting that the US-MR fBx provided a lower rate of upgrading than that of the TRUS Bx, 13.2% vs. 19.6% (Table 1). This may be for the same reason as that of the downgrading. When the urologist or radiologist performing the prostate biopsy has the guidance of the MRI-US fusion images, they may have greater accuracy in sampling the suspicious areas, particularly when taking samples from the different parts of the suspicious area (average 4.0). In other words, the suspicious area might be better represented by fBx with multiple passes through the different parts of the area, as compared to the TRUS Bx with only one pass through a potential lesion. Whatever the reasoning may be, the GGG from TRUS Bx in general has historically proven to be suboptimal in comparison with final pathology [21,22], a finding which was supported by our results. Consequently, the concordance rate was higher in the US-MR fBx group than the TRUS Bx group (72.9% vs. 60.7%, p < 0.05)) across all GGG (Tables 1,2).
Another interesting finding was a difference in the characteristic of prostate size between the TRUS Bx group and the US-MR fBx group (Table 1). The prostate sizes in the TRUS Bx group were on average, about 11 ml larger than those of the US-MR fBx group. One explanation for this may have to do with the fBx group having more patients with intermediate and high GGG (49.6%) as compared to the TRUS Bx group (40.02%) (Table 1). We observed that in many patients who had localized intermediate and high-grade PCa, the prostate sizes were smaller than in the normal population. This explanation, however, would require a more dedicated investigation which has been ongoing in our institution.
The weakness of our study lies in the retrospective design and its inherent selection bias. As our cohort is derived from a large tertiary referral center, some of our patients underwent TRUS bx at outside institutions. This can create a bias in the selection criteria, as biopsies are likely done using different US and biopsy systems, and the experience and skill of the Urologists performing the biopsy vary between institutions. This limitation may be unavoidable in every retrospective analysis. Therefore, a prospective, single-institution design with larger sample size is needed to validate our results.
In conclusion, our data has shown that US-MR fBx has a higher GGG concordance with final pathology from RP compared to TRUS biopsy (72.9% vs. 60.7%, p < 0.05). This may have clinically important implications in choosing treatment options for patients with PCa. In addition, downgrading and upgrading of final PCa GGG by target biopsy are also less frequent compared to TRUS Bx (14% vs. 19.6%, 13.2% vs. 19.6%, respectively), further supporting the impression that US-MR fBx presents a significant advantage over TRUS Bx, providing a GGG that is more accurate and representative of the most dominant underlying PCa.
Subscribe to our articles alerts and stay tuned.