ABSTRACT
Objective
This study aimed to evaluate whether adding a broad set of pre-treatment [18F] fluorodeoxyglucose positron emission tomography/computed tomography (18F-FDG PET/CT) metabolic and volumetric parameters to routine clinical variables improves the prediction of overall survival (OS) in patients with gastric cancer (GC). Secondary objectives were to assess the prognostic value of blood- and spleen-normalized metabolic indices and to explore associations between PET metrics and HER2 status.
Methods
In this retrospective cohort, pre-treatment 18F-FDG PET/CT data were analyzed to extract standardized uptake value (SUV)- and volume-based PET metrics, BLR_mean, and SLR_mean. Clinical variables, pathological features, treatment details, HER2 status, and survival outcomes were obtained from institutional records. OS was calculated based on the date of initial management. Prognostic performance was evaluated using Cox models, calibration metrics, time-dependent area under the curve (AUC), and decision curve analysis (DCA). Nested models (clinical-only vs. clinical+PET) were compared to determine the incremental value.
Results
SUV- and volume-based PET metrics showed variable but directionally consistent associations with OS. Metabolic tumor volume (MTV_40) and total lesion glycolysis (TLG_40) demonstrated trends toward worse outcomes, although these effects did not consistently reach statistical significance in multivariable analyses. Blood- and spleen-normalized parameters (BLR_mean and SLR_mean) showed stronger effects in the PET-only model but became attenuated after adjustment for clinical covariates. Incorporating PET parameters into the clinical model modestly improved discrimination and yielded acceptable calibration. HER2-positive tumors exhibited higher metabolic activity; however, no significant interaction was observed between HER2 status and the prognostic effect of PET metrics. Across clinically relevant decision thresholds (10–40%), the combined Clinical+PET model achieved higher net benefit than the clinical model alone. The combined model demonstrated a higher net benefit at 12 and 24 months.
Conclusion
Pretreatment 18F-FDG PET/CT appears to provide additional prognostic information beyond routine clinical variables in GC. The inclusion of SUV-based, volumetric, and normalized metabolic parameters modestly improves risk stratification and is associated with favorable decision-analytic performance. These findings support integrating quantitative PET metrics into prognostic evaluation frameworks for patients undergoing management for GC.
INTRODUCTION
GC is still one of the leading causes of cancer-related death worldwide, despite improvements in diagnosis and treatment (1). The 2019 WHO classification emphasizes the biological diversity of gastric adenocarcinoma, distinguishing intestinal, diffuse, mixed, and other subtypes with distinct patterns of behavior and prognosis (2). This variability diminishes the predictive utility of standard clinicopathological variables and underscores the necessity for quantitative markers that more precisely represent tumor biology. [18F] fluorodeoxyglucose positron emission tomography/computed tomography (18F-FDG PET/CT) is used in selected patients to identify distant metastases and refine treatment decisions. Several studies have shown that staging 18F-FDG PET/CT can reveal otherwise unsuspected metastatic disease and modify management in patients considered for radical therapy (3, 4). These findings suggest that PET-derived metabolic information may also offer predictions that go beyond conventional anatomical staging. 18F-FDG PET/CT quantifies tumor glucose uptake through metabolic indices such as standardized uptake value maximum (SUVmax) and SUVmean. A meta-analysis demonstrated that higher pre-treatment SUVmax is associated with worse OS, although study results vary with tumor subtype and methodology (5). Cohort studies likewise report that SUV-based metrics correlate with more aggressive disease and poorer outcomes (6). Volumetric PET parameters, including MTV and TLG, may better represent total metabolic burden than SUV alone. In resectable and locally advanced disease, higher MTV or TLG consistently predicted shorter OS or recurrence-free survival and in several reports outperformed SUVmax as prognostic indicators (7-9). Several reports indicate that MTV and TLG retain prognostic value even within biologically defined subgroups, for example, in c-MET–positive GCs, suggesting that these volumetric indices add information beyond conventional staging (10). In parallel, there is increasing interest in intratumoral metabolic heterogeneity. Initial data suggest that non-uniform FDG uptake within the primary lesion, summarized by various heterogeneity indices, can predict survival independently of MTV and TLG (11). Taken together, these observations raise the possibility that combining volumetric parameters with heterogeneity measures may better reflect underlying tumor biology. HER2 status is also an important modifier of GC behavior. Although available series are relatively small, several have reported higher FDG uptake and distinct metabolic patterns in HER2-positive tumors, with signals of an association with clinical outcome (12, 13). However, evidence remains limited and heterogeneous, and is rarely integrated into multivariable survival models. Overall, while metabolic and volumetric PET parameters show prognostic value in GC, it is not yet clear how much they improve established clinical risk models or whether integrating multiple PET-derived indices enhances prediction of OS in real-world settings (5-11). Few studies have examined calibration or clinical utility using decision-analytic methods.
In this study, we evaluated the incremental prognostic value of a broad panel of 18F-FDG PET/CT parameters—including SUVmax, SUVmean, SUV corrected for lean body mass peak (SULpeak), MTV_40, and TLG_40 as well as normalized measures such as BLR_mean and SLR_mean—when added to clinical prognostic models. We also assessed their relationship with HER2 expression and analyzed discrimination, calibration, and decision-analytic performance.
MATERIALS AND METHODS
Patient Selection
This retrospective observational study included consecutive patients with biopsy-confirmed primary gastric adenocarcinoma who underwent baseline 18F-FDG PET/CT for initial staging at Göztepe Prof. Dr. Süleyman Yalçın City Hospital between April 2016 and March 2025. Clinical, imaging, and follow-up data were obtained from institutional electronic records. No formal sample size calculation was performed; all eligible patients within the study period were included.
Eligibility criteria included age ≥18 years, histologically confirmed primary gastric adenocarcinoma, and availability of a pretreatment 18F-FDG PET/CT acquired before the initiation of oncologic therapy. Patients with missing imaging data, unavailable pathology reports, or insufficient follow-up information were excluded. In total, 92 patients were initially screened. Three patients were excluded: one with missing pre-treatment PET/CT, one with non-adenocarcinoma histology, and one with incomplete baseline clinical data. The final study cohort consisted of 89 patients.
The study was approved by the Noninterventional Clinical Research Ethics Committee of Göztepe Prof. Dr. Süleyman Yalçın City Hospital (approval number: 2025/0053; date: 31.07.2025) and was conducted in accordance with the Declaration of Helsinki. Owing to the retrospective design of the study, informed consent was waived.
PET/CT Imaging Protocol
All PET/CT examinations were performed on the same integrated device using a standard institutional protocol. Patients fasted for at least 6 hours, and a weight-based dose of 18F-FDG (3.7 MBq/kg) was administered intravenously. Image acquisition started 60–70 minutes after injection.
Low-dose CT (140 kV, 40–60 mAs, 5-mm slices) was used for attenuation correction and anatomical localization. PET data were acquired from the skull base to mid-thigh and reconstructed in axial, coronal, and sagittal planes using consistent parameters throughout the study period.
Quantitative PET Metrics
Tumor segmentation was performed on the primary gastric lesion using semiautomated three-dimensional volumes of interest, and all VOIs were jointly reviewed and finalized by two nuclear medicine physicians with more than 12 years of PET/CT experience. To reduce potential bias, all quantitative PET metrics were extracted using a standardized protocol. The retrospective, single-center nature of the study, heterogeneity in treatment pathways, and missing HER2 data represent the main potential sources of bias. We sought to limit selection bias by including all eligible consecutive patients and to minimize measurement bias by using consistent PET/CT acquisition and reconstruction protocols throughout the study period. Survival bias was reduced by defining overall survival (OS) as beginning at the start of cancer management.
The following quantitative parameters were recorded:
SUVmax, SUVmean, SUVpeak, SULmax_James, SULpeak_James, SULmean_James, metabolic tumor volume at 40% of SUVmax (MTV_40) and total lesion glycolysis (TLG_40), BLR_mean = tumor SUVmean / bone marrow SUVmean, and SLR_mean = tumor SUVmean / spleen SUVmean. Tumor size on CT was measured as the largest axial diameter.
Lean body mass was estimated using the James equation.
For men: LBM = 1.10 × weight (kg) − 128 × [weight (kg)/height (m)]².
For women: LBM = 1.07 × weight (kg)− 148 × [weight (kg)/height (m)]².
SUL values were calculated by normalizing SUV to estimated lean body mass.
For bone marrow assessment, three-dimensional volumes of interest were placed within the T11–L4 vertebral bodies on PET/CT. A 75% SUVmax isocontour was used to delineate intramedullary uptake while excluding cortical bone and focal benign or metastatic lesions. After verification in the axial, coronal, and sagittal planes, the SUVmean values from the selected vertebrae were averaged and recorded as the bone marrow SUVmean.
Survival Endpoints
The primary endpoint was OS. OS was defined as the time (in months) from diagnosis or treatment initiation (OS_start in the dataset) to death from any cause or last known follow-up.
There was no loss to follow-up; censoring occurred only due to administrative end-of-study cutoffs.
Statistical Analysis
1. Descriptive Statistics
Continuous variables were described using the mean and standard deviation (SD), the median and interquartile range (IQR), and the observed minimum–maximum values. Categorical variables were summarized as counts and percentages.
2. Univariate Cox Regression
As an initial screening step, each PET/CT-derived quantitative metric (SUV/SUL measures, MTV_40, TLG_40, BLR_mean, SLR_mean, and tumor size) was entered separately into a Cox proportional hazards model. For these analyses, we estimated hazard ratios (HRs) with 95% confidence intervals (CIs) and reported the corresponding p-values.
3. Multivariable Cox Modeling
The main analysis consisted of three predefined Cox models:
a) Clinical Model
Included covariates were age, sex, and tumor differentiation (coded as 1 = well, 2 = moderate, 3 = poor/low). Lymph node metastasis was recorded as present or absent. Distant metastasis was defined as peritoneal carcinomatosis or the presence of at least one metastatic lesion in the bone, liver, lung, adrenal glands, or brain.
b) PET-Only Model
The included PET variables were SULmax_James, SULpeak_James, SULmean_James, MTV_40, TLG_40, BLR_mean, SLR_mean, and tumor size.
c) Clinical+PET Combined Model
The model was constructed by adding all PET variables to the clinical model. For each model, HR, 95% CI, p-values, and the concordance index (c-index) were obtained.
Potential confounding was addressed by prespecified multivariable Cox models including all clinically relevant covariates. Missing HER2 values (n = 19) were handled by listwise deletion for analyses involving HER2; no imputation was performed.
Model Handling and Software
HER2 status was missing for 19 patients; all other clinical and PET/CT variables had complete data.
There was no loss to follow-up; censoring occurred only due to administrative end-of-study cut-offs.
Differentiation was encoded ordinally, and sex was binarized (M = 0; W = 1).
All analyses were performed in Python 3.10 using the lifelines package (v0.30) and the CoxPHFitter function.
Survival Modelling and Model Performance
Cox proportional hazards models were fitted for three predefined predictor sets: (i) clinical variables (age, gender, tumor differentiation, lymph node metastasis, distant metastasis); (ii) PET-only variables (SULmax, SULpeak, SULmean, MTV_40, TLG_40, BLR_mean, SLR_mean, tumor size); and (iii) a combined Clinical+PET set including all of these covariates. Individual risk scores were derived from the model’s linear predictor. For each model, patients were dichotomized into low- and high-risk groups using the cohort median, and OS was compared with Kaplan–Meier curves and two-sided log-rank tests (Figure 1a–c). Model discrimination was summarized using the c-index.
Model Calibration at Multiple Time Horizons
For model calibration at 12, 36, and 60 months, predicted survival probabilities from the Clinical+PET combined Cox model were obtained by evaluating the baseline survival function at each time point and applying the individual linear predictors. Predicted probabilities were grouped into 10 deciles, and the corresponding observed survival within each decile was estimated using Kaplan–Meier curves at the same horizon. Calibration was visualized using scatter plots of observed versus predicted survival, overlaid with a LOESS-smoothed curve and the ideal 45° reference line (Figure 2a–c).
Decision-Curve Analysis
To evaluate the potential clinical utility of the prognostic models, we performed decision curve analysis (DCA) at 12 and 24 months. Model-based risks of death by time t were obtained from the Cox models, and patients censored before t were excluded from that horizon-specific analysis. Net benefit was calculated for threshold probabilities between 10% and 40% using the standard formula:
NB = [(TP / N) − (FP / N) × (pt / (1 − pt)]
Decision curves for the Clinical, PET-only, and Clinical+PET models were compared with the “treat all” and “treat none” reference strategies.
Presentation of multivariable effects
For each of the three multivariable Cox models (Clinical, PET-only, and Clinical+PET), HRs with 95% CIs were estimated and graphically summarized using forest plots.
Bootstrap-corrected calibration at 60 months
To further assess and adjust for potential optimism in model calibration, we performed a bootstrap-based out-of-bag (OOB) correction at the 60-month horizon for the Clinical+PET model. In each of 200 bootstrap replicates, a Cox model was refitted on the bootstrap sample, and 60-month survival probabilities were predicted only for OOB patients. For each patient, OOB predictions were then averaged across replicates to yield a bootstrap-corrected survival estimate. Apparent and OOB-corrected predictions were each grouped into 10 deciles, and observed survival within each decile was estimated from Kaplan–Meier curves. Calibration error was quantified using the mean absolute difference between predicted and observed probabilities (E_avg) and its 90th percentile (E90).
Distribution of predicted risk
For descriptive purposes, 60-month death risks derived from the Clinical+PET Cox model [1−S(60|X)] were plotted as a histogram to illustrate the distribution of patient-level predictions. The cohort median predicted risk was displayed as a reference line.
Risk Distribution Analysis
Risk distribution plots were generated to illustrate the spread of predicted probabilities of death among patients. For each time horizon (12, 24, 36, and 60 months), predicted survival probabilities were computed using the combined Clinical+PET Cox model by evaluating the baseline survival function at the specified time points. Individual predicted risks were defined as 1– S(t|X), sorted in ascending order, and displayed as waterfall plots, with the cohort median risk indicated by a horizontal reference line. For tertile-based Kaplan–Meier analyses, patients were stratified according to the distribution of the Clinical+PET model–estimated 60-month mortality risk using tertile cutoffs of approximately 0.70 and 0.91.
Performance Metrics
Time-dependent discrimination was assessed at 12, 24, 36, and 60 months using an inverse-probability-of-censoring-weighted estimator applied to each model’s linear predictor. Restricted mean survival time (RMST) was calculated for up to 60 months in low- and high-risk groups defined by the median risk score of the Clinical+PET model. For visual stratification of survival, the predicted 60-month risk was computed from the baseline survival function and used to generate tertile-based Kaplan–Meier curves.
Follow-Up Time Estimation (Reverse Kaplan–Meier)
Follow-up duration was estimated using the reverse Kaplan–Meier method, in which censoring and death indicators are exchanged to obtain the distribution of potential follow-up time. The median follow-up and its 95% CI were extracted from the survival function of the reversed-event process.
Exploratory ROC Analysis for HER2 Prediction
To assess the discriminatory ability of PET-derived quantitative metrics for HER2 status, we included patients with complete HER2 data. Continuous PET parameters (SUVmax, SUVpeak, SUVmean, MTV_40, TLG_40, SUL-derived metrics, BLR_mean, SLR_mean, and tumor size) were evaluated individually using Receiver Operating Characteristic (ROC) analysis. Area under the curve (AUC) values and 95% CIs were estimated using bootstrap resampling (n = 1000 iterations). Given the limited number of HER2-positive cases (n = 12), this analysis was considered exploratory.
Study Design and Reporting
This observational study was conducted in accordance with the STROBE reporting principles. The design, data collection procedures, variables, and analytical methods were structured to meet these guidelines, and the completed STROBE checklist is included among the submission files.
RESULTS
Patient Characteristics
The study included 89 patients. The mean age was 64.2 ± 12.6 years, and 75.3% of the cohort were male. Lymph node metastasis was present in 67.4%, distant metastasis in 30.3%, and peritoneal carcinomatosis in 12.4% of patients. Detailed distributions of continuous and categorical variables are provided in Table 1. During follow-up, 64 deaths occurred, corresponding to an event rate of 71.9%.
Size refers to the longest diameter of the primary tumor, as measured on CT. SUV-based metrics were extracted from baseline 18F-FDG PET/CT scans acquired before treatment. MTV and TLG were calculated using a 40% SUVmax threshold. Continuous variables are reported as mean ± SD, median with IQR, and minimum–maximum values.
Univariate Analyses
In univariable Cox regression analyses, SULmean_James (HR 1.075, p = 0.048) was significantly associated with worse OS, while SULpeak_James (p = 0.053) and TLG_40 (p = 0.057) showed borderline significance. Among the background-normalized ratios, SLR_mean suggested the most pronounced increase in risk (HR 2.97; 95% CI, 0.67–13.12), whereas BLR_mean indicated a potential protective effect (HR 0.64); however, both estimates were statistically non-significant due to wide CIs (p = 0.150 and p = 0.304, respectively). Conventional SUV parameters and the remaining volumetric metric (MTV_40) demonstrated weaker and generally non-significant trends (Table 2).
Univariable Cox proportional hazards models were used to assess the relationships between quantitative baseline PET/CT parameters and OS. HRs are presented with 95% CIs and Wald p-values.
Each parameter was analyzed in a separate Cox model; n_used denotes the number of patients with available data for that variable (total analyzed cohort, n = 89). MTV_40 and TLG_40 were calculated using a 40% SUVmax threshold.
Multivariable Analyses
1. Clinical Model
In the clinical model, two variables were independently associated with OS:
Age (HR = 1.04; 95% CI, 1.01–1.06; p = 0.006)
Presence of distant metastasis (HR = 2.16; 95% CI, 1.14–4.09; p = 0.018)
The model’s discriminative performance was c-index = 0.669.
2. PET-Only Model
In the PET-only model, two background-normalized ratios stood out as independent predictors. Higher BLR_mean was associated with a lower risk of mortality (HR 0.26; p = 0.010), whereas SLR_mean showed the opposite pattern, with a markedly increased risk (HR 6.82; p = 0.032). By contrast, conventional SUV/SUL measures and the volumetric parameters (MTV_40 and TLG_40) did not reach independent significance.
3. Clinical + PET Combined Model
Incorporating PET-derived variables into the clinical model modestly improved prognostic performance. Within the combined model, MTV_40 (HR ≈ 1.02; p = 0.068) and BLR_mean (HR ≈ 0.32; p = 0.079) showed borderline associations with OS. In contrast, in the PET-only model, BLR_mean (p = 0.010) and SLR_mean (p = 0.032) were statistically significant predictors. The Clinical+PET combined model achieved the highest discrimination among the three predefined models (c-index = 0.728). Full results are presented in Table 3.
Multivariable Cox models were fitted using three predefined predictor sets: a clinical model (age, gender, tumour differentiation, lymph-node metastasis, any distant metastasis), a PET-only model (SULmax, SULpeak, SULmean, MTV_40, TLG_40, BLR_mean, SLR_mean, tumour size), and a combined Clinical + PET model including all variables from both sets. For each covariate, HRs with 95% CIs and Wald p-values are reported.
Overall Interpretation
Among the three prespecified models, the combined Clinical+PET model provided the highest prognostic performance, whereas the PET-only and clinical models demonstrated more limited but complementary discriminatory performance. Kaplan–Meier curves for these risk models are presented in Figure 1a–c.
Survival curves were compared using the log-rank test. Time shown in months.
Calibration of the Clinical+PET model showed good overall agreement between predicted and observed survival across all three time horizons (12, 36, and 60 months).
At 12 months, predicted probabilities aligned closely with observed outcomes, with minimal systematic deviation (Figure 2a).
At 36 months, the model maintained broadly acceptable calibration, although some variability was noted in mid-range risk strata (Figure 2b).
At 60 months, calibration remained acceptable, but the model tended to modestly overestimate survival in higher predicted-risk groups, consistent with a mild upward drift of the LOESS curve relative to the ideal 45° line (Figure 2c).
Each point represents one decile of predicted survival probabilities, averaged across patients within that decile. Observed survival was estimated from Kaplan–Meier curves at the corresponding horizons (12, 36, or 60 months). The dashed diagonal line denotes perfect calibration, while the solid line shows a LOESS-smoothed estimate of the observed–predicted relationship. The model showed close agreement between predicted and observed outcomes at 12 months, moderate variability at 36 months, and mild overestimation of survival probabilities in higher predicted-probability ranges at 60 months.
DCA indicated that the Clinical+PET model performed best at 12 and 24 months, offering greater net benefit than either the clinical-only or the PET-only model within the accepted 0.10–0.40 clinical threshold range.
The combined model demonstrated a superior net-benefit profile at 12 months, particularly for threshold probabilities between approximately 0.15 and 0.35 (Figure 3a).
At 24 months, all three models performed similarly, but the combined model again yielded a slightly higher net benefit across most thresholds (Figure 3b).
In multivariable Cox regression analysis (Figure 4), age and distant metastasis remained independent clinical predictors of OS. Among PET-derived parameters, higher BLR_mean was associated with improved survival, whereas higher SLR_mean indicated increased mortality risk. In the combined Clinical+PET model, clinical factors retained their prognostic significance, whereas PET metrics did not substantially alter effect directions but improved overall model discrimination (c-index ≈0.73).
At 60 months, apparent calibration of the Clinical+PET model showed a mean absolute calibration error (E_avg) of 0.129 and a 90th-percentile error (E90) of 0.279. Bootstrap OOB–corrected estimates were very similar (E_avg = 0.136, E90 = 0.294), indicating only modest optimism and broadly preserved calibration after correction (Figures 5a–b).
The distribution of predicted 60-month mortality risk from the Clinical+PET model was right-skewed, indicating that most patients clustered within the lower predicted risk range. Predicted risks had a median of 0.81 (range, 0.33–1.00). This pattern aligns with the overall high-risk profile of the cohort (Figure 6).
Each bar represents the number of patients within a given probability interval. The dashed vertical line indicates the cohort median predicted risk (0.81). The distribution displays a right-shifted pattern, consistent with the high-risk profile of the cohort.
The risk plots showed that predicted mortality varied widely across patients. At 12, 24, 36, and 60 months, the predicted risk did not follow a symmetric distribution; instead, most patients had moderate risk values, with only a small fraction in the high-risk tail. Median predicted risk increased as the horizon extended, in line with the accumulation of events over time. The skew was most marked at 60 months, reflecting the greater long-term mortality burden in this cohort (Figure 7).
In the time-dependent AUC analysis, the Clinical+PET model maintained AUC values between 0.85 and 0.89 at all evaluated time points, and these values were higher than those of the Clinical-only and PET-only models. Taken together, a multidimensional comparison of model performance across discrimination, calibration, and clinical utility metrics is presented in Figure 8. In the RMST analysis, low- and high-risk groups differed clearly, with mean restricted survival times of 33.9 months and 17.7 months, respectively. Kaplan–Meier curves based on tertiles of the predicted 60-month risk showed a stepwise separation, consistent with a strong prognostic signal of the model.
Median follow-up, calculated using the reverse Kaplan–Meier method, was 60.0 months (95% CI, 42.1–79.7 months) (Figure 9).
The reverse Kaplan–Meier method was used to estimate the distribution of potential follow-up time, treating deaths as censored observations and censoring events as failures. The vertical dashed line denotes the median follow-up of 60.0 months (95% CI, 42.1–79.7 months). Shaded areas represent the 95% confidence bands.
HER2 Status Prediction (Exploratory Analysis)
A subset of 70 patients with available HER2 data (HER2–: n=58; HER2+: n=12) was used to explore whether PET-derived metabolic and volumetric parameters could discriminate HER2 status. Overall discrimination was limited, with AUC values ranging from 0.536 to 0.609 across SUV/SUL- and volume-based metrics. The highest (yet still modest) performance was observed for SUVmean (AUC 0.609; 95% CI 0.406–0.796) and SULmean_James (AUC 0.606; 95% CI 0.404–0.787). BLR_mean and SLR_mean showed no discriminatory ability (AUC < 0.36). These results should be interpreted as exploratory due to the small number of HER2-positive cases (Table 4).
We conducted receiver operating characteristic (ROC) analysis for HER2 positivity based on individual PET-derived metrics (n=70; HER2–=58, HER2+=12). Values represent AUCs with 95% CIs. Given the small number of HER2-positive patients, results should be interpreted cautiously.
DISCUSSION
In this study, we evaluated whether a broad panel of 18F-FDG PET/CT metabolic and volumetric parameters improves the prediction of OS beyond established clinical variables in patients with GC. The need for more refined prognostication is well recognized. Gastric cancer (GC) remains a substantial global health concern (1), and the biological heterogeneity described in the 2019 WHO classification continues to hinder consistent risk stratification (2). These differences in tumor behavior encourage the search for quantitative biomarkers that capture tumor aggressiveness and complement routine clinicopathological features (14).
The potential value of 18F-FDG PET/CT in this setting is supported by evidence showing that PET can reveal occult metastases and influence management in a substantial subset of patients considered for radical treatment (3, 4, 15). In addition to staging, metabolic data provide prognostic information that may indicate the underlying biology of the tumor. A number of prior studies have found that SUV-based metrics, especially SUVmax, correlate with worse clinical outcomes (5, 6). Because these values depend on technical factors such as uptake timing and image reconstruction (7-9,16-18), they offer only a partial view of tumor metabolism and may vary across institutions.
Volumetric parameters such as MTV and TLG, which reflect the overall metabolic burden, have shown more consistent prognostic value (19). Studies in stage III and locally advanced GC frequently report that higher MTV or TLG is associated with inferior survival or an increased likelihood of early recurrence (7-10). Our findings are consistent with this literature. MTV_40 and TLG_40 showed trends toward contributing to the model, although these effects did not consistently reach statistical significance. Their inclusion was associated with a modest improvement in discrimination, while calibration remained acceptable. These results are consistent with earlier observations that volumetric tumor burden remains prognostically relevant even in biologically distinct subsets, such as c-MET–positive tumors (10). The link between metabolic burden and c-MET activation supports a biological rationale for the observed associations.
Heterogeneity in FDG uptake has also emerged as a feature associated with aggressive tumor biology. Early work demonstrated that intratumor metabolic heterogeneity predicts survival independent of conventional metabolic or volumetric indices (11). Additional prognostic evidence has also been reported from interim and restaging PET studies in metastatic or recurrent gastric cancer (20, 21). More recent analyses confirmed that heterogeneity-based metrics provide important prognostic information, likely reflecting variations in cell density, hypoxia, or necrosis (22). In our cohort, although voxel-level heterogeneity metrics were not computed, BLR_mean emerged as a statistically significant PET-derived predictor and may act as a practical surrogate for metabolic variability, albeit within the limitations of our sample size.
Histological diversity remains an important determinant of FDG avidity. Diffuse-type and mucinous GCs typically show lower FDG uptake than intestinal-type tumors, which can complicate prognostic interpretation (23). Such histological differences need to be considered when PET-derived biomarkers are incorporated into survival models. In our cohort, PET-based parameters showed directionally consistent associations with outcome after adjustment for clinicopathological variables, although these associations were attenuated. This suggests that metabolic tumor burden may capture risk across histological subtypes.
The link between FDG uptake and HER2 status has also attracted attention. Several reports have described higher SUV values in HER2-positive GCs (12, 13) or have proposed distinct metabolic patterns according to HER2 expression (24-27), although these studies are heterogeneous and generally involve modest sample sizes. In the present analysis, PET-derived metabolic parameters showed prognostic trends that were not fully explained by HER2 status. However, the number of HER2-positive cases was limited, and these findings should be interpreted with caution.
More recently, developments in image analysis have expanded the range of PET-based prognostic tools. Radiomics approaches, in particular, have been used to explore associations with survival, lymphovascular invasion, and treatment response (28-33). Machine learning approaches, including deep learning models, have demonstrated encouraging performance in estimating survival or immunotherapy response (31). Although these methods were outside the scope of our study, our findings may serve as a foundation for future multimodal prognostic models that incorporate handcrafted and derived PET features.
PET has also been used extensively to assess response during neoadjuvant therapy. Several reports have shown that early metabolic changes after the first cycles of chemotherapy are associated with histopathological tumor regression and longer-term outcomes (34-37), although some subgroups appear to benefit less from this approach (35). Our study centered on pretreatment scans rather than treatment response, but early metabolic change remains a promising area for future research, particularly when evaluated alongside volumetric PET measures.
Radiomics-based and advanced quantitative analyses have also been used to predict metastatic risk. Baseline PET radiomics features were associated with subsequent distant metastases in gastro-esophageal cancers in a recent prospective investigation (38). Our results suggest that PET-derived signatures may help to identify patients at higher risk who could be candidates for more intensive systemic therapy or closer surveillance.
Taken together with previous work, this supports a complementary, rather than standalone, role for PET-based metrics. In our dataset, adding PET parameters to clinical factors improved model discrimination and increased net benefit across relevant decision thresholds while maintaining acceptable calibration. These observations argue for incorporating PET information into prognostic models, especially when treatment decisions depend on estimated risk.
Study Limitations
It was a retrospective, single-center study, which may limit the generalizability of the findings. Diffuse-type tumors with intrinsically low FDG uptake may have attenuated the apparent prognostic value of metabolic parameters. We did not investigate texture-based radiomics features, which could provide complementary information but would require robust standardization before clinical use. Treatment heterogeneity, particularly with respect to HER2-targeted regimens, also restricted the depth of subgroup analyses, and HER2 status was unavailable in a subset of patients. However, all PET/CT studies were acquired and processed using uniform protocols and a standardized segmentation workflow, which likely reduced measurement variability and supported the internal consistency of the results.
CONCLUSION
Our results indicate that 18F-FDG PET/CT–derived metabolic and volumetric parameters offer complementary prognostic information beyond routine clinical factors in GC. The consistent associations observed for MTV_40, TLG_40, SULpeak, and BLR_mean support their use as practical imaging biomarkers that can strengthen survival models. Integrating these PET-derived measures with clinical variables may improve risk stratification and support more informed therapeutic decision-making in selected patients.


