Predicting the probability of complications during prostatectomy in pa-tients with prostate cancer using machine learning methods

Cover Page

Cite item

Abstract

Objective. To determine the probabilities of predicting possible complications after surgery in patients with the diagnosis of prostate cancer using artificial intelligence methods.

Materials and methods. Case histories of 701 patients who underwent prostatectomy were analyzed in the study. The anamnesis, findings of clinical, laboratory and instrumental study, as well as objective data of clinical observations were evaluated. The average age was 64.72. On the basis of the set of examination results, patients were selected according to the following inclusion criteria: prostate cancer patients without confirmed metastases with disease stage from T1N0M0 to T3N0M0; absence of previous and concomitant special treatment (immunotherapy or targeted therapy); informed consent to the surgery. Logistic regression, a binary classifier using a sigmoidal activation function on linear combinations of features, was used as a machine learning model.

Results. It was determined that the logistic regression model based on selected parameters (prostate volume, pain syndrome, disease duration), predicts the probability of complications quite well (TPR = 1). The overall accuracy of the model is: Accuracy = 0.98. At the same time, it can be noticed from the agreement matrix that the trained model plays it safe and classifies some cases without complications incorrectly in 5.3 % (FNR = 0.053). However, the model never made an error and did not classify cases with a high risk of complications as those in which such a possibility was unlikely.

Conclusions. The results obtained show that on the basis of just three parameters (prostate volume, pain syndrome, duration of the disease), it is possible to build a fairly good predictive model of the probability of complications after prostatectomy based on such machine learning method as logistic regression.

Full Text

Introduction

The incidence of prostate cancer (PC) has been rapidly increasing over the last decade in Russia. PC is on the 4th place (6.9 % of tumors of all localizations) after lung cancer, gastric cancer and skin tumors in the structure of malignant neoplasm morbidity among males [1–4]. The number of patients with localized forms of prostate cancer has increased significantly after the implementation of screening programs using prostate specific antigen (PSA) testing [5–7]. A recurrence of PC occurs among 10–30 % of patients after surgical interventions. PC is determined by an increase in PSA level values in the early stages [8–11]. Improvement of the prostatectomy technique proceeds accordingly to the evolution of the study of the anatomy of this area, more accurate understanding of the peculiarities of the location and structure of the fascial layers and functionally important anatomical structures [12; 13]. Due to the active development of AI, it is possible to create an aid system for making medical decisions on predicting the occurrence of complications of various diseases, including PC. Currently, clinical decision support systems for physician based on retrospective analysis of outpatient charts and clinical history are already being developed and implemented; real-time systems for ICU patients that allow to warn the medical personnel about the onset of critical conditions; wearable systems for monitoring and subsequent retrospective analysis of anamnesis data.

One of the ways of improving the outcomes of post-prostatectomy PC treatment is to identify and predict the postoperative survival rate of patients and the rate of complications at an early stage by using gradient-boosting methods, which will undoubtedly be able to greatly simplify the construction and strategy of treatment.

The aim of the study is to determine the possibilities of predicting the probability of complications after surgical intervention among patients diagnosed with PC using AI methods.

Materials and methods

The study analyzed data from the clinical histories of 701 patients who had a prostatectomy. The anamnesis, data of the clinical laboratory and instrumental methods of research, as well as objective data of clinical observations were conducted. The average age was 64,72 y. All included in the study patients received a comprehensive examination according to clinical guidelines for diagnosis and treatment of prostate cancer patients. Morphologic examinations of the obtained material (after surgical treatment) was conducted according to the standard technology. The slices colored by hematoxylin and eosin were used in the observational morphological analysis to determine the histological type of the tumor, the degree of differentiation, the severity of secondary changes, and the prevalence of the tumor process according to the WHO classification. eosin were used to determine the histological type of tumor, the degree of differentiation, the intensity of secondary changes and the prevalence of the tumor process according to the WHO classification. Patients were selected according to a set of examination results. They met the following inclusion criteria: cancer patients without confirmed metastases with the disease stage from T1N0M0 to T3N0M0; absence of previous and concomitant special treatment (immunotherapy or targeted therapy); informed consent to undergoing surgical intervention and participation in the study. The exclusion criteria were: PC patients with confirmed metastases, previous and concomitant special treatment, and also the presence of exacerbations of chronic diseases. During clinical examination, PSA levels were determined to range from 3.98 to 30.49 ng/mL; the Glisson number was from 3 to 7, and the prostate tumor size ranged from 33.04 to 143.88 cm3.

Logistic regression is a binary classifier that uses a sigmoid activation function on linear combinations of features. It was used as a machine learning model. This machine learning method is the simplest classifier that still shows reasonably good results for certain tasks. At the same time, it allows us to find out the presence of linearly dependent parameters of the dataset.

The following metrics were used here:

Accuracy=TP+TNTP+TN+FP+FN

An approval matrix in the form of:

TPRFNRFPRTNR,

where

TPR=TPTP+FP;   FPR=FPTP+FP;TNR=TNTN+FN;   FNR=FNTN+FN;

TPR is the share of patients who had a complication and the model predicted the complication, out of all patients who had a predicted complication; FPR is the share of patients who did not have a complication, but the model predicted a complication, out of all patients who had a predicted complication; FNR is the share of patients who had complications but the model did not predict a it, out of all patients who had a predicted absence of complications; TNR is the share of patients who did not have a complication and the model predicted the absence of a complication, out of all patients who had a predicted absence of a complication; TP is the amount of patients who had a complication and the model predicted the complication; FP is the amount of patients who did not have a complication but the model predicted a complication; FN is the amount of patients who had complications but the model did not predict a complication; TN is the amount of patients who did not have a complication and the model predicted the absence of a complication.

Permission for conducting this study was reflected by the Local Ethical Committee (LEC) of the V.I. Razumovsky Saratov State Medical University (LEC protocol No. 2 of 16.09.2023). The study was conducted in the presence of voluntary informed consent of patients in accordance with the declaration of compliance with international as well as Russian ethical principles and standards (excerpt from Minutes No. 19 of the Bioethics Committee of 26th October, 2018). The study was conducted in accordance with the requirements of the World Medical Association Declaration of Helsinki (revised in 2013).

Results and discussion

In addition to TNM staged diagnoses (at the time of hospitalization and after histological confirmation), the collected data set contained the following parameters
(I – range of values, m – average, s – standard deviation), shown in Table 1.

 

Table 1

Parameters of the studied dataset

Name of parameter

Value range

Code

Age, years

I = [50…80]

m = 64.73

s = 8.14

AGE

Duration of disease, months

I = [7…120]

m = 26.87

s = 19.08

DD

PSA level before surgery, ng/mL'

I = [3.98...30.49]

m = 17.21

s = 7.74

PSABS

TNM Glisson score for surgery

I = [3.00…7.00]

m = 4.90

s = 1.42

GLISSONFS

Name of parameter

Value range

Code

 

 

 

Prostate ultrasound at the time of hospitalization, cm

I = [3.00…5.89]

m = 4.28

s = 0.71

US1

Prostate ultrasound after surgery, cm

I = [2.91…8.78]

m = 4.23

s = 0.83

US2

Prostate ultrasound at the time of discharge, cm

I = [2.89…9.70]

m = 4.25

s = 0.86

US3

Prostate volume, cm3

I = [25.90…180.20]

m = 87.84

s = 32.05

PV

Was there residual urine

Yes/No

RU

Infected urine before surgery

(All patients had a value of “No”.

The parameter was excluded from the study)

Yes/No

 

Comorbidity

Yes/No

COMORB

Coexisting diseases of the cardiovascular system

Yes/No

CCVD

Coexisting gastrointestinal diseases

Yes/No

GIT

Coexisting diseases of the respiratory system

Yes/No

RS

Surgical history

Yes/No

SH

Surgery type (patients underwent the following surgeries depending on the stage of the tumor process: posterior radical prostatectomy; laparoscopic posterior radical prostatectomy; radical perineal prostatectomy)

Posterior prostatectomy

Laparoscopic prostatectomy

Perineal prostatectomy

SURT

TNM Glisson score after surgery

I = [3.00…10.00]

m = 6.45

s = 2.17

GLISSONAS

Diagnostic concordance according to the Glisson scale

Yes/No

GLISSONCON

Impurity of blood in urine after surgery

Yes/No

BLOODURINE

Duration of hospitalization after surgery, days

I = [7.00…41.00]

m = 19.69

s = 8.34

HOSPIT

Discharged with a catheter

Yes/No

CATHETER

Blood loss

Yes/No

BLOODL

Demand for blood transfusion

Yes/No

TRANSF

Interoperative complications

Yes/No

INTEROP

Postoperative complications

Yes/No

POSTOP

Complications which are not directly related to the surgery

Yes/No

COMPLIC

Sluggish urine stream before surgery

Yes/No

SLUGSTREAM

Severe pain syndrome

Yes/No

PAINSYN

Nocturia

Yes/No

NOCT

 

Figure 1 shows the percentage of patients according to the type of surgery, figure 2 shows the distribution of patients according to age.

 

Fig. 1. Percentage of patients by surgery type

 

According to the data of Fig. 1, the information set is unbalanced by the type of performed operation. The majority of patients (58.6 %) had a retropubic prostatectomy. Laparoscopic prostatectomy was conducted for 26.2 % of patients and perineal prostatectomy for 15.1 %. At the same time, the amount of patients with and without complications was approximately the same, as can be seen from Fig. 2.

 

Fig. 2. Patients distribution by presence and absence of complications: 0 – there were no complications, 1 – there were complications

 

For further study, parameters with values that were either unique or the same for all patients were removed. As a result, the following parameters remained: “AGE” (age of the patients), ‘DD’ (duration of disease (in months)), ‘TNM.T’ (tumor size according to TNM classification), ”TNM. N“ (stages with lymph node involvement according to TNM classification), ‘PSABS (preoperative PSA level, ng/mL’)”, “GLISSONFS” (TNM Glisson score for surgery), “US1” (prostate ultrasound at the time of hospitalization, cm), “US2” (prostate ultrasound after surgery, cm), “US3” (prostate ultrasound at the time of discharge, cm), “PV” (prostate volume, cm3), ‘RU’ (was there residual urine), ‘CCVD’ (coexisting diseases of the cardiovascular system), ‘GIT’ (coexisting gastrointestinal diseases), ‘RS’ (coexisting diseases of the respiratory system), ‘SH’ (surgery history), ‘SURT’ (surgery type (patients underwent the following surgeries depending on the stage of the tumor process: posterior radical prostatectomy; laparoscopic posterior radical prostatectomy; radical perineal prostatectomy), “GLISSONAS” (TNM Glisson score after surgery), “gTNM. T” (histologic verification of tumor according to TNM classification), ‘GLISSONCON’ (diagnostic concordance according to the Glisson scale), ‘BLOODURINE’ (impurity of blood in urine after surgery), “CATHETER” (discharged with a catheter), ‘BLOODL’ (blood loss), ‘TRANSF’ (demand for blood transfusion), ‘PAINSYN’ (severe pain syndrome), ‘NOCT’ (nocturia). Target variable for predicting “POSTOP” (postoperative complications).

Logistic regression was used to identify parameters that were linearly dependent from the others. The calculation results of the significance of the remaining linearly independent parameters are summarized in Table 2.

 

Table 2

Calculation results of the importance of the remaining independent parameters

Model

Logit

Method

MLE

Dependent Variable:

AS

Pseudo R-squared:

0.853

Date:

2024-03-29 20:26

AIC:

186.4373

No. Observations:

701

BIC:

291.1450

Df Model:

22

Log-Likelihood:

-70.219

Df Residuals:

678

LL-Null:

-478.59

Converged:

1.0000

LLR p-value:

1.6073e-158

No. Iterations:

11.0000

Scale:

1.0000

                 Coef.      Std.Err.      z      P > |z|      [0.025   0.975]

AGE

-0.0220

0.0270

-0.8158

0.4146

-0.0750

0.0309

DD

-0.0204

0.0091

-2.2558

0.0241

-0.0382

-0.0027

TNM.T

-0.0465

0.6236

-0.0745

0.9406

-1.2688

1.1759

TNM.N

0.0198

1.9323

0.0102

0.9918

-3.7674

3.8069

PSABS

0.0146

0.0320

0.4565

0.6480

-0.0480

0.0772

GLISSONFS

-0.1571

0.1754

-0.8955

0.3705

-0.5009

0.1867

US1

-0.0702

0.3461

-0.2028

0.8393

-0.7486

0.6082

US2

-0.4671

0.2545

-1.8349

0.0665

-0.9659

0.0318

US3

-0.0195

0.2300

-0.0850

0.9323

-0.4704

0.4313

PV

-0.0148

0.0073

-2.0188

0.0435

-0.0292

-0.0004

RU

0.0001

0.0329

0.0018

0.9985

0.0645

0.0646

CCVD

-1.0520

1.1430

-0.9204

0.3574

-3.2923

1.1883

GIT

-0.6683

0.5550

-1.2041

0.2286

-1.7560

0.4195

RS

1.1087

1.5938

0.6956

0.4866

-2.0150

4.2324

SH

1.0268

0.9198

1.1163

0.2643

-0.7760

2.8297

SURT

0.3699

0.4353

0.8499

0.3954

-0.4832

1.2231

GLISSONAS

-0.0891

0.1207

-0.7381

0.4605

-0.3256

0.1475

gTNM.T

1.1078

0.8687

1.2752

0.2022

-0.5948

2.8104

GLISSONCON

-0.0751

0.5762

-0.1303

0.8963

-1.2044

1.0542

BLOODURINE

0.0498

1.4686

0.0339

0.9730

-2.8286

2.9281

CATHETER

-0.9088

0.7519

-1.2087

0.2268

-2.3824

0.5649

BLOODL

-0.0786

1.4563

-0.0539

0.9570

-2.9329

2.7758

PAINSYN

10.4449

1.5913

6.5636

0.0000

7.3259

13.5638

          

 

As we can see from the data presented in Table 2, the most important parameters determining the likelihood of complications are prostate volume (PV, p = 0.0435), pain syndrome (PAINSYN, p = 0.0000), and disease duration (DD, p = 0.0241).

Then, logistic regression was trained on these parameters to determine the probability of complications.

The original data set was divided in the proportion of 70 %/30 % for training and metrics calculation such that the distributions of the target variable (AS) were statistically indistinguishable in the training and variation metrics.

The Accuracy metric was 0.98 as a result of testing the trained model on the validation sample. The concordance matrix is shown in Fig. 3. As it can be seen, the share of patients who had complications and among patients to whom the model predicted complications was TPR = 1. The model never made an error and did not categorize patients with complications to patients without complications (FPR = 0). In this case, the model “reinsured” and it predicted the occurrence of complications for 5.3 % of patients, although they did not get a complication (FNR = 0.053 and TNR = 0.95).

 

Fig. 3. Approval matrix

 

It is necessary to mention that the certificate of state registration of computer programs “System of prediction of complications prediction during prostatectomy for prostate cancer” (No. 2024613673)1 has also been obtained to date.

Conclusions

As can be seen from the obtained metrics, the logistic regression model predicts the probability of complications reasonably well (TPR = 1) on the selected parameters (prostate volume (PV), pain syndrome (PAINSYN), duration of disease (DD)). The overall accuracy of the model is 0.98. However, as can be seen from the concordance matrix, the model “reinsures” and classifies a part of cases without complications incorrectly. Thus, 5.3 % (FNR = 0.053) were misclassified as cases with a high likelihood of complications. At the same time, the model never made an error in categorizing cases in which there was a high probability of complications to cases where such a possibility was low.

Thus, the obtained results show that on the basis of only three parameters (prostate volume (PV), pain syndrome (PAINSYN), duration of disease (DD)), it is possible to build a reasonably good predictive model of the probability of complications after prostatectomy based on such a machine learning method as logistic regression. If the model metrics need to be improved, further the patient sample can be increased and the model can be trained using more sophisticated machine learning and AI methods.

×

About the authors

M. A. Polidanov

University «Reaviz»; Medical University «Reaviz»

Author for correspondence.
Email: maksim.polidanoff@yandex.ru
ORCID iD: 0000-0001-7538-7412

Research Department Specialist, Assistant of the Department of Biomedical Disciplines, Postgraduate Student of the Department of Surgical Diseases

Russian Federation, Saint Petersburg; Saratov

M. A. Barulina

Perm State National Research University; Institute of Problems of Precision Mechanics and Control of RAS

Email: maksim.polidanoff@yandex.ru
ORCID iD: 0000-0003-3867-648X

DSc (Physics and Mathematics), Director of the Institute of Physics and Mathematics, Head of the Laboratory «Analysis and Synthesis of Dynamic Systems in Precision Mechanics», Chief Researcher

Russian Federation, Perm; Saratov

V. S. Marchenko

Saratov State Medical University named after V.I. Razumovsky

Email: maksim.polidanoff@yandex.ru
ORCID iD: 0009-0006-8652-5298

resident of the Department of Urology

Russian Federation, Saratov

K. A. Volkov

Saratov State Medical University named after V.I. Razumovsky

Email: maksim.polidanoff@yandex.ru
ORCID iD: 0000-0002-3803-2644

2nd-year student of the Medical Faculty

Russian Federation, Saratov

A. P. Dyagel

Saratov State Medical University named after V.I. Razumovsky

Email: maksim.polidanoff@yandex.ru
ORCID iD: 0009-0004-5983-2116

2nd-year student of the Medical Faculty

Russian Federation, Saratov

N. A. Luzhnov

Samara State Medical University

Email: maksim.polidanoff@yandex.ru
ORCID iD: 0009-0008-0628-4389

5th-year student of the Institute of Pediatrics

Russian Federation, Samara

V. N. Kudashkin

Samara State Medical University

Email: maksim.polidanoff@yandex.ru
ORCID iD: 0000-0001-9099-3517

6th-year student of the Institute of Pediatrics

Russian Federation, Samara

N. V. Kolpakova

Saratov State Medical University named after V.I. Razumovsky

Email: maksim.polidanoff@yandex.ru
ORCID iD: 0009-0006-4837-584X

6th-year student of the Medical Faculty

Russian Federation, Saratov

References

  1. Sekhoacha M., Riet K., Motloung P., Gumenku L., Adegoke A., Mashele S. Prostate Cancer Review: Genetics, Diagnosis, Treatment Options, and Alter-native Approaches. Molecules. 2022; 27 (17): 5730.
  2. Алипов В.В., Тахмезов А.Э., Полида-нов М.А., Мусаелян А.Г., Кондрашкин И.Е., Волков К.А., Алипов А.И. Улучшение результатов лечения и диагностики послеоперационных осложнений в абдоминальной хирургии с применением многофункционально-го устройства. Медицинская наука и образование Урала. 2023; 24 (1–113): 67–71 / Alipov V.V., Takhmezov A.E., Polidanov M.A., Musaelyan A.G., Kondrashkin I.E., Volkov K.A., Alipov A.I. Improvement of the results of treatment and diagnosis of postoperative complications in abdominal surgery with the use of multifunctional device. Medi-cal Science and Education of the Urals 2023; 24 (1–113): 67–71 (in Russian).
  3. Wasim S., Lee S.Y., Kim J. Complexities of Prostate Cancer. Int J Mol Sci. 2022; 23 (22): 142–157.
  4. Desai K., McManus J.M., Sharifi N. Hormonal Therapy for Prostate Cancer. Endocr Rev. 2021; 42 (3): 354–373.
  5. Achard V., Putora P.M., Omlin A., Zilli T., Fischer S. Metastatic Prostate Cancer: Treatment Options. Oncology. 2022; 100 (1): 48–59.
  6. Williams I.S., McVey A., Perera S., O'Bri-en J.S., Kostos L., Chen K., Siva S., Azad A.A., Murphy D.G., Kasivisvanathan V., Lawrentschuk N., Frydenberg M. Mod-ern paradigms for prostate cancer detection and management. Med J Aust. 2022; 217 (8): 424–433.
  7. Rizzo A., Santoni M., Mollica V., Fiorentino M., Brandi G., Massari F. Microbiota and prostate cancer. Semin Cancer Biol. 2022; 86: 1058–1065.
  8. Eifler J.B., Feng Z., Lin B.M., Partin M.T., Humphreys E.B., Han M., Epstein J.I., Walsh P.C., Trock B.J., Partin A.W. An updated prostate cancer staging nomogram (Partin tables) based on cases from 2006 to 2011. BJU Int. 2023; 111 (1): 22–29.
  9. Пушкарь Д.Ю., Раснер П.И. Диагностика и лечение локализованного рака предстательной железы. М.: МЕД-прессинформ 2008: 320 / Pushkar D.Y., Rasner P.I. Diagnostics and treatment of localized prostate cancer. Moscow: MEDpress-Inform 2008: 320 (in Russian).
  10. Велиев Е.И., Голубцова Е.Н., Томилов А.А. Клинический случай: хирургическое лечение прогрессирующего рака предстательной железы. Онкоурология. 2014; 3: 95–100 / Veliev E.I., Golubtsova E.N., Tomilov A.A. Surgical treatment for progressive prostate cancer: A clinical case. Onkourologiya 2014; 3: 95–100 (in Russian).
  11. Велиев Е.И., Томилов А.А., Богда-нов А.Б. Спасительная лимфаденэктомия у пациентов с подтвержденным ПЭТ/КТ олигометастатическим рецидивом рака предстательной железы. Онкоурология. 2018; 4: 79–86 / Veliev E.I., Tomilov A.A., Bogdanov A.B. Salvage lymphadenectomy in patients with PET/CT-confirmed oligometastatic recurrence of prostate cancer. Oncourology 2018; 4: 79–86 (in Russian).
  12. Евсюкова О.И., Черняев В.А., Халмурзаев О.А., Хафизов К.А., Хачатурян А.В., Тхакохов М.М., Матвеев В.Б. Оценка безопасности и целесобразности сальважной лимфаденэктомии у пациентов с лимфогенными метастазами рака предстательной железы после радикального лечения. Онкоурология. 2017; 4: 64–69 / Evsyukova O.I., Chernyaev V.A., Khalmurzaev O.A., Khafizov K.A., Khachaturyan A.V., Tkhakokhov M.M., Matveev V.B. Evaluation of safety and feasibility of salvage lymphadenectomy in patients with lymphogenic metastases of prostate cancer after radical treatment. Oncourology 2017; 4: 64–69 (in Russian).
  13. Перепечай В.А., Васильев О.Н. Лапароскопическая радикальная простатэктомия. Вестник урологии. 2018; 3: 57–72 / Perepechai V.A., Vasiliev O.N. Laparoscopic radical prostatectomy. Bulletin of Urology 2018; 3: 57–72 (in Russian).

Supplementary files

Supplementary Files
Action
1. JATS XML
2. Fig. 1. Percentage of patients by surgery type

Download (61KB)
3. Fig. 2. Patients distribution by presence and absence of complications: 0 – there were no complications, 1 – there were complications

Download (62KB)
4. Fig. 3. Approval matrix

Download (290KB)

Copyright (c) 2024 Eco-Vector



СМИ зарегистрировано Федеральной службой по надзору в сфере связи, информационных технологий и массовых коммуникаций (Роскомнадзор).
Регистрационный номер и дата принятия решения о регистрации СМИ: серия ЭЛ № ФС 77 - 75489 от 05.04.2019 г
.