Development of a prognostic model for diagnosis of prostate cancer based on radiomics of biparametric magnetic resonance imaging apparent diffusion coefficient maps and stacking of machine learning algorithms

封面


如何引用文章

全文:

详细

BACKGROUND: Prostate cancer is one of the most common cancers among men [1, 2]. In recent years, a number of prognostic models based on texture analysis of biparametric magnetic resonance images have been created. The research has shown that radiomics features extracted from apparent diffusion coefficient maps are the most reproducible [3]. However, the models were limited in accuracy, since they are built using a single machine learning algorithm, which takes into account only linear dependences [4–6].

AIM: Increasing the accuracy of a prognostic model diagnosing prostate cancer through the use of stacking machine learning algorithms that takes into account not only linear, but also nonlinear dependencies based on radiomics of biparametric magnetic resonance imaging apparent diffusion coefficient maps.

MATERIALS AND METHODS: A single-center cohort retrospective study of patients with suspected prostate cancer was conducted in the X-ray Diagnostics and Tomography Department of the United Hospital and Polyclinic (Moscow, Russia) from 2017 to 2023. The presence of prostate cancer was confirmed by biopsy or radical prostatectomy. Statistical analyses was performed using Python 3.11.

RESULTS: The study involved 67 men aged 60 [54; 66] years, of which 57 were diagnosed with prostate cancer, and 10 — with benign prostate formation. The LIFEx software identified 96 radiomic features.

Statistically significant differences were found for: PARAMS_ZSpatialResampling (the voxel size of the image: Z dimension) (p=0.001), SHAPE_Sphericity[onlyFor3DROI] (how spherical a Volume of Interest is) (p=0.006), SHAPE_Compacity[onlyFor3DROI] (how compact the Volume of Interest is) (p=0.004), GLRLM_HGRE (p=0.039), GLRLM_SRHGE (p=0.041), GLRLM_RLNU (p=0.039), where GLRLM — Grey-Level Run Length Matrix. Univariate logistic regression showed that SHAPE_Compacity[onlyFor3DROI] (R2=15%) and PARAMS_ZSpatialResampling (R2=18%) had a statistically significant effect on the outcome. First, using the multivariate logistic regression method, a prognostic model was built that takes into account only linear dependencies. The model includes 3 features that together have a statistically significant effect on the outcome (R2=23%): SHAPE_Sphericity[onlyFor3DROI], PARAMS_ZSpatialResampling and GLRLM_RLNU.

To describe nonlinear relationships, another model was built based on the “Decision Tree” algorithm. It included 4 indicators (R2=58%): DISCRETIZED_HISTO_Entropy_log10 (the randomness of the distribution), SHAPE_Sphericity[onlyFor3DROI], PARAMS_ZSpatialResampling and GLRLM_SRE.

Stacking of algorithms, which consists of calculating the arithmetic mean between the predictions of the multivariate logistic regression and “Decision Tree” algorithms, made it possible to construct a model that takes into account both linear and nonlinear dependencies. The model includes 5 features (R2=77%). The constructed model formed the basis of the developed calculator program [7], currently introduced into a radiology practice.

CONCLUSION: The new model built on the basis of apparent diffusion coefficient maps performs better (area under ROC-curve 99.0% [97.7; 100.0]) than the existing models with area under ROC-curve 83.6% [78.3; 88.9], which also show high heterogeneity (I2=71%). The accuracy of the new model was increased due to the use of stacking machine learning technologies, which made it possible to take into account both linear and nonlinear effects from features on the outcome.

全文:

BACKGROUND: Prostate cancer is one of the most common cancers among men [1, 2]. In recent years, a number of prognostic models based on texture analysis of biparametric magnetic resonance images have been created. The research has shown that radiomics features extracted from apparent diffusion coefficient maps are the most reproducible [3]. However, the models were limited in accuracy, since they are built using a single machine learning algorithm, which takes into account only linear dependences [4–6].

AIM: Increasing the accuracy of a prognostic model diagnosing prostate cancer through the use of stacking machine learning algorithms that takes into account not only linear, but also nonlinear dependencies based on radiomics of biparametric magnetic resonance imaging apparent diffusion coefficient maps.

MATERIALS AND METHODS: A single-center cohort retrospective study of patients with suspected prostate cancer was conducted in the X-ray Diagnostics and Tomography Department of the United Hospital and Polyclinic (Moscow, Russia) from 2017 to 2023. The presence of prostate cancer was confirmed by biopsy or radical prostatectomy. Statistical analyses was performed using Python 3.11.

RESULTS: The study involved 67 men aged 60 [54; 66] years, of which 57 were diagnosed with prostate cancer, and 10 — with benign prostate formation. The LIFEx software identified 96 radiomic features.

Statistically significant differences were found for: PARAMS_ZSpatialResampling (the voxel size of the image: Z dimension) (p=0.001), SHAPE_Sphericity[onlyFor3DROI] (how spherical a Volume of Interest is) (p=0.006), SHAPE_Compacity[onlyFor3DROI] (how compact the Volume of Interest is) (p=0.004), GLRLM_HGRE (p=0.039), GLRLM_SRHGE (p=0.041), GLRLM_RLNU (p=0.039), where GLRLM — Grey-Level Run Length Matrix. Univariate logistic regression showed that SHAPE_Compacity[onlyFor3DROI] (R2=15%) and PARAMS_ZSpatialResampling (R2=18%) had a statistically significant effect on the outcome. First, using the multivariate logistic regression method, a prognostic model was built that takes into account only linear dependencies. The model includes 3 features that together have a statistically significant effect on the outcome (R2=23%): SHAPE_Sphericity[onlyFor3DROI], PARAMS_ZSpatialResampling and GLRLM_RLNU.

To describe nonlinear relationships, another model was built based on the “Decision Tree” algorithm. It included 4 indicators (R2=58%): DISCRETIZED_HISTO_Entropy_log10 (the randomness of the distribution), SHAPE_Sphericity[onlyFor3DROI], PARAMS_ZSpatialResampling and GLRLM_SRE.

Stacking of algorithms, which consists of calculating the arithmetic mean between the predictions of the multivariate logistic regression and “Decision Tree” algorithms, made it possible to construct a model that takes into account both linear and nonlinear dependencies. The model includes 5 features (R2=77%). The constructed model formed the basis of the developed calculator program [7], currently introduced into a radiology practice.

CONCLUSION: The new model built on the basis of apparent diffusion coefficient maps performs better (area under ROC-curve 99.0% [97.7; 100.0]) than the existing models with area under ROC-curve 83.6% [78.3; 88.9], which also show high heterogeneity (I2=71%). The accuracy of the new model was increased due to the use of stacking machine learning technologies, which made it possible to take into account both linear and nonlinear effects from features on the outcome.

×

作者简介

Anton Kuznetsov

Moscow Aviation Institute (National Research University)

编辑信件的主要联系方式.
Email: drednout5786@yandex.ru
ORCID iD: 0000-0003-2182-5792
SPIN 代码: 8824-9080
俄罗斯联邦, Moscow

参考

  1. Mottet N, van den Bergh RCN, Briers E, et al. EAU-EANM-ESTRO-ESUR-SIOG Guidelines on Prostate Cancer—2020 Update. Part 1: Screening, Diagnosis, and Local Treatment with Curative Intent. Eur Urol. 2021;79(2):243–262. doi: 10.1016/j.eururo.2020.09.042
  2. Smelov PA, Nikitina SYu, editors. Health care in Russia. 2021: statistical compendium. Moscow: Federal'naya sluzhba gosudarstvennoi statistiki; 2021. (In Russ).
  3. Shah V, Turkbey B, Mani H, et al. Decision support system for localizing prostate cancer based on multiparametric magnetic resonance imaging. Med Phys. 2012;39(7):4093–4103. doi: 10.1118/1.4722753
  4. He D, Wang X, Fu C, et al. MRI-based radiomics models to assess prostate cancer, extracapsular extension and positive surgical margins. Cancer Imaging. 2021;21(1):46. doi: 10.1186/s40644-021-00414-6
  5. Lu Y, Li B, Huang H, et al. Biparametric MRI-based radiomics classifiers for the detection of prostate cancer in patients with PSA serum levels of 4~10 ng/mL. Front Oncol. 2022;12. doi: 10.3389/fonc.2022.1020317
  6. Chen T, Li M, Gu Y, et al. Prostate Cancer Differentiation and Aggressiveness: Assessment With a Radiomic-Based Model vs. PI-RADS v2. Journal of Magnetic Resonance Imaging. 2019;49(3):875–884. doi: 10.1002/jmri.26243
  7. Certificate for the computer software 2023669718/ 19.09.2023. Schepkina EV, Kruchkova OV, Kuznetsov AI. Computer Software for the Prognosis of Probability of Prostate Cancer in Patients Based on MRI. Available from: https://www.elibrary.ru/item.asp?id=54657407 [cited 2024 Jan 28]. (In Russ). EDN: FNYFEB

补充文件

附件文件
动作
1. JATS XML

版权所有 © Eco-Vector, 2024

Creative Commons License
此作品已接受知识共享署名-非商业性使用-禁止演绎 4.0国际许可协议的许可。

СМИ зарегистрировано Федеральной службой по надзору в сфере связи, информационных технологий и массовых коммуникаций (Роскомнадзор).
Регистрационный номер и дата принятия решения о регистрации СМИ: серия ПИ № ФС 77 - 79539 от 09 ноября 2020 г.


##common.cookie##