346
Views
1
CrossRef citations to date
0
Altmetric
Original Articles: BiGART 2023 Issue

Consistency in contouring of organs at risk by artificial intelligence vs oncologists in head and neck cancer patients

ORCID Icon, ORCID Icon, ORCID Icon, , ORCID Icon, , , ORCID Icon, ORCID Icon, ORCID Icon, , , ORCID Icon, ORCID Icon, ORCID Icon, , , ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon show all
Pages 1418-1425 | Received 21 May 2023, Accepted 04 Sep 2023, Published online: 13 Sep 2023
 

Abstract

Background

In the Danish Head and Neck Cancer Group (DAHANCA) 35 trial, patients are selected for proton treatment based on simulated reductions of Normal Tissue Complication Probability (NTCP) for proton compared to photon treatment at the referring departments. After inclusion in the trial, immobilization, scanning, contouring and planning are repeated at the national proton centre. The new contours could result in reduced expected NTCP gain of the proton plan, resulting in a loss of validity in the selection process. The present study evaluates if contour consistency can be improved by having access to AI (Artificial Intelligence) based contours.

Materials and Methods

The 63 patients in the DAHANCA 35 pilot trial had a CT from the local DAHANCA centre and one from the proton centre. A nationally validated convolutional neural network, based on nnU-Net, was used to contour OARs on both scans for each patient. Using deformable image registration, local AI and oncologist contours were transferred to the proton centre scans for comparison. Consistency was calculated with the Dice Similarity Coefficient (DSC) and Mean Surface Distance (MSD), comparing contours from AI to AI and oncologist to oncologist, respectively. Two NTCP models were applied to calculate NTCP for xerostomia and dysphagia.

Results

The AI contours showed significantly better consistency than the contours by oncologists. The median and interquartile range of DSC was 0.85 [0.78 − 0.90] and 0.68 [0.51 − 0.80] for AI and oncologist contours, respectively. The median and interquartile range of MSD was 0.9 mm [0.7 − 1.1] mm and 1.9 mm [1.5 − 2.6] mm for AI and oncologist contours, respectively. There was no significant difference in ΔNTCP.

Conclusions

The study showed that OAR contours made by the AI algorithm were more consistent than those made by oncologists. No significant impact on the ΔNTCP calculations could be discerned.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data used in this study are part of a clinical trial and are not available.

Additional information

Funding

Supported by the Novo Nordisk Foundation (NNF18OC0034612), DCCC Radiotherapy - The Danish National Research Center for Radiotherapy, Danish Cancer Society (grant no. R191-A11526), Danish Comprehensive Cancer Center, University of Southern Denmark Faculty of Health Sciences Scholarship, and Odense University Hospital.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.