Publication details

Assessing the Accuracy of a Deep Learning Method to Risk Stratify Indeterminate Pulmonary Nodules

Authors

MASSION P. P. ANTIC S. ATHER S. ARTETA C. BRABEC Jan CHEN H. D. DECLERCK J. DUFEK David HICKES W. KADIR T. KUNST Jonáš LANDMAN B. A. MUNDEN R. F. NOVOTNY P. PESCHL H. PICKUP L. C. SANTOS C. SMITH G. T. TALWAR A. GLEESON F.

Year of publication 2020
Type Article in Periodical
Magazine / Source American Journal of Respiratory And Critical Care Medicine
MU Faculty or unit

Faculty of Medicine

Citation
Web https://www.atsjournals.org/doi/10.1164/rccm.201903-0505OC#aff5
Doi http://dx.doi.org/10.1164/rccm.201903-0505OC
Keywords early detection; risk stratification; neural networks; lung cancer; computer-aided image analysis
Description Rationale: The management of indeterminate pulmonary nodules (IPNs) remains challenging, resulting in invasive procedures and delays in diagnosis and treatment. Strategies to decrease the rate of unnecessary invasive procedures and optimize surveillance regimens are needed. Objectives: To develop and validate a deep learning method to improve the management of IPNs. Methods: A Lung Cancer Prediction Convolutional Neural Network model was trained using computed tomography images of IPNs from the National Lung Screening Trial, internally validated, and externally tested on cohorts from two academic institutions. Measurements and Main Results: The areas under the receiver operating characteristic curve in the external validation cohorts were 83.5% (95% confidence interval [CI], 75.4-90.7%) and 91.9% (95% CI, 88.7-94.7%), compared with 78.1% (95% CI, 68.7-86.4%) and 81.9 (95% CI, 76.1-87.1%), respectively, for a commonly used clinical risk model for incidental nodules. Using 5% and 65% malignancy thresholds defining low- and high-risk categories, the overall net reclassifications in the validation cohorts for cancers and benign nodules compared with the Mayo model were 0.34 (Vanderbilt) and 0.30 (Oxford) as a rule-in test, and 0.33 (Vanderbilt) and 0.58 (Oxford) as a rule-out test. Compared with traditional risk prediction models, the Lung Cancer Prediction Convolutional Neural Network was associated with improved accuracy in predicting the likelihood of disease at each threshold of management and in our external validation cohorts. Conclusions: This study demonstrates that this deep learning algorithm can correctly reclassify IPNs into low- or high-risk categories in more than a third of cancers and benign nodules when compared with conventional risk models, potentially reducing the number of unnecessary invasive procedures and delays in diagnosis.