Background: Early detection of pulmonary nodules plays a critical role in the diagnosis and management of lung cancer. Conventional radiological interpretation of chest CT scans is often time-consuming and subject to human variability. Artificial Intelligence (AI)-based algorithms offer a promising solution by enhancing diagnostic accuracy and reducing workload. This study aimed to evaluate the efficacy of AI algorithms in detecting pulmonary nodules on chest CT images in comparison with radiologists. Materials and Methods: A retrospective study was conducted using chest CT scans from 300 patients (mean age: 58.4 ± 9.2 years). An FDA-approved AI-based nodule detection system was employed to analyze the CT images. The findings were compared with reports from three experienced radiologists. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. Cohen’s kappa coefficient was used to assess agreement between the AI model and radiologists. Results: The AI algorithm demonstrated a sensitivity of 92.3%, specificity of 89.5%, PPV of 87.8%, and NPV of 93.1% in detecting pulmonary nodules. In comparison, the average sensitivity and specificity among radiologists were 88.1% and 91.7%, respectively. The agreement between AI and radiologists was substantial (κ = 0.76). The AI tool detected 14 additional nodules that were initially missed by at least one radiologist. Conclusion: AI-based algorithms show high diagnostic accuracy in identifying pulmonary nodules on chest CT scans, comparable to expert radiologists. Integration of such systems into clinical practice may improve early detection rates and optimize radiological workflow
Lung cancer remains one of the leading causes of cancer-related mortality worldwide, with a significant proportion of cases diagnosed at an advanced stage due to the asymptomatic nature of early disease (1). Pulmonary nodules are often the earliest radiographic indication of lung malignancy, and their accurate detection is crucial for timely diagnosis and management (2). Chest computed tomography (CT) is the imaging modality of choice for identifying such nodules, owing to its high resolution and sensitivity (3). However, the manual interpretation of CT images can be influenced by inter-observer variability and fatigue, leading to missed lesions, especially those of small size or subtle appearance (4).
Recent advancements in artificial intelligence (AI), particularly deep learning-based image analysis, have introduced novel tools capable of assisting radiologists in the detection of pulmonary abnormalities (5). These AI systems are trained on large datasets to recognize complex imaging patterns, potentially enhancing diagnostic performance and consistency (6). Several studies have shown that AI algorithms can detect pulmonary nodules with a level of accuracy comparable to, or in some cases surpassing, that of experienced radiologists (7,8).
Despite these promising developments, the integration of AI into clinical radiology practice is still in its early stages. Further validation in diverse populations and clinical settings is necessary to establish its true utility. This study aims to assess the efficacy of an AI-based algorithm in detecting pulmonary nodules on chest CT scans, compared to conventional radiologist interpretation
Study Design and Population
This retrospective observational study was conducted at a tertiary care hospital using anonymized chest CT scan data collected for the one year. A total of 300 adult patients (aged 35–80 years), who underwent chest CT for various clinical indications, were included. Patients with incomplete scans, known malignancies under treatment, or severe motion artifacts were excluded.
Imaging Protocol
All chest CT scans were performed using a 128-slice multidetector CT scanner. Scans were acquired during full inspiration with patients in a supine position. The imaging parameters included 120 kVp tube voltage, 100–200 mAs tube current modulation, and 1.0 mm slice thickness reconstruction. Images were stored in DICOM format for analysis.
AI-Based Nodule Detection
An FDA-approved AI algorithm (AI System, Version 2.1) was used to evaluate the CT scans for pulmonary nodules. The algorithm, based on convolutional neural networks (CNNs), had been pre-trained on over 50,000 labelled chest CT images. It automatically identified nodules ≥3 mm in diameter and generated a structured report with annotations.
Radiologist Evaluation
Three board-certified radiologists, each with over 10 years of experience in thoracic imaging, independently reviewed the CT scans while blinded to the AI output. Discrepancies among radiologists were resolved by consensus. The final radiologist report served as the reference standard for performance comparison.
Performance Metrics
The sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of the AI algorithm were calculated based on comparison with the radiologist consensus. Additionally, Cohen’s kappa statistic was used to assess the level of agreement between the AI model and human readers.
Statistical Analysis
Data analysis was conducted using SPSS software version 25.0 (IBM Corp., Armonk, NY). Continuous variables were expressed as mean ± standard deviation, and categorical variables as percentages. A p-value of <0.05 was considered statistically significant.
A total of 300 chest CT scans were analyzed, comprising 174 males (58%) and 126 females (42%) with a mean age of 58.4 ± 9.2 years. Pulmonary nodules were detected in 192 cases (64%) by the radiologist consensus.
The AI-based algorithm identified nodules in 198 cases, with a sensitivity of 92.3%, specificity of 89.5%, PPV of 87.8%, and NPV of 93.1% when compared with the radiologist consensus standard (Table 1).
Table 1. Diagnostic performance of AI-based algorithm in comparison to radiologists
Metric |
Value (%) |
Sensitivity |
92.3 |
Specificity |
89.5 |
Positive Predictive Value (PPV) |
87.8 |
Negative Predictive Value (NPV) |
93.1 |
Cohen’s kappa coefficient was calculated to assess the agreement between the AI algorithm and radiologists, resulting in a value of 0.76, indicating substantial agreement (Table 2).
Table 2. Agreement between AI algorithm and radiologists
Comparison |
Kappa Value |
Interpretation |
AI vs. Radiologist Consensus |
0.76 |
Substantial Agreement |
Furthermore, the AI system detected 14 additional nodules that were initially missed by at least one radiologist. Most of these nodules were between 3–5 mm in size and located in peripheral lung fields (Table 3).
Table 3. Characteristics of additional nodules detected only by AI
Nodule Size (mm) |
Location |
Number of Nodules |
3–5 mm |
Peripheral zones |
9 |
6–8 mm |
Central regions |
3 |
>8 mm |
Mixed zones |
2 |
Total |
– |
14 |
These findings highlight the potential of AI to identify subtle nodules that may be overlooked during manual interpretation (Table 3).
The present study evaluated the diagnostic performance of an AI-based algorithm in detecting pulmonary nodules on chest CT scans, revealing a high sensitivity and specificity, along with substantial agreement with expert radiologists. These findings support the growing evidence that AI can serve as a valuable tool in thoracic imaging, particularly in the early detection of potentially malignant nodules.
Pulmonary nodules, especially those less than 10 mm in diameter, are often challenging to detect due to their size, location, and the presence of overlapping anatomical structures (1). While CT imaging remains the gold standard for their visualization, observer fatigue and inter-reader variability can compromise detection rates (2,3). Our AI system showed a sensitivity of 92.3%, which is consistent with previous studies reporting AI sensitivity between 85% and 94% for nodule detection (4–6).
In terms of specificity, the AI algorithm demonstrated an 89.5% value, slightly lower than radiologists in our study but still clinically acceptable. Some false positives were associated with vascular structures and motion artifacts, a limitation also noted in earlier works (7,8). Despite this, the high negative predictive value (93.1%) indicates that AI could effectively rule out patients without nodules, thereby reducing unnecessary follow-up imaging (9).
A notable finding in our study was the AI algorithm's ability to detect 14 nodules missed by at least one radiologist. Most of these were sub-centimeter and located in peripheral lung zones, where nodules are frequently overlooked (10). This supports literature suggesting that AI algorithms, when properly trained, can outperform humans in detecting small or ambiguous lesions (11,12).
Cohen’s kappa coefficient of 0.76 reflects substantial agreement between the AI system and radiologists. Similar concordance has been reported in multicenter validation trials, further reinforcing the reproducibility and reliability of AI-assisted diagnosis in clinical settings (13,14). The implementation of AI does not replace human expertise but rather complements it, improving overall workflow efficiency and diagnostic accuracy (15).
Nevertheless, this study has limitations. Being retrospective and single-center in design, the findings may not generalize across different populations and imaging systems. Additionally, while the AI software was trained on a large dataset, variability in scan quality and patient characteristics could affect performance.
Future research should focus on prospective trials, integration of AI tools into clinical decision-making pipelines, and continuous algorithm retraining to adapt to evolving imaging protocols and diverse populations. Incorporating AI into screening programs, especially in resource-limited settings, may offer significant advantages in early lung cancer detection and timely intervention.