Senior Expertise and Peer Consensus: A Comparative Analysis of AI and Clinician Measurements in Multi-Curve Scoliosis Assessment

Frank Ibañez Elijorde; Joselito F. Villaruz; Ma Beth S. Concepcion; Mylo N. Soriaso

doi:10.3844/jcssp.2026.886.897

Research Article Open Access

Senior Expertise and Peer Consensus: A Comparative Analysis of AI and Clinician Measurements in Multi-Curve Scoliosis Assessment

Frank Ibañez Elijorde¹, Joselito F. Villaruz², Ma Beth S. Concepcion³ and Mylo N. Soriaso⁴

¹ Division of Information Technology, College of Information and Communications Technology West Visayas State University, Iloilo, Philippines
² Department of Pediatrics, College of Medicine, West Visayas State University, Iloilo, Philippines
³ Department of Information Systems, College of Information and Communications Technology West Visayas State University, Iloilo, Philippines
⁴ Department of Orthopedics, West Visayas State University Medical Center, Iloilo, Philippines

Abstract

Given the scarcity in the literature, this study explored the use of AI for multi-curve scoliosis assessment. Its performance was analyzed through comparison against a group of clinicians composed of one senior and five non-senior orthopedic surgeons. The analysis focused on Cobb angle measurement and identification of vertebral endplates across three curve regions, namely Main Thoracic (MT), Proximal Thoracic (PT), and Thoracolumbar/Lumbar (TL/L). As evidenced by the results, there is a strong agreement in the MT region, as shown by low Mean Absolute Differences (MAD) at 2.21 and high interclass correlation coefficients (ICC 0.94 –0.98), suggesting the clinically reliable performance of AI in this area of the spine. Meanwhile, moderate agreement was observed in the TL/L region (ICC 0.74–0.89), although the PT region presented significant challenges, with high MAD values and ICC values near zero. This highlights variations in end vertebra selection due to anatomical and image quality limitations, which significantly affect the respective Cobb angle measurements of the human observers. On the other hand, subjectivity in identifying vertebral landmarks, which is apparent in low-quality radiographs, was revealed through qualitative observations. An interesting finding is that most of AI's measurements aligned more closely with the group consensus of non-senior clinicians than with the senior expert, possibly signifying its inclination towards combined human patterns rather than expert-level preference. Caudal endplate identification showed higher agreement across evaluators than cranial endplates, implying that certain anatomical landmarks are more consistently identifiable. This result is indicative of AI’s potential for standardizing scoliosis evaluation, particularly in the MT and TL/L region, despite its underperformance in the PT region. Thus, it concludes that there is a need to emphasize enhanced algorithm development, improved training datasets, and above all, to integrate expert oversight. The alignment of AI with general clinician consensus underlines its potential as a reliable, standardizing tool in clinical practice, but it is imperative that expert input remains a crucial part of the study.

Journal of Computer Science

Volume 22 No. 3, 2026, 886-897

DOI: https://doi.org/10.3844/jcssp.2026.886.897

Submitted On: 7 July 2025 Published On: 9 March 2026

How to Cite: Elijorde, F. I., Villaruz, J. F., Concepcion, M. B. S. & Soriaso, M. N. (2026). Senior Expertise and Peer Consensus: A Comparative Analysis of AI and Clinician Measurements in Multi-Curve Scoliosis Assessment. Journal of Computer Science, 22(3), 886-897. https://doi.org/10.3844/jcssp.2026.886.897

Copyright: © 2026 Frank Ibañez Elijorde, Joselito F. Villaruz, Ma Beth S. Concepcion and Mylo N. Soriaso. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

67 Views
19 Downloads
0 Citations

Download

Keywords

Artificial Intelligence and Deep Learning
Scoliosis Assessment
Multi-Curve Scoliosis
Vertebral Endplate Selection
Cobb Angle Measurement