Laryngeal motility assessment is essential for diagnosing and managing laryngeal disorders. However, paralysis evaluations suffer from high inter-rater variability, necessitating a more objective and quantitative approach. This study introduces a novel AI-driven pipeline that leverages computer vision techniques to classify 155 video-laryngoscopies into unilateral paralysis (n = 68), bilateral paralysis (n = 50), and healthy laryngeal function (n = 37). Our approach includes several advancements over existing literature. We extract the vocal fold positions from each video and automatically identify the most informative, noise-cleaned video segments for classification. We define novel movement-based features that quantitatively capture the restricted mobility characteristics of laryngeal paralysis. These features are used to train two classification models using a 5-fold cross-validation strategy: one model for binary classification (healthy vs. paralyzed) and the other for multi-class classification (healthy vs. unilateral paralysis vs. bilateral paralysis). To assess the importance of these features, we conduct an ablation study using Shapley values. Our method achieves a precision of 0.83, sensitivity (recall) of 0.85, F1-score of 0.84, and balanced accuracy of 0.85 for distinguishing between healthy and paralyzed individuals. For multi-class classification (healthy vs unilateral paralysis vs bilateral paralysis), our model achieves a precision of 0.80, sensitivity of 0.83, F1-score of 0.81, and a balanced accuracy of 0.83. These results highlight the effectiveness of our method and underscore the relevance of our features, further validated by the ablation study. Our AI-grounded approach enhances the accuracy and reliability of automatic laryngeal motility assessment. By introducing novel metrics to quantify paralysis severity, we provide a more objective, reproducible, and clinically valuable evaluation tool.

Artificial intelligence in otolaryngology: redefining automatic laryngeal paralysis assessment for optimal healthcare / Agrimi, Emanuele; Pietrogiacomi, Francesco; Fiorini, Linda; Mularoni, Francesca; Vilaseca, Isabel; Peretti, Giorgio; Taboni, Stefano; Ferrari, Marco; Carobbio Andrea Luigi, Camillo; Nicolai, Piero; Sampieri, Claudio; Gnecco, Giorgio Stefano. - In: SN COMPUTER SCIENCE. - ISSN 2662-995X. - 7:(2026). [10.1007/s42979-025-04606-w]

Artificial intelligence in otolaryngology: redefining automatic laryngeal paralysis assessment for optimal healthcare

Agrimi Emanuele;Pietrogiacomi Francesco;Fiorini Linda;Gnecco Giorgio
2026

Abstract

Laryngeal motility assessment is essential for diagnosing and managing laryngeal disorders. However, paralysis evaluations suffer from high inter-rater variability, necessitating a more objective and quantitative approach. This study introduces a novel AI-driven pipeline that leverages computer vision techniques to classify 155 video-laryngoscopies into unilateral paralysis (n = 68), bilateral paralysis (n = 50), and healthy laryngeal function (n = 37). Our approach includes several advancements over existing literature. We extract the vocal fold positions from each video and automatically identify the most informative, noise-cleaned video segments for classification. We define novel movement-based features that quantitatively capture the restricted mobility characteristics of laryngeal paralysis. These features are used to train two classification models using a 5-fold cross-validation strategy: one model for binary classification (healthy vs. paralyzed) and the other for multi-class classification (healthy vs. unilateral paralysis vs. bilateral paralysis). To assess the importance of these features, we conduct an ablation study using Shapley values. Our method achieves a precision of 0.83, sensitivity (recall) of 0.85, F1-score of 0.84, and balanced accuracy of 0.85 for distinguishing between healthy and paralyzed individuals. For multi-class classification (healthy vs unilateral paralysis vs bilateral paralysis), our model achieves a precision of 0.80, sensitivity of 0.83, F1-score of 0.81, and a balanced accuracy of 0.83. These results highlight the effectiveness of our method and underscore the relevance of our features, further validated by the ablation study. Our AI-grounded approach enhances the accuracy and reliability of automatic laryngeal motility assessment. By introducing novel metrics to quantify paralysis severity, we provide a more objective, reproducible, and clinically valuable evaluation tool.
2026
Artificial intelligence applied to medicine, Computer vision, Otolaryngology, Laryngeal paralysis assessment
File in questo prodotto:
File Dimensione Formato  
s42979-025-04606-w.pdf

accesso aperto

Descrizione: Artificial Intelligence in Otolaryngology: Redefining Automatic Laryngeal Paralysis Assessment for Optimal Care
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 4.31 MB
Formato Adobe PDF
4.31 MB Adobe PDF Visualizza/Apri
Artificial_Intelligence_in_Otolaryngology__Redefining_Laryngeal_Paralysis_Assessment_for_Optimal_Care.pdf

accesso aperto

Descrizione: Postprint - Artificial Intelligence in Otolaryngology: Redefining Automatic Laryngeal Paralysis Assessment for Optimal Care
Tipologia: Documento in Post-print
Licenza: Creative commons
Dimensione 5.56 MB
Formato Adobe PDF
5.56 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11771/38099
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
social impact