๐Ÿง‘๐Ÿผโ€๐Ÿ’ป Research - September 5, 2025

Lung lobe segmentation: performance of open-source MOOSE, TotalSegmentator, and LungMask models compared to a local in-house model.

๐ŸŒŸ Stay Updated!
Join AI Health Hub to receive the latest insights in health and AI.

โšก Quick Summary

This study evaluated the performance of open-source lung lobe segmentation modelsโ€”MOOSE, TotalSegmentator, and LungMaskโ€”against a local in-house model. The findings revealed that while open-source tools excel in simpler cases, they struggle with complex scenarios, highlighting the importance of training on diverse datasets for improved accuracy.

๐Ÿ” Key Details

  • ๐Ÿ“Š Dataset: 164 computed tomography scans
  • ๐Ÿงฉ Task Difficulty: Classified as easy, moderate, or hard
  • โš™๏ธ Models Assessed: MOOSE, TotalSegmentator, LungMask, and a local nnU-Net model
  • ๐Ÿ† Performance Metrics: Dice similarity coefficient (DSC), robust Hausdorff distance (rHd95), normalized surface distance (NSD)
  • ๐ŸŒ External Validation: Evaluated on an external dataset (LOLA11, nโ€‰=โ€‰55)

๐Ÿ”‘ Key Takeaways

  • ๐Ÿ“Š TotalSegmentator outperformed MOOSE in DSC and NSD across all difficulty levels (pโ€‰<โ€‰0.001).
  • ๐Ÿ’ก MOOSE and TotalSegmentator surpassed LungMask in all metrics (pโ€‰<โ€‰0.001).
  • ๐Ÿ† The local nnU-Net model exceeded all other models in performance on both internal and external datasets (pโ€‰<โ€‰0.001).
  • ๐Ÿ” Missing lobes were identified correctly by the local model and LungMask in 3 and 1 of 7 cases, respectively.
  • ๐ŸŒ Open-source tools perform well in straightforward cases but struggle with complex anomalies.
  • ๐Ÿ“ˆ Data diversity is more critical than sheer quantity for improving model performance.
  • ๐Ÿฅ Accurate segmentation can enhance presurgical planning and patient outcomes in respiratory care.
  • ๐Ÿ”„ Generalizability of models can be improved through training on diverse datasets.

๐Ÿ“š Background

Lung lobe segmentation is essential for assessing lobar function, particularly in the context of nuclear imaging prior to surgical interventions. Accurate segmentation can significantly influence treatment decisions and patient outcomes in respiratory and thoracic care. However, the effectiveness of existing deep learning models, particularly those trained on non-specialized datasets, has been questioned, especially in complex clinical scenarios.

๐Ÿ—’๏ธ Study

This study aimed to evaluate the performance of various lung lobe segmentation models, including open-source tools and a locally developed nnU-Net model. Researchers compiled an internal dataset of 164 CT scans, categorizing them by task difficulty to assess how well each model performed across different scenarios.

๐Ÿ“ˆ Results

The results indicated that TotalSegmentator consistently outperformed MOOSE in key metrics, while both models surpassed LungMask. Notably, the local nnU-Net model demonstrated superior performance across all metrics and difficulty levels, emphasizing the importance of training on a clinically representative dataset.

๐ŸŒ Impact and Implications

The findings of this study have significant implications for the field of respiratory care. By highlighting the limitations of open-source segmentation tools in complex cases, the research underscores the necessity for training models on diverse datasets. This approach not only enhances the accuracy of lung lobe segmentation but also optimizes clinical decision-making and patient outcomes, paving the way for improved presurgical assessments and treatment strategies.

๐Ÿ”ฎ Conclusion

This study illustrates the critical role of data diversity in enhancing the performance of lung lobe segmentation models. While open-source tools show promise, their limitations in complex cases cannot be overlooked. The integration of specialized, representative datasets into model training is essential for achieving better accuracy and improving patient care in respiratory and thoracic medicine. Continued research in this area is vital for advancing clinical practices.

๐Ÿ’ฌ Your comments

What are your thoughts on the performance of open-source segmentation tools in clinical settings? We invite you to share your insights and engage in a discussion! ๐Ÿ’ฌ Leave your comments below or connect with us on social media:

Lung lobe segmentation: performance of open-source MOOSE, TotalSegmentator, and LungMask models compared to a local in-house model.

Abstract

BACKGROUND: Lung lobe segmentation is required to assess lobar function with nuclear imaging before surgical interventions. We evaluated the performance of open-source deep learning-based lung lobe segmentation tools, compared to a similar nnU-Net model trained on a smaller but more representative clinical dataset.
MATERIALS AND METHODS: We collated and semi-automatically segmented an internal dataset of 164 computed tomography scans and classified them for task difficulty as easy, moderate, or hard. The performance of three open-source models-multi-organ objective segmentation (MOOSE), TotalSegmentator, and LungMask-was assessed using Dice similarity coefficient (DSC), robust Hausdorff distance (rHd95), and normalized surface distance (NSD). Additionally, we trained, validated, and tested an nnU-Net model using our local dataset and compared its performance with that of the other software on the test subset. All models were evaluated for generalizability using an external competition (LOLA11, nโ€‰=โ€‰55).
RESULTS: TotalSegmentator outperformed MOOSE in DSC and NSD across all difficulty levels (pโ€‰<โ€‰0.001), but not in rHd95 (pโ€‰=โ€‰1.000). MOOSE and TotalSegmentator surpassed LungMask across metrics and difficulty classes (pโ€‰<โ€‰0.001). Our model exceeded all other models on the internal dataset (nโ€‰=โ€‰33) in all metrics, across all difficulty classes (pโ€‰<โ€‰0.001), and on the external dataset. Missing lobes were correctly identified only by our model and LungMask in 3 and 1 of 7 cases, respectively. CONCLUSION: Open-source segmentation tools perform well in straightforward cases but struggle in unfamiliar, complex cases. Training on diverse, specialized datasets can improve generalizability, emphasizing representative data over sheer quantity. RELEVANCE STATEMENT: Training lung lobe segmentation models on a local variety of cases improves accuracy, thus enhancing presurgical planning, ventilation-perfusion analysis, and disease localization, potentially impacting treatment decisions and patient outcomes in respiratory and thoracic care. KEY POINTS: Deep learning models trained on non-specialized datasets struggle with complex lung anomalies, yet their real-world limitations are insufficiently assessed. Training an identical model on a smaller yet clinically diverse and representative cohort improved performance in challenging cases. Data diversity outweighs the quantity in deep learning-based segmentation models. Accurate lung lobe segmentation may enhance presurgical assessment of lung lobar ventilation and perfusion function, optimizing clinical decision-making and patient outcomes.

Author: [‘Amini E’, ‘Klein R’]

Journal: Eur Radiol Exp

Citation: Amini E and Klein R. Lung lobe segmentation: performance of open-source MOOSE, TotalSegmentator, and LungMask models compared to a local in-house model. Lung lobe segmentation: performance of open-source MOOSE, TotalSegmentator, and LungMask models compared to a local in-house model. 2025; 9:86. doi: 10.1186/s41747-025-00623-9

Share on facebook
Facebook
Share on twitter
Twitter
Share on linkedin
LinkedIn
Share on whatsapp
WhatsApp

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.