Segmentation of organs at risk in CT images of chest cavity
- Manko, Maksym
- Javier Ramírez Pérez de Inestrosa Co-director
- Anton Popov Co-director
Defence university: Universidad de Granada
Fecha de defensa: 19 June 2024
- Begoña Acha Piñero Chair
- Fermín Segovia Román Secretary
- Andres Ortiz García Committee member
Type: Thesis
Abstract
In the last decade, deep learning algorithms have significantly advanced the field of medical image analysis, particularly in the segmentation of anatomical structures. Currently, deep neural network-based segmentation systems are capable of addressing almost any segmentation task in medical imaging across various modalities, demon- strating remarkable reliability and efficiency. However, despite the substantial benefits of deep learning over traditional segmentation techniques, it faces certain challenges and limitations. The reliance on data-driven approaches means that the performance of these networks heavily depends on the training data’s characteristics. A notable limitation within medical datasets is their constrained size and the lack of sufficient variability in samples, potentially failing to cover all possible physiological conditions. Such constraints lead to overfitting, where the machine learning model overly spe- cializes on the idiosyncrasies of the training data, failing to generalize to the broader population, leading to reduced model performance at the inference stage. In order to address this, implementing various strategies to mitigate overfitting is crucial, aiming to increase the model’s generalizability and accuracy. This thesis focuses on the development of a deep learning-based segmentation system for identifying organs at risk within the thoracic cavity. Considering the limitations posed by the small size of the considered dataset, this research investigate innovative and effective methodologies to prevent model overfitting. Through this focus, the thesis contributes to enhancing the robustness and applicability of deep learning models in the critical area of medical image segmentation, offering potential improvements in diagnostic accuracy and patient care. The research is methodically divided into three main areas, each incorporating advanced methodologies to prevent overfitting and improve model performance. The first study introduces a novel consistency regularization technique characterized by training two structurally identical neural networks with different weight coefficient initializations in a fully supervised task. The consistency loss is calculated based on pseudo-labels, which are the predictions made by one model for the other and vice versa. Additionally, this research evaluates the beneficial effect of employing an exponential moving average in generating pseudo-labels. The subsequent study focude on development of a novel neural network architec- ture, embodying a U-Net-like structure with custom self-attention blocks that utilize cosine similarity for computing attention maps. The core concept was to combine local and global context processing by fusing convolutional layers with self-attention blocks and refining the self-attention block architecture to optimize model accuracy, while ensuring adequate resources consumption and high speed performance. Integrating this novel architecture with consistency regularization succeeded in achieving a state-of-the-art level performance, surpassing all previously published methods for the trachea and aorta. The final investigation explored the generalization capabilities of foundation seg- mentation model called Segment Anything Model. This study focused on several tasks, including automating Segment Anything Model through its integration with an object detection model to generate visual prompts automatically, eliminating manual prompting. The performance of the automated segmentation model was evaluated in a zero-shot setting for the segmentation of organs at risk, and a method for end-to-end fine-tuning of the combined foundation model with the object detector was developed, focusing solely on optimizing the model’s segmentation head. The experiments demon- strated relatively favorable outcomes, though the performance level of the custom U-Net -like architecture with consistency regularization wasn’t reached. Nonetheless, these results provide valuable insights, outlining the direction for future research in this domain. In summary, this thesis delineates the effectiveness of deep learning methodologies in the domain of medical image segmentation, particularly focusing on the segmen- tation of organs at risk within the thoracic cavity. It brings to light the difficulties encountered in training deep neural networks on limited datasets and proposes inno- vative techniques for network regularization and the mitigation of overfitting. The proposed methods significantly improve the accuracy of segmentation in CT imagery. Furthermore, the thesis acknowledges ongoing challenges and gaps in the current state of research, providing a roadmap for future investigations and potential enhancements to the introduced methods. The outcomes of this research are poised to contribute significantly to the advancement of organ segmentation tasks, including organs at risk segmentation. It aims to move the field closer towards achieving a segmentation quality that enables accurate and precise localization of organs at risk, thus contributing to the saving of patient lives and well-being.