Attention-Based Neural Network for Cardiac MRI Segmentation: Application to Strain and Volume Computation - 30/07/24
, Catherine Achard b, Saud Khan a, Vincent Nguyen a, Mikael Prigent c, Mohamed Zarai c, Khaoula Bouazizi a, c, Johanne Sylvain d, Alban Redheuil a, c, e, Gilles Montalescot d, Nadjia Kachenoura a, c, 1, Thomas Dietenbeck a, c, 1Abstract |
Context |
Deep learning algorithms have been widely used for cardiac image segmentation. However, most of these architectures rely on convolutions that hardly model long-range dependencies, limiting their ability to extract contextual information. Moreover, the traditional U-net architecture suffers from the difference of semantic information between feature maps of the encoder and decoder (also known as the semantic gap).
Material and method |
To address this issue, a new network architecture relying on attention mechanism was introduced. Swin Filtering Blocks (SFB), that use Swin Transformer blocks in a cross-attention manner, were added between the encoder and the decoder to filter information coming from the encoder based on the feature map from the decoder. Attention was also employed at the lowest resolution in the form of a transformer layer to increase the receptive field of the network.
We conducted experiments to assess both generalization capability and to evaluate how training on all frames of the cardiac cycle rather than only the end-diastole and end-systole impacts strain and segmentation performances.
Results and conclusion |
Visual inspection of feature maps suggested that Swin Filtering Blocks contribute to the reduction of the semantic gap. Performing attention between all patches using a transformer layer brought higher performance than convolutions. Training the model with all phases of the cardiac cycle resulted in slightly more accurate segmentations while leading to a more noticeable improvement for strain estimation. A limited decrease in performance was observed when testing on out-of-distribution data, but the gap widens for the most apical slices.
Le texte complet de cet article est disponible en PDF.Graphical abstract |
Highlights |
• | Neural network architecture relying on attention to bridge the semantic gap. |
• | Tested on both ACDC and an in-house dataset containing 271 patients. |
• | Generalization capabilities assessed by testing on out-of-distribution data. |
• | Comparison between training on all cardiac phases or only on ED and ES frames. |
• | State of the art performance and accurate volumetric indices are obtained. |
Keywords : Segmentation, Deep learning, Transformers, Cardiac, MRI
Plan
Vol 45 - N° 4
Article 100850- août 2024 Retour au numéroBienvenue sur EM-consulte, la référence des professionnels de santé.
L’accès au texte intégral de cet article nécessite un abonnement.
Déjà abonné à cette revue ?
