west china medical publishers

Article Figure Table

Relevance Release date Cited by PDF Downloads

find Keyword "Mamba" 4 results

The dual-stream feature pyramid network based on Mamba and convolution for brain magnetic resonance image registration

Journal of Biomedical Engineering 2024, 41(6): 1177-1184

Deformable image registration plays a crucial role in medical image analysis. Despite various advanced registration models having been proposed, achieving accurate and efficient deformable registration remains challenging. Leveraging the recent outstanding performance of Mamba in computer vision, we introduced a novel model called MCRDP-Net. MCRDP-Net adapted a dual-stream network architecture that combined Mamba blocks and convolutional blocks to simultaneously extract global and local information from fixed and moving images. In the decoding stage, we employed a pyramid network structure to obtain high-resolution deformation fields, achieving efficient and precise registration. The effectiveness of MCRDP-Net was validated on public brain registration datasets, OASIS and IXI. Experimental results demonstrated significant advantages of MCRDP-Net in medical image registration, with DSC, HD95, and ASD reaching 0.815, 8.123, and 0.521 on the OASIS dataset and 0.773, 7.786, and 0.871 on the IXI dataset. In summary, MCRDP-Net demonstrates superior performance in deformable image registration, proving its potential in medical image analysis. It effectively enhances the accuracy and efficiency of registration, providing strong support for subsequent medical research and applications.

Release date：2024-12-27 03:50 Export PDF Favorites Scan
Lung nodule segmentation method based on multiscale feature interaction and coordinate information

Journal of Biomedical Engineering 2026, 43(2): 353-361

For pulmonary nodules in computed tomography (CT) images, which exhibit complex morphology and blurred boundaries, existing segmentation methods still fall short in modelling cross-level dependencies of multi-scale features, thereby limiting their performance in pulmonary nodule segmentation tasks. To address these challenges, this paper proposes a semantic segmentation method for pulmonary nodules based on multiscale feature interaction and cross-level coordinate attention (MFI-CLCA). This U-shaped network incorporated three architectures: a convolutional neural network (CNN), a Transformer, and Mamba. During the encoding phase, combining CNN and Mamba learning paradigms capured both global and local information in the input data. The convolutional component extracted complex boundary features of the target by combining multi-scale convolutional operations with adaptive fusion operations. Global and local multi-head attention mechanisms were introduced in the bottleneck layer and decoding phase respectively to model these hierarchical feature dependencies. The skip-connection section incorporated a multi-level coordinate attention module to adaptively focus on the information being passed through. Experimental results on the Lung Image Database Consortium (LIDC) dataset demonstrated that this approach achieved Dice scores of 90.52% and sensitivity of 91.93%, which outperforms existing state-of-the-art methods and validates its effectiveness for lung nodule segmentation tasks.

Release date： Export PDF Favorites Scan
Electroencephalogram emotion recognition based on state-space models combined with spatio-temporal feature

Journal of Biomedical Engineering 2026, 43(2): 311-318

To address the challenges of spatiotemporal feature heterogeneity, insufficient utilization of frequency band information, and weak cross-subject generalization in electroencephalogram (EEG)-based emotion recognition, this paper proposes a hierarchical spatiotemporal feature learning architecture named spatio-temporal mamba (ST-Mamba) based on state space models. Firstly, the proposed conv-spatio-temporal (CST) dual-branch collaborative module integrates the local feature extraction capability of convolutional neural network (CNN) with the global modeling ability of state space models. Through adaptive weighted fusion, it effectively mitigates the issue of inadequate modeling of inter-channel relationships in EEG signals. Secondly, the designed multi-band spatio-temporal feature pyramid (MBSTP) module adaptively weights features from different frequency bands via a frequency-band attention mechanism, while capturing spatial topological dependencies across brain regions through a hierarchical fusion strategy. Additionally, a data augmentation framework efficiently enhances the model’s cross-subject generalization by applying augmentations in the frequency, temporal, and spatial domains. The proposed model achieves average accuracies of 95.56% and 84.47% on the Shanghai Jiao Tong University emotion EEG dataset (SEED), version III (SEED-III) and version IV (SEED-IV), respectively. Experiments demonstrate that the state space model effectively alleviates the over-smoothing issue in deep networks, offering a novel solution to spatiotemporal heterogeneity and cross-subject generalization challenges in EEG-based emotion recognition.

Release date： Export PDF Favorites Scan
Research on surgical instrument segmentation algorithm based on frequency-domain adaptive feature decomposition and visual Mamba

Journal of Biomedical Engineering 2026, 43(3): 554-561, 570

For the endoscopic surgical instrument segmentation task, existing methods have failed to address the semantic gap caused by the mismatch between high-frequency spatial details and low-frequency semantic features in the U-shaped network (U-Net) architecture. This study proposes a U-Net algorithm based on frequency-domain adaptive feature decomposition and visual Mamba, namely frequency-domain decoupling Mamba U-Net (FDMUNet), for surgical instrument segmentation. This algorithm embeds a frequency-domain adaptive feature enhancement module into the skip connections, and decomposes features into high-frequency and low-frequency components through the Fourier transform and a learnable filter, followed by channel weighting and fusion, so as to enhance surgical instrument edge information and bridge the semantic gap between encoder and decoder features. On the Endoscopic Vision Challenge 2017 public dataset, FDMUNet achieved Intersection over Union, mean Intersection over Union, and mean class Intersection over Union scores of 70.79%, 74.25%, and 69.50%, respectively. In addition, the ablation experiment further verified the effectiveness of the proposed module. This method not only provides a new solution for instrument segmentation in complex scenes, but also provides a new research idea for the application of frequency-domain information in medical image segmentation.

Release date： Export PDF Favorites Scan

Search

Article Figure Table

The dual-stream feature pyramid network based on Mamba and convolution for brain magnetic resonance image registration

Lung nodule segmentation method based on multiscale feature interaction and coordinate information

Electroencephalogram emotion recognition based on state-space models combined with spatio-temporal feature

Research on surgical instrument segmentation algorithm based on frequency-domain adaptive feature decomposition and visual Mamba

Format

Content