Evaluation of grouped capsule network for intracranial hemorrhage segmentation in CT scans

Wang, Lingying; Tang, Menglin; Hu, Xiuying

doi:10.1038/s41598-023-30581-4

Download PDF

Article
Open access
Published: 01 March 2023

Evaluation of grouped capsule network for intracranial hemorrhage segmentation in CT scans

Lingying Wang^1,2,
Menglin Tang¹ &
Xiuying Hu²

Scientific Reports volume 13, Article number: 3440 (2023) Cite this article

2384 Accesses
13 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Intracranial hemorrhage is a cerebral vascular disease with high mortality. Automotive diagnosing and segmentation of intracranial hemorrhage in Computed Tomography (CT) could assist the neurosurgeon in making treatment plans, which improves the survival rate. In this paper, we design a grouped capsule network named GroupCapsNet to segment the hemorrhage region from a Non-contract CT scan. In grouped capsule network, we constrain the prediction capsules for output capsules produced from different groups of input capsules with various types in each layer. This method can reduce the number of intermediate prediction capsules and accelerate the capsule network. In addition, we modify the squashing function to further accelerate the forward procedure without sacrificing its performance. We evaluate our proposed method with a collected dataset containing 210 intracranial hemorrhage CT scan slices. In experiments, our proposed method achieves competitive results in intracranial hemorrhage area segmentation compared to the existing methods.

Intracranial hemorrhage segmentation and classification framework in computer tomography images using deep learning techniques

Article Open access 17 May 2025

A joint convolutional-recurrent neural network with an attention mechanism for detecting intracranial hemorrhage on noncontrast head CT

Article Open access 08 February 2022

Noncontrast CT-based deep learning for predicting intracerebral hemorrhage expansion incorporating growth of intraventricular hemorrhage

Article Open access 31 August 2025

Introduction

Intracranial hemorrhage, which causes bleeding within brain tissue, is a life-threatening cerebral disease with high incidence and mortality rates^1,2. Non-contrast Computed Tomography (CT) scan is the first imaging modality to diagnose intracranial hemorrhage(ICH), due to its rapidity, convenience, and sensitivity to blood. The location and volume of hemorrhage derived from CT are crucial for neurosurgeons to make operation plans. The volume of ICH can be a predictor of mortality³ and secondary hemorrhage expansion (HE)^1,4,5. Automatic intracranial hemorrhage segmentation on CT^4,6,7 is a fundamental task that assists radiologists in the quantification of hemorrhage volume and shapes analysis.

Recent developments show the deep convolutional neural network (CNN) is powerful in medical image analysis⁸. Some existing methods pay attention to the following related problem: ICH or normal detection^9,10,11, ICH sub-types detection^12,13,14 and hemorrhage region segmentation^14,15,16,17. These CNN models have significantly improved performance by providing frameworks for end-to-end representation learning and a large amount of data training. Then, a validation or test data set is used to evaluate their generalization performance. These methods benefit from the powerful feature extraction ability of convolution neural networks and achieve promising performance.

However, existing CNN-based methods tend to suffer performance degradation due to the lack of spatial relationships or a large amount of data. Beyond CNN, the capsule network extends using vector^18,19,20 or matrix^21,22,23 as a representative unit instead of scalar in CNN. Vector preserves equivariance through the whole capsule network, where its length and orientation represent the probability and property of entity. The evidence that the capsule network has a more powerful representative ability than CNN is shown in the initial experiment^18,24. While sharing the weights and improving the dynamic routing algorithm, these methods remain to have high computational overhead. How to reduce the amount of computation while maintaining the segmentation performance of the capsule network is still an open problem.

In this paper, we propose an effective approach, referred to as GroupCapsNet for intracranial hemorrhage segmentation. Firstly, inspired by group convolution²⁵, which is widely used to reduce convolution calculation, we introduce the group concept to accelerate the capsule network and reduce the memory consumption of immediate vote capsules. During the capsule forwarding procedure, the output capsule only receives a group of input capsules which is a proportion of the whole input. Then, we further propose to modify the squashing function to have a faster computing speed without diminishing the performance.

Contributions

In this paper, our contributions can be summarized as follows.

(1)
We propose the group capsule to accelerate the capsule network and reduce the memory consumption of immediate vote capsules. The modified squashing function also suggests to replace the original one.
(2)
Our GroupCapsNet achieves comparable results with the existing methods in intracerebral hemorrhage segmentation tasks. Comprehensive experiments are conducted to show the superior performance of the group capsule network compared to the non-group counterpart and the convolutional neural network.

Related work

In this section, we will briefly highlight some recent related work including ICH region segmentation, capsule network, and group concept used in CNN.

ICH region segmentation

Either segmentation or dense prediction on the voxel level of ICH is required to calculate the volume of ICH. Grad-CAM²⁶ used by Ye et al.¹² can be employed to produce a highlighted map for ICH, but this map is too coarse and just be adopted as guidance for the interpretation of neural networks by radiologists. A more accurate ICH segmentation map can be obtained by training a full convolution network (FCN) in a supervised learning manner. Chilamkurthy et al.¹⁴ used U-Net based approach to segment ICH, intraparenchymal (IPH), intraventricular (IVH), subdural (SDH), extradural (EDH) and subarachnoid (SAH) hemorrhages Manvel et al.¹⁷ explored the variant U-Net method with test time augmentation and ensembling strategies to segment ICH in three projections (i.e., axial, coronal, and sagittal). Islam et al.¹⁶ proposed ICHNet facilitating hypercolumn features from dilated convolutional layer and incorporating 3D conditional random field to smoothen the segmentation prediction. Ironside et al.¹⁵ evaluated a deeper variant U-Net in ICH volumetric analysis. These methods are based on CNN (especially U-Net⁸), although there are some inherent shortcomings in CNN¹⁸.

Capsule network

In the capsule, the feature is stored as vector¹⁸ or matrix²¹ to preserve its equivariance. The length and orientation of the vector represent the probability and property of the entity presented in the input. In forwarding capsules to the next layer, each input capsule produces a vote capsule to each capsule in the next layer by matrix transformation. This procedure requires additional computational cost and large memory consumption, resulting in the inability to apply a larger and deeper capsule network. To tackle this drawback of the capsule, trainable weight matrices within the local spatial kernel of the same type of capsule are reasonably shared^18,27. In addition, capsule-pooling layer²⁸ was proposed in generalizing capsules to higher dimensional inputs. Are the variants of dynamic routing or expectation-maximum (EM) routing based on weighted kernel density estimation²⁹ was came up with a faster and more effective procedure. These methods enable the capsule network to maintain performance while reducing computational costs. The capsule has also been extended and studied to segmentation tasks by some researchers^27,30. However, because of the heavy memory consumption of immediate vote capsules and the expansive computation of the dynamic routing algorithm, the capsule network is hard to become as deep and large as CNN is. To conquer the issue and apply it to large data set and explore the potential, many researchers focus on accelerating and simplifying the capsule network by sharing the weights^18,27,28, introducing special capsule layer^28,31, improving the dynamic routing algorithm^32,33 etc. Particularly, SegCaps^27,34,35 extend vector capsule to object segmentation including intracranial hemorrhage segmentation task. Despite the accuracy, capsules are less efficient to large computing overhead. How to maintain the accuracy while improving the computational efficiency of the capsule network is still an open problem.

Methods

In this section, we will introduce our capsule network (named GroupCapsNet) used in ICH segmentation. First, we will briefly revisit the basic capsule concept. Second, the grouped capsule on which GroupCapsNet is built will be illustrated and discussed. Finally, the whole GroupCapsNet will be described including its special layer and loss function.

Ethical approval

The study was approved by the Ethics Committee of West China Hospital, Sichuan University in 2022. The study was performed in accordance with relevant guidelines and regulations in compliance with the Declaration of Helsinki. All images were from HIS cases of the system. Informed consent was obtained from all participants and the data was treated anonymously.

Recap capsule

The capsule is a novel form of feature representation, which makes use of vector¹⁸ or matrix²¹ as its unit instead of scalar in a neural network. For clarity, we use vector capsule by default. As to forwarding propagation, it has a similar manner in both capsule and neural networks. The next layer of the capsule will be produced in the next three steps, voting, clustering, and nonlinear mapping.

Let $v^L_t \in {\textbf{R}}^{n \times 1}$ denote the type t capsule in layer L. In voting step, each capsule $v^L_t$ produces its vote capsule $u_{t'|t}^{L+1} \in {\textbf{R}}^{n' \times 1}$ for type $t'$ capsule in layer $L+1$ by matrix transformation as follows:

$$\begin{aligned} u_{t'|t}^{L+1} = W_{t'|t}^L \cdot v_t^L \end{aligned}$$

(1)

where $W_{t'|t}^L \in {\textbf{R}}^{n' \times n} $ is a trainable weight matrix.

In clustering step, all vote capsules $u_{t'|t}^{L+1}$ are aggregatd by weighted summation to produce the dominant capsule ${\hat{v}}^{L+1}_{t'}$ as follows:

$$\begin{aligned} {\hat{v}}^{L+1}_{t'} = \sum _{t} c_{t'|t}^{L+1} u_{t'|t}^{L+1} \end{aligned}$$

(2)

where $c_{t'|t}^{L+1} \in {\textbf{R}}$ is the weight and can be derived from routing algorithm¹⁸.

In nonlinear mapping step, dominant capsule ${\hat{v}}^{L+1}_{t'}$ is mapping by a squashing function to obtain the final output capsule $v^{L+1}_{t'}$. The squashing function can be formulated as follows:

$$\begin{aligned} v^{L+1}_{t'} = \frac{\left\| {\hat{v}}^{L+1}_{t'}\right\| ^2}{1+\left\| {\hat{v}}^{L+1}_{t'}\right\| ^2} \cdot \frac{{\hat{v}}^{L+1}_{t'}}{\left\| {\hat{v}}^{L+1}_{t'}\right\| } \end{aligned}$$

(3)

Convolutional capsule

For inputs like images, the capsule can take advantage of the local receptive field and weight sharing like the convolutional neural network (CNN). The convolutional capsule is obtained from different types of capsules within a spatial local kernel in the previous layer. Additionally, a trainable weight matrix of the same type of capsules is reasonably shared at a different spatial locations to further reduce the computational cost^18,27,28.

Grouped capsule

Different types of capsules are distinguishing features of different entities or parts. To produce a capsule, it does not need all types of capsules in the previous layer as input. The distinguishing feature requires relevant types of capsules rather than as many as possible types of capsules. Therefore, it is reasonable to divide different types of the capsule into groups. In addition, grouped computation in a convolutional neural network has been more widely studied recently and empirically demonstrated its potential in improving accuracy and reducing computational cost³⁶.

Based on the assumption that the relevant types of the capsule can be automatically gathered into one group by training procedure, we equally divide all types of the capsule into non-overlapping groups. Then, each group of capsule independently produce the capsule in the next layer (Fig. 1).

Formally, in grouped capsule, all types t of capsules in one layer are divided into g (i.e. 2, 4 or 8) groups denoted as $G = \{ G_1, G_2, \dots , G_g \} $, where $G_i \subset \{t\}^n $ and $ | G_i | = | G_j | $, $ G_i \cap G_j = \emptyset $ for every $i\ne j$. For the next layer, capsules with type $t'$ are also divided into $G' = \{ G'_1, G'_2, \dots , G'_g \}$ satisfying the same constraint as G, where $G'_i \subset \{t'\}^n$. Capsule with type $t' \in G'_i$ is only related to type $t \in G_i$ capsule in the previous layer, therefore Eq. (2) becomes as follows:

$$\begin{aligned} {\hat{v}}^{L+1}_{t'} = \sum _{t\in G_i} c_{t'|t}^{L+1} u_{t'|t}^{L+1} \end{aligned}$$

(4)

where $t' \in G'_i$. On the other hand, capsule with type t only produces the vote capsules for types in $G'_i$ which is a portion of all types. This ends up with a significant reduction of both the trainable weight matrix and immediate vote capsules. Theoretically, when g increases, the computational cost reduces and is roughly 1/g original without grouped capsule. This forces the grouped capsule to learn the compact representative features.

The grouped capsule can easily generalize to the convolutional version in the case of image-like inputs, where the capsule only has the connection to the previous layer capsules within a specific kernel. Furthermore, the grouped capsule can be seamlessly added to the existing capsule network.

Network architecture

Our whole network architecture named GroupCapsNet is based on a symmetrical U-shape design following U-Net⁸. GroupCapsNet contains the encoder and the decoder parts. The encoder part extracts high-level semantic features, while the decoder part restores these features in high spatial resolution.

In the encoder part, two normal convolutional layers with 16 channels, $3 \times 3$ kernel and stride 1 are firstly applied. We regard 16-channels convolutional output as 2-types, 8-dimensions capsules. Non-linear mapping is employed to the output of convolutional layers to form the initial capsules before forwarding to the next step. Then, the initial capsules are processed through four stages. The design for these stages follows three rules: capsules in the same stage have the same number of types and dimensions; capsules in the next stage double the number of types and dimensions, and halve the spatial resolution by $2 \times 2$ stride;the number of grouped capsule layer doubles in the next stage.

In the decoder part, there also exist four stages. At each stage, the deconvolutional capsule is firstly applied to double spatial resolution from the output of the previous stage or the final stage in the encoder part. All types of the capsule from the output of the deconvolutional capsule and the corresponding stage in the encoder part are collected for subsequent processing. These stages are also satisfied the similar rules as mentioned in the encoder part: capsules in the same stage have the same number of types and dimensions; the number of grouped capsule layer halves in the next stage.

The segmentation header layer is added up to the output of the decoder. In the segmentation header, one capsule layer with $1 \times 1$ kernel and $1 \times 1$ stride is employed to produce the final 1-type, 8-dimension segmentations capsules. The hemorrhage region and the background are determined by the length of segmentation capsules after thresholding.

Non-linear mapping

Non-linear mapping is designed to enhance the non-linear representation ability of the network. The squashing function (Eq. 3) normalizes the length of the capsule. The right part of the squashing makes the capsule become a unit vector, while the left part takes non-linear mapping to its length. In our GroupCapsNet, we propose to use the modified squashing function as follows:

$$\begin{aligned} v^{L+1}_{t'} = \frac{\left\| {\hat{v}}^{L+1}_{t'}\right\| }{1+\left\| {\hat{v}}^{L+1}_{t'}\right\| } \cdot \frac{{\hat{v}}^{L+1}_{t'}}{\left\| {\hat{v}}^{L+1}_{t'}\right\| } = \frac{{\hat{v}}^{L+1}_{t'}}{1+\left\| {\hat{v}}^{L+1}_{t'}\right\| } \end{aligned}$$

(5)

which is slightly different from the left part of the squashing function.

Results

In this section, we will conduct comprehensive experiment to evaluate our GroupCapsNet in ICH segmentation. First, we will describe the experimental setting including data set collection, preprocessing and metric involved in our experiment. Second, ablation study of GroupCapsNet is conducted to demonstrate the effectivness of group number and modified squashing function. Finally, we discuss the comparison of GroupCapsNet and the existing methods in ICH Segmentation.

Experimental setting

Implementation details

We implement our proposed method with PyTorch on NVIDIA Tesla V100 GPUs. We perform random flip, rotation as data augmentation and gaussian additive noise with ${\mathcal {N}}(0,0.01)$ as data perturbation. Our method is trained by Adam optimizer³⁷ with initial learning rate $5e-4$ for 100 epochs. The learning rate is reduced by factor 0.97 for each epoch. The batch size is 32 where a half is labeled images and the rest is unlabeled images. We set $\alpha =0.99$ in exponential moving average(EMA).

Dataset

The dataset used in this experiment is a non-public dataset which is came from our cooperative hospital, which consists a total of 210 Non-Contrast CT scans of patients’ brains from cooperative hospital during January 2017 to December 2019 named ICH210. Each slice of CT scans has been traced into hemorrhage and brain tissues by a master degree student who majors in neurosurgery and all the tracing result have been checked by an emergency medicine specialist working in a hospital. The size of Non-Contrast CT scans is $ {\mathcal {R}}^{512 \times 512 \times (10 - 60)} $. We employ 5-fold cross validation technology to evaluate our proposed method.

Preprocessing

Because the brain tissues and hemorrhage that we concerned have a CT value smaller than 90 Hounsfield Unit, we clip all scans CT value into the range of 0–90 and then normalize them into 0–1. We split the CT scans into slices in z-axial. Slice images have been resized into $ {\mathcal {R}}^{256 \times 256} $. For data augmentation, we flip and rotate the slice randomly.

Evaluation metric

To quantitatively evaluate the methods in ICH segmentation, we calculate four types metrics: Dice Coefficient (Dice), Intersection over Union (IOU), Sensitivity and Specificity¹⁶. They can be formulated as follows:

$$\begin{aligned} Dice= & {} \frac{2 \cdot TP}{2 \cdot TP+FN+FP} \end{aligned}$$

(6)

$$\begin{aligned} IOU= & {} \frac{TP}{TP+FN+FP} \end{aligned}$$

(7)

$$\begin{aligned} Sensitivity= & {} \frac{TP}{TP + FN} \end{aligned}$$

(8)

$$\begin{aligned} Specificity= & {} \frac{TN}{TN + FP} \end{aligned}$$

(9)

where TP, FP, TN and FN are the number of true-positive, false-positive, true-negative and false-negative respectively. Due to common data imbalance problems invovled in ICH segmentation, the performance of segmentation mainly depends on Dice Coefficient.

Ablation study

Modified squashing vs. squashing

The only difference between modified squashing and squashing function is how to take non-linear mapping on capsule length. By rewriting Eq. (3) into Eq. (5), these non-linear mapping function have the similar properties as their curves shown in Fig. 2a. However, modified squashing has a greater advantage over squashing in term of forward running time. We evaluate them by a 16-dimension capsule and their forward execution times are shown in Fig. 2b. The result is obtained by averaging over 1000 times experiments on PyTorch platform. The forward time of the modified squashing is about 30% lower than that of the squashing (from 0.0001 to 0.00007 s). Because the modified squashing reduces the number of floating point operations, in our experiment, the backward running time of the modified squashing is also reduced.

We conduct the experiment to compare the impact of different non-linear mapping function on performance. We train two types of the GroupCapsNet about 250 epoches in ICH210 dataset. As shown in Table 1, comparing to squashing, modified squashing doesn’t reduce but slightly improve the performance of the GroupCapsNet.

Table 1 Performance comparison of different non-linear mapping function.

Full size table

Grouped vs. non-grouped

We investigate the impact of grouped capsule and non-grouped capsule in terms of computational speed and performance. The number of groups is a crucial factor in our GroupCapsNet. When the group number g increases, the number of immediate vote capsules significantly reduces by 1/g. This is beneficial in computational speed of both forward and backward procedures between capsules. We note that non-grouped capsule network can be seen as a special case of grouped capsule network (i.e. $g=1$).

To empirically show the impact of the number of group in computational speed, we study four types of grouped capsules with different group number (i.e. $g=1$, $g=2$, $g=4$ and $g=8$). For different types of grouped capsules, the dimension of input and output capsules are 16 and 32 respectively, and the dimension of other capsules are all 8. The spatial resolution of different grouped capsule is $64 \times 64$ and the iteration number of routing algorithm is 3. The experiment result is shown in Fig. 3. The forward execution times of $g=2$, $g=4$ and $g=8$ grouped capsule are 59%, 45% and 38% of non-grouped capsule. This shows that grouped capsule can significantly reduce the computational cost. We note that the number of execute time reduces is not the theoretical value (i.e. 1/g). The reason may be the inefficient operation of concatenating all capsule groups to form the output capsule. When g increases, the proportion of operation time increases.

To investigate the impact of the group number on the performance, we design four types of GroupCapsNets, which have different group numbers in grouped capsule layer. When the numbers of initial capsule types and dimensions are determined, the following architecture of GroupCapsNet can be automatically configurated due to its highly modular design. We denote four types of GroupCapsNets as GroupCapsNet-G1, GroupCapsNet-G2, GroupCapsNet-G4 and GroupCapsNet-G8, which have the group number 1, 2, 4 and 8 respectively. And we set the numbers of initial capsule types and dimensions in GroupCapsNet-G$\#g$ ($\#g \in \{1, 2, 4, 8\}$) as $\#g$ and $16/\#g$ respectively. We train these types of GroupCapsNets about 250 epoches in ICH210 dataset. The experiment results are shown in Table 2. GroupCapsNet-G2 achieves the best performance and outperforms 2% than non-grouped GroupCapsNet in term of dice coefficient. The results also show that the performance decreases when $g=4, 8$. On the one hand, we think that how many groups capsules can be divided depends on specific dataset. If we force to split the relevant types capsules into independent groups, the effect of GroupCapsNet will decline. On the other hand, when $g=4,8$, the couterpart of GroupCapsNet have fewer weights to capture patterns and the capsules with fewer dimension may do not have sufficient representation capacity.

Table 2 Performance comparison of GroupCapsNets with different group numbers.

Full size table

Compare with existing methods

To demonstrate the effectiveness of GroupCapsNet, we compare it with existing methods of ICH segmentation. DPN92-Unet and ICHNet are recently proposed to focus on ICH segmentation and achieve promising results. These models are based on deep convolutional nerual network, unfortunately, their datasets and code are not public.

U-Net is a mature medical image segmentation method, which is simple but effective, and it is still state-of-the-art in some applications recently. We evaluate a smaller version of U-Net named U-Net-2M on ICH210, which has 1/4 trainable weight parameters in each layer comparing to the original U-Net. U-Net-2M is designed to have similar parameters as our GroupCapsNet. The SegCaps is also evaluated on ICH210, because it is similar to our non-group counterpart of GroupCapsNet. U-Net-2M, SegCaps and GroupCapsNet are trained using BCE loss. The optimizer is Adam with 0.001 learning rate. The learning rate is decayed by a factor of 0.9 for 60 epoches. All the methods are trained at most 250 epoches. Table 3 shows the result from 5-folds cross validation of different methods. We also conduct an ensembles of 3 GroupCapsNets.

Table 3 Performance comparison between GroupCapsNet and existing methods in intracranial hemorrhage segmentation, DPN92-Unet ensembles and GroupCapsNet ensembles methods means that training 3 models and averaging their prediction results.

Full size table

Our GroupCapsNet outperforms U-Net 2M by about 2% in term of Dice. In the case of the similar number of parameters, GroupCapsNet shows its potential on feature representation. The SegCaps is one kind of non-group capsule network. The experiment shows the effectiveness of grouped capsule. In addition, our GroupCapsNet shows the comparable performance as comparing to ICHNet in ICH segmentation. After ensembling (training 3 models and averaging their prediction results), our GroupCapsNet achieve the best dice of 0.8901 in ICH segmentation.

The quantitative output for ICH segmentation is shown in Fig. 4.

Conclusion

In this paper, we aim to reduce the burden computational cost and memory consuming of capsule network and employ it to ICH segmentation task. To solve the shortcoming of capsules, we design a capsule network named GroupCapsNet comprising the grouped capsule layer. A comprehensive experiment is conducted to show that the effectivness of grouped capsule and modified squashing function. In ICH segmentation, our GroupCapsNet achieve the comparable performence as comparing to the existing approaches.

Data availability

The datasets analysed during the current study are not publicly available due to protection of personal information but are available from the corresponding author on reasonable request.

References

Wu, X. et al. Risk factors for intracranial hemorrhage and mortality in adult patients with severe respiratory failure managed using veno-venous extracorporeal membrane oxygenation. Chin. Med. J. 135, 36–41 (2022).
Article ADS CAS Google Scholar
Seiffge, D. J. et al. Meta-analysis of haematoma volume, haematoma expansion and mortality in intracerebral haemorrhage associated with oral anticoagulant use. J. Neurol. 266, 3126–3135 (2019).
Article CAS PubMed PubMed Central Google Scholar
Broderick, J. P., Brott, T. G., Duldner, J. E., Tomsick, T. & Huster, G. Volume of intracerebral hemorrhage. A powerful and easy-to-use predictor of 30-day mortality. Stroke 24, 987–993 (1993).
Article CAS PubMed Google Scholar
Mansour, R. F. & Aljehane, N. O. An optimal segmentation with deep learning based inception network model for intracranial hemorrhage diagnosis. Neural Comput. Appl. 33, 13831–13843 (2021).
Article Google Scholar
Muschelli, J. et al. Pitchperfect: Primary intracranial hemorrhage probability estimation using random forests on ct. NeuroImage Clin. 14, 379–390 (2017).
Article PubMed PubMed Central Google Scholar
Quintas-Neves, M. et al. Noncontrast computed tomography markers of outcome in intracerebral hemorrhage patients. Neurol. Res. 41, 1083–1089 (2019).
Article PubMed Google Scholar
Kumar, I., Bhatt, C. & Singh, K. U. Entropy based automatic unsupervised brain intracranial hemorrhage segmentation using ct images. J. King Saud Univ. Comput. Inform. Sci. 34, 2589–2600 (2022).
Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. in International Conference on Medical Image Computing and Computer-Assisted Intervention, 234–241 (Springer, 2015).
Kuo, W., Häne, C., Yuh, E., Mukherjee, P. & Malik, J. Cost-sensitive active learning for intracranial hemorrhage detection. in International Conference on Medical Image Computing and Computer-Assisted Intervention, 715–723 (Springer, 2018).
Zhao, L., Lee, A., Fan, Y.-H., Mok, V. C. & Shi, L. Magnetic resonance imaging manifestations of cerebral small vessel disease: Automated quantification and clinical application. Chin. Med. J. 134, 151–160 (2021).
Article Google Scholar
Vidya, M., Mallya, Y., Shastry, A. & Vijayananda, J. Recurrent sub-volume analysis of head ct scans for the detection of intracranial hemorrhage. in International Conference on Medical Image Computing and Computer-Assisted Intervention, 864–872 (Springer, 2019).
Ye, H. et al. Precise diagnosis of intracranial hemorrhage and subtypes using a three-dimensional joint convolutional and recurrent neural network. Eur. Radiol. 29, 1–11 (2019).
Yu, N. et al. A robust deep learning segmentation method for hematoma volumetric detection in intracerebral hemorrhage. Stroke 53, 167–176 (2022).
Article PubMed Google Scholar
Chilamkurthy, S. et al. Development and validation of deep learning algorithms for detection of critical findings in head ct scans. arXiv preprint arXiv:1803.05854 (2018).
Ironside, N. et al. Fully automated segmentation algorithm for hematoma volumetric analysis in spontaneous intracerebral hemorrhage. Stroke 50, 3416–3423 (2019).
Article PubMed Google Scholar
Islam, M. et al. Ichnet: Intracerebral hemorrhage (ich) segmentation using deep learning. in International MICCAI Brainlesion Workshop, 456–463 (Springer, 2018).
Manvel, A., Vladimir, K., Alexander, T. & Dmitry, U. Radiologist-level stroke classification on non-contrast ct scans with deep u-net. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 820–828 (Springer, 2019).
Sabour, S., Frosst, N. & Hinton, G. E. Dynamic routing between capsules. Adv. Neural Inform. Process. Syst. 9, 3856–3866 (2017).
Goceri, E. Capsnet topology to classify tumours from brain images and comparative evaluation. IET Image Process. 14, 882–889 (2020).
Article Google Scholar
Xiang, C., Zhang, L., Tang, Y., Zou, W. & Xu, C. Ms-capsnet: A novel multi-scale capsule network. IEEE Signal Process. Lett. 25, 1850–1854 (2018).
Article ADS Google Scholar
Sabour, S., Frosst, N. & Hinton, G. Matrix capsules with em routing. in 6th International Conference on Learning Representations, ICLR (2018).
Chao, H., Dong, L., Liu, Y. & Lu, B. Emotion recognition from multiband eeg signals using capsnet. Sensors 19, 2212 (2019).
Article ADS PubMed PubMed Central Google Scholar
Liu, Y., Zhang, D., Zhang, Q. & Han, J. Part-object relational visual saliency. IEEE Trans. Pattern Anal. Mach. Intell. 44, 3688–3704 (2022).
PubMed Google Scholar
Mukhometzianov, R. & Carrillo, J. Capsnet comparative performance evaluation for image classification. arXiv preprint arXiv:1805.11195 (2018).
Zhang, T., Qi, G.-J., Xiao, B. & Wang, J. Interleaved group convolutions. in Proceedings of the IEEE International Conference on Computer Vision, 4373–4382 (2017).
Selvaraju, R. R. et al. Grad-cam: Visual explanations from deep networks via gradient-based localization. in Proceedings of the IEEE International Conference on Computer Vision, 618–626 (2017).
LaLonde, R. & Bagci, U. Capsules for object segmentation. arXiv preprint arXiv:1804.04241 (2018).
Duarte, K., Rawat, Y. & Shah, M. Videocapsulenet: A simplified network for action detection. in Advances in Neural Information Processing Systems, 7610–7619 (2018).
Zhang, S., Zhou, Q. & Wu, X. Fast dynamic routing based on weighted kernel density estimation. in International Symposium on Artificial Intelligence and Robotics, 301–309 (Springer, 2018).
Bonheur, S. et al. Matwo-capsnet: A multi-label semantic segmentation capsules network. in International Conference on Medical Image Computing and Computer-Assisted Intervention, 664–672 (Springer, 2019).
Edraki, M., Rahnavard, N. & Shah, M. Subspace capsule network. Proc. AAAI Conf. Artif. Intell. 34, 10745–10753 (2020).
Google Scholar
Mobiny, A., Lu, H., Nguyen, H. V., Roysam, B. & Varadarajan, N. Automated classification of apoptosis in phase contrast microscopy using capsule network. IEEE Trans. Med. Imaging 39, 1–10 (2019).
Article PubMed Google Scholar
Chen, J. & Liu, Z. Mask dynamic routing to combined model of deep capsule network and u-net. IEEE Trans. Neural Netw. Learn. Syst. 31, 2653–2664 (2020).
PubMed Google Scholar
Koresh, H. J. D., Chacko, S. & Periyanayagi, M. A modified capsule network algorithm for oct corneal image segmentation. Pattern Recognit. Lett. 143, 104–112 (2021).
Article ADS Google Scholar
Ghosh, S., Das, N., Das, I. & Maulik, U. Understanding deep learning techniques for image segmentation. ACM Comput. Surveys (CSUR) 52, 1–35 (2019).
Article Google Scholar
Sinha, D. & El-Sharkawy, M. Thin mobilenet: An enhanced mobilenet architecture. In 2019 IEEE 10th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), 0280–0285 (IEEE, 2019).
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. in 3rd International Conference for Learning Representations (2014).

Download references

Author information

Authors and Affiliations

West China School of Nursing, Sichuan University/ West China Hospital Critical Care Medicine Department, Sichuan University, Chengdu, 610041, China
Lingying Wang & Menglin Tang
Innovation Center of Nursing Research, Nursing Key Laboratory of Sichuan Province, West China Hospital, Sichuan University, Chengdu, 610041, China
Lingying Wang & Xiuying Hu

Authors

Lingying Wang
View author publications
Search author on:PubMed Google Scholar
Menglin Tang
View author publications
Search author on:PubMed Google Scholar
Xiuying Hu
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, LY.W. and ML.T.; methodology, LY.W. and ML.T.; collection, LY.W., and XY.H.; writing—original draft preparation, LY.W.; writing—review and editing, ML.T. and XY.H.

Corresponding author

Correspondence to Xiuying Hu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, L., Tang, M. & Hu, X. Evaluation of grouped capsule network for intracranial hemorrhage segmentation in CT scans. Sci Rep 13, 3440 (2023). https://doi.org/10.1038/s41598-023-30581-4

Download citation

Received: 27 November 2022
Accepted: 27 February 2023
Published: 01 March 2023
Version of record: 01 March 2023
DOI: https://doi.org/10.1038/s41598-023-30581-4

This article is cited by

Intracranial hemorrhage segmentation and classification framework in computer tomography images using deep learning techniques
- S. Nafees Ahmed
- P. Prakasam
Scientific Reports (2025)
STCPU-Net: advanced U-shaped deep learning architecture based on Swin transformers and capsule neural network for brain tumor segmentation
- Ilyasse Aboussaleh
- Jamal Riffi
- Hamid Tairi
Neural Computing and Applications (2024)

Subjects

Abstract

Similar content being viewed by others

Intracranial hemorrhage segmentation and classification framework in computer tomography images using deep learning techniques

A joint convolutional-recurrent neural network with an attention mechanism for detecting intracranial hemorrhage on noncontrast head CT

Noncontrast CT-based deep learning for predicting intracerebral hemorrhage expansion incorporating growth of intraventricular hemorrhage

Introduction

Contributions

Related work

ICH region segmentation

Capsule network

Methods

Ethical approval

Recap capsule

Convolutional capsule

Grouped capsule

Network architecture

Non-linear mapping

Results

Experimental setting

Implementation details

Dataset

Preprocessing

Evaluation metric

Ablation study

Modified squashing vs. squashing

Grouped vs. non-grouped

Compare with existing methods

Conclusion

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Intracranial hemorrhage segmentation and classification framework in computer tomography images using deep learning techniques

STCPU-Net: advanced U-shaped deep learning architecture based on Swin transformers and capsule neural network for brain tumor segmentation

Search

Quick links