Introduction

In disease diagnostic, pathologists perform microscopic examinations of histopathological samples to identify the sign of infection. During this analysis, primary focus is on tissue structure, cell count, and shape of the cell. Moreover, this manual examination demands expertise which makes it an expensive, one-sided, and tedious process1. Consequently, its automation is fundamental for quick and impartial finding2. To do this, histopathological images are captured through microscopic mounted cameras, which are further analyzed by computer-assisted histopathological image analysis methods. Figure 1 illustrates some sample images of different tissues, taken at 40x magnification level3. The intricate structure of histopathological images presents a challenging environment, even for classification tasks. Therefore, this paper presents a novel method for the efficient classification of histopathological images.

Fig. 1
figure 1

Representative histopathological images of different tissues, taken at 40x magnification level3.

Further, the success of any classification-based method is predominantly dependent on the quality of features4. In literature, numerous methods are presented to efficiently extract the features, which are broadly classified as traditional and learning. The first category corresponds to feature descriptors which are extracted based on statistical computation5. Scale-invariant feature transform (SIFT)6, histogram-oriented gradient (HOG)7, and speed-up robust features (SURF)8 are some of the common examples of traditional feature extraction methods. These methods are quite effective for natural image analysis. However, these methods produce a large number of irrelevant features in a complex environment. Moreover, features generated by traditional methods are generally non-transferable. On the contrary, learning-based methods employ various machine learning models to obtain features9. Deep neural networks (DNN) models and auto-encoders restricted Boltzmann machines are some of the commonly used models. The literature witnessed that DNN-based solutions are effective in analyzing complex and diverse environments10, like histopathological images. Masci et al.11 presented a feature extraction method by initializing a convolutional neural network (CNN) with a convolution auto-encoder. Xu et al.9 extracted features from stromal and epithelial tissue by employing a deep CNN. A comprehensive survey on various DNN-based feature extraction methods can be found in12,13. Generally, DNN-based feature extraction methods find global features by learning directly from low-level features, resulting in the generation of better feature descriptors. However, these feature descriptors may include redundant and irrelevant features, which will degrade the performance of the classifier14,15,16,17. Therefore, this paper presents a new feature selection method for efficiently classifying histopathological images.

In general, the selection of relevant features is a combinatorial optimization problem as there is a possibility of \(2^n\) combinations of relevant features for n features18,19. When feature space is large, identifying the correct combination of features is exhaustive. To fasten this search, researchers have employed meta-heuristic-based solutions20,21,22. Generally, the swarm-based algorithms are optimization models that stimulate nature’s behaviour mathematically23. In literature, these algorithms have successfully solved various complex and non-linear real-world problems of different domains such as biomedical image analysis, wireless sensor network, software engineering, sentiment analysis, etc.24,25,26,27,28,29. Some of the widely used meta-heuristic algorithms are differential algorithm30, genetic algorithm31, particle swarm optimization32, gravitational search algorithm33, artificial bee colony34, cuckoo search35, and military-dog-based algorithm36. Mostly, meta-heuristic algorithms consider single objective functions to do optimization37,38,39. However, this presents only a single perspective for solving the problem40,41,42. In contrast, multi-objective meta-heuristic algorithms are better as multiple criteria are considered while performing optimization43. In literature, researchers have presented several algorithms for unfolding multi-objective optimization. Some of the widely used methods for untangling multi-optimization problems are Pareto-archived evolution strategy (PAES), Non-dominated sorting GA (NSGA-II), and strength-Pareto evolution algorithm (SPEA)44. Abdollahzadeh and Gharehchopogh45 employed three optimization algorithms to select the best feature set. In their approach, in the first solution, the Harris Hawks Optimization (HHO) algorithm has been multiplied, and in the second solution, the Fruitfly Optimization Algorithm (FOA) has been multiplied, and in the third solution, these two solutions are hybridized to improve the efficacy. Li et al.46 introduced a multi-objective binary grey wolf optimization for feature selection using Pearson correlation-guided mutation, uniform initialization, and three distinct mutation strategies for efficient search. Jiao et al.47 presented a multiform framework for multiobjective feature selection, leveraging auxiliary single-objective tasks in a multitask environment to enhance evolutionary search by utilizing promising feature subsets. Xue et al.48 proposed a multi-objective evolutionary algorithm with interval-based initialization and self-adaptive crossover for large-scale feature selection, improving initial population distribution and reducing decision space similarity. Bai et al.49 introduced a joint multiobjective optimization approach for high-dimensional data classification using a neural network classifier and a non-iterative training algorithm, ensuring good performance and fast learning. Zhang et al.50 proposed a multi-objective optimization algorithm combining Equilibrium Optimizer (EO) and NSGA-III for feature selection in high-dimensional data. It uses S-shaped, V-shaped, and U-shaped transfer functions for binary coding and optimizes the binary search space through an external archive and clustering strategy. Recently, Aziz et al.51 presented Multi-objective whale optimization algorithm (MOWOA), which mimics the hunting behavior of humpback whales52. MOWOA consists of mainly three phases: prey encircling, bubble-net attacking strategy, and searching for new prey. Although MOWOA has shown a number of merits, such as optimal Pareto-fronts and balanced exploration and exploitation. However, it sometimes suffers from poor population diversity, resulting in less exploration rate. Therefore, this paper introduces a new variant of MOWOA, improved multi-objective WOA (IMOWOA), to improve population diversity.

The contribution of this paper can be analyzed in two folds;

  1. 1.

    Novel Variant of Multi-Objective WOA A novel variant of the multi-objective WOA is proposed, termed improved multi-objective WOA (IMOWOA). In the proposed IMOWOA, the archive set is partitioned into different priority sets based on non-dominated sorting and crowding distance. This prioritization ensures better management of solutions, leading to improved population diversity and enhanced exploration capabilities.

  2. 2.

    New Feature-Selection MethodA new feature-selection method is developed, which uses IMOWOA to select optimal features for the efficient classification of histopathological images. The performance of IMOWOA is validated on ten CEC2009 multi-objective benchmark problems, including bi-objective and tri-objective test problems. To compare the results, three parameters, namely maximum spread (MS), inverted generational distance (IGD), and spacing (SP) are considered. The results are validated against three existing multi-objective meta-heuristic algorithms: multi-objective evolutionary algorithm (MOEA/D), multi-objective particle swarm optimization (MOPSO), and MOWOA.

The proposed feature-selection method is investigated on a publicly available histopathological dataset with four tissue classes: epithelial, connective, muscular, and nervous. For comparison, three existing feature selection methods are considered. The extracted features by the considered methods are classified using six well-known classifiers: LDA, SVM, kNN, ZeroR, and RF. The results are compared in terms of accuracy, mean number of selected features, and computational time.

The remaining paper is organized as follows; Section “Multi-objectivewhale optimizationalgorithm (MOWOA)” briefs the multi-objective WOA. The proposed approach, along with the proposed variant, is illustrated in section “Proposed method”. Section “Results and discussion” discusses the experimental analysis of the IMOWOA and the proposed method. Finally, Section “Conclusion” presents the conclusion of the paper.

Multi-objective whale optimization algorithm (MOWOA)

MOWOA is a multi-objective version of WOA52, which is inspired by Humpback Whales. Whales are the largest creature globally; which has twice the number of body cells than humans. The optimization process is designed using the hunting analogy of the whales. There are mainly three phases of hunting: encircling the prey, bubble-net attacking strategy, and searching for new prey. The conceptual and mathematical descriptions of these phases are provided below.

  1. 1.

    Encircle the prey: This is the first phase of hunting. As the location of the prey in the search domain is initially unknown, WOA considers the current best position as the overall best solution and updates its further movement accordingly as per Eq. (1)

    $$\begin{aligned} \vec {X}(t+1)=\vec {X}_b(t) - {2\vec {a} \cdot \vec {r} - \vec {a}} \cdot \mid 2 \cdot \vec {r} \cdot \vec {X}_b(t) - \vec {X}(t)\mid \end{aligned}$$
    (1)

    where, \({X}_b(t)\) is the best position at iteration t, \(r \in (0,1)\), vector a is contain decreasing values from 2 to 0, and \({X}(t+1)\) represents the location of the whale at next step.

  2. 2.

    Bubble-net attacking strategy In this phase, along with the encircling strategy, the spiral movement of the whales is also considered to search the prey in an equally likely manner. The probability (\(p_r\)) of using spiral and shirking encircling phases is \(50\%\) each. The mathematical formulation of bubble-net attacking is provided in Eq. (2)

    $$\begin{aligned} \vec {X}(t+1)={\left\{ \begin{array}{ll} (({X}_b(t)-{X}(t))\cdot e^{bl}\cdot (2\pi l)+ {X}_b(t)) & p_r\ge 0.5\\ (Eq. (xm{4}))& p_r <0.5 \end{array}\right. } \end{aligned}$$
    (2)

    where b denotes the shape of the spiral and kept as the constant, \(l \in [-1,1]\) denotes a random number.

  3. 3.

    New prey search: The above two phases are responsible for the exploitation, whereas this phase is leveraged to explore the search domain for finding new prey. The searching behavior is purely random and can be formulated by Eq. (3).

    $$\begin{aligned} \vec {X}(t+1)=\vec {X}_{r}(t) - {2\vec {a} \cdot \vec {r} - \vec {a}} \cdot \mid 2 \cdot \vec {r} \cdot \vec {X}_r(t) - \vec {X}(t)\mid \end{aligned}$$
    (3)

    where \({X}_{r}\) is a randomly generated solution at iteration t.

The above-defined WOA can be efficiently leveraged for multi-objective problems by the inclusion of various significant constructs like finding non-dominance between solutions, maintaining the archive set, selection methods, etc. in MOWOA. In multi-objective optimization, the ranking of the solutions is defined by pareto-optimality based on non-dominated solutions. Further, to manage the repository of these solutions archive set is maintained. There are certain rules to update or modify the archive set which is described as follows:

Rule 1::

Initially the archive set is empty, so all the non-dominated solutions are inserted into it.

Rule 2::

Let the archive set consists of S solutions which are non-dominated. If the new non-dominated solutions are not better than these S and few solutions belonging to the archive set dominate, then there is no change in the archive set.

Rule 3::

If new solutions and solutions in the archive set are non-dominated then these new solutions will be added to the archive set.

Rule 4::

If any of the new non-dominated solutions is better than one or more solutions in the archive set, then the better solutions is included in place of previous ones.

Rule 5::

If the archive set is not empty, then the most crowded solutions are taken away from the archive to balance the diverse set of solutions and new solutions are given chance.

Moreover, to maintain the diversity and distribution of the solutions in the archive set roulette wheel method is used.

Proposed method

This paper introduces a novel variant of multi-objective WOA termed as improved multi-objective WOA (IMOWOA). Further, a new IMOWOA-based feature-selection method is introduced to extract relevant features for the efficient classification of images, especially histopathological images. The overall flow diagram of the proposed method is depicted in Fig. 2.

Fig. 2
figure 2

The overall flow of the propose method.

IMOWOA

In the existing MOWOA, only one archive set is maintained in each iteration, which may sometimes result in poor population diversity and less exploration. However, in the proposed IMOWOA, although there is always one archive set, it is partitioned into different priority sets. This partitioning ensures better management of the solutions, leading to improved population diversity and enhanced exploration capabilities. The archive set is updated by including non-dominated solutions from different priority sets, which allows for a more diverse and comprehensive search of the solution space. The detailed steps for updating the archive set are as follows:

  1. 1.

    Let there are N individual solutions in the population of size P. Find the non-dominated set of individual solutions from P.

  2. 2.

    Store the non-dominated individual solutions in the \(priority\_set_i\) (i=1) and delete these solutions from the P.

  3. 3.

    Again, find the non-dominated individual solutions from the remaining population P and add these solutions to the \(priority\_set_i\) where (\(i=2, 3, 4, \cdots\))

  4. 4.

    Repeat the above process until no solution is left in the population P.

Once the archive set is partitioned into the different priority sets, then the population (\(P'\)) for the next iteration is updated using the following steps:

  1. 1.

    Add the solutions of \(priority\_set_i\) (i=1) to the \(P'\)

  2. 2.

    If (\(\mid P \mid <N\)) and (N-\(\mid P\mid )<\)s number of solutions in \(priority\_set_i\) where i = 2, 3,... then add solutions of \(priority\_set_i\) to the \(\mid P'\mid\)

  3. 3.

    Otherwise, find the less crowded solutions in the \(priority\_set_i\) and add them to \(P'\)

Once the \(P'\) is filled with N solutions, the above procedure is stopped. The resulting \(P'\) is nothing but the newly generated archive set. Due to the inclusion of domination sorting and crowding distance, the new archive set is more diverse and has good exploration capability.

IMOWOA for feature selection

A novel IMOWOA-based feature-selection (IMOWOA-FS) method is proposed for the optimal selection of relevant features. First, the considered images are processed through a convolutional neural network to obtain the relevant features. The convolutional neural network’s architecture is illustrated in Fig. 3. Next, the proposed variant, an improved multi-objective whale optimization algorithm (IMOWOA), operates on the extracted features to attain the optimal feature set. In the proposed variant, the population is initialized with real values between 0 and 1. Each extracted feature represents one dimension of the data. Further, a sigmoid function maps each individual to 0 and 1. Equation (1) presents the formulation of the sigmoid function for \(x_i^j\) value of the \(i{\text {th}}\) individual at \(j{\text {th}}\) dimension.

Fig. 3
figure 3

Architecture of CNN.

$$\begin{aligned} S\left( x_i^j\right) ={\frac{1}{1+e^{-x_i^j}}} \end{aligned}$$
(4)

Furthermore, if the resultant value \(\bigg (x_i^j\bigg )\) of \(i{\text {th}}\) individual corresponds to ‘1’, then the corresponding extracted feature is selected, else the corresponding extracted feature is discarded. Based on the set of selected features, the fitness of individuals is computed by considering 2 objective functions as presented in Eqs. (2) and (3). Equation (2) corresponds to the maximization of the classification accuracy on the selected features while Eq. (3) depicts the minimization of the number of features to be selected.

$$\begin{aligned} & Maximization: \frac{1}{k}\sum _{i=1}^{k} TP \end{aligned}$$
(5)
$$\begin{aligned} & Minimization: \frac{\#SF}{\#TEF} \end{aligned}$$
(6)

In Eq. (2), ‘k’ denotes the number of classes while, TP denotes the correctly identified instances of class ‘k’. Further, Eq. (3) considered the ratio of selected features (SF) to total extracted features (TEF). The position of each individual is updated using proposed IMOWOA. Finally, the optimal feature set trains the considered classifier to infer labels for the input image.

In multi-objective optimization, it is common to treat objectives with equal importance to ensure a balanced exploration of the solution space. The IMOWOA inherently balances the maximization of accuracy and minimization of the number of selected features through its non-dominated sorting and crowding distance mechanisms. This balance ensures that solutions are not biased towards either objective, promoting a diverse set of optimal solutions that achieve a good trade-off between accuracy and feature selection. This approach is beneficial because it allows the algorithm to explore a wide range of potential solutions without prematurely converging on solutions that overly prioritize one objective over the other. This is particularly important in feature selection tasks, where both accuracy and the number of selected features significantly impact the performance and interpretability of the classifier. By maintaining this balance, IMOWOA ensures that the selected features are not only relevant and contribute to high classification accuracy but are also minimal, reducing computational complexity and enhancing the model’s generalizability.

Simultaneous optimization of maximization and minimization objectives

The proposed IMOWOA handles both maximization and minimization objectives simultaneously through the following steps:

  • Non-Dominated Sorting: During each iteration, the algorithm sorts the population based on non-domination. Solutions that are not dominated by any other solution in terms of both objectives (maximization of accuracy and minimization of the number of selected features) are considered non-dominated.

  • Pareto Front Construction: The non-dominated solutions form the Pareto front, representing the best trade-offs between the objectives. These solutions are stored in the archive set.

  • Crowding Distance Calculation: To maintain diversity among the Pareto front solutions, a crowding distance metric is calculated for each solution. This metric ensures a diverse spread of solutions by favoring those in less crowded regions of the objective space.

  • Selection for Next Iteration: The next generation of solutions is selected based on their dominance rank and crowding distance. Solutions with better trade-offs (higher accuracy and fewer selected features) are given higher priority.

By balancing these two objectives through non-dominated sorting and crowding distance, IMOWOA effectively handles the simultaneous optimization of maximization and minimization objectives, leading to a diverse set of optimal solutions.

Time complexity of IMOWOA

In the proposed algorithm, the time complexity depends on the determination of the non-dominated individuals and the updation of the individuals. The computation complexity for identifying the non-dominated solutions is O(\(N^2\)*L) wherein N represents the number of individuals and L corresponds to the number of objective functions. The updation of individuals takes O(N*D) computing cost, where D depicts the feature space of each individual. Therefore, the overall complexity of the proposed algorithm is O(\(N^2\)*L).

Results and discussion

The performance of the proposed IMWOA has been discussed in sections “Performance analysis of proposed IMOWOA” and “Performance analysis of feature selection technique”. First, the proposed IMOWOA has been validated on ten CEC-2009 multi-objective benchmark problems in section “Performance analysis of proposed IMOWOA”, then in section “Performance analysis of feature selection technique” it has been used for selecting relevant features from histopathological image datasets. All of the experiments were run on a Matlab 2017a machine with a 2.90 GHz Core i3 processor and 16 GB of RAM for a fair analysis.

Performance analysis of proposed IMOWOA

To test the efficiency of proposed IMOWOA, 7 bi-objectives (\(UF_1\)\(UF_7\)) and 3 tri-objective test benchmarks (\(UF_8\)\(UF_{10}\))53,54 have been considered. The details of the benchmark functions are presented in Figs. 4 and 5. It can be observed from the table that the considered benchmark functions contain distinct multi-objective search regions with non-convex, convex, multi-modal, and dis-continuous Pareto fronts which have been considered the hardest test problems in the literature. Maximum spread (MS), Inverted generational distance (IGD), and spacing (SP) are commonly used to validate the efficacy of multi-objective methods since they consider the standard deviation and mean values. IGD value assesses the convergence while MS and SP evaluate coverage of the considered approach44,55,56. IGD is an improved version of Coello’s generational distance (GD)55,57 which is computed using Eq. (7) whereas the values of SP and MS are computed using Eqs. (8) and (9).

$$\begin{aligned} IGD=\frac{\sqrt{\sum _{a=1}^N dist_a^2}}{N} \end{aligned}$$
(7)

where N denotes the true optimum Pareto solutions (PS), whereas \({dist_j}\) is the Euclidean distance measure between the \(a{\text {th}}\) true Pareto optimal and the reference set’s nearest computed POS.

$$\begin{aligned} SP={\sqrt{\frac{1}{N-1}\sum _{a=1}^N \bigg (dist' - dist_a\bigg )^2}} \end{aligned}$$
(8)

here, dist’ denotes the mean of all \(dist_a\), N denotes the total number of optimum PS found, and \(dist_a=\min _a\bigg (|f_1^i (\textbf{x})- f_1^a (\textbf{x})|+|f_2^i (\textbf{x})- f_2^a (\textbf{x})|\bigg )\) for all \(i, a=1,2,3, \cdots , N\).

$$\begin{aligned} MS={\sqrt{\sum _{a=1}^t \max \bigg (det(x_i,y_i)\bigg )}} \end{aligned}$$
(9)

where \({det{()}}\) calculates the Euclidean distance, \(x_j\), \(y_j\) is the maximum and minimum value in \(a{\text {th}}\) objective, and t represents the total count of objectives.

Fig. 4
figure 4

Bi-objective test problems.

Fig. 5
figure 5

Tri-objective test problems.

The optimal values SP, MS, and IGD values of all the considered approaches have been compared to assess the efficacy qualitatively. To investigate the results, the proposed IMOWOA is compared to MOPSO56, MOGWO53, and MOEA/D58. Each algorithm has been run 10 times to get the mean value of results, which have been considered to ensure a fair analysis and reduce the interference impact. The number of iterations (tit) and size of the population (N) are fixed to 1000 and 50, respectively, while the rest of the parameters are taken from the relevant literature. The mean, standard deviation, median, best, and worst values of IGD, MS, and SP of the proposed IMOWOA and other considered methods are compared to assess the efficiency of the proposed approach. The IGD, SP, and MS values produced by the proposed IMOWOA and other techniques are depicted in Tables 1, 2, 3. Table 1 shows that for more than 90% of benchmark problems, the proposed IMOWOA achieves the best IGD values. IGD values are normally used as a benchmark for comparing the convergence of different methods. So, from the results listed in Table 1, it can be envisioned that the proposed IMOWOA shows better convergence. There are a few benchmark problems for which other algorithms return the best IGD values. MOEA/D produces the best IGD values for UF3, UF6, and UF7, while for the benchmark UF8, MOPSO outperforms the proposed and other techniques. Thus, it can be vindicated from the experimental analysis that the proposed IMOWOA has a consistent performance than the other methods.

Table 1 Statistical results for IGD of the investigated techniques.

Besides, in Tables 2 and 3, the MS and SP values are also examined. As MOEA/D is not executed for the tri-objective function \(UF_8\), \(UF_9\), and \(UF_{10}\), henceforth, for these benchmark problems, the proposed IMOWOA is only validated against MOPSO and MOGWO. From the tables, it can be shown that the proposed IMOWOA has superior convergence and coverage. Though there are a few discontinuities on the Pareto optimal front achieved by the proposed IMOWOA, the entire front coverage front is wider than MOEA/D, MOPSO, and MOGWOA for a few benchmarks. However, the proposed IMOWOA’s Pareto optimal solutions are nearer to the optimal Pareto front and fairly distributed for bi and tri-objectives. Hence, from the statistical findings, the efficiency of the IMOWOA can be interpreted.

Table 2 Statistical results for SP of the investigated techniques.
Table 3 Statistical results for MS of the investigated techniques.

Additionally, a non-parametric Friedman test59 was performed to determine the best-performing method. This test ranks algorithms based on their performance across all datasets60. The highest-performing model is assigned a rank of 1, with subsequent ranks assigned to lower-performing models. If two algorithms perform equally, their ranks are averaged across different runs. The Friedman test produced a p-value of 2.54E–25, significantly below the threshold (0.05), indicating substantial differences in performance across various groups of meteorological variables. Table 4 presents the rankings obtained from the Friedman test, showing that the proposed method is significantly superior to other considered methods. Besides, the convergence plot is also plotted in Figs. 6 and 7. It can be observed from the convergence plot that the proposed method converges more quickly compared to other methods. Thus, from the above analysis efficacy of the proposed method can be observed.

Table 4 Mean ranking returned by the Friedman test.
Fig. 6
figure 6

Convergence plot for benchmark function UF1.

Fig. 7
figure 7

Convergence plot for benchmark function UF7.

Performance analysis of feature selection technique

A histological tissue image dataset containing four histopathological tissue images namely, nervous, connective, epithelial, and muscular tissues have been used to investigate the IMOWOA feature selection (FS) approach. The images in datasets have been taken from publicly accessible sources61,62. Each image category consists of 101 photographs with separate staining processes. The dataset parameters are listed in Table 5. In contrast Table 6 denotes the number of selected features for different algorithms. Stratified random sampling is employed to partition the entire dataset testing and training sets for classification. To elucidate the efficacy of the proposed IMOWOA-based feature selection, three state-of-the-art techniques, namely differential evolution (DE), Jaya algorithm (JA), and adaptive Jaya algorithm (AJA) are considered. For the same, all the considered methods was first tested on original dataset without feature selection and dataset with feature selection in Table 7. The original dataset without feature selection and the dataset with feature selection are shown in Table 2 to demonstrate the significance of the feature selection. All the considered methods was first tested on original dataset without feature selection and dataset with feature selection in Table 6. The number of features returned by the various classifiers is depicted in Table 7. Further, the mean selected features and accuracy of all the considered algorithms are computed using six well-known classifiers: KNN, LDA, ZeroR, RF, and SVM. The findings of all the methods are tabulated in Table 8. The accuracy of the ZeroR is the baseline for all the approaches. It can be vindicated from the tables that the IMOWOA-based FS approach corresponds to the least number of selected features from the set of features extracted by AlexNet. It eliminates 93.4 percent of features, followed by AJA’s 91.6 percent, JA’s 90.8 percent, and DE’s 88.5 percent. The accuracy obtained by the classifiers for selected FS has been used to assess the relevance of the chosen features. It is clearly visible in the table that all of the investigated classifiers generate the highest accuracy from the features confirmed by the novel IMOWOA. The accuracy achieved by the SVM, i.e., 56.50%, is the best among all the considered classifiers. Thus, the above analysis has observed that the IMOWOA with SVM outruns the other FS approaches.

Table 5 The various types of image datasets and associated staining techniques.
Table 6 No. of selected features.
Table 7 Comparative analysis of considered feature selection methods.
Table 8 Comparison of classification accuracy of the investigated techniques.

In addition, the computational time of considered approaches are also compared in Table 9. It is clearly envisaged from the table that the IMOWOA has the minimum execution time for all the classifiers. Thus, it can be claimed that the obtained features are non-redundant and relevant, resulting in improved accuracy without compromising a classifier’s computing speed. Therefore, it can be said from the experimental analysis that the developed IMOWOA-based feature selection method generates fewer features while generating a high level of accuracy.

Table 9 Comparative analysis of computational time.

Furthermore, the classification performance of the considered feature selection approaches using SVM has been examined independently for each of the four categories. The confusion matrices of all of the investigated categories are shown in Fig. 8. MT, NT, CT, and ET correspond to muscular, nerve, connective, and epithelial tissue respectively in the figure. The IMOWOA-based technique has greater than 50% accuracy for MT, ET, CTand MT images. However, it only achieves 48% accuracy for NT, which is the maximum among all the algorithms tested for this specific image. On the contrary, AJA, JA, and DE attain greater than 50% accuracy on CT and MT images. Besides, the performance of the confusion matrix has been investigated using F-measure, precision, recall, and specificity, as shown in Table 10. The results show that IMOWOA outperforms the other approaches. IMOWOA’s overall classification accuracy is 59.65 percent, which is the best among the considered methods.

Fig. 8
figure 8

Confusion matrices for histopathological image.

Table 10 Comparison of investigated feature selection techniques.

Moreover, for better visualization of the performance of all considered techniques, radar charts are developed for each category of the datasets separately. The radar charts depict the values of four parameters, namely recall, precision, F1-score, and G-mean. The method with the larger area is considered the best performer. Figure 9 represents the radar charts for connective, epithelial, muscle, and nervous tissues, respectively. It can be elucidated from the figures that the area of the developed IMOWOA is larger for each category than other considered techniques.

Fig. 9
figure 9

(a) Radar chart for connective tissue; (b) Radar chart for epithelial tissue; (c) Radar Chart for muscle tissue; (d) Radar chart for nervous tissue.

Comparison with other multi-objective feature selection techniques

To provide a comprehensive evaluation, we compare the proposed IMOWOA with other multi-objective feature selection techniques. We evaluate the performance of IMOWOA, NSGA-II63, SPEA244, and MODE64 on the same histopathological image dataset. The experiments are conducted using six well-known classifiers, namely LDA, SVM, kNN, ZeroR, and RF. The results are compared in terms of accuracy, mean number of selected features, and computational time.

Table 11 provides a comparative analysis of the feature selection methods, including IMOWOA, NSGA-II, SPEA2, and MODE, using the SVM classifier. The table shows that IMOWOA achieves the highest accuracy with a balanced number of selected features and computational time, indicating its effectiveness in feature selection.

Table 11 Comparative analysis of considered feature selection methods.

Table 12 shows the classification accuracy of the investigated techniques across different classifiers. IMOWOA consistently achieves higher accuracy compared to NSGA-II, SPEA2, and MODE, demonstrating its robustness and superior performance in feature selection.

Table 12 Comparison of classification accuracy of the investigated techniques.

Table 13 presents the computational time for each method across different classifiers. IMOWOA exhibits competitive computational times, highlighting its efficiency in feature selection tasks without compromising on performance.

Table 13 Comparative analysis of computational time.

Conclusion

In this paper, a novel feature selection method, improved multi-objective WOA-based feature selection (IMOWOA-FS), has been introduced for histopathological image classification. The proposed method finds the optimal features by using a novel variant, termed an improved multi-objective WOA (IMOWOA). To validate the optimization ability, the proposed IMOWOA is tested on 7 bi-objective and 3 tri-objective CEC2009 benchmarks, and the results are validated against three other state-of-the-art methods, namely, MOPSO, MOEA/D, and MOWOA. It has been vindicated from the results that the developed IMOWOA outperforms the other considered methods on 90% of the benchmarks. Furthermore, IMOWOA-FS has been employed for the classification of histopathological images on a publicly available histopathological dataset of tissue images. The results are validated against 5 classifiers, namely, SVM, LDA, RF, kNN, and ZeroR, and compared in terms of classification accuracy, mean number of selected features, and computation time. The results show that IMOWOA-FS outperforms the other considered methods. However, it is essential to acknowledge the limitations of IMOWOA, such as its computational complexity, parameter sensitivity, and challenges in escaping local optima. Additionally, while IMOWOA has shown effectiveness in feature selection for histopathological images, its performance across other domains has not been extensively tested. Addressing these drawbacks through hybrid approaches, adaptive parameter control, and parallel computing can enhance its scalability and robustness. In future, the proposed IMOWOA-FS can be explored on other datasets, such as the plant leaf disease dataset and X-ray image dataset. Moreover, some other classifiers can be explored to improve the accuracy and computation time. Recognizing these limitations provides a valuable starting point for future research, paving the way for more efficient and generalized optimization algorithms.