Introduction

Micronutrients, such as vitamins and minerals, are essential for the human body in micrograms or milligrams per day1,2. They play a crucial role in bodily functions, and deficiencies in any of these micronutrients can lead to severe, and potentially fatal, illnesses3. “Hidden hunger” or “hidden malnutrition,” which refers to micronutrient deficiencies, remains a significant global public health challenge, affecting one out of every four people worldwide, totaling more than two billion individuals4,5. Common micronutrient deficiencies include vitamin A, folate, iron, iodine, and zinc6. Low-income countries in Africa and Asia bear a disproportionate burden of micronutrient deficiencies, with vulnerable groups such as pregnant women and children being particularly affected7.

Pregnancy-related micronutrient deficiencies have implications for the health of pregnant women and the development of their children6,7. Micronutrient supplementation is essential to prevent pregnancy complications and reduce the risk of adverse outcomes8. Of particular concern is iron-folic acid deficiency, as it contributes significantly to anemia among pregnant women and is the most prevalent micronutrient deficiency worldwide during pregnancy7,9. Globally, anemia affects up to 36% of pregnant women aged 15–49 years and is estimated to be the cause of 22% of maternal deaths10. Low-income countries bear the highest burden of anemia, with East Africa accounting for 39% of global pregnant anemia cases10.

The World Health Organization (WHO), in collaboration with member states and other partners, invests significant efforts to address all forms of malnutrition. These initiatives encompass high-folic acid and iron supplementation and fortification of food with micronutrients2. As the reduction of all forms of malnutrition is a key agenda of the Sustainable Development Goals (SDGs) for 2030, WHO member states strive to achieve the target level of reducing micronutrient malnutrition11. However, despite these efforts, the risks of death due to micronutrient malnutrition remain high among pregnant women and children in Eastern Africa10,12.

Existing literature on micronutrient intake and deficiency has identified several potential factors contributing to these deficiencies14,15,16. These factors include educational level13,14,15,16,17,18,19,20,21, age14, sex22, residency23, wealth status13,15,16,18,24, number of children14, access to health facilities25, media exposure14, marital status26, working status26, and antenatal care (ANC) visits18,27,28.

Machine learning technologies have witnessed widespread adoption and achieved remarkable advancements in various fields. However, their application in the realm of public health and medicine has been relatively limited29,30. Traditional analytical techniques have predominantly been employed in prior studies investigating the supplementation status of pregnant women in East African countries31. However, utilizing machine learning models for predicting micronutrient supplementation can enhance empirical evidence. Machine learning algorithms can capture complex relationships, handle high-dimensional data, adapt to non-linear patterns, and provide robust and efficient analysis of large-scale datasets32,33. Therefore, in this study, we employed eight state-of-the-art machine learning algorithms, including association rule mining, to predict the status of micronutrient supplementation using recent DHS data from East African countries.

Method

Data source

This study utilized secondary data from the most recent DHS conducted in 12 East African countries, namely Ethiopia (2016), Kenya (2014), Uganda (2016), Tanzania (2016), Burundi (2017), Rwanda (2015), Madagascar (2009), Mozambique (2011), Zimbabwe (2015), Zambia (2018), Malawi (2016), and Comoros (2012). The data sets from these countries were extracted from the official DHS program database, which can be accessed at https://dhsprogram.com/data/available-datasets.cfm.We obtained ethical approval from the Institutional Review Board for the DHS program to access the data.

The DHS Program has conducted standardized surveys in over 90 countries, collecting representative data on population, health, human immunodeficiency virus (HIV), and nutrition. The surveys used a multi-stage stratified sampling method, selecting participants from households within clusters. Sampling strata were created based on rural and urban sectors, and enumeration areas were chosen using probability proportional to size. Within the selected enumeration areas, households were chosen using equal probability systematic sampling. The study specifically examined women aged 15 to 49 residing in East Africa, focusing on those who had been pregnant within the preceding five years of data collection. The research sample included a total of 138,426 individuals from 12 countries in East Africa. The dataset utilized in the study comprised 13 features34.

Study variables and measurements

The dependent variable in this study was micronutrient supplementation, defined as the usage of iron folic acid tablets or syrup for at least ninety days or the usage of deworming medicine during a previous pregnancy35. Pregnant women meeting these criteria were classified as “supplemented” (coded as 1), while those who did not receive this supplementation were classified as “not supplemented” (coded as 0)31. The study considered various independent variables, including place of residence, age group, religion, number of living children, ANC visit, working status, media exposure, marital status, educational status, wealth status, birth interval, and distance from health facility. The selection of these independent variables was based on a comprehensive review of previous literature13,31.

Data preprocessing

The process of machine learning begins with data pre-processing, which involves modifying or encoding the data to make it suitable for computer interpretation36. In our machine learning workflow, we employed a continuous improvement process for our models. This process included selecting and engineering relevant features, balancing the data, splitting the data, model training, model evaluation, model optimization, choosing the top performer model, and deploying the selected model for prediction. Through an iterative approach, we refined our models. Figure 1 provides a visual representation of the steps in our workflow; however, it does not encompass certain tasks that were iteratively performed throughout the process.

Fig. 1
figure 1

Study work flow diagram.

Data cleaning

During the data analysis process, we manually examined the data for redundancy and determined that no redundant data were present in our dataset. To handle missing values, we utilized the K-nearest neighbors (KNN) imputation technique41. We employed various visualization techniques such as scatter plots, box plots, and histograms to identify outliers. Additionally, we assessed multicollinearity by examining the correlation matrix and considering a correlation value above 0.8 between two pairs of variables as indicative of high correlation37,38.

Feature engineering

Feature engineering involves identifying, acquiring, and modifying the most relevant characteristics from the available data to construct machine learning models that are more accurate and efficient39. We employed one-hot encoding for nominal categorical variables and label encoding for ordinal categorical variables to encode the data40.

Dimensionality reduction

We employed various techniques for dimensionality reduction in our study, aiming to optimize model performance and reduce the complexity of our dataset. These techniques included univariate selection, recursive feature elimination (RFE), random forest feature elimination, principal component analysis (PCA), lasso regression, and a feature selection method based on Boruta41.

Through repeated experiments, we found that the Boruta-based feature selection method outperformed other techniques in terms of accuracy and robustness. The Boruta-based feature selection method assesses feature importance by comparing their performance against randomly generated shadow features that simulate noise. Features consistently outperforming the shadow features are deemed significant and incorporated into our predictive model42.

Data balancing

Data imbalance poses a common challenge in data mining and machine learning, often leading to decreased classification accuracy for instances belonging to the minority class43. To tackle this issue, we utilized four data balancing methods: under-sampling, over-sampling, adaptive synthetic sampling (ADASYN), and synthetic minority oversampling technique (SMOTE). Each of these techniques has distinct characteristics and aims to address class imbalance effectively.

Under-sampling involves reducing the number of instances from the majority class to achieve a more balanced dataset. By randomly removing instances from the majority class, under-sampling aims to align the number of instances in the minority class with that in the majority class. This approach prevents classifier bias towards the majority class, but it may result in the loss of potentially valuable information44,45,46.

On the other hand, over-sampling increases the number of instances in the minority class by replicating or generating new instances. This technique ensures a balanced dataset by ensuring that the number of instances in the minority class is comparable to that in the majority class. Over-sampling can be achieved through methods such as random duplication, bootstrapping, or synthetic data generation46,47.

ADASYN extends the SMOTE technique to address its limitation in handling datasets with varying densities within the minority class. ADASYN synthesizes new instances in the minority class by considering the distribution of instances in the feature space. It focuses on generating more synthetic examples for the minority class instances that are harder to classify, thereby adapting the sampling strategy to the local characteristics of the data48.

SMOTE, on the other hand, is a popular over-sampling technique that creates synthetic instances in the minority class by interpolating between existing instances. It randomly selects a minority class instance and identifies its k nearest neighbors. It then generates synthetic instances by randomly selecting a neighbor and creating a new instance along the line segment between the original instance and the chosen neighbor. SMOTE helps balance the dataset and introduces diversity in the minority class48,49.

To enhance the performance of our predictive model, we initially trained our machine learning algorithms using unbalanced data. Subsequently, we explored and applied the aforementioned balancing techniques to train the models using balanced datasets. To evaluate the performance of each model across each data balancing technique, we conducted a comprehensive analysis using five performance metrics: accuracy, precision, recall, F1-score, and AUC.

While accuracy is suitable for evaluating models with balanced classes, AUC becomes particularly valuable when dealing with imbalanced datasets or when the relative costs of false positives and false negatives are uncertain. However, it is advisable to consider both accuracy and AUC, along with other relevant metrics, to obtain a comprehensive evaluation of model performance and make informed comparisons between different machine learning algorithms50,51,52. Based on these considerations, we selected the data balancing technique that demonstrated superior performance for the final prediction.

Model selection and development

In our study, the dependent variable, micronutrient supplementation status, required a classification approach, as it was divided into “supplemented” and “not supplemented” categories. To make predictions, we needed to select appropriate classifiers. We utilized the Scikit-learn version 1.3.2 packages in Python, implemented within Jupyter Notebook, to employ machine learning algorithms.

To evaluate the predictive capabilities of machine learning algorithms for predicting micronutrient supplementation status, we employed eight state-of-the-art algorithms. Each algorithm has its unique approach and characteristics. Here are brief definitions for each of the algorithms we utilized:

  1. 1.

    Support Vector Machines (SVM) with Kernel Methods: SVM is a powerful algorithm used for both classification and regression tasks. It constructs a hyperplane or a set of hyperplanes in a high-dimensional space to separate different classes53.

  2. 2.

    Gaussian Naive Bayes: This algorithm is based on Bayes’ theorem, assuming that features are conditionally independent. It is particularly effective when dealing with high-dimensional data54,55.

  3. 3.

    Logistic Regression: Logistic regression is a statistical model that is used for binary classification. It estimates the probability of an instance belonging to a particular class based on the input features56,57.

  4. 4.

    Decision Tree Classifier: Decision trees are hierarchical structures that make decisions based on the values of input features. The decision tree classifier uses a tree-like model of decisions and their possible consequences to predict the class label of instances58.

  5. 5.

    Random Forest Classifier: Random forest is an ensemble learning method that combines multiple decision trees. It generates a set of decision trees and makes predictions by averaging the outputs of individual trees59.

  6. 6.

    Gradient Boosting Machines: Gradient boosting is another ensemble learning method that combines multiple weak prediction models, typically decision trees, to create a strong predictive model. It trains new models to correct the mistakes made by previous models60,61.

  7. 7.

    eXtreme Gradient Boosting (XGBoost): XGBoost is an optimized implementation of gradient boosting that provides better performance and scalability. It employs a variety of regularization techniques to prevent overfitting and enhance the overall predictive power62.

  8. 8.

    KNN: KNN is a non-parametric algorithm that classifies instances based on their similarity to neighboring instances. It assigns a class label to an instance by considering the labels of its k nearest neighbors in the feature space63.

The selection of these algorithms was based on their suitability for classification tasks and their compatibility with the characteristics of our dataset64,65,66.

Model training and evaluation

In order to construct a reliable predictive model within machine learning, it is essential to perform model training and evaluation67,68. In this particular study, a straightforward approach was employed, where the data was divided into an 80% training set and a 20% testing set. This division allowed us to assess the performance of each predictive model effectively.

To evaluate the performance of the predictive models, several metrics were utilized, including accuracy, precision, recall, F1-score, and AUC. Each of these metrics provides valuable insights into different aspects of the model’s performance.

  1. 1.

    Accuracy: Accuracy measures the overall correctness of the model’s predictions. It is calculated as the ratio of the number of correct predictions to the total number of predictions.

  2. 2.

    Precision: Precision evaluates the accuracy of positive predictions made by the model. It quantifies the proportion of true positive predictions out of the total predicted positives. The formula for precision is:

Precision = TP / (TP + FP) where TP represents true positive and FP represents false positive.

  1. 3.

    Recall: Recall, also known as sensitivity or true positive rate, assesses the model’s ability to identify all positive instances. It measures the proportion of true positive predictions out of the total actual positives. The formula for recall is:

Recall = TP / (TP + FN) where TP represents true positive and FN represents false negative.

  1. 4.

    F1-score: The F1-score provides a balanced measure of a model’s performance by considering both precision and recall. It is the harmonic mean of precision and recall, and it is calculated using the following formula:

F1-score = 2 * (Precision * Recall) / (Precision + Recall).

  1. 5.

    AUC: AUC is a metric calculated from the area under the receiver operating characteristic (ROC) curve. The ROC curve represents the true positive rate (TPR) plotted against the false positive rate (FPR) at various classification thresholds. AUC indicates the algorithm’s ability to discriminate between classes, where a higher AUC value suggests better discrimination.

In summary, by utilizing these metrics, we were able to comprehensively evaluate the performance of each predictive model in terms of overall correctness, accurate positive predictions, identification of positive instances, balanced measure, and discriminatory ability69.

In order to further evaluate the performance of the model, tenfold cross-validation techniques were employed. Prior to this, different k-fold validation techniques, including three-fold, five-fold, and ten-fold, were compared to determine the most suitable approach70.

The study also conducted a thorough analysis of hyperparameters to refine and improve the model’s performance. Grid search, random search, and Bayesian optimization were systematically explored to find the best hyperparameter settings. Comparing the outcomes from these techniques helped identify the configurations that provided the highest performance. In order to improve the accuracy and dependability of the model used in this study, we conducted model calibration. Through fine-tuning the model via calibration, we enhanced its performance in accurately predicting the desired outcome.

In our study, we conducted an extensive comparison of different kernel methods for the SVM model, with the main goal of identifying the most suitable kernel function to maximize the model’s performance. We carefully evaluated and compared various kernel functions, including linear, polynomial, radial basis function (RBF), and sigmoid. Through meticulous analysis, our objective was to select the kernel method that produced the most favorable outcomes and achieved optimal performance for the SVM model71.

Model interpretability

In our comprehensive approach to understanding the data and exploring the factors influencing the prediction of micronutrient supplementation, we employed various techniques including the Apriori algorithm. Firstly, we calculated the mean SHAP (Shapley Additive exPlanations) values to assess the average impact of each feature on the model’s predictions, providing insights into the relative significance of different variables. This allowed us to understand the individual contributions of each feature.

Additionally, we utilized the Apriori algorithm, a popular algorithm for association rule mining. By applying the Apriori algorithm, we were able to uncover hidden patterns and relationships among the variables in the dataset. The algorithm allowed us to discover frequent item sets and association rules based on measures such as lift and confidence. Lift helped us determine the strength of the associations between different variables, indicating the degree to which the presence of one variable influences the likelihood of another variable occurring. Confidence, on the other hand, provided us with a measure of reliability or certainty in the association rules, indicating how often the consequent variable appeared when the antecedent variable was present72,73,74.

By incorporating mean SHAP values and the Apriori algorithm into our analysis, we gained a deeper understanding of the dataset and the factors influencing the prediction of micronutrient supplementation. These techniques allowed us to uncover concealed patterns and relationships, leading to robust predictions and identification of influential factors. In summary, our approach involved calculating mean SHAP values to determine feature importance and utilizing the Apriori algorithm for association rule mining. This comprehensive methodology provided us with valuable insights into the dataset, enhancing model interpretability and facilitating a better understanding of the factors impacting micronutrient supplementation predictions75,76,77.

Results

Descriptive results of the background characteristics

The study encompassed a comprehensive analysis of descriptive and socio-demographic characteristics among a weighted sample of 138,426 pregnant women. Among the participants, the largest proportion, comprising 57,174 (41.30%), fell within the age group of 26 to 34 years. In terms of residence, the majority, accounting for 105,613 (76.30%) of the study participants, hailed from rural areas. Regarding employment status, a significant number of respondents, totaling 101,407 (73.26%), were employed (See Table 1 for more detailed information).

Table 1 Individual characteristics of reproductive age group women in east African countries (n = 138,426).

Micronutrient supplementation status in east African countries

According to the specified DHS dataset, the pooled prevalence of micronutrient supplementation status among pregnant women in East Africa was found to be 28.90% (95% CI: 28.68, 29.12). Ethiopia had the lowest rate of micronutrient supplementation among pregnant women, with only 7.80% receiving supplementation. On the other hand, Zambia had the highest prevalence of micronutrient supplementation, with 66.94% of pregnant women receiving supplementation (See Fig. 2 for more detailed information).

Fig. 2
figure 2

Prevalence of micronutrient supplementation status among pregnant women in East Africa countries using forest tree plot.

Machine learning analysis of micronutrient status

Feature selection

Upon evaluating various feature selection methods, we observed that the Boruta algorithm exhibited strong performance. As depicted in Fig. 3, the algorithm effectively visualized the importance of variables, with significant variables highlighted in green, unimportant variables in red, and tentative variables in yellow. Tentative variables are those that require further investigation78.

In our comprehensive analysis, we decided to exclude birth interval and marital status from consideration, as the Boruta algorithm deemed them unimportant. No tentative variables were identified. Consequently, we proceeded with utilizing the variables selected by the Boruta algorithm to predict the micronutrient supplementation status and explore data patterns through association rule mining.

Fig. 3
figure 3

Feature selection using Boruta algorithm (Note: v218 = number of children, v025 = residence).

Data balancing

In Table 2, various data balancing techniques, such as under-sampling, over-sampling, ADASYN, and SMOTE, were compared. Among the evaluated techniques, SMOTE demonstrated the highest performance, with the random forest classifier achieving an AUC of 0.878 and an accuracy of 91%. The results indicated that SMOTE outperformed other data balancing methods.

Table 2 Compression of imbalanced data handling techniques using accuracy and AUC values.

Development and performance comparisons of machine learning-based models

By utilizing these performance metrics such as accuracy, precision, recall, F1 score, and AUC, we conducted a comprehensive evaluation to determine how effectively the algorithms could predict micronutrient supplementation. Table 3 presents the performance measures of multiple selected algorithms after data balancing and calibrated tuning processes had been applied. Based on the evaluation results, the top three machine learning algorithms for classifying micronutrient supplementation status were found to be the random forest classifier, decision tree classifier, and XGBoost with excellent ROC value (See Table 3).

Table 3 Accuracy and AUC value of the selected machine algorithm after data balancing and tuning.

Figure 4 illustrates the ROC curve analysis conducted on selected machine learning algorithms. These algorithms were trained on balanced data using the SMOTE data balancing technique and underwent optimized hyperparameter tuning. Among the different hyperparameter tuning techniques experimented with, grid search proved to be the most suitable for our dataset. The application of hyperparameter tuning resulted in a significant enhancement in the performance of our model. For a detailed comparison, please refer to Figs. 4 and 5, which respectively present the ROC curve analysis before and after hyperparameter tuning.

As depicted in Fig. 5, the final ROC curve for the tuned model showcased that the random forest classifier outperformed all other machine learning algorithms, exhibiting an AUC of 0.892. The decision tree classifier and XGBoost followed closely behind with AUC values of 0.862 and 0.856, respectively, which can be considered excellent. The KNN, Gradient boosting classifier, and SVM achieved reasonably acceptable ROC values of 0.797, 0.739, and 0.721, respectively. However, the logistic regression and Gaussian Naïve Bayes algorithms displayed lower AUC values of 0.683 and 0.651, respectively, indicating suboptimal discrimination.

We also provided a comprehensive comparison of model performance using various performance metrics. In cases where the machine learning algorithms exhibited similar performance and distinguishing the superior algorithm became challenging, we used a comprehensive analysis. For further details, please refer to Fig. 6.

Fig. 4
figure 4

ROC curve analysis of selected machine learning algorithms with balanced data using SMOTE data balancing technique.

Fig. 5
figure 5

ROC curve analysis of selected machine learning algorithms after optimized Hyperparameter tuning using balanced data.

Fig. 6
figure 6

Overall model performance of selected machine learning algorithms after optimized hyperparameter tuning of a balanced data.

Model interpretability

SHAP value interpretation

Based on the findings presented in Fig. 7, the mean SHAP value report provided insights into the relative importance of different features in the classification model. ANC visits, number of living children, and media exposure emerged as the most influential factors and exhibited high mean SHAP values. This indicated that these features had high significant impact on the model’s predictions.

Additionally, religion, working status, education status, age group, and wealth status displayed minimal influence on the classification outcome, as evidenced by their low mean SHAP values. These features contribute less to the model’s decision-making process and have limited importance on the model prediction. On the other hand, residence and perceived distance were positioned at the bottom of the graph, with a mean SHAP value of zero. This suggests that these features do not contribute to the model prediction.

Fig. 7
figure 7

A mean SHAP value report.

Association rule mining

Using the Apriori algorithm, our research identified the most influential association rules based on their lift values and confidence. These rules provided valuable insights into the probability of micronutrient supplementation during pregnancy in East African countries. Remarkably, the recurring presence of factors such as maternal education level, employment status, ANC visit, media accessibility, number of living children, and religion in these association rules indicated their consistent association with the likelihood of receiving micronutrient supplementation.

The top five association rules and their corresponding lift values are as follows:

  1. 1.

    1. If the mother has a secondary education, is employed, has more than four ANC visits, and has media exposure, the probability of being supplemented with micronutrients is 86.3% (Confidence = 0.863 and lift = 2.36).

  2. 2.

    2. If the mother is in the age group of 21 to 25, has less than three children, has more than four ANC visits, and follows the catholic religion, the probability of being supplemented with micronutrients is 85.9% (Confidence = 0.859 and lift = 2.04).

  3. 3.

    3. If the mother has less than three children, access to media, follows the Catholic religion, and has higher education, the probability of being supplemented with micronutrients is 81.4% (Confidence = 0.814 and lift = 1.64).

  4. 4.

    4. If the mother has less than three children, is employed, in the age group of 26 to 34, has higher education, and follows the traditional or animist religion, the probability of being supplemented with micronutrients is 78.4% (Confidence = 0.784 and lift = 1.83).

  5. 5.

    5. If the mother has four ANC visits, is employed, has access to media, belongs to a high-income wealth status, and has a primary education, the probability of being supplemented with micronutrients is 74.3% (Confidence = 0.743 and lift = 1.25).

Discussion

The study demonstrated the potential of machine learning algorithms in accurately predicting the status of micronutrient supplementation among pregnant women in East Africa. The random forest classifier, decision tree classifier, and XGBoost were identified as the most effective models for classifying supplementation status. Specifically, the random forest classifier outperformed other algorithms, with an AUC value of 0.892 and an accuracy of 94.0%. Similar studies conducted in Rwanda33, Zambia79, Ethiopia80, Mozambique, and Nigeria81 also found the random forest model to be superior in predicting various health outcomes.

We used association rule mining to identify the top predictors in our data set. Accordingly, we built five top rules using association rule mining and implied that the mother’s education level consistently appeared in association rules and had a significant impact on the likelihood of receiving micronutrient supplementation. The justification for this impact was supported by the existing research literature. Similar to our findings studies conducted in Central Ethiopia21, Kenya19, and Bangladesh20, and elsewhere in the world13,14,15,16,17,18,19,20,21 noticed the importance of education for enhancing micronutrient supplementation status. One possible justification for this finding could be that education empowers individuals to make informed decisions regarding their health and encourages adherence to prenatal care.

Based on the finding of the association rule mining, ANC visits were a strong predictor that influenced the likelihood of receiving micronutrient supplementation. This finding was supported by global studies highlighting the significance of ANC visits in enhancing supplementation status18,27,28. The possible justification could be ANC visits may serve as a platform for healthcare providers to educate pregnant women about the importance of proper nutrition and micronutrient supplementation, addressing misconceptions and providing information on the specific benefits of supplementation through counseling sessions.

Association rule mining consistently demonstrated that religion played a significant role in determining the likelihood of receiving micronutrient supplementation. This finding underscores the importance of considering religious factors in public health interventions and highlights the need for culturally sensitive strategies to address micronutrient deficiencies in diverse religious communities82.

The association rule mining consistently demonstrated that an increase in the number of children had a negative association with the likelihood of receiving micronutrient supplementation. This association was supported by the literature, which underscores the significance of family size as a determinant of access to and utilization of health services14. The possible reason for this finding is that the added responsibilities of raising a larger family may pose challenges for parents, especially busy mothers, in prioritizing and accessing healthcare, including attending antenatal care visits and receiving the necessary micronutrient supplements.

The association rule mining consistently provided evidence that individuals with media exposure had a higher probability of receiving adequate micronutrient supplementation. This finding was further supported by the literature, which emphasized the role of media in influencing health-related behaviors14. The justification for this finding could be media platforms provide opportunities to educate individuals about the benefits of adequate nutrition and the availability of supplementation programs.

The consistent findings from association rule mining indicate that individuals who had primary, secondary, and higher levels of education are more likely to receive micronutrient supplementation. Previous studies conducted worldwide supported this relationship13,14,15,16,17,18,19,20,21. This could be attributed to the fact that individuals with higher education levels may possess a deeper understanding of the significance of proper nutrition and the specific advantages of taking micronutrient supplements during pregnancy.

The association rule mining consistently revealed the significant impact of employment status on the likelihood of receiving micronutrient supplementation. This finding was consistent with a study conducted in Northwest Ethiopia26. One possible justification is that employed women may have greater exposure to health-related information through workplace wellness programs, employee benefits, or interactions with colleagues.

Strength and limitations of the study

The study demonstrates its strength by employing a comprehensive analysis of predictive capabilities through the use of eight supervised machine-learning algorithms. This approach enhances the reliability and credibility of the findings by revealing hidden patterns and relationships within the data. However, it is important to acknowledge certain limitations associated with the study. The retrospective data collection and reliance on secondary data introduce potential drawbacks, including the possibility of incomplete or missing data, which may impact the accuracy and introduce biases in the results.

Additionally, the study only explores four data balancing techniques, which may limit the ability to fully harness the potential power of each machine learning algorithm. Another limitation is the absence of exploring the performance of each algorithm through ensembling, a technique that combines multiple models to enhance predictive accuracy. Lastly, it is worth noting that the study shares the inherent limitation of building association rules solely based on the Apriori algorithm.

Conclusion

The study highlights the effectiveness of machine learning in accurately predicting the status of micronutrient supplementation among pregnant women in East Africa. Notably, the random forest classifier demonstrated exceptional performance, achieving excellent evaluation scores with an AUC of 0.892 and an accuracy of 94.0% in predicting the supplementation status. This emphasizes the potential of machine learning algorithms as valuable tools for policymakers and healthcare providers to develop targeted strategies aimed at improving the uptake of micronutrient supplementation among pregnant women.

Based on the study’s findings, several key intervention components are recommended to enhance the utilization of micronutrient supplementation. Strengthening educational initiatives can provide vital information on the importance and benefits of supplement intake. Improving antenatal care services ensures proper screening, monitoring, and guidance for pregnant women regarding their nutritional needs. Comprehensive media campaigns can effectively raise awareness and emphasize the significance of micronutrient supplementation during pregnancy. It is also crucial to consider cultural and religious sensitivities when designing interventions to ensure their acceptance and effectiveness within the specific population.

Furthermore, researchers are encouraged to explore and experiment with various techniques to optimize algorithm performance, leading to the identification of the most effective predictors and enhanced accuracy in predicting micronutrient supplementation status.