Introduction

The coronavirus disease 2019 (COVID-19) is an emerging respiratory disease illness caused by a novel coronavirus known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)1. Since its initial detection, it has spread very fast to more than 200 countries and has been declared as a public health emergency of international concern (PHEIC) by the World Health Organisation (WHO) and declared it a Public Health Emergency of International Concern (PHEIC) on 30th January 20202.

A review of the literature reveals that numerous studies have examined the knowledge, attitudes, and practices (KAP) related to COVID-19 across various populations3,4 including two studies from Malaysia5,6. While these studies employed various questionnaires to assess KAP, there was a notable gap in evaluating health literacy concerning COVID-19. However, none of the studies explored the health literacy on COVID-19. This is one of the most important aspects that are lacking in the published articles.

Knowledge, attitude and practice (KAP) survey data are vital to help plan, implement and evaluate a prevention and control work. Apart from identifying the knowledge gap, cultural beliefs or behavioural patterns, it can also identify the factors influencing behaviour that are known and not known to most people, reasons for their attitudes, how and why they practice the certain health behaviours. It can also deepen the understanding of commonly known information, attitudes, and factors that influence human behaviour7. Besides, it can also enable the stakeholders or policymakers to set the priorities and make strategic decisions based on the gap and essential information found from the KAP survey data8. KAP research also has been the leading educational intervention strategy for respiratory diseases control across the globe8. Various researchers had reported that an individual’s level of KAP is linked to the intelligent control and prevention of illness, response to medical treatment and advancement of one’s health9,10,11.

A questionnaire is the most used tool in collecting the survey data for research or surveillance, or general observation. However, to ensure the consistency and accuracy of the collected data, the questionnaire should be carefully constructed, have a grounded theory to develop the tools, validity and reliability to be ensured for the usage of the intended population. Validation of a questionnaire is crucial as part of establishing a standardized and valid questionnaire. The benefit of having a standardized questionnaire is it can be used to measure and compare on studied domains among different groups of people or settings. It also needs to be properly validated to ensure that the questionnaire is psychometrically sound12.

There is a critical gap for Malaysians understand and apply the health information particularly in health communication and health literacy13. Study by Okan et. a; l14. , found that more than half of the German adult population had “inadequate” or “problematic” levels of health literacy related to COVID-19. Therefore, the need for targeted public information campaigns and the promotion of population-based health literacy to help individuals navigate the infodemic, identify disinformation, and make decisions based on reliable and trustworthy information. Enhancing health literacy (HL) and digital health literacy (DHL) is crucial to improve students attitudes and behaviour during COVID-19 pandemic15.

During the COVID-19 pandemic in Malaysia, most of the information regarding COVID-19 were primarily delivered in Malay. Therefore, to the target language preference, the questionnaire need to develop in Malay for the use of general Malaysian population. Most of the COVID-19 questionnaire in Malaysia only focused on knowledge, attitude and practice about that disease. However, the questionnaire should integrate with health literacy to find trustworthy information and critically appraise misinformation about COVID-19. According to a report by Department of Statistics Malaysia16 stated that majority of the Malaysian population can speak and understand Malay. Therefore, this study aims to develop and validate the Malay language of understanding, attitude, practice and health literacy towards COVID-19 Questionnaire (MUAPHQ C-19) using exploratory factor analysis (EFA).

Materials and methods

Study design

A cross-sectional study conducted for the validation of the questionnaire (MUAPHQ C-19) including the unidimensionality, reliability and validity. The methodology in this study was adhered to the Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) questionnaire validation guideline17 and the questionnaire translation and cultural adaptation guidelines18.

Study setting

The study was conducted in three hospitals, i.e., Pusat Perubatan Universiti Teknologi MARA, Sungai Buloh (PPUiTM Sungai Buloh), Hospital Al-Sultan Abdullah, Puncak Alam (Hospital UiTM Puncak Alam) and Hospital Kuala Lumpur.

Target population

The target population for this study was the general public, defined as Malaysian citizens aged 18 years old and above from the Klang Valley tertiary hospital, either male or female, who are not a healthcare worker. The inclusion criteria were those who are able to read and understand the Malay Language and the exclusion criteria was illiterate people.

Sample size determination

Exploratory factor analysis (EFA)

For the exploratory factor analysis, there are several rules of thumb on the minimum sample size has been discussed and summarised by Kyriazos19 i.e., 100 to 25020,21, 30022 or 500 or more23. The sample size for EFA also can be calculated based on the N: p ratio, i.e., number of participants (N) to variables (p). Traditionally, it is set as 5:1. However, studies suggest that strength of item loadings, uniformity of the communalities and number of items per factor24 are vital for the stability, reliability, and replicability of a factor solution25.

Therefore, the minimum sample size taken for getting a reliable result for EFA was taken as 100 based on one of the rules of thumb19,21. There were 100 participants recruited and analysed for the exploratory factor analysis. This sample size is based on a combination of methodological recommendation and practical constraints.

Test-retest reliability

According to the central limit theorem, a large sample size of at least 30 will yield data approaches to the normal distribution26,27. Seventy participants agreed to participate in the test-retest analysis. However, only 30 participants responded to the questionnaire after being approached within two to four weeks after their first response. The 30 participants’ data were analysed for the test-retest reliability (PPUiTM Sg Buloh, n = 7 respondents; HUiTM Puncak Alam, n = 11 respondents; HKL, n = 12 respondents).

Sampling method

Participants were selected using a systematic random sampling technique to ensure a representative and unbiased sample of hospital attendees. The steps were as follows: eligible participants who fulfilled the inclusion and exclusion criteria were listed, a sampling frame was developed, and the sampling intervals were determined. Then, a random starting point was selected from numbers 1 to 9 before systematic selection was conducted until the desired sample size of 100 participants was reached.

Instrument

The original MUAPHQ C-19 questionnaire consists of four domains: understanding, attitude, practice, and health literacy (Appendix A). The result of content validity and face validity (FVI) have been published in Dalawi et al.28.

Understanding

Initially, this domain comprises 12 items, categorized into seven subdomains: source of information (1 item), causative agent (1 item), route of transmission (2 items), symptom (1 item), risk factor (1 item), complication (1 item) and preventive measures (4 items). Each item is measured on an interval scale ranging from 1 (lowest understanding) to 10 (highest understanding).

Attitude

This domain begins with 15 items, categorized into four subdomains: perceived susceptibility (4 items), perceived severity (5 items), perceived benefits (2 items) and perceived barrier (5 items). Items in this domain are measured using an interval scale from 1 (strongly disagree) to 10 (strongly agree).

Practice

This domain comprises 11 items, divided into two subdomains: do’s (7 items) and don’t (4 items). Items are measured on an interval scale from 1 (very rare) to 10 (very frequent).

Health literacy

Originally, this domain includes 12 items, organized into four subdomains: access to health information (3 items), understand health information (4 items), appraising health information (4 items) and applying health information (2 items). Items in this domain are measured on an interval scale ranging from 1 (strongly disagree) to 10 (strongly agree).

Conduct of the study

Exploratory factor analysis

A cross-sectional study was conducted for two weeks, i.e., from 20th June 2022 to 30th June 2022. A total of 100 participants who fulfilled the inclusion and exclusion criteria were recruited from the Outpatient clinics, PPUiTM Sungai Buloh, Hospital UiTM Puncak Alam and Hospital Kuala Lumpur (HKL). Participants were recruited and asked to answer the MUAPHQ C-19. A study information sheet that explains the study’s background, purpose, benefit, study process, participation in the study, confidentiality, and the researcher’s contact numbers were given to the participants. Only respondents who were eligible and agreed to participate in the study were recruited.

Test-retest reliability

Participants were also invited to answer back the MUAPHQ C-19 in two to four weeks to test for the test-retest reliability of the MUAPHQ C-19. This invitation was made upon their participation in the study. Participants who were keen to participate in the test-retest were asked to give their contact numbers. They were re-contacted, and to return the questionnaire to the researcher within two to four weeks.

Statistical analysis

Data was analyzed using IBM SPSS (Version 29.0)29. The factor structure of every domain in the MUAPHQ C-19 was explored statistically. The results are displayed in three outcomes: descriptive statistics, exploratory factor analysis and reliability analysis.

Descriptive statistics

The socio-demographic characteristics is presented using mean and standard deviation for normally distributed continuous data or median and interquartile for non-normally distributed data. For the categorical variables, data are presented using absolute number and frequency.

Exploratory factor analysis (EFA)

All items with negative statements from the attitude and practice domain were reverse-coded in the SPSS software before further analysis were conducted. Before proceeding with EFA, several essential steps must be reviewed to fulfil the EFA assumptions30. These assumptions include checking for missing data or outliers, ensuring sample size adequacy, assessing the suitability of the data for multivariate normality, verifying the sufficiency of item correlations and addressing multicollinearity.

For factor exploration, the data suitability and sample size adequacy was tested using Kaiser-Meyer-Olkin (KMO) Measure of Sampling Adequacy and Bartlett’s Test of Sphericity. The KMO index must be more than 0.5 and the Bartlett’s Test of Sphericity must be significant (p < 0.05) to proceed for factor analysis31,32,33.

The data was checked for multivariate normal distribution, through the chi-square versus Mahalanobis distance plot. The maximum value for Mahalanobis is obtained and is compared to the critical value of degree of freedom in the chi-square distribution table at 0.05 significance level. If the maximum value is more than the critical value (p < 0.05), no multivariate normality is seen. If no multivariate normality is violated, the unweighted least squares (ULS) technique can be employed in the factor analysis process to analyse the data.

Additional measures are available to ascertain whether the items are sufficiently correlated. The anti-image denotes the proportion of variance in an item that is unrelated to another item in the analysis34,35. Anti-image correlation matrix diagonals should be greater than 0.5, which is associated with smaller off-diagonal partial pairwise correlations35,36. Other than that, the data matrix’s interrelationships should be sufficient. If the correlation coefficient is less than 0.33, no factor analysis can be performed30,36. In addition, correlation matrix that gives a very high correlation (> 0.80) to the pair item need to be eliminated by any of the pair.

Additional measures are available to ascertain whether the items are sufficiently correlated. The anti-image denotes the proportion of variance in an item that is unrelated to another item in the analysis34,35. Anti-image correlation matrix diagonals should be greater than 0.5, which is associated with smaller off-diagonal partial pairwise correlations35,36. Other than that, the data matrix’s interrelationships should be sufficient. If the correlation coefficient is less than 0.33, no factor analysis can be performed30,36. In addition, correlation matrix that gives a very high correlation (> 0.80) to the pair item need to be eliminated by any of the pair.

The principal component analysis (PCA) with the varimax rotational method was applied to examine the dimensionality and the construct validity of MUAPHQ C-19. The factor loading for every item must be 0.4 or greater to retain the item and forms the factor structure32,33. The cross-loading items are recommended to be eliminated. According to Çokluk37a cross-loading item is defined as item that has a high factor loading on more than one component and the difference between that of components are less than 0.10. Samuels35 has suggested that the item elimination process should be repeated until no cross-loading items remain.

The number of factors are determined in three ways32,38,39: by referring to eigenvalue > 1.0; or by referring to the minimum cumulative variance of 60%; or by referring to the scree plot (the data points above the point of inflection). Which items belong to which component were assessed using the rotated component matrix38,39. Other than that, communality values also were taken into consideration. The communality measured the percent of variance in each variable is explained by all the factors jointly. The communality values greater than 0.3 was taken as the cut-off value for item removal consideration. However, the communality cut-off value of greater than 0.2 is still considered acceptable40.

Reliability analysis

There are two types of reliability study which includes the internal consistency reliability test and test-retest reliability. This reliability was accounted for the scores’ agreement to the same subjects and not be regarded as randomized samples.

The internal consistency for all domains was determined based on the Cronbach’s alpha coefficient. Literatures have recommended that Cronbach’s alpha of more than 0.7 was considered acceptable39,41. There were also another literatures that supported the Cronbach’s alpha above 0.6 as acceptable and highly reliable42,43.

The test-retest reliability was analysed using the intra-class correlation coefficient (ICC). ICC values of greater than 0.75 indicate a good reliability44,45. Two-way mixed effect model and absolute agreement was applied based on average measurement of baseline data and two to four weeks after the first data collected.

Ethical consideration

The ethical approval was obtained from the Universiti Teknologi MARA (UiTM) Research Ethics Committee (REC/01/2021 (MR/01)) and Medical Research and Ethics Committee, MREC (NMRR-21-319-58243 (IIR)) before the study commencement. The research complied with the ethical principles outlined in the Declaration of Helsinki and the Malaysian Good Clinical Practice Guideline. There were no identifiers collected in the data collection form.

Informed consent process

An inform consent was obtained from the interested and eligible participants prior to their involvement in the study. Participants were approached and asked about their willingness to participate in the research. They were fully informed of the study’s background, the purpose of the study, the benefit of the study, the process of the study, and participation in the study, confidentiality, and the researcher’s contact numbers were given as a hard copy to those participants who agreed to participate in the study. This information was provided both verbally and in writing through an informed consent form in the Participant Information Sheet (PIS) and the participation was voluntary. The participants had the right to withdraw from the study at any time, including during and follow-up stages without any penalties.

Results

Sociodemographic characteristics

A total of 100 participants that fulfilled the inclusion and exclusion criteria of the study and completed the questionnaire were taken for the exploratory factor analysis (PPUiTM Sungai Buloh, n = 24 respondents; HUiTM Puncak Alam, n = 38 respondents; HKL, n = 38 respondents). The drop-out rate was 1.9%. The demographic characteristics of the respondents are presented in Table 1.

Table 1 The sociodemographic of the respondents (N = 100).

Exploratory factor analysis (EFA)

The mean and standard deviation, communalities extraction and rotated component matrix for the understanding, attitude, practice and health literacy domains are displayed in Table 2.

Table 2 The mean and standard deviation, communalities extraction and rotated component matrix for the understanding, attitude, practice and health literacy domains.

Understanding domain

The range of mean scores of the items in the understanding domain was between 8.28 and 9.06. Due to the presence of a high correlation between items U4 and U3 with the value of 0.827 and the cross-loading item of U12, the EFA for the understanding domain was re-analysed after removing the item U4 and U12.

Attitude domain

The range of mean scores of the items in the understanding domain was between 4.38 and 8.91. Item A4 was deleted due lto the ower value of anti-imge correlation than the threshold (0.500). Then, data was re-analyzed without item A4. Following this revised analysis, a cross-loading issue was identified with item A10, as it loaded onto factors 1 and 3 with values less than 0.1. Therefore, item 10 was also removed, and the data was re-analysed again by excluding both item A4 and item 10.

Practice domain

The range of mean scores of the items in the practice domain was between 5.05 and 8.97. There is no deleted item in this domain.

Health literacy

The range of mean scores of the items in the health literacy domain was between 7.04 and 9.72.

Item HL1, HL4 and HL6, was deleted due to highly correlated and Item HL7 was deleted due to lower value of anti-image correlation analysis than the threshold (0.500).

Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy and bartlett’s test

The result of KMO Measure of Sampling Adequacy and Bartlet’s Test for each domain are shown in Table 3.

Table 3 Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy and bartlett’s Test.

Since the KMO is more than 0.6 and Bartlett’s test of sphericity is significant (p < 0.005) in all domains, it indicates that the sample data is adequate to proceed with the reduction procedure in EFA.

Total variance explained

The total variance explained for all domains are shown in Table 4.

Table 4 The total variance explained for all domains.

Understanding

After analyzing with 10 items (after removing U4 and U12), two components emerged from the EFA with the computed Eigenvalue greater than 1.0. The eigenvalues ranged between 2.694 and 4.131. The variance explained 41.308% for Component 1, and 26.942% for Component 2. The total variance explained for measuring this domain is 68.250%, well exceeding the cut-off value of 60% and improved after the item removal procedure.

Attitude

After analyzing with 13 items (after removing items A3 and A10) the EFA revealed three components based on Eigenvalues greater than 1.0, with the Eigenvalues ranging from 1.740 to 2.964. The variance explained for 22.802% for Component 1, 22.787% for Component 2, and 13.385% for Component 3. In total, these components explain 58.973% of the variance, which is considered an acceptable level for measuring this construct.

Practice

After analyzing with 11 items (without removing any item), the EFA revealed four components based on the computed eigenvalue greater than 1.0. The eigenvalues ranged between 1.586 and 2.184. The variance explained 19.982%, 19.543%, 19.490% and 14.414% for Component 1, 2, 3 and 4, respectively. The total variance explained for measuring this construct is 73.300%, exceeding the cut-off value of 60%.

Health literacy

After analyzing with 12 items (A = after Removing Item HL1, HL4, HL6 and HL7), the EFA revealed only one component emerged based on the computed eigenvalue greater than 1.0. The eigenvalue is 5.135. The total variance explained for 64.194% for Component 1 which is acceptable.

Reliability analysis

The reliability of the MUAPHQ C-19 was assessed using two methods: Cronbach’s alpha coefficient and the intra-class coefficient (ICC) value. The Cronbach’s alpha was computed to assess the internal consistency reliability of the domains in the MUAPHQ C-19. In contrast, the ICC was computed to assess the test-retest reliability of the domains over time.

Cronbach’s alpha

Based on Table 5, Cronbach’s Alpha of all domains in MUAPHQ C-19 (Version 3.0) was good to excellent. This indicates that the MUAPHQ C-19 is a good and reliable scale.

Table 5 Cronbach’s alpha for each domain and the total cronbach’s alpha of the MUAPHQ C-19 before and after the item removal. (N = 100)

Intraclass correlation (ICC)

Table 6 shows the intraclass coefficient (ICC) for the test-retest of every domain in the MUAPHQ C-19 (Version 3.0). There were 30 out of 100 participants completed the test-retest (30.0%). Overall, the range of ICC for all domains lies from 0.562 to 0.715. The stability of understanding, attitude, practice and health literacy domain was 0.715, 0.698, 0.759 and 0.562, respectively. The practice domain has the highest ICC value, which indicates that the reliability is good. While the understanding, attitude and health literacy domains show fair reliability. In addition, the mean score between the test and retest for all domains is not significant. This indicates that the domains are stable and reliable over time.

Table 6 Intraclass coefficient (test-retest) for all domains (N = 30).

Summary

Table 7 presents a summary of the items for all domains in the MUAPHQ C-19 before and after the EFA analysis. Initially, there were 50 items in the MUAPHQ C-19 questionnaire. However, after conducting the EFA, eight items were removed for various reasons, resulting in a total of 42 items remaining at the conclusion of the analysis.

Table 7 The summary of the items for all domains in MUAPHQ C-19 before and after EFA analyses.

Discussion

In this study 100 responses were obtained and used for the exploratory data analysis (EFA) as the preliminary data to explore the factor structure of a set of measured variables in MUAPHQ C-19. Hair et al.32 has mentioned that a EFA can be conducted as a pilot study to test the justification, straightforwardness and the suitability of the gathered information which later the usefulness of each item and their dimensionality in measuring their respective latent construct can be measured by the EFA46.

EFA is useful in determining the constructs under a given data set and the extent to which these constructs represent the original variables. Besides, EFA also can investigate the correlations between the observed variables. EFA also has the ability to combine the common variables in the dataset into descriptive categories and thus reducing the number of factors. This will lead to shrinking a relatively large set of variables to a smaller and more manageable number while preserving the original variance as much as possible47.

The dimensionality of items may change when the current study differs from the previous study in terms of the difference in the industry, the difference in culture and socioeconomic status between the two populations, and the lapse in time (duration) between the current study and the previous studies. In other words, the dimensions obtained by previous studies might not hold, especially when the current study is conducted in a different environment and different industry. Many researchers suggested that the EFA procedure is necessary for every construct before proceed to the Confirmatory Factor Analysis (CFA) to determine if the dimensionality of items has changed from a previous study where the dimensions were developed33,38,39,48.

Regarding sample size for EFA, there are debates regarding the rule of thumb or the best sample size estimation needed for EFA to be conducted. There are many ways of determining the sample size needed for an EFA, which include the minimum number of cases, samples to variables ratio (N/p) and factor loading requirement. The suggested sample size ranges from as low as less than 50 to as high as greater than 400 47. Ideally, the EFA will function better and gives a more stable solution with a larger sample size by reducing the margin of error. In this study, 100 samples have been used for EFA, which still falls within the range of the suggested sample size49. Furthermore, 100 samples can be considered adequate for the EFA as evidenced by the Kaiser-Meyer-Olkin result above 0.6 and the significant Bartlett’s test for Sphericity result.

The development of MUAPHQ C-19 was continued with several processes of adjusting and deleting items following the EFA results for every domain in the questionnaire. The emerging components from each domain were renamed accordingly, following the meaningful characteristics of its items. The deletion of the items was done following several issues on the very high or low correlation between items and the presence of an item with cross-loading, indicating the redundancies between the subdomains. Redundancies are essentially worth deletion as they ask the same things differently12,47. When developing a questionnaire, decisions to remove items during Exploratory Factor Analysis (EFA) should be guided by both statistical evidence and conceptual considerations50. Items with low factor loadings, low communalities, or significant cross-loadings should be considered for removal. From a conceptual standpoint, it is crucial to assess the relevance of each item to the underlying construct before deciding on its removal. Therefore, a balance between statistical evidence and conceptual considerations must be maintained51.

According to established guidelines, Cronbach’s alpha values above 0.7 generally indicate acceptable reliability for psychometric tools52. Therefore, it can be concluded that the questionnaire demonstrates an adequate reliability, and the items appear to measure the constructs consistently.

The MUAPHQ C-19 was developed to discuss on the association between health literacy to the knowledge, attitudes and practices on COVID-19. MUAPHQ C-19 targets health literacy as a central element, while instruments by Rahman et al.,53 primarily focus on general KAP without a strong emphasis on health literacy. This developed tool is very important for the real-world application designed to assess the understanding, attitudes, practices and health literary related to COVID-19, and can be effectively utilized by various stakeholders such as policy maker and health care providers which could be integrated into community54.

Besides, the rearrangement of the items and factor structure was needed, thus making the questionnaire development has deviated from the earlier conceptual framework proposed in the study. For example, in the Health Belief Model theory, four elements were adapted to be the questionnaire’s subdomain for the attitude domain. The four elements were Perceived Susceptibility, Perceived Severity, Perceived Benefits and Perceived Barriers. EFA has grouped all the items under the subdomain of Perceived Barriers as one component. However, the items developed under the other three subdomains were regrouped into two groups, making the researcher renamed them accordingly. This could be due to some items that the research team thought had grouped differently, but in fact, there are sharing the same elements of meaning with the respondents.

The health literacy domain was initially developed by following the European Health Literacy Survey Framework, which used four elements: access, understand, appraise and apply the information related to health. However, EFA has suggested regrouping all the items into one group only. This may imply that EFA has defined the variable and factor better with more suitable items under the suggested factor. During the COVID-19 pandemic, health literacy has emerged as a critical factor influencing how individuals interpret public health messages14. Individuals with higher health literacy were better equipped to navigate the overwhelming influx of information, discern credible sources, and make informed decisions55.

In summary for EFA, after considering the issues regarding the data adequacy and suitability for analysis, item’s correlation, factor’s extraction and retention, item’s retention and deletion, as well as item’s factor loading, the study has deleted eight items, thus, retained the 42 items under all four domains. The understanding domain retained two factors with ten items, the attitude domain retained three factors with 13 items, the practice domain retained four factors with 11 items, and the health literacy domain with one factor with eight items. The 42-item-Malay Version of the Understanding, Attitude, Practice and Health Literacy Questionnaire on COVID-19explored in the EFA proceeded for the confirmatory method in the confirmatory factor analysis afterwards. An ongoing evaluation and updating of the questionnaire are needed to ensure its continued relevance and accuracy in line with the most current information and public health recommendations.

Strength and limitation of the study

This study has a few limitations. Firstly, the sample selections were limited to populations that visited three leading hospitals near Kuala Lumpur and Selangor region. Thus, the results of the study do not represent a nationwide outcome. However, the researcher has employed systematic random sampling to minimise the selection bias and improve the generalizability of the study. This sample study was drawn from hospital attendees, which may not be fully representative of the general Malaysian population. Therefore, it affects the generalizability. Future research is needed to replicate this study in more diverse settings to better capture broader population. A cross-sectional study was chosen in this study utilized a self-reported data which also subjected to response bias which limiting the representativeness and generalizability of the results.

The validated MUAPHQ C-19 was developed in a single language which is the Malay language. Thus, it opened an unequal opportunity for Malaysian citizens with low Malay language literacy. Future translation studies can be conducted into English, Tamil and Mandarin to cater for broader study populations. Next, the utility of the MUAPHQ C-19 questionnaire could become constrained as time progresses beyond the period of lifted control measures. Nonetheless, given that the preventive measures and controls for COVID-19 are relevant to a substantial portion of airborne and droplet-mediated transmission of infections, its applicability extends to potential respiratory outbreak scenarios soon.

This study did not examine whether reliability or factor structures differ across demographic groups. It is recommended to conduct these subgroup analyses in the future research for better understand how the questionnaire performs across various demographic groups. It will help the consistency and applicability of the tool in broader public health efforts related to COVID-19.

Conclusion

This study produced an exploration valid and reliable new 42-item MAPHQ C-19 that can be used to measure the attitude, health literacy and practice towards COVID-19 among the general public in Malaysia. Future studies on the nationwide population using MAPHQ C-19 (MUAPHQ C-19 can be conducted from time to time to monitor the public’s literacy and behaviour towards COVID-19 and to assess the effectiveness of the prevention and control program implemented.

This study developed a new, valid and reliable 42-item MUAPHQ C-19 questionnaire to measure the understanding, attitude, practice and health literacy towards COVID-19 among the general public in Malaysia. The tool demonstrates preliminary evidence of validity and reliability, based on the results of EFA and internal consistency analysis. Further validation studies, including confirmatory factor analysis (CFA) in future research need to be conducted to strengthen the psychometric properties of the instrument. Additionally, future nationwide studies using the questionnaire can be conducted periodically to monitor public literacy and behaviours towards COVID-19 and assess the effectiveness of the implemented prevention and control program.