Introduction

Excessive fat accumulation alters body metabolism, influences fat distribution, and compromises health1. By 2030, an estimated 2.16 billion people will be overweight and 1.12 billion obese2. Obesity is a multifactorial condition influenced by environmental, behavioral, and genetic factors, with genetic makeup determining susceptibility by affecting metabolism, appetite, and physical activity3. Family and twin studies indicate body mass index (BMI) heritability ranges from 40 to 70%, underlining the major genetic influence4,5.

Despite decades of research, the genetic basis of obesity remains incompletely understood. Mitochondrial uncoupling proteins (UCPs) have been a focus due to their role in energy metabolism6,7. An imbalance between energy intake and fat utilization drives the obese phenotype8. Energy in fat cells is either used for ATP synthesis or dissipated as heat through thermoregulation9. UCPs, originally identified for thermogenesis10 uncouple oxidative phosphorylation from ATP generation11,12. The Human Obesity Gene Map Update identifies uncoupling protein 3 (UCP3) as a candidate gene linked to hereditary obesity susceptibility13,14. UCP3, encoded on chromosome 11q13.4 with seven exons and six introns15 is mainly expressed in brown adipose tissue and skeletal muscle10,16,17 particularly in glycolytic type 2 fibers. Its role includes facilitating fatty acid anion export from the mitochondrial matrix, protecting against lipid-induced damage18,19. UCP3 expression in skeletal muscle increases with high-fat diet, acute exercise, and fasting20,21,22,23 but decreases after endurance training, weight loss, and in oxidative type 1 fibers24,25. Reduced activity of UCP3 is linked to lower energy expenditure and increased fat storage26 while higher expression may enhance metabolic rate and lower BMI27.

Not all genetic variants increase obesity risk; some may be protective. Variants in the UCP3 promoter can modulate expression in ways that predispose to higher BMI or maintain healthy weight. Some studies show no significant association between UCP3 variants and obesity, while others suggest a link28,29 demonstrating inconsistent results.

The relationship between UCP3 variation and obesity in Pakistanis remains unknown. Using sequencing data from 400 individuals in Karachi, this study aimed to identify and annotate novel and known UCP3 variants and to explore characteristics of individuals with obesity carrying variants observed only among cases and absent in controls.

Results

Sociodemographic factors, obesity onset, and associated clinical manifestations

The ages of participants ranged from 12 to 63 years old. The study sample, consisting of a sample size of 400, encompassed 183 females and 217 males. Among the total study population (n = 400), most of the study participants fell in the category of young adults with a median (interquartile range) age of 27 (13.75). Among cases (individuals with obesity), 64 individuals (32%) had obesity since childhood. However, 55 participants reported gaining weight in their twenties, 31 participants had obesity in their thirties, while only 9 participants were observed to have obesity in their forties. In addition, 20.5% of individuals with obesity (n = 41) reported that they have had obesity since adolescence (Supplementary Table 1). Table 1 presents a comparison of categorical parameters (sociodemographic factors and obesity-related physical signs, comorbidities, and stigma) between cases and controls. A statistically significant difference in sociodemographic factors (marital status and family history of obesity) was observed between cases and controls. A greater proportion of individuals with obesity were married compared to the control group (p =corrected0.003). Among participants with obesity (cases), 45% of males (47 out of 105) and 53% of females (50 out of 95) were married; the difference was not statistically significant (p = 0.226). The population with obesity exhibits a greater proportion of individuals with a family history of obesity compared to the control group (pcorrected= 0.009). However, no substantial difference was noted regarding parental consanguinity between cases and controls (p > 0.05). The comparison of obesity-related physical signs in Table 1, which includes abdominal striae, axillary striae, acanthosis nigricans, and hyperlipidemia signs, revealed that these manifestations were substantially more prevalent in cases than in controls (pcorrected< 0.001). Similarly, a greater percentage of cases exhibited obesity-related comorbidities such as joint issues and depression compared to controls (pcorrected< 0.001). Moreover, female participants with irregular menstrual cycles were substantially more prevalent in cases than in controls (pcorrected = 0.009). Furthermore, 46.5% of cases experienced stigma for having obesity. However, after comparing the obesity-related stigma between male and female cases, significantly higher percentage (65.3%) of females were found to be stigmatized for having obesity ( pcorrected< 0.001) as compared to males (29.5%). All associations remained statistically significant even after applying corrections for multiple testing.

Table 1 Comparison of categorical variables between cases (individuals with obesity) and controls (individuals with normal BMI).

Ethnic diversity

The study sample consisted of participants of diverse ethnic backgrounds (Fig. 1A). The highest number of study participants was Urdu-speaking, with 56.8% participants. On the contrary, Balochis were observed as the smallest number of study participants (2.3%). In addition, Sindhis, Punjabis, and Pakhtuns were 7.0%, 17.8%, and 11.0%, respectively. Moreover, 5.1% participants belonged to various other ethnicities of Pakistan, such as Baltistan, Chitrali, Gilgit, Hazara, Kargili, and Kashmiri. Similarly, when compared between cases and normal-BMI controls, almost similar ethnic distribution was observed in both study groups. The order of ethnic representation in cases (Fig. 1B) and controls (Fig. 1C) from highest to lowest representation was—Urdu-speaking (cases = 57.5%, controls = 56.0%), Punjabi (cases = 17%, controls = 18.5%), Pakhtuns (cases = 12.0%, controls = 10.0%), Sindhi (cases = 6.0%, controls = 8.0%), others (cases = 4.0%, controls = 6.5%), Balochi (cases = 3.5%, controls = 1.0%).

Fig. 1
figure 1

Ethnic representation. Panel A shows the distribution of ethnic groups in the overall study sample, Panel B shows the distribution among cases (individuals with obesity), and Panel C among normal-BMI controls. Each rectangle represents a specific ethnic group, and the size of each area corresponds to its proportion (%) in the respective group. All values are shown in percentage. Color intensity reflects relative proportions, with darker blue indicating higher representation.

Significantly aberrant anthropometric and metabolic variables in cases

Comparison of anthropometric and metabolic variables between cases (individuals with obesity) and controls (individuals with normal BMI) is presented in Table 2. Almost all study variables including anthropometric measurements (weight, BMI, waist circumference, hip circumference, waist-to-height ratio, waist-to-hip ratio, biceps skinfold thickness, triceps skinfold thickness, abdomen skinfold thickness, supra-iliac skinfold thickness, thigh skinfold thickness, subscapular skinfold thickness and body fat percentage) and metabolic parameters (systolic and diastolic blood pressure, fasting blood glucose, fasting insulin levels, homeostasis model assessment of insulin resistance, total cholesterol, triglycerides, low-density lipoprotein cholesterol, very low-density lipoprotein cholesterol, coronary risk index, atherogenic index, triglycerides-to-high-density lipoprotein cholesterol ratio, visceral adiposity index, lipid accumulation product, and product of triglyceride and glucose) were found significantly aberrant in cases as compared to their corresponding normal-BMI controls (p < 0.05). All associations remained statistically significant even after applying corrections for multiple testing. However, height and high-density lipoprotein cholesterol (HCL-C) did not differ between cases and controls.

Table 2 Comparison of anthropometric and metabolic parameters between cases and controls.

Disparities in lifestyle and behavioral factors among cases and controls

The comparison of lifestyle and behavioral characteristics between cases and controls (Table 3) revealed significant differences. A notably higher proportion of individuals with obesity (cases) followed a random eating pattern compared to controls (pcorrected = 0.015). Additionally, cases showed a significantly greater tendency to consume fat-dense foods (pcorrected = 0.002). Physical activity levels were considerably lower among cases than controls (p < 0.001). The individuals deviating from the standard sleep period (8:00 PM to 12:00 midnight) and/or wake-up time (5:00 AM to 7:30 AM) are classified as having an irregular sleep-wake cycle. This pattern was significantly more prevalent in cases (individuals with obesity) than in controls (pcorrected= 0.003). Furthermore, shift work was more common among cases (pcorrected= 0.013). All associations remained statistically significant even after applying corrections for multiple testing. However, no significant differences were observed between the two groups regarding diet consciousness and insufficient sleep (< 7 h) (p > 0.05).

Table 3 Comparison of obesity-related behavioral traits between cases and controls.

Genetic variants of UCP3 identified through next-generation sequencing in the study population

Supplementary Table 2 presents a comprehensive list of genetic variants identified in the UCP3 gene through next-generation sequencing (NGS) in the study population. After removing off-target variants beyond positions 73711111 to 73720334, 130 genetic variants were identified in the UCP3 gene. The different genetic variants were specified with chromosomal position, reference allele, and alternate allele. After exclusion of monomorphic variants (genetic positions where all individuals in the study population have the same allele, showing no variation), singleton variants (appearing only once, considering as technical artifacts) and variants detected only in normal-BMI controls (non-disease-associated variants) as a part of quality control for data reliability, a total of 43 genetic variants were selected, including single nucleotide polymorphisms (SNPs), insertions, and deletions, with some being biallelic (two alleles) and others multiallelic (more than one alternate allele). Comparison of alternate allele frequencies observed in this study with those reported in the global and South Asian populations from Genome Aggregation Database (gnomAD) revealed that some variants had relatively high frequencies. However, other variants such as some insertions and deletions, listed in Table 4, were extremely rare or entirely absent in these reference datasets. The functional significance of the variants was assessed using ClinVar, where some were labelled benign, while many were not reported, indicating unknown clinical impact.

Table 4 Filtered UCP3 gene variants after quality control check.

Annotation of selected variants in the UCP3 gene

Figure 2 provides an overview of the forty-three genetic variants in the UCP3 gene. Among them, 3 (7%) were novel, while 40 (93%) were previously reported in databases. The distribution of variant types indicates that the majority (84%) were intronic, followed by 3′ untranslated region (UTR) variants (9%), coding variants (5%), and 5′ UTR variants (2%). All the identified coding sequence variants were observed to be synonymous. This suggests that mostly identified variants are likely to impact gene regulation rather than directly altering protein structure, which may influence gene expression or splicing rather than causing direct functional disruptions in the protein itself. Table 5 presents the annotation of 43 selected UCP3 gene variants to provide insights into their potential functional impact. The variants are classified based on their genomic location, including intron variants, 3′ and 5′ UTR variants, and synonymous variants. Additionally, most of the variants are previously known with existing rsIDs; however, a few variants, such as CTTT > CTTTT/CTT at position 73718482 and GA > GAA at position 73712770, were classified as “novel,” suggesting they were not previously reported in major databases.

Fig. 2
figure 2

Distribution of different types of variants identified in the UCP3 gene. The pie chart on the left illustrates the classification of 43 variants based on their genomic locations, with the majority being intronic (84%), followed by 3′ UTR (9%), coding sequences (5%), and 5′ UTR variants (2%). The table summarizes variant counts, including the number of novel variants (n = 3, 7.0%) identified in the study.

Table 5 Characterization of UCP3 gene variants predicted by variant effect predictor.

Putative functional implications of UCP3 variants

Among 43 UCP3 variants, Combined Annotation Dependent Depletion (CADD) PHRED-scaled scores were available for 37 of the identified variants, while six variants had no pre-computed scores in the database. The majority of variants (n = 36, approximately 84%) had scores < 10, suggesting low predicted deleteriousness and most likely benign or tolerated. Only a single variant (rs3741135) at a chromosomal position 73713960 with G > A transition showed a score > 10 (10.26), falling within the top 10% of potentially deleterious variants. Moreover, SpliceAI annotation revealed that the delta scores for acceptor loss and donor loss were 0.00 for all variants, indicating no predicted loss of canonical acceptor or donor splice sites. In case of delta score for acceptor gain, two variants scored 0.01 and one scored 0.02, while for delta score for donor gain, two variants scored 0.02 and one scored 0.14. All observed values were below the 0.20 threshold for predicted splice impact, indicating no strong evidence for splicing alterations among the identified variants (Supplementary Table 3). In addition, RegulomeDB annotation was available for all variants (43). Of these, 11 variants had scores between 1a–2b, indicating strong evidence of regulatory potential, including overlap with known transcription factor binding sites and DNase hypersensitive regions. The remaining variants had scores ≥ 3, suggesting less likely to affect binding with minimal or no supporting regulatory evidence (Supplementary Table 4).

Characteristics of cases carrying case-exclusive variants

Among the 43 variants identified in this study (Table 4), three (rs35517703, rs142107918, and rs928421426) were detected that showed presence of alternate allele (either in heterozygous or homozygous from) exclusively in cases (individuals with obesity) but not in any of the control samples. These case-exclusive variants were not previously listed in the ClinVar database. Table 6 presents the key characteristics of the individuals carrying these variants. The variant rs35517703 (CT > C), located within intron 6 of the UCP3 gene at chromosomal position 73712720, was found in four cases. Among them, 75% reported the onset of obesity in their twenties, while one case developed obesity in thirties. The group had an average BMI of 33.7 ± 1.7 kg/m2, and half of the individuals reported a family history of obesity. All carriers of this variant exhibited the heterozygous genotype. The variant rs142107918 (G > C), located in 4th intron of UCP3 gene at position 73716530, was identified in one Punjabi male case who was homozygous (CC) for the alternate allele. This individual had obesity onset in his thirties, a BMI of 36.5 kg/m2, and had a family history of obesity. The third variant, rs928421426, situated in intron 1 of UCP3 at position 73718482, was observed in two female cases. One reported obesity beginning in childhood, while the other reported onset in her twenties. The average BMI among these two individuals was 37.9 ± 4.5 kg/m2, and one had a family history of obesity.

Table 6 Characteristics of cases carrying case-exclusive variants.

Discussion

We assessed sociodemographic, clinical, anthropometric, metabolic, and behavioral parameters in individuals with obesity compared to normal-BMI controls and, for the first time in a Pakistani population, performed next-generation sequencing of UCP3 gene variants. The findings provide insights into environmental and genetic contributors to obesity, with both consistent and divergent perspectives from prior literature.

The early onset of obesity observed in this study aligns with global evidence that childhood obesity predicts adult obesity30. Married individuals were more likely to have obesity, consistent with reports from Polish, Iranian, and Chinese populations31,32,33 possibly due to post-marriage lifestyle changes, reduced physical activity, and richer meals33,34. A strong family history of obesity supports its heritability, as shown in twin and family studies4. Clinical manifestations and comorbidities such as acanthosis nigricans, hyperlipidemia signs, striae, joint problems, depression, and menstrual irregularities in women were significantly more prevalent among individuals with obesity, consistent with prior research linking obesity to multisystemic health burdens and reproductive dysfunctions35,36,37,38,39.

Anthropometric parameters—including weight, BMI, waist and hip circumference, waist-to-height and waist-to-hip ratios, skinfold thickness, and body fat percentage (%BF)—were consistently elevated among participants with obesity, reflecting generalized and central obesity40. Metabolic abnormalities, including elevated fasting blood glucose (FBG), insulin, homeostasis model assessment of insulin resistance (HOMA-IR), triglycerides (TGs), total cholesterol (TC), low-density lipoprotein cholesterol (LDL-C), very low-density lipoprotein cholesterol (VLDL-C), and lipid ratios (with no difference in HDL-C), indicate insulin resistance and dyslipidemia, consistent with prior South Asian and global findings41,42. An elevated TG-to-HDL-C ratio, visceral adiposity index (VAI), and lipid accumulation product (LAP) further support their predictive value for metabolic risk43,44,45,46. Lifestyle factors linked to obesity included random eating patterns, high intake of fat-dense foods, low physical activity, irregular sleep-wake cycles, and higher prevalence of shift work. These findings are consistent with global evidence connecting modern lifestyle patterns to obesity risk. In our sample, irregular meal timing and greater fat-dense food consumption paralleled observations from U.S. and international cohorts47,48. Reduced physical activity was also associated with obesity, consistent with Spanish49. Irregular sleep-wake cycles and shift work, known to disrupt circadian rhythms50,51 were significantly more common among cases. In Pakistan, economic pressures often necessitate shift-based employment that may partly explain this association. Cultural factors may influence the absence of significant differences in diet consciousness between cases and controls as these findings contrasts from the previous study conducted in Dutch population, where conscious dietary restraint is often more prevalent among overweight individuals52. The weight-related stigma was observed significantly more in female cases than male cases, aligning with prior findings of greater societal pressure on women’s physical appearance53.

In this study, 130 genetic variants were initially identified in the UCP3 gene region (chr11:73711111–73720334), reflecting substantial genetic diversity in the Pakistani population. After stringent quality control to remove off-target, monomorphic, and singleton variants, 43 remained, including SNPs, insertions, and deletions in intronic, 3′ UTR, 5′ UTR, and coding regions. Most were intronic, consistent with broader genomic research showing that functional variation in complex diseases like obesity often arises from non-coding regulatory variants rather than protein-altering mutations54. Such variants may affect transcription, splicing, or mRNA stability rather than directly altering UCP3 structure. Given UCP3’s role in mitochondrial energy metabolism, even subtle regulatory changes could influence energy balance55. Comparison with global datasets (gnomAD, Variant Effect Predictor) revealed several variants that were rare or absent globally. For example, rs35517703 (CT > C deletion) at position 73712720 is extremely rare globally but common (37%) in our population, while a novel GA > GAA insertion at 73712770 occurred at a high frequency (~ 66%) among Pakistanis but was absent in gnomAD. These findings highlight population-specific genetic architectures of obesity risk, supporting previous observations that South Asian populations have distinct risk profiles56,57.

Three novel variants were identified: GA > GAA insertion at 73712770 in the 6th intron, and CTTT > CTT deletion and CTTT > CTTTT insertion at 73718482 in the 1st intron. These were absent in Single Nucleotide Polymorphism database (dbSNP), gnomAD, and ClinVar, suggesting population-specific mutations. Given UCP3’s roles in thermogenesis, fatty acid oxidation, and lipotoxicity protection18,19,24,25 even minor regulatory changes could be phenotypically relevant. Higher alternate allele frequency in our cohort compared to their absence globally may reflect population-specific diversity or shared ancestral background.

Using multiple in silico annotation tools, we assessed the potential functional relevance of the identified variants. Most had low CADD PHRED scores (< 10), indicating limited predicted deleteriousness, except rs3741135 (score = 10.26), which merits further consideration. This pattern is consistent with reports that common, especially non-coding, variants tend to have lower CADD scores than rare ones58,59. SpliceAI predictions suggested no substantial splicing effects, with all acceptor and donor gain scores below the 0.20 threshold and a maximum donor gain of 0.14, indicative of weak prediction. While rare exceptions exist, functional validation would be needed to confirm any splicing impact60. RegulomeDB analysis highlighted some variants with ranks ≤ 2b, supported by transcription factor binding and DNase evidence, suggesting possible regulatory roles. Notably, this observation aligns with a previous comprehensive computational analysis of obesity-associated intronic variants, many of which also had RegulomeDB ranks ≤ 261. Such variants, despite low CADD scores, may still influence gene regulation.

Three intronic variants—rs35517703, rs142107918, and rs928421426—were observed exclusively in cases. Their presence in regulatory regions supports a possible effect on UCP3 expression through alternative splicing62,63. Although no functional annotations exist in ClinVar or dbSNP, their occurrence only in obesity cases, coupled with early obesity onset and elevated BMI in carriers, suggests a contributory role in obesity pathogenesis. Larger studies are needed to confirm these findings, assess their association with obesity, and examine their presence in other populations.

A key strength of this study is the deliberate selection of variants for functional annotation, population frequency assessment, and pathogenicity prediction, providing a strong foundation for replication. Inclusion of well-represented variants minimizes misinterpretation and reinforces the importance of studying population-specific variants56. Genetic studies in understudied populations, such as Pakistanis, offer opportunities to identify novel markers. Another strength is the integration of sociodemographic, clinical, lifestyle, and genetic data, providing a holistic view of obesity. Rigorous variant screening and annotation further enhance data reliability. At this point, it is important to mention the limitations of this study. One of these limitations includes the relatively moderate sample size (n = 400). Another limitation is the inclusion of only one gene (UCP3) for NGS analysis, which may overlook other relevant genes or polygenic effects. Finally, the reliance on self-reported lifestyle data may be subject to recall bias. All these limitations are attributable to financial and time constraints.

Conclusion

This study compared sociodemographic, clinical, anthropometric, metabolic, and behavioral parameters between individuals with obesity and normal-BMI control individuals in a Pakistani population. Participants with obesity showed a higher prevalence of physical signs and comorbidities such as abdominal and axillary striae, acanthosis nigricans, hyperlipidemia signs, joint issues, and depression. Among females, irregular menstrual cycles were more common in those with obesity. Anthropometric and metabolic parameters were markedly elevated in individuals with obesity, and aberrant lifestyle patterns were more prevalent in this group. The genetic analysis identified several UCP3 variants, including three found exclusively in individuals with obesity. These variants, located in regulatory regions, may influence obesity risk through modulation of gene expression rather than changes in protein structure. These findings provide population-specific insights that can inform future research on the genetic basis of obesity.

Materials and methods

Study design, work location, and ethical approval

This study used a case-control research design at Dr. Panjwani Center for Molecular Medicine and Drug Research, located at the University of Karachi in Pakistan. The study involved 400 participants, 200 of whom had obesity and 200 were individuals of normal BMI. The study was approved by the Independent Ethics Committee (IEC) of the International Center for Chemical and Biological Sciences (Ref. No ICCBS-001-BC-2014/Protocol/1.0) and the Advanced Studies and Research Board (ASRB), University of Karachi, Pakistan. A simple random sampling procedure without replacement was used to select participants. Participants with endocrine disorders and a history of medications known to affect body weight were excluded from the study. Pregnant and lactating women were also excluded from the study. Individuals with a BMI ≥ 30 kg/m2 were included in the case category, while participants with a BMI within the range of 18.5–24.9 kg/m2, along with the absence of self-reported or clinically diagnosed obesity-related conditions / metabolic diseases, encompassed the control group. Adult subjects (≥ 20 years old) had their BMI reference ranges based on WHO (World Health Organization) recommendations. However, the BMI reference ranges for individuals under 20 years old were considered according to the CDC (Center for Disease Control and Prevention) growth charts. The demographic data encompassed a broad range of information such as sex, age, marital status, ethnicity, parental consanguinity, and family history of obesity. A written informed consent was obtained from each participant prior to their participation in the study. In case of children and adolescents, consent was obtained from their parents or legal guardians. It is important to note that all study protocols were adhered to the principles outlined in the Declaration of Helsinki (1964) and its subsequent amendments or equivalent ethical standards.

Obesity-related physical signs, comorbidities and behavioral traits

A physical examination was conducted to assess for the presence of hyperlipidaemia signs, acanthosis nigricans, and abdominal and axillary striae. Weight-related stigma was assessed via concise, direct, and self-reported questions (framed as yes/no)64. The study participants with obesity (cases) were asked whether they had ever experienced unfair or negative treatment because of their weight, been teased about their weight, or had avoided social or healthcare situations due to fear of judgment for body weight. Information related to comorbidities (Joint problems, depression, and female menstrual cycle) and behavioral factors (eating pattern, diet consciousness, tendency towards fat-dense food, physical activity, lack of sleep, sleep-wake cycle, and shift work) for each participant was collected via a well-designed questionnaire.

Collection of blood samples

A 5-mL blood sample was obtained with a disposable syringe after fasting for 8 to 12 h. A 2-mL blood sample was taken into a yellow-top vacutainer tube for serum isolation. The serum was taken in a 2 mL microfuge tube for further biochemical analyses. One mL of blood was taken into a grey-top tube containing an antiglycolytic agent for determining FBG concentration. Finally, a 2 mL blood sample was collected in an EDTA-coated lavender-capped vacutainer tube for DNA extraction.

Anthropometric measurements

During the study, detailed anthropometric data was collected from each participant, including measurements of body height, weight, waist circumference (WC), hip circumference (HC), and various skinfold thicknesses to compute %BF. Measurements were recorded while wearing lightweight clothing. Participants were instructed to stand upright and remain relaxed during measurements. Body weight (in kilograms) and height (in centimeters) were measured by a weight and height machine (TCS-180 L, Capacity 180 kg×50 g, KINLee ®). The BMI of adults was calculated using the online BMI calculator of NIH (https://www.nhlbi.nih.gov/health/educational/lose_wt/BMI/bmicalc.htm). For children and adolescents, BMI was determined using percentiles. WC was measured in centimeters using a non-stretchable measuring tape by wrapping it at the midpoint between the hip bone and ribs. To ensure precision, WC was measured after expiration. HC was also measured in centimeters by placing a non-stretchable measuring tape around the widest area of the buttocks. The waist, height, and hip circumference measurements were used to calculate the waist-to-height ratio (WHtR) by dividing WC by height and the waist-to-hip ratio (WHR) by dividing WC by HC. Skinfold thickness at different body locations, such as biceps, triceps, abdomen, sub-scapular, supra iliac, and thighs, was assessed using a skinfold caliper (Model # C-120B, Slim guide, MI, USA). Skinfold thickness measurements were used to calculate %BF using equations appropriate to each sex, as outlined by Jackson and Pollock in 198565.

Metabolic parameters

The metabolic assessments comprised blood pressure, fasting insulin, FBG, and lipid profile. For accurate and consistent measurement of systolic (SBP) and diastolic blood pressure (DBP), the participant was instructed to sit comfortably and remain at rest before measuring blood pressure. The blood pressure was measured from the right arm of each participant. A digital blood pressure monitor (Model CH-453-AC, CITIZEN SYSTEMS, JAPAN) and blood pressure cuff (Model SCN-003, CITIZEN SYSTEMS, JAPAN) were used for blood pressure assessment. The fasting insulin concentration was measured by sandwich immunoassay using Elecsys Insulin assay (Catalog number 12017547 122 Roche Diagnostics Corporation, Indianapolis, IN 46250, US) on an automatic immunochemistry analyzer (Roche Hitachi cobas e 411). The levels of FBG (Catalog number 5.17604.0001, Martin Dow, France), TC (Catalog number 5.17501.0001, Martin Dow, France), TGs (Catalog number 5.17511.0001, Martin Dow, France), HDL-C (Catalog number 5.17518.0001, Martin Dow, France), and LDL-C (Catalog number 5.17519.0001, Martin Dow, France) were measured via end-point enzymatic-colorimetric-based assay kits on automatic chemistry analyzer (Selectra Pro M Lite, France). The estimated VLDL-C levels were obtained by dividing the TG concentrations by 5. Moreover, by computing the values from different anthropometric and biochemical assessments, additional parameters such as HOMA-IR, VAI (mmol L−1), LAP (mmol L−1), TyG index (product of triglyceride and glucose), coronary risk index (CRI), atherogenic index (AI), TG/HDL-C (triglyceride-to-HDL-C ratio) were calculated. HOMA-IR was calculated using the formula suggested by Matthews et al.66. To calculate VAI46 and LAP67 gender-specific formulae were used. The formula suggested by Simental-Mendia et al.68 was used to calculate TyG index. For calculating VAI and LAP, the values of TGs and HDL‐C were converted into mmol L−1 by multiplying the TG values by 0.0113 and HDL-C values by 0.0259. CRI was considered as the ratio of TC-to-HDL-C, and AI was taken as the ratio of LDL-C-to-HCL-C.

DNA extraction and NGS

DNA was isolated from total blood using the genomic DNA purification kit (ZYMO RESEARCH, catalog number D4068, CA, US) according to the manufacturer’s instructions. The horizontal agarose gel electrophoresis (Thermo Scientific, MA, USA) was conducted to evaluate the overall integrity of the genomic DNA. The isolated DNA was quantified using a NanoDrop™ spectrophotometer (Thermo Scientific, USA). The UCP3 gene sequence from the NCBI Reference Sequence (RefSeq: NC_000011.9) was used as the basis for primer design. The forward (AGCGTCCACAGCTTAAAGGAG) and reverse primers (TTGGAAGTCGGTCGTCTGG) were designed using the free online tool Primer3 (v. 0.4.0) (https://bioinfo.ut.ee/primer3-0.4.0/). The UCP3 gene is positioned at 73711322 (start) − 73720130 (end) on chromosome 11. For amplification of the UCP3 gene, the upstream and downstream regions were also considered (73711111–73720334). Thus, the final product size was found to be 9224 bp. The UCP3 was amplified by long-range PCR using 2X Invitrogen Platinum SuperFi II PCR Master Mix (Catalog number 12368010, Thermo Fisher Scientific, USA). Purification of PCR products was done using an automated Agencourt AMPure XP purification system (Kit Part number A63880, BECKMAN COULTER, MA, USA). The Qubit dsDNA HS Assay kit (Catalog number Q32854, Thermo Fisher Scientific, MA, USA) was employed for the quantification of the amplicon. The concentration of amplicon in all the samples was adjusted to a range of 0.2-0.3 ng/µL. The amplicon was sequenced by preparing a paired-end DNA library by using the Nextera XT DNA library prep kit (Illumina, San Diego, CA, US) using the reference guide. For this, 1 ng of purified PCR product was used. The pooled library was diluted for sequencing on the Illumina® MiSeq® system (Illumina, San Diego, CA, USA). For this, 10 µL of each library was added in a clean and sterile 1.5 mL microcentrifuge tube to create a multiplexed library pool.

Bioinformatic analysis

Raw sequencing reads were assessed using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) to evaluate base quality69. Low-quality reads were trimmed using Trimmomatic70 and high-quality paired-end reads were aligned to the human reference genome (GRCh37/hg19) using Burrows-Wheeler Aligner with the maximum exact matches71. Resulting SAM files were converted to BAM format and processed with SAMtools version 0.1.1972 and PICARD (version 1.109; http://picard.sourceforge.net)73 for sorting, duplicate removal, and read group assignment. Variant calling was performed using the HaplotypeCaller tool in Genome Analysis Toolkit (GATK v3.2) applying a quality score threshold of 30, following realignment around insertions and deletions and base quality score recalibration74. Individual GVCF files were merged using GenotypeGVCFs, and variants were filtered based on depth (DP < 20), genotype quality (GQ < 20), and overall quality (QUAL < 50). Variant Quality Score Recalibration was applied to improve variant call accuracy using established training datasets75. Variants outside the target region (chr11:73711111–73720334) were excluded, along with monomorphic sites, singletons, and variants exclusive to controls (non-disease-associated variants). Final variant sets were used for downstream analyses including functional annotation, population frequency assessment, and pathogenicity prediction.

Variant annotation, population frequency comparison, and functional impact assessment

To interpret the NGS results, all variants types (single nucleotide polymorphisms and indels) were considered, including those in 3’ and 5’ untranslated regions (UTRs), intronic regions, exonic regions (missense, synonymous, and frameshift), and variants near splice sites76. The variant annotation was performed using the ANNOVAR tool (v2014)77. For further analysis, the annotated VCF file was converted to an MS Excel-compatible format using tab-delimited settings. For maximum accuracy and comprehensive funactional annotations, the variants were analyzed using the web-based human VEP (Ensembl GRCh37 release 113) (https://www.ensembl.org/info/docs/tools/vep/index.html). The VEP output showed the consequence, impact, the intron or exon number, protein position, and amino acid. It also showed whether the variant is novel or already existing. The scores of CADD and delta scores of SpliceAI (v1.3) prediction from the VEP tool were also used to explore putative functional implications. The CADD PHRED-scaled scores were considered to estimate the potential deleteriousness of each variant78. However, SpliceAI predictions were extracted to assess the predicted effect of variant on splicing through acceptor gain/loss or donor gain/loss79,80. Moreover, RegulomeDB scores (v2.0) were used to assess the likelihood of a variant being located within a regulatory region81. The variants of interest (VOIs) were further examined using publicly accessible genetic databases. To evaluate the allele frequencies of the variants, comparisons were made with genetic data from South Asian and global population available in the gnomAD v2.1.1 (https://gnomad.broadinstitute.org). ClinVar (a curated repository of human genetic variant-phenotype associations; https://www.ncbi.nlm.nih.gov/clinvar/) was also used to check the pathogenicity (pathogenic, likely pathogenic, variant of uncertain significance, likely benign or benign) of the VOIs if they are present on the database82. The variants which were not reported in ClinVar were searched in dbSNP. All databases and tools were accessed in March, 2025.

Sample size calculation

The Calculator.net sample size tool (available at: https://www.calculator.net/sample-size-calculator.html) was used for estimating sample size of the study. A minimum sample of 375 participants was required based on a population size of 235,001,746 (Pakistan’s estimated population in 2020, according to Worldometer; available at: https://www.worldometers.info/world-population/pakistan-population/) and a generalized obesity prevalence of 57.9% as reported in a prior study83 with a 95% confidence level and 5% margin of error. Thus, a total of 400 participants were included in the study, exceeding this requirement.

Statistical analysis

Age categories and obesity age onset were presented as counts with percentages. All categorical variables were analysed using the chi-square test. Normality of continuous parameters was assessed using the Shapiro-Wilk test, with a p-value < 0.05 indicating non-normal distribution. Non-normally distributed continuous variables were expressed as median with interquartile range, while normally distributed parameters were represented as mean ± standard deviation (SD). Comparative analysis of continuous parameters between cases (individuals with obesity) and normal-BMI controls was performed by the Mann-Whitney U test for non-normally distributed data, providing mean ranks, and p-values, whereas for parameters with normal distribution t-test was employed. All association analyses were corrected for multiple comparisons using the Benjamini-Hochberg method to control for false discovery rates84. Statistical significance was determined by a p-value < 0.05. All statistical analyses were conducted using SPSS version 25. The treemaps illustrating the distribution of ethnicities were generated using RAWGraphs 2.0, an open-source web-based tool for visual data representation (https://app.rawgraphs.io/)85.