Abstract
The efficacy and effectiveness of antibiotics and neuropathic drugs are essentially guided by their physicochemical properties governing stability, bioavailability, and therapeutic activity. This work utilises mathematical modelling and quantitative structure-property relationship (QSPR) analysis for predicting important physicochemical properties such as boiling point, enthalpy of vaporisation, flash point, and molar refraction of chosen antibiotics and neuropathic drugs. Modified degree-based topological indices are utilised as molecular descriptors for correlations between physicochemical functionality and molecular structure. Linear and quadratic forms are various forms of regression models employed for improved predictions. The findings exhibit excellent performance of quadratic models across all but one property compared to linear models, highlighted by significant statistical markers like high \(R^{2}\) values and low error margins. These results highlight the potential use of topological descriptors in combination with sound mathematical frameworks for drug optimisation and early-stage screening.
Similar content being viewed by others
Introduction
Over recent years, QSPR modeling has been established as a highly viable technique in pharmaceutical chemistry for its potential in predicting major physicochemical properties from molecular structure directly1. This predictive property is important for accelerating drug discovery since early evaluation of molecular performance can be conducted without requiring long-term laboratory experiments2. Boiling points, enthalpy for vaporization, flash points, and molar refraction are particularly significant due to their impact on influential pharmaceutical parameters such as stability, availability, and formulation safety3. Such considerations are particularly significant when developing antibiotics, where therapeutic efficacy is highly dependent upon molecular traits. A QSPR model provides standardized means for connecting molecular descriptors and such properties, making the rationale-based lead optimization and drugs design possible4. For an elaborate review of QSPR methods applied to drug molecules, see5. Kumar and Das6 compared Generalized Quadratic (GQ) and Quadratic Generalized (QG) indices as molecular descriptors. Their work shows how these indices are effective in modeling structural aspects pertinent for quantum chemical properties. The analysis indicates that these two descriptors hold potential for improved predictive modeling for QSPR research work.
Here, important physicochemical traits are studied for a group of drugs of pharmacological interest, such as Norfloxacin, Ciprofloxacin, Gemifloxacin, Chlorpyrifos, Dimethoate, Phorate, Monocrotophos, Thiram, Azoxystrobin, Atrazine, Chlorothalonil, and Warfarin. Most of these drugs, which are either antibiotics or structurally analogous bioactive molecules, are extensively employed for various therapeutic and agricultural applications and share diverse and pertinent physicochemical requirements7. For example, vaporization enthalpy informs about the amount of energy needed for phase boundary transitions, which has direct bearing on drug formulation and storage functions. In a similar vein, data for boiling points for drugs such as Norfloxacin and doxycycline are important for gauging thermal stability for various temperature and pressure conditions8. Kumar and Das9 explored the structural sensitivity and chemical applicability of numerous newly developed degree-based topological indices. Their analysis proved how these indices react under different structural changes in molecules. The analysis reaffirmed their potential applicability in QSPR/QSAR modeling by proving effective for correlation with important chemical parameters. In turn, molar refraction is a gauge of the polarizability of a molecule and has predictive value for drug biological environment interactions, whereas flash point measurements are important for determining flammability and for proper handling of volatile drugs10.
The present research utilizes degree-based topological indices, which are mathematical descriptors of molecular structure11,12. In graph theory, vertex degree noted \(\widehat{\lambda }_{a}\) refers to the number of connected edges linking a given vertex a13. Degree-based topological indices are highly effective for predicting physicochemical, thermodynamic, and biological characteristics because they capture essential structural information such as molecular branching and network connectivity14. Kumar and Das15 compared the performance of Nirmala and Sombor indices in detail, i.e., comparing their applicability, degeneracy, and smoothness. This work compared how each index functions in differentiating between molecular structures. The results underscore the efficacy and limitations of these two indices and offer insights into their possible use for modeling molecules using graphs. Within QSAR and QSPR analysis, degree-based topological indices constitute an important subdomain because vertex degree-based information allows for high correlation between molecular structure and experimentally tangible properties or bioactivities16.
Jacob and Clement17 elucidated the structural complexity of zeolite EDI through the application of topological entropy and proved the efficacy of the model to predict the interaction of molecules inside zeolite structures. The method introduced a new quantitative means to interpret host-guest behavior in porous compounds. Gayathri, Santiago, and Govardhan18 investigated hexagonal fractals via the application of topological indices and the use of fractal dimensions to describe structure-property relationships. Their work showed the capability of the aforementioned mathematical measures to predict the behavior of molecules and predict material properties. Peter et al.19 created a predictive model of interaction energies between molecules through the application of topological and spectral entropies of zeolite AWW. The article stressed the significance of descriptors of the entropic kind to predict the interaction of molecules inside porous compounds. Shenoy et al.20 carried out statistical analysis of the application of Revan topological indices to drug compounds for the treatment of tuberculosis. Their analysis showed the promise of the indices to analyze and discriminate molecular structures for application to pharmaceutical research. Jacob et al.21 carried out network topology and entropy analysis of tetragonal farneseite zeolites in order to study their structural complexity. Their research emphasized the use of measures of entropy to identify complex topological properties of the zeolite structures.
The application of topological indices as structural descriptors has a history dating back a long time in predicting molecular properties. The first contribution was made by Wiener22, who proved that paraffin boiling points could be well estimated through topological indices, setting a precursor to the relationship between molecular structure and thermodynamic property. This was further developed upon by Gutman and Trinajstic23 when the first Zagreb indices were introduced degree-based topological indices that have been an integral part in investigating the electronic aspect of molecules thereafter. This framework was expanded upon later by Kulli24 when he introduced reverse Zagreb indices that capture reverse vertex degrees and present additional structural insights. Further building upon this was done by Khabyah et al.25, who applied reverse degree-based indices toward extracting knowledge about coronene structures. Being symmetric, two-dimensional, and highly conjugated in nature, coronene is a perfect system for studying molecular symmetry and stability using topological analysis. For this work, a diversity of modified degree based topological indices are applied toward capturing the antibiotic and related molecules’ molecular structure. The indices are capable of encoding essential structural features size, branching, and connectivity effectively, enabling quantitative correlation between architectural building blocks and physicochemical behavior26.
Quantitative Structure-Property Relationship (QSPR) models have been widely developed for predicting chemical and physical properties of pharmacologically active compounds, illustrated by comparison analysis between regression models depicted in Figs. 1, 2, 3, and 4. Amongst these studied compounds, Norfloxacin, a synthetic fluoroquinolone antibiotic, is extensively utilized in clinical practice for treating urinary tract infection (UTI), prostatitis, and various gastrointestinal infections caused by susceptible bacteria. Norfloxacin inhibits bacterial DNA gyrase and ultimately causes cell division and DNA replication inhibition, which leads ultimately to cell death in bacteria. This drug is mainly effective in gram-negative organisms like Escherichia coli and Klebsiella pneumoniae and is administered orally. Nevertheless, in clinical use, its activity is sometimes hindered because of side effects like nausea, dizziness, and uncommon but serious tendon issues, particularly in elderly patients. Norfloxacin is contraindicated in infancy and pregnant individuals because of possible adverse cartilage and joint development effects, as depicted in Fig. 1a (source: http://www.chemspider.com).
Ciprofloxacin Ciprofloxacin is a fluoroquinolone antibiotic agent that exerts broad-spectrum efficacy against bacterial infections, including respiratory, urinary, gastrointestinal, and skin infections. This agent exerts its antibacterial effects by inhibiting bacterial DNA gyrase and topoisomerase IV, enzymes critical for DNA replication in bacteria27,28. This triggers the death of bacterial cells. Side effects may include nausea, diarrhea, and photosensitivity, risk of tendon ruptures, particularly in the elderly. Because of these possible adverse effects, it is usually avoided in children as well as pregnant women see Fig. 1b.
Gemifloxacin Gemifloxacin is a fluoroquinolone antimicrobial agent indicated for respiratory tract infections, including acute bacterial exacerbation’s of chronic bronchitis and mild to moderate forms of community-acquired pneumonia29,30. The drug exerts its antibacterial activity by inhibiting DNA gyrase and topoisomerase IV-bacterial enzymes critical for DNA replication. The common side effects include rash, diarrhea, and nausea, while serious ones might include tendonitis and tendon rupture. This is a cautious prescription that should not be dispensed to children or pregnant women see Fig. 1c.
Chlorpyrifos Chlorpyrifos is not an antibiotic but an organophosphate pesticide in common use for the control of insect pests both in agriculture and residentially. It works through the inhibition of acetyl cholinesterase enzyme action in insects, an action which causes overstimulation of the nervous system and eventually death31,32. It is due to these reasons that it has been restricted or banned in many countries, with studies linking exposures to developmental and neurological problems. see Fig. 2a.
Dimethoate It is not an antibiotic; it is an organophosphate insecticide, widely used in agriculture to control a wide range of pests on crops. It works by inhibiting acetyl cholinesterase, an enzyme essential for insect nerve function, thus disrupting their nervous system and eventually killing them33,34. While effectively applied for pest control, dimethoate had raised health and environmental concerns because of its toxicity, affecting not only non-target species, including beneficial insects, but also humans upon exposure. Due to its associated risks, certain countries restrict its use; recommended guidelines on safety are advised upon application to reduce the impact on the environment and human health see Fig. 2b .
Phorate Phorate is a highly toxic organophosphate insecticide and acaricide. It is not an antibiotic, and its application is only for the safety of crops such as corn, potatoes, and cotton. Phorate acts through the inhibition of acetyl cholinesterase, an enzyme critical to the normal functioning of the nervous system; the insects are then paralyzed and dead35,36. Phorate is extremely toxic and, therefore, particularly dangerous to humans, wild animals, and the environment through exposure. Because of this, its use has been seriously circumscribed when this insecticide is used, and certain countries have banned it. Recommended safety precautions during handling and application are rigorous see Fig. 2c.
Monocrotophos Monocrotophos is not an antibiotic, but it contains a highly toxic organophosphorus insecticide used in large-scale control of various pests in a variety of crops such as cotton, rice, and sugar cane. It acts through the inhibition of acetyl cholinesterase that overstimulates the nervous system and eventually leads to paralysis and death in insects. Monocrotophos is highly toxic, and exposure has a severe threat to human beings, birds, and aquatic life37,38. For this reason, several countries have banned or restricted the use of the chemical, due to which only a High degree of care can be afforded to those areas where the use of the chemical is not prohibited for agricultural purposes see Fig. 3a.
Thiram Thiram is not an antibiotic. It is a chemical type of fungicide, and it finds wide application in agriculture to prevent fungal infections in crops such as fruits, vegetables, and ornamental plants. This effect is through the inhibition of spore germination of fungi, thus protecting the plants from diseases such as blight, rust, and mold39,40. It is injurious to health if inhaled and ingested and can cause irritation to the skin of human beings too. As it is hazardous to the environment, its use has some prescription on the basis of which it has to be handled and applied see Fig. 3b.
Azoxystrobin In fact, it is not an antibiotic but a broad-spectrum fungicide. It finds wide application in agriculture for the protection of various crops against very different fungal diseases such as blight, rust, and powdery mildew. The mode of action consists of the inhibition of mitochondrial respiration in fungi, which halts the growth of fungal cells and stops the proliferation of the infection41,42. This product is taken to be generally safe while being used in the prescribed manner; however, it does have a potential threat to the environment, particularly aquatic life, and should be carried out very carefully near water systems to minimize ecological damage see Fig. 3c.
Atrazine Atrazine is not an antibiotic, but it is one of the most widely used herbicides in the world. It is applied for the control of broadleaf and grassy weeds in a number of crops, including corn, sugarcane, and sorghum43,44. A chemical kill of plants, it inhibits photosynthesis in plants, thus causing death to the weeds and increasing crop yields. Due to the possible influence of the chemical on fauna and suspected endocrine-disrupting activity, its use is either limited or prohibited in some countries, while it remains one of the most widely used herbicides in other countries see Fig. 4a.
Chlorothalonil Chlorothalonil is not an antibiotic. It is a broad-spectrum fungicide. It finds wide application in agriculture and horticulture for the protection of crops such as peanuts, potatoes, tomatoes, and turf grass against fungal diseases like mold, blight, and leaf spot. Where chlorothalonil is effective, there is also a risk to the environment, aquatic organisms in particular, and irritation in man in the case of exposure45. Its use is thus regulated in many regions with specific view to minimizing ecological impacts, and safety measures are advised during application with a view to reducing health risks see Fig. 4b.
Warfarin Warfarin is not an antibiotic; it is an anticoagulant or blood thinner used to prevent and treat blood clots that form in the veins of the legs, in the lungs, and with atrial fibrillation. Warfarin requires close monitoring since its efficacy depends on dietary factors, concurrent medications, and patient variables. Blood sampling is frequent to maintain it within a narrow window of safety and efficacy46. Adverse effects include bleeding complications, which may well require a patient to be advised about diet and the use of medications cautiously to avoid undesirable interactions see Fig. 4c.
Degree based topological indices
In this section, we provide the initial literature review for topological indices.
Gutman and Polansky introduced the first and second Zagreb index47 as:
Martinez-Martinez et al.48 defined the Harmonic index as follows:
The forgotten index was introduced by Furtula and Gutman49 as:
The Shilpa-Shanmukha Index was defined by Zhao et al.50 as follows:
The atom bond connectivity index was introduced by Estrada et al.51 as:
The Randic index was defined by Randic et al.52as:
The sum connectivity index and the geometric arithmetic index was defined by Vukicevic et al.53 as:
The Hyper Zagreb Index was introduced by Rajasekharaiah et al.54 as:
Main results
In this section, we presented antibiotic drug molecules’ degree-based bond partitions, see Table 1. Moreover using this edge partition for all antibiotic drugs, we have computed degree-based topological indices.
For the drug molecule Norfloxacin \(\left( G_1\right)\), By using first and second Zagreb index and edge partition of \(G_1\).
By using Harmonic index and edge partition of \(G_1\)
By using forgotten index and edge partition of \(G_1\)
By using Shilpa-Shanmukha Index and edge partition of \(G_1\)
By using atom bond connectivity index and edge partition of \(G_1\)
By using Randic index and edge partition of \(G_1\)
By using sum connectivity index and edge partition of \(G_1\)
By using geometric arithmetic index and edge partition of \(G_1\)
By using Hyper Zagreb Index and edge partition of \(G_1\)
Similarly, in accordance with what was described above, this computation process was applied to all of the entire dataset, and the calculated values correspondingly for all 12 antibiotic drugs. The ERC’s governing instruments are compiled with Table 2.
Exploring topological indices and physicochemical properties via regression models
Quantitative Structure-property Relationship (QSPR) models allow for predictions of a compound’s various physical and chemical properties solely from its molecular structure55. Quantitative Structure-Activity Relationship (QSAR) models provide a mathematical approach for investigating complex interdependencies between a molecule’s structure and its physicochemical or biochemical activities56. Topological descriptors are particularly significant in QSPR investigations, as they contain useful connections with a variety of physicochemical properties pertinent to drugs employed for treating various medical conditions57. Both linear and quadratic regression models were employed in this study for analyzing antibiotic drugs’ physicochemical properties. The experimental property values for these antibiotics are given in Table 3.
Table 4 comparison between linear and quadratic regression models against various physicochemical properties prediction based on the model \(M_1(G)\). Quadratic models generally outperform the linear one with a higher value of \(R^2\) and lower SE: properties like BP, ENP, and HAC provide the best fitting with the quadratic model with \(R^2 = 0.962\). The quadratic model for HAC is the most accurate in predicting the property with \((R = 0.980)\), \((F = 115.763)\). Both models perform badly for molar volume and MV since \(R^2 \le 0.7\). Overall, a quadratic model provides the best predictions for most properties in this dataset. The graphical prediction is shown in Fig. 5.
Table 5 compares the performance of linear and quadratic regression models for the prediction of physicochemical properties using \(M_2 (G)\). A quadratic model tends to make better predictions, higher \(R^2\) values and lower standard errors (SE). For properties like boiling point (BP), enthalpy of vaporization (ENP), and heavy atom count (HAC), quadratic models show excellent performance, with . The best prediction is observed for BP, with \(R^2 = 0.957\) and high statistical significance \(p<10^{-6}\). The flash point (FP) and molar refraction (MR) are also well described by quadratic models. Molar volume (MV) yields poorer correlations, with \(R^2< 0.7\) In general, both models normally fit below 0.7, while the quadratic models are the best for most of the properties. The graphical prediction is shown in Fig. 6.
Table 6 presents a comparison of linear versus quadratic regression models against each other for the prediction of physicochemical properties using H(G). The quadratic model gives better performance with considerably high values of \(R^2\) and lower values of standard error (SE) in the case of most of the properties. The boiling point and heavy atom count are best fitted by quadratic models with \(R^2 = 0.917\) and \(R^2 =0.972\), respectively, with high statistical significance \(p< 10^{-5}\). Quadratic equations also provide very good predictions for properties such as molar refraction and polarization. The opposite holds for the enthalpy of vaporization, ENP; both have fair correlations, \(R^2 =0.831\). Globally, quadratic models are the best predictors for the data. The graphical explanation of the comparison is shown in Fig. 7.
This Table 7 collects some statistical parameters and also the regression model in the different properties as functions of F(G). Each property is modeled by a linear and quadratic model. The best model between these two is the one that realizes higher \(R^2\) and the lowest p-values. For the most important properties, the quadratic model resulted in better fitted (higher R and \(R^2\)), indicating a nonlinear relation. Properties like BP, ENP, MR, POL, and Com give good \(( R^2 )\) values greater than 0.9 for a quadratic model, indicating that the predictive accuracy is fairly good. The P-values of such models are also very low, hence statistically significant. In the scatter plot of highlighted properties, the quadratic model comes out to be the best for many properties, something as HAC. The graphical explanation of the comparison is shown in Fig. 8.
The Table 8 below shows the statistical parameters and regression models for different properties as functions of SS(G), for which both linear and quadratic models have been fitted, considering that the best fit always correlates with higher \(R^2\) values with lowerp-values. Most of the properties have a better fit, as by \(R^2\) values closer to 1, with quadratic models like BP, FP, POL, and MW. On the other hand, for properties such as HIAC, ENP, and MR, the quadratic model is highly predictive with large p-values, indicating thereby how appropriate this approach is. In general, the quadratic models do better for this data set. The graphical comparison is shown in Fig. 9.
Table 9 above shows, for a number of properties, regression models as functions of the ABC index. In general, quadratic models are better fits, as measured by the higher p-value. The quadratic model for BP has an \(R^2=0.947\) with a \(p=1.68 \times 10^{-6}\), where as the quadratic model for ENP has an \(R^2=0.863\) with a \(p=1.82\times 10^{-5}\). The quadratic model systematically demonstrates an outstanding fit for HIAC with \(R^2=0.960\) and \(p=5.25\times 10^{-7}\); hence, it was the best model for this property. Most of the other properties, such as FP, POL, and MW, also presented high \(R^2\) values in their respective quadratic models, confirming their accuracy and suitability. The graphical comparison is shown in Fig. 10.
The Table 10 below depicts the quality of fit tests utilizing linear and quadratic regression models for several properties with RI(G). From the rather higher values, it is obvious that in most cases the quadratic model fits best. Most importantly, the best fit is manifested by HAC with \(R^2=0.948\), followed by BP with \(R^2=0.920\), and POL with \(R^2=0.868\). The results point to how well the quadratic model fits to catch nonlinear relationships and thus serves as the best predictive model across the data. The graphical comparison is shown in Fig. 11.
The Table 11 shows quadratic regression models fitting best for most properties of SS(G), as evidenced by high values of \(R^2\). The best fittings are for HAC, with \(R^2= 0.957\), MW, with \(R^2= 0.977\), and BP, with \(R^2= 0.984\). This suggests that the quadratic model does a better job of capturing nonlinear trends in the data than the linear model. The graphical comparison is shown in Fig. 12.
Table 12 presents the regression models together with their statistical parameters for different properties versus GA(G). Most of the properties were tested both for linear and quadratic models, and the Table 12 gives values of the equation, correlation coefficient R, determination coefficient \(R^2\), standard error \(S_E\), and p-values. Quadratic models generally exhibit better \(R^2\) values compared to linear models, hence a better fit of the model. For example, the quadratic model for HAC has a high value of \(R^2=0.978\), compared to \(R^2=0.897\) for the linear model. All the models have very low values of p, hence they are statistically significant. Notably, properties such as ENP and POL also exhibit excellent fits with \(R^2\) values surpassing 0.9. The graphical comparison is shown in Fig. 13.
Table 13 gives statistical parameters and regression models of different properties concerning relation HZ(G). Quadratic models generally provide higher values \(R^2\) than linear models, which express better fittings. For example, a quadratic model of the HAC property yields an \(R^2\) of 0.951 against that of the linear model, 0.917, indicating that the performance is better. The computed values in all the models were much less, indicating that they are statistically significant. In general, the quadratic models provide a much better explanation for the properties with much higher \(R^2\) values for most properties and smaller standard errors. In other words, the quadratic models can normally explain property HZ(G) more effectively. The graphical comparison is shown in Fig. 14.
Comparison of actual and prediction values of drugs properties
Table 14 compares the predicted boiling point, enthalpy of vaporization and Flash point from various models linear and quadratic of different types of drug molecules to real values. In each case, the quadratic model provided the closest yield on boiling point and enthalpy of vaporization to real values, especially for complex drug molecules . This means that the quadratic model handles the representation of drug molecular complexity better than the other models. The graphical comparison is shown in Fig. 15.
The Table 15 below shows the comparison of the predicted and experimental values of Molar refractivity, Polarizability and Molar weight of various drug molecules using different models: linear, and quadratic. The quadratic model generally shows closer alignment with the actual values of both properties, especially for the most complex molecules. This trend in general points toward the better performance of the quadratic model in correct property prediction, although some deviations remain, more precisely concerning molar refraction. The graphical comparison is shown in Fig. 16.
Table 16 shows that, overall, the quadratic model gives the closest predictions to real values in respect to polarization and molar weight for different drug molecules. While the model generally predicts polarization very well for most drugs considered, significant deviations become apparent for Abamectin, which documents the limitations of the model for some compounds. The linear and logarithmic models tend to underestimate polarization, especially if high values are considered. This is so true, especially for molar weight, since all models’ results keep close to the actual values, therefore indicating simpler predictive needs versus polarization. Lastly, according to both properties in this dataset, the quadratic model gives more accurate results. The graphical comparison is shown in Fig. 17.
Conclusion
Our study presents a systematic QSPR analysis of antibiotic drugs, utilizing degree-based topological indices to predict critical physicochemical properties. The findings highlight the effectiveness of reverse degree-based indices in capturing molecular structural attributes and their correlation with drug characteristics. The application of regression models, particularly quadratic models, demonstrates high predictive accuracy, indicating the potential for structural tailoring in drug development. Future research can extend this approach to a broader range of pharmaceuticals, incorporating machine learning techniques for enhanced predictive performance. These findings highlight the application of the use of QSPR models as a valuable drug development aid that enables scientists to design the required drug candidates at an early development phase. This computational approach not only saves the effort and the cost of laboratory screening but also provides a superior insight into the molecular factors affecting the behavior of the drug.
Future work
For applied drug discovery purposes, merging bioactivity information (QSAR) into QSPR descriptors has the potential for producing multifaceted predictive models enabling lead optimization early in drug development. Further, affiliating these topological descriptors with ADMET (absorption, distribution, metabolism, excretion, and toxicity) traits would lead to a comprehensive predictive system, enhancing new drug candidates’ pharmacokinetic profiling. Finally, building an automation pipeline or an analogous software package using this methodology would help medicinal chemists screen drug-like molecules virtually in high-throughput, speeding up drug development.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Qin, H. et al. On QSPR analysis of pulmonary cancer drugs using python-driven topological modeling. Sci. Rep. 15(1), 3965 (2025).
Kanabur, R. & Shigehalli, V. QSPR analysis of degree-based topological indices with physical properties of benzenoid hydrocarbons. Gen. Lett. Math. 2(3), 150–169 (2017).
Zaman, S., Yaqoob, H. S. A., Ullah, A. & Sheikh, M. QSPR analysis of some novel drugs used in blood cancer treatment via degree based topological indices and regression models. Polycycl. Aromat. Compd. 44(4), 2458–2474 (2024).
Das, P., Mondal, S., Some, B. & Pal, A. Extension of adjacency matrix in QSPR analysis. Chemom. Intell. Lab. Syst. 243(105024), 15–25 (2023).
Qin, H. et al. A python approach for prediction of physicochemical properties of anti-arrhythmia drugs using topological descriptors. Sci. Rep. 15(1), 1742 (2025).
Kumar, V. & Das, S. Comparative study of GQ and QG indices as potentially favorable molecular descriptors. Int. J. Quantum Chem. 124(3), e27334 (2024).
Mahboob, A., Rasheed, M. W., Dhiaa, A. M., Hanif, I. & Amin, L. On quantitative structure-property relationship (QSPR) analysis of physicochemical properties and anti-hepatitis prescription drugs using a linear regression model. Heliyon 10(4), 1–18 (2024).
Huang, R., Mahboob, A., Rasheed, M. W., Alam, S. M. & Siddiqui, M. K. On molecular modeling and QSPR analysis of lyme disease medicines via topological indices. Eur. Phys. J. Plus 138(3), 243–249 (2023).
Kumar, V. & Das, S. On structure sensitivity and chemical applicability of some novel degree-based topological indices. MATCH Commun. Math. Comput. Chem. 92(1), 165–203 (2024).
Zhou, H. et al. On QSPR analysis of molecular descriptor and thermodynamic features of narcotic drugs. Polycycl. Aromat. Compd. 44(5), 3079–3099 (2024).
Ghazwani, H., Koam, A. N., Nadeem, M. F. & Ahmad, A. Topological Insights into nanostar Dendrimers by computing the augmented Zagreb index. Comb. Chem. High Throughput Screen. 23, 14–24 (2024).
Alhulwah, K. H., Hussain, M., Almohanna, N. E., Hanif, M. F. & Siddiqui, M. K. On physical analysis of cadmium bismuth sulfide using quadratic regression approach. Chem. Pap. 79, 1–20 (2025).
Wei, J., Hanif, M. F., Mahmood, H., Siddiqui, M. K. & Hussain, M. QSPR analysis of diverse drugs using linear regression for predicting physical properties. Polycycl. Aromat. Compd. 44(7), 4850–4870 (2024).
Nadeem, M. F. et al. Topological aspects of metal-organic structure with the help of underlying networks. Arab. J. Chem. 14(6), 103–123 (2021).
Kumar, V. & Das, S. Comparative study between Nirmala and Sombor indices based on their applicability, degeneracy and smoothness. Eur. Phys. J. Plus 139(11), 1–22 (2024).
Delahi, K., Ahmad, A., Asim, M. A. & Hasni, R. Computation of topological indices of binary and ternary trees using algorithmic approach. Iran. J. Math. Chem. 15(2), 107–115 (2024).
Jacob, K. & Clement, J. Topological entropy characterization of zeolite EDI and its application in predicting molecular interactions. Eur. Phys. J. Plus 139(2), 161 (2024).
Gayathri, K. B., Santiago, R. & Govardhan, S. Hexagonal fractals: Topological indices, fractal dimensions, structure-property modeling and its applications. Curr. Org. Synth. 15(4), 112 (2025).
Peter, P., Clement, J., Arockiaraj, M. & Jacob, K. Predictive modeling of molecular interaction energies using topological and spectral entropies of zeolite AWW. Front. Chem. 13, 1543588 (2025).
Shenoy, B. G., Kumar, V., Acharya, D., HM, N. & Poojary, P. Statistical analysis of Revan topological indices for drug compounds used in treatment of tuberculosis. Phys. Scr. 100(3), 035203 (2025).
Jacob, K., Clement, J., Arockiaraj, M., Peter, P. & Balasubramanian, K. Network topology and entropy analysis of tetragonal farneseite zeolites. Sci. Rep. 15(1), 14896 (2025).
Wiener, H. Structural determination of paraffin boiling points. J. Am. Chem. Soc. 69(1), 17–20 (1947).
Gutman, I. & Trinajstic, N. Graph theory and molecular orbitals. Total electron energy of alternant hydrocarbons. Chem. Phys. Lett. 17(4), 535–538 (1972).
Kulli, V. R. Reverse Zagreb and reverse hyper-Zagreb indices and their polynomials of rhombus silicate networks. Ann. Pure Appl. Math. 16(1), 47–51 (2018).
Huang, L., Alhulwah, K. H., Hanif, M. F., Siddiqui, M. K. & Ikram, A. S. On QSPR analysis of glaucoma drugs using machine learning with XGBoost and regression models. Comput. Biol. Med. 187, 109731 (2025).
Mufti, Z. S., AlQadi, H., Tabraiz, A., Hanif, M. F. & Fiidow, M. A. Fuzzy and crisp computational analysis of certain graphs structures via machine learning techniques. Sci. Rep. 15(1), 5995 (2025).
Gao, M. et al. Removal of ciprofloxacin by PAA-PAM hydrogel: Adsorption performance and mechanism studies. J. Water Process Eng. 71, 107361 (2025).
Feng, X., Ma, Z., Yu, C. & Xin, R. MRNDR: Multihead attention-based recommendation network for drug repurposing. J. Chem. Inf. Model. 64(7), 2654–2669 (2024).
Li, X. et al. Screening for primary aldosteronism on and off interfering medications. Endocrine 83(1), 178–187 (2024).
Zhang, R. et al. MvMRL: A multi-view molecular representation learning method for molecular property prediction. Brief. Bioinform. 25(4), bbae298 (2024).
Huang, C., Liu, X. Wang, W. & Guo, Z. Exploring the mechanism of centipeda minima in treating nasopharyngeal carcinoma based on network pharmacology. Curr. Comput. Aided Drug Des. 48(1) (2024).
Hui, Z. et al. Mechanisms and therapeutic potential of chinonin in nervous system diseases. J. Asian Nat. Prod. Res. 26(12), 1405–1420 (2024).
Saleem, N. et al. Hematological changes in the blood of experimental male and female albino rats on exposure to pesticide, dimethoate. PLoS One 20(5), e0321848 (2025).
Lou, Y. et al. Effects of the CYP3A inhibitors, voriconazole, itraconazole, and fluconazole on the pharmacokinetics of osimertinib in rats. PeerJ 11, e15844 (2023).
Yang, J., Li, G. & Yang, Y. Identification of indigenous phorate-degrading pathways during natural attenuation of contaminated soils. Environ. Res. 278, 121754 (2025).
Li, W. et al. Puerarin-loaded PEG-PE micelles with enhanced anti-apoptotic effect and better pharmacokinetic profile. Drug Deliv. 25(1), 827–837 (2018).
Renu, V. V. & Vardhanan, Y. S. Transcriptomic insights into the effects of monocrotophos on Chrysomya megacephala: Forensic implications of altered pupation time. Mol. Biol. Rep. 52(1), 378 (2025).
Lin, S. et al. A single-dose, randomized, open-label, four-period, crossover equivalence trial comparing the clinical similarity of the proposed biosimilar rupatadine fumarate to reference Wystamm in healthy Chinese subjects. Front. Pharmacol. 15, 1328142 (2024).
Ge, Z. et al. Three-dimensional urchin-like K2Ti8O17/Ag NPs composite as a SERS substrate for detecting folic acid and thiram. Talanta 292, 127926 (2025).
Zhao, X. et al. Comparison of the efficacy and safety of low-dose antihypertensive combinations in patients with hypertension: Protocol for a systematic review and network meta-analysis. BMJ Open 14(10), e086323 (2024).
Wan, C. et al. Distribution and accumulation dynamics of fungicide azoxystrobin in the soil-plant system. Environ. Res. 274, 121287 (2025).
Feng, C. et al. Precisely tailoring molecular structure of doxorubicin prodrugs to enable stable nanoassembly, rapid activation, and potent antitumor effect. Pharmaceutics 16(12), 1582 (2024).
Ahmad, F., Tahir, S., Wali, A., Khan, M. I. & Shanableh, A. A review article on the photocatalytic degradation of atrazine by potential catalysts. Next Mater. 8, 100534 (2025).
Zhou, X. et al. Cascade reaction of isocyanides with carboxylic acid and CD3SSO3Na: Toward S-CD3 thiocarbamates. Org. Lett. 27(18), 4742–4746 (2025).
Rinkevich, F. D., Dodge, D. & Egnew, N. Minimal toxicological impact of chlorothalonil on adult honey bees (Apis mellifera L.). Pestic. Biochem. Physiol. 208, 106300 (2025).
Lassoued, A. et al. An experimental study to assess the ecotoxicity of warfarin and tinzaparin on meiobenthic amphipods: Original taxonomic data from Saudi Arabia and computational modeling. Toxics 13(4), 264 (2025).
Gutman, I. & Polansky, O. E. Mathematical concepts in organic chemistry (Springer, UK, 2012).
Martinez-Martinez, C. T., Mendez-Bermudez, J. A., Rodriguez, J. M. & Sigarreta, J. M. Computational and analytical studies of the harmonic index on Erdos–Renyi models. MATCH-Commun. Math. Comput. Chem. 85(2), 395 (2021).
Furtula, B. & Gutman, I. A forgotten topological index. J. Math. Chem. 53(4), 1184–1190 (2015).
Zhao, W., Shanmukha, M. C., Usha, A., Farahani, M. R. & Shilpa, K. C. Computing SS index of certain dendrimers. J. Math. 2021(1), 7483508 (2021).
Estrada, E., Torres, L., Rodriguez, L. & Gutman, I. An atom-bond connectivity index: Modelling the enthalpy of formation of alkanes (1998).
Randic, M. Characterization of molecular branching. J. Am. Chem. Soc. 97(23), 6609–6615 (1975).
Vujoevic, S., Popivoda, G., Vukicevic, K., Furtula, B. & Krekovski, R. Arithmetic geometric index and its relations with geometric arithmetic index. Appl. Math. Comput. 391, 125706 (2021).
Rajasekharaiah, G. V. & Murthy, U. P. Hyper-Zagreb indices of graphs and its applications. J. Algebra Comb. Discret. Struct. Appl. 8(1), 9–22 (2021).
Hosamani, S., Perigidad, D., Jamagoud, S., Maled, Y. & Gavade, S. QSPR analysis of certain degree based topological indices. J. Stat. Appl. Prob. 6(2), 361–371 (2017).
Bokhary, S. A. U. H., Adnan, Siddiqui, M. K. & Cancan, M. On topological indices and QSPR analysis of drugs used for the treatment of breast cancer. Polycycl. Aromat. Compd. 42(9), 6233–6253 (2022).
Havare, Q. C. Topological indices and QSPR modeling of some novel drugs used in the cancer treatment. Int. J. Quantum Chem. 121(24), e26813 (2021).
Acknowledgements
This work was supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University (IMSIU) (grant number IMSIU-DDRSP2502).
Author information
Authors and Affiliations
Contributions
Ibrahim Al-Dayel worked on carrying out investigations, analyzing the data, designing tests and developing them. Meraj Ali Khan deals with financing sources, computation, analysis of the data and verifying calculations. Muhammad Faisal Hanif acquired funding and performed the formal analysis of the study. Aside from project overseeing, Muhammad Kamran Siddiqui planned the approach, scheduled it, found the sources, and wrote the introduction of the paper. Saba Hanif contributed to the Matlab and Maple graph elevation of the project. Brima Gegbe handled the development of the software and validation of the study. The final report of the project is reviewed and approved by each of the authors.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Al-Dayel, I., Khan, M.A., Hanif, M.F. et al. A graph-based computational approach for modeling physicochemical properties in drug design. Sci Rep 15, 21170 (2025). https://doi.org/10.1038/s41598-025-06624-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-06624-3