Introduction

Statistical models have been extensively applied in industries such as survival, accounting, actuarial sciences, insurance, and engineering analysis to analyze data patterns. The success of statistical inference is strongly dependent on how well the chosen probability distribution matches the underlying data patterns. However, standard distributions frequently lack the versatility and accuracy required to effectively describe complex information. This constraint has culminated in the development of more adaptive and complex distributional models that better explain real-world data behavior.

Based on existing studies and findings, we can observe that engineering and environmental data sets most often possess a right-tailed distribution. For such types of data sets, probability distributions with heavy-tailed behavior are very competitive models. In this context, many authors have tried to find new statistical models, extending known ones, and better fitting various types of data sets. This flexibility is obtained by adding novel parameters or by some modifications to the classical distribution, increasing its ability to handle complicated scenarios. For more information, one may refer to some recently generated family of distributions, namely, Marshall and Olkin1, progressive type-II censoring Marshall-Olkin distributions2, \(\alpha\)-power transformation3, a new two-parameter mixture family of generalized distributions4, inverse Weibull-G model5, Kumaraswamy-G family6, A new modified cubic transmuted-G family of distributions7, a new power G-family of distributions8, novel odd Type-X family9, Tangent exponential-G family of distributions10, transformed-transformer class of distributions11, exponentiated generalized alpha power family12, Topp-Leone Weibull generated family13, Novel family of probability generating distributions14, mixture Models of the New Topp-Leone-G family15, generalized odd linear exponential family of distribution16, Lomax tangent generalized family of distributions17, new flexible power-X family18, novel compound two components of exponentiated class19, Power Lindley-G family20, sine \(\pi\)-power odd-G family of distributions21, new logarithmic tangent-U family22, new family of upper-truncated distributions23, transformation of the Chen distribution24, modified Weibull distribution25, modified tangent Weibull26, arc cosine-G family27, new logarithmic-G family28, transformed Weibull distribution29, the Fréchet–Power Rayleigh30, and new Topp Leone-G family31.

It is well documented that no particular probability distribution is available to handle every phenomenon, as real-world phenomena are complex, which is the main reason for developing more flexible probability distributions with greater distributional flexibility. Due to this,32 suggested a new methodology for creating a new class of distributions known as the Ramos-Louzada (RL) class of distributions. The derivation of the distributions can be described as follows:

$$\begin{aligned} M(x)= & \displaystyle \int _{0}^{-\log [1-K(x)]} w(t) \;dt \nonumber \\ \nonumber \\= & \left\{ W(t)\right\} _{0}^{-\log [1-K(x)]}. \end{aligned}$$
(1)

Here, K(x) and k(x) are respectively the cumulative distribution function (CDF) and the probability distribution function (PDF) of the standard distribution. Further, by taking respectively w(t) and W(t) as the PDF and CDF of the RL distribution with the forms

$$\begin{aligned} w(t)=\dfrac{(\beta ^2+t-2\beta ) }{\beta ^2(\beta -1)} \ e^{-t/\beta }, \; \; t>0, \beta \ge 2 \end{aligned}$$
(2)

and

$$\begin{aligned} W(t)=1-\dfrac{(\beta ^2+t-\beta ) }{\beta (\beta -1)} \ e^{-t/\beta }, \; \; t>0, \beta \ge 2. \end{aligned}$$
(3)

The CDF of the RL family of distributions is

$$\begin{aligned} M(x)=1-\left( 1-\dfrac{\log (1-K(x))}{\beta (\beta -1)}\right) \left[ 1-K(x)\right] ^{\tfrac{1}{\beta }} \; \; \; x\in \mathbb {R}, \; \; \beta \ge 2. \end{aligned}$$
(4)

Considering the computation of \(\frac{d}{dx} M\left( x\right)\), brings us to the mathematical form of the PDF, denoted as \(m\left( x\right)\), which is represented as follows:

$$\begin{aligned} m(x)=\dfrac{k(x) \ \left[ 1-K(x)\right] ^{\tfrac{1}{\beta }-1}}{\beta ^2(\beta -1)} \ \left[ \beta ^2-2\beta -\log (1-K(x)\right] . \end{aligned}$$
(5)

The RL procedure is a novel tool for generalizing a given model, which adds an additional parameter to a distribution. The proposed method has many applications that extend to fitting, especially engineering and environmental areas, and it has an ability to model different kinds of data sets. However, recent works considering the RL technique include33 who proposed the discrete RL distribution,34 defined the generalization of the RL model using the exponentiated method, and35 introduced the inverse power RL with medicine and geology applications.

To our knowledge, the exponential (Exp) distribution is a flexible one-parameter distribution. The Exp model can be utilized in various sectors, including survival, hydrology, insurance, and energy theory. Let Y follows the Exp distribution. The corresponding CDF and PDF are, respectively, given by

$$\begin{aligned} \Xi (y)&=1-e^{-\alpha y},~0<y,~\alpha >0 \end{aligned}$$
(6)

and

$$\begin{aligned} \xi (y)&=\alpha e^{-\alpha y}. \end{aligned}$$
(7)

In research, the Exp distribution has emerged as a well-known survival distribution. In engineering data modeling, this distribution has received a lot of attention, for example, see45,46,47. While there are several generators to attain adaptability, this can make estimation and interpretation difficult. To address this, we are motivated in this study to offer a new form of the Exp distribution based on the approach discussed in Eq. (1), which is intended to give increased flexibility by introducing an additional parameter. The resultant distribution is named the Ramos Louzada Exponential (RL-Exp) model. It is interesting because it offers a broad range of density shapes, including symmetric, skewed, and heavy-tailed forms while also allowing for a variety of failure rate behaviors, and it blends simplicity in definitions, enabling accurate mathematical understanding and effective data modeling. Such versatility renders it appropriate for a variety of statistical and reliability applications. Further, the n parameter offers increased versatility in analyzing the tail behavior of the specified density function. The following objectives of this study can be described as follows:

  • Defined a novel two-parameter generalization of the Exp model by applying the RL tool, which can be enhanced in flexibility, robustness, and applicability to different data scenarios, and established its mathematical properties.

  • Two estimation methods, namely, the maximum likelihood estimator (MLE) and Bayesian estimators based on:

    1. 1.

      Square error (SE) loss function proposed by38, and it measures the discrepancy between predicted and true values by taking the square of their difference.

    2. 2.

      Linex (LI) loss function presented by39, and it is an asymmetric function.

    3. 3.

      General entropy (GE) defined by40, and is also an asymmetric loss function

    These loss functions can be apply to compute the cost of errors in parameter estimates.

  • Three real data sets are applied from engineering and environmental fields to demonstrate the utility of the proposed RL-Exp model in application. For more reading see36,37,48

The further study is organized as follows: The recommended RL-Exp is defined in “Ramos Louzada-exponential distribution” with some distributional properties. In “Characterization of the RL-Exp model”, various mathematical properties of the new model are established. Section “Statistical inference” demonstrates the estimation of the model parameters based on two different proposed methods. Section “Simulation study and algorithms” delineates the efficacy of the estimations using a Monte Carlo simulation, and three distinct real data sets taken from the engineering and environmental fields are applied to show the results of the application of the RL-Exp model. Section 7 offers final observations.

Ramos Louzada-exponential distribution

A random variable Z is said to follow the RL-Exp model with two parameters \(\alpha\) and \(\beta\) if its CDF and PDF are of the form

$$\begin{aligned} \Psi (z)&=1-\left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}, \ \ 0<z, \ \ \ \beta \ge 2, \alpha >0 \end{aligned}$$
(8)

and

$$\begin{aligned} \psi (z)&=\dfrac{\alpha \ e^{-\tfrac{\alpha }{\beta }z} \ (\beta ^2-2\beta +\alpha z)}{\beta ^2(\beta -1)}, \ { 0<z, \ \ \ \beta \ge 2, \alpha >0}. \end{aligned}$$
(9)

The plots of the CDF and PDF functions under some selected parameter values are shown in Fig. 1, which might have a declining trend, unimodal, and right-skewed PDF shapes. Further, when \(\alpha =1\), we obtained the Ramos Lozuada (RL) distribution.

Figure 1
Figure 1
Full size image

The CDF and PDF curves of the RL-Exp model are based on some choices of parameters.

The survival function (SF) and hazard rate function (HRF) are formulated, respectively, as:

$$\begin{aligned} S(z)=\left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z} \end{aligned}$$

and

$$\begin{aligned} h(z)= \dfrac{\alpha \beta (\beta -1)(\beta ^2-2\beta +\alpha z)}{\beta (\beta -1)+\alpha z}. \end{aligned}$$

The graphical behavior of the SF and HRF for several selected parameters is sketched in Fig. 2 of the RL-Exp model. For the HRF, it has decreasing and unimodal shapes according to the parameter values.

Figure 2
Figure 2
Full size image

The SF and HRF curves of RL-Exp model are based on some choices of parameters.

Characterization of the RL-Exp model

Quantile function and and random number generation

The quantile function serves a significant purpose in producing random numbers from a probability model, which are essential for executing a simulation study. The quantile function \(z_p\) associated with the RL-EXP MODEL is written in the following manner:

$$\begin{aligned} z_u=-\dfrac{1}{\alpha }\log \left\{ \dfrac{W(-(\beta -1)(1-u) \ e^{-(\beta -1)})}{-(\beta -1)(1-u)}\right\} ^{-\beta }, \; \; 0<u< 1, \end{aligned}$$
(10)

where W(.) represents the Lambert function with the positive branch.

Proof:  For \(0<u<1\), we can define the quantile function of Z as follows:

$$\begin{aligned} 1-\left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}= & u\\ \left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}= & 1-u\\ \left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha z}{\beta (\beta -1)}}= & (1-u)e^{-(\beta -1)}\\ -(\beta -1)(1-u)e^{-\tfrac{\alpha z}{\beta }}= & W(-(\beta -1)(1-u)e^{-(\beta -1)})\\ z= & -\dfrac{1}{\alpha }\log \left\{ \dfrac{W(-(\beta -1)(1-u) \ e^{-(\beta -1)})}{-(\beta -1)(1-u)}\right\} ^{-\beta } \end{aligned}$$

From Eq. (10), we can be obtain the median value of the proposed RL-Exp model by setting \(u=0.5\), and it is given as

$$\begin{aligned} z_{0.5}=-\dfrac{1}{\alpha }\log \left\{ \dfrac{W(-0.5(\beta -1) \ e^{-(\beta -1)})}{-0.5(\beta -1)}\right\} ^{-\beta }. \end{aligned}$$

Concerning the RL-Exp distribution, the skewness (\(\mathcal {SKW}\)) and kurtosis (\(\mathcal {KUR}\)) metrics are, receptively, given in the following way:

$$\begin{aligned} \mathcal {SKW}=\dfrac{z_{0.25}+z_{0.75}-2z_{0.5}}{z_{0.75}-z_{0.25}} \end{aligned}$$

and

$$\begin{aligned} \mathcal {KUR}=\dfrac{z_{0.875}-z_{0.625}+z_{0.375}-z_{0.125}}{z_{0.75}-z_{0.25}}. \end{aligned}$$

Ordinary moments

Assume that Z is a random variable following the RL-Exp distribution. The \(k^{th}\) moment of Z can be obtained as

$$\begin{aligned} u_{k}'=\dfrac{\beta ^k \ \Gamma (k+1)}{\alpha ^k(\beta -1)} \ \left( k+\beta -1\right) , \end{aligned}$$
(11)

where \(\Gamma (n+1)=n!\), \(n=0,1,2,\ldots\).

The mean and variance of Z can be obtained in special case from Eq. (11), and they are in the following form:

$$\mu '_{1}= \dfrac{\beta ^2}{\alpha (\beta -1)}$$

and

$$V_{Z}=\dfrac{\beta ^2(\beta ^2-4\beta +2)}{\alpha ^2(\beta -1)^2}.$$

The index of dispersion of Z is derived as bellow

$$I_Z=\dfrac{(\beta ^2-4\beta +2)^{\tfrac{1}{2}}}{\beta }.$$

Moment generating function

Let the random variable Z has a RL-Exp model. Hence, the associated moment generating function (MGF) can be provided in the following form:

$$\begin{aligned} M_Z(t)= \displaystyle \sum _{k=0}^{\infty }\dfrac{t^k\beta ^k \ \Gamma (k+1)}{\alpha ^k(\beta -1)k!} \ \left( k+\beta -1\right) . \end{aligned}$$
(12)

Table 1 demonstrates different mathematical properties of the recommended RL-Exp model using various selected parameter values, and Fig. 3 is the 3-dimensional plots of the proposed statistical properties of the new distribution. It can be seen that as \(\beta\) increase and for fixed values of \(\alpha\), the values of \(\mu '_{1}\), \(V_{Z}\), \(I_Z\), \(\mathcal {SKW}\), and \(\mathcal {KUR}\) increase. Further, for fixed values of \(\beta\) and as \(\alpha\) increase the values of \(\mu '_{1}\) and \(V_{Z}\) decrease, where the values of \(I_Z\), \(\mathcal {SKW}\) and \(\mathcal {KUR}\) are constant. In addition, multiple non-monotonic forms are seen, which confirms that the explored RL-Exp model is an adaptable model that accommodates larger data sets.

Table 1 Various computational metrics of the RL-exp model.
Figure 3
Figure 3
Full size image

The plots of mean, variance, skewness and kurtosis for the RL-Exp.

Order statistics

Let a random sample \(z_1,z_2\ldots z_m\) represent the RL-Exp model. The PDF of the \(j^{th}\) order statistic can be expressed as follows

$$\begin{aligned} f_{(j:m)}(z)= & \dfrac{m!}{(j-1)!(m-j)!} \ \psi (z) \ [\Psi (z)]^{j-1} \ [1-\Psi (z)]^{m-j}\nonumber \\ \nonumber \\= & \dfrac{\alpha \ m! \ e^{-\tfrac{\alpha }{\beta }z} \ (\beta ^2-2\beta +\alpha z)}{(j-1)!\beta ^2(\beta -1)(m-j)!} \ \left[ 1-\left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}\right] ^{j-1}\nonumber \\ \nonumber \\\times & \left[ \left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}\right] ^{m-j}. \end{aligned}$$

Specially, the PDF of the highest order statistics \(f_{(m:m)}(z)\) is obtained as setting \(j=m\) in Eq. () to get:

$$\begin{aligned} f_{(m:m)}(z)= \dfrac{\alpha \ m \ e^{-\tfrac{\alpha }{\beta }z} \ (\beta ^2-2\beta +\alpha z)}{\beta ^2(\beta -1)} \ \left[ 1-\left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}\right] ^{m-1}. \end{aligned}$$

Similarly, the PDF of the lowest order statistics \(f_{(1:m)}(z)\) is obtained as setting \(j=1\) in Eq. ()

$$\begin{aligned} f_{(1:m)}(z)= & \dfrac{\alpha \ m \ e^{-\tfrac{\alpha }{\beta }z} \ (\beta ^2-2\beta +\alpha z)}{\beta ^2(\beta -1)} \ \left[ \left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}\right] ^{m-1}. \end{aligned}$$

Furthermore, the \(j^{th}\) CDF of the order statistic of the RL-Exp model can be given by

$$\begin{aligned} F_{(j:m)}(z)= & \displaystyle \sum _{i=j}^{m}\Psi ^i(z) \ [1-\Psi (z)]^{m-i}\\= & \displaystyle \sum _{i=j}^{m} \left[ 1-\left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}\right] ^{i} \ \left[ \left( 1+\dfrac{\alpha z}{\beta (\beta -1)}\right) \ e^{-\tfrac{\alpha }{\beta }z}\right] ^{m-i}. \end{aligned}$$

Statistical inference

Maximum likelihood estimation

In this section, we estimate model parameters \(\rho =(\alpha , \beta )\) using MLE technique based on complete data. Assume that \(\{z_1, z_2, \ldots , z_m\}\) is a random sample follows the equation given in Eq. (9). The likelihood function can be represented as

$$\begin{aligned} L(\rho )= \prod _{i=1}^{m}\psi (z_i). \end{aligned}$$
(13)

Using Eqs. (9) in (13), we get that:

$$\begin{aligned} L(\rho )&= \dfrac{1}{\beta ^{2m}(\beta -1)^m}\prod _{i=1}^{m}\alpha \ e^{-\tfrac{\alpha }{\beta }z_i} \ (\beta ^2-2\beta +\alpha z_i). \end{aligned}$$
(14)

The corresponding log-likelihood function is given by

$$\begin{aligned} \log L(\rho )&= -2m\log (\beta )-m\log (\beta -1)+m\log \alpha -\dfrac{\alpha }{\beta }\displaystyle \sum _{i=1}^{m}z_i+\displaystyle \sum _{i=1}^{m}\log (\beta ^2-2\beta +\alpha z_i). \end{aligned}$$
(15)

The first partial derivatives of log-likelihood function in Eq. (15) with respect to \(\alpha\) and \(\beta\) provide likelihood equations given as

$$\begin{aligned} \frac{\partial \log L}{\partial \alpha }&=\dfrac{m}{\alpha }-\dfrac{1}{\beta }\displaystyle \sum _{i=1}^{m}z_i+\displaystyle \sum _{i=1}^{m}\dfrac{z_i}{\beta ^2-2\beta +\alpha z_i}=0, \end{aligned}$$
(16)
$$\begin{aligned} \frac{\partial \log L}{\partial \beta }&= -\dfrac{2m}{\beta }-\dfrac{m}{\beta -1}+\dfrac{\alpha }{\beta ^2}\displaystyle \sum _{i=1}^{m}z_i+(2\beta -2)\displaystyle \sum _{i=1}^{m}\dfrac{1}{\beta ^2-2\beta +\alpha z_i}=0. \end{aligned}$$
(17)

The maximum likelihood estimates of unknown parameters \(\rho =(\alpha , \beta )\) can now be obtained by solving the above two equations. We adopt a numerical method like the Newton-Raphson or secant methods to evaluate these estimates, as the above equations are nonlinear in nature. The well-known statistical programs for maximizing the log-likelihood function are the R package (Adequacy- Model), the Ox program (subroutine-Max-BFGS), and SAS (PROC-NLMIXED). In this study, we applied the R script with the function (optim), employing the method = L-BFGS-B.

Bayesian estimation

In contrast to the maximum likelihood estimation approach, Bayesian estimation is a more efficient approximation. Taking into consideration past data and samples, it calculates the interest parameters that have not been stated. From a Bayesian standpoint, there is no hard and fast method for choosing priors because one cannot clearly assert that one prior is superior to another. When there is insufficient interpretive information about the unknown parameter, a non-informative prior is frequently adopted. In contrast, if considerable information about the parameter(s) is available, an informative prior is generally more effective. Let the parameters \(\alpha\) and \(\beta\) have, respectively, the prior gamma distributions (Gamma\((a_1, b_1)\) and Gamma\((a_2, b_2)\). The main motivation for the Gamma prior is usually to constrain the random variables to positive values. This is especially useful when dealing with random variables that represent the variance of another distribution. Next, let define the joint priors for the parameters \(\rho =(\alpha , \beta )\) as follows:

$$\begin{aligned} \Pi (\rho ) \propto \alpha ^{a_1-1} \ \beta ^{a_2-1} e^{-\frac{\alpha }{b_1}-\frac{\beta }{b_2}}, \quad \alpha , ~\beta >0. \end{aligned}$$
(18)

The posterior density of \(\rho\) has the following form

$$\begin{aligned} \Pi ^{*}(\alpha , \beta )&= L(\alpha , \beta ) \propto \Pi (\alpha , \beta ) = \dfrac{\alpha ^{m+a_1-1} \ \beta ^{a_2-1-2m}}{(\beta -1)^m} \ e^{-\frac{\alpha }{b_1}-\frac{\beta }{b_2}} \ e^{-\tfrac{\alpha }{\beta }\sum _{i=1}^{m}z_i} \prod _{i=1}^{m} (\beta ^2-2\beta +\alpha z_i). \end{aligned}$$
(19)

The joint posterior distribution of \(\rho\) is given by

$$\begin{aligned} \Pi (\alpha , \beta \mid z)= & \dfrac{L(\alpha , \beta ) \propto \Pi (\alpha , \beta )}{\displaystyle \int _{0}^{\infty }\displaystyle \int _{0}^{\infty }L(\alpha , \beta ) \propto \Pi (\alpha , \beta ) \; d\alpha \ d\beta }\nonumber \\= & \dfrac{\dfrac{\alpha ^{m+a_1-1} \ \beta ^{a_2-1-2m}}{(\beta -1)^m} \ e^{-\frac{\alpha }{b_1}-\frac{\beta }{b_2}} \ \ e^{-\tfrac{\alpha }{\beta }\sum _{i=1}^{m}z_i} \displaystyle \prod _{i=1}^{m} (\beta ^2-2\beta +\alpha z_i)}{\displaystyle \int _{0}^{\infty }\displaystyle \int _{0}^{\infty }\dfrac{\alpha ^{m+a_1-1} \ \beta ^{a_2-1-2m}}{(\beta -1)^m} \ e^{-\frac{\alpha }{b_1}-\frac{\beta }{b_2}} \ \ e^{-\tfrac{\alpha }{\beta }\sum _{i=1}^{m}z_i}\prod _{i=1}^{m} (\beta ^2-2\beta +\alpha z_i)\; d\alpha \ d\beta }. \end{aligned}$$
(20)

The marginal posterior of the \(\alpha\) and \(\beta\) are obtained to be

$$\begin{aligned} \Pi (\alpha \mid \beta , z)=\alpha ^{m+a_1-1} \ e^{-\alpha \left( \tfrac{1}{b_1}+\tfrac{1}{\beta }\right) \sum _{i=1}^{m}z_i} \ \prod _{i=1}^{m}(\beta ^2-2\beta +\alpha z_i), \end{aligned}$$
(21)

and

$$\begin{aligned} \Pi (\beta \mid \alpha , z)=\beta ^{a_2-2m-1} \ (\beta -1)^{-m}\ e^{-\beta \left( \tfrac{1}{b_2}+\tfrac{\alpha }{\beta ^2}\right) \sum _{i=1}^{m}z_i} \ \prod _{i=1}^{m}(\beta ^2-2\beta +\alpha z_i) \end{aligned}$$
(22)

Hence, the Bayes \(\phi (\rho )\) based on SE loss function is

$$\begin{aligned} \tilde{\phi }_{SE}(\rho )=\displaystyle \int _{\rho } (\rho -\hat{\rho })^2 \ \Pi (\rho \mid z) \; d\rho . \end{aligned}$$
(23)

Next, we provide other loss functions, including LI and GE functions. The Bayes estimators based on these loss functions can be formulated, respectively, as \(\tilde{\phi }_{LI}(\rho )=-\frac{1}{q}\text {log}\left( E_{\rho }\left( e^{-q\Pi (\rho )} \mid z\right) \right)\) and \(\tilde{\phi }_{GE}(\rho )=\left( E_{\rho }\left( (\Pi (\rho ))^{-q} \mid z\right) \right) ^{-\frac{1}{q}}\).

By obtaining the joint prior, the posterior function, which is not from a well-known distribution may be determined. Due to this, we are unable to create samples from it; hence, we will employ the Metropolis-Hastening method.

Metropolis hasting method

  1. 1.

    Start with initial values \(\alpha ^{(0)}=\hat{\alpha },\beta ^{(0)}=\hat{\beta }\).

  2. 2.

    Let \(m=1\) define the first iteration.

  3. 3.

    obtain \(\alpha ^*,\beta ^*\), respectively, from a proposal distribution.

  4. 4.

    Compute \(H_\alpha =\min \left[ 1, \dfrac{\pi ^{*}(\alpha ^*\mid \beta ^{m-1})}{\pi ^{*}(\alpha ^{m-1}\mid \beta ^{m-1})}\right]\), \(H_\beta =\min \left[ 1, \dfrac{\pi ^{*}(\beta ^*\mid \alpha ^{m-1})}{\pi ^{*}(\beta ^{m-1}\mid \alpha ^{m-1})}\right]\).

  5. 5.

    For \(1\ge j \ge 2\), generate random samples \(Q_j\), where \(Q_j\sim U(0, 1)\).

  6. 6.

    If \(Q_1\le H_\alpha , \ Q_2\le H_\beta\), then \(\alpha ^{(m)}=\alpha ^{*}, \ \beta ^{(m)}=\beta ^{*}\), else \(\alpha ^{(m)}=\alpha ^{m-1}, \ \beta ^{(m)}=\beta ^{m-1}\).

  7. 7.

    \(m=m+1\).

  8. 8.

    For M repeat of process, the steps (3) to (7) are repeated, and finally, we obtain \(\alpha ^{(m)}\), \(\beta ^{(m)}\) for \(m=1,\ldots , B\).

At the end, the Bayes estimators of \(\alpha\) and \(\beta\) using the Metropolis-Hastening algorithm under the proposed loss error functions can be given as

$$\begin{aligned} \hat{\rho }_{SE}=\dfrac{1}{D-B}\displaystyle \sum _{l=B+1}^{D}\rho (\alpha ^{(l)}, \beta ^{(l)}), \end{aligned}$$
(24)
$$\begin{aligned} \hat{\rho }_{LI}=-\frac{1}{q}\log \left( \dfrac{1}{D-B}\displaystyle \sum _{l=B+1}^{D} \ e^{-q\rho (\alpha ^{(l)}, \beta ^{(l)})}\right) \end{aligned}$$
(25)

and

$$\begin{aligned} \hat{\rho }_{GE}=\left( \dfrac{1}{D-B}\displaystyle \sum _{l=B+1}^{D}(\rho (\alpha ^{(l)}, \beta ^{(l)}))^{-q}\right) ^{-1/q}, \end{aligned}$$
(26)

where B denotes the burn-in period.

Interval estimation

Credible interval

In this subsection, we construct the credible confidence intervals of parameters of \(\rho = (\alpha , \beta )\) using the Metropolis-Hastening technique. To cancel the issue of the initial guess, we delete B initial samples (burn-in period). Furthermore, applying the idea of Chen and Shao41, and generating posterior samples, we can compute the credible intervals. Consequently, the \(100(1-p)\%\) Cred Int of \(\Xi =(\delta , \eta )\) is defined by

$$\begin{aligned} \frac{1}{M^*}\sum _{j=i_1^*}^{M^*}\rho _{(j)}\le 1-p \le \frac{1}{M^*}\sum _{j=i_1^*}^{M^*}\rho _{(j)}, \end{aligned}$$
(27)

where, \(i_1^*<i_2^*,\ i_1^*,i_2^*\in \{1,2,\dots ,D\}\) and \(M^*=D-B\).

Bootstrap confidence intervals

In this section, we discuss the construction of bootstrap-percentile (boot-p) and bootstrap-t (boot-t ) confidence intervals. Step-by-step algorithms are provided below for the two methods:

(A) Boot-p

  1. 1.

    Obtain the estimate \(\tilde{\rho }\) based on complete data generated by considering the true parameter as \(\rho\).

  2. 2.

    Obtain the updated estimate \(\tilde{\rho }^{*}\) using \(\tilde{\rho }\) as the true parameter.

  3. 3.

    Repeat step 2, desired \(N_{B}\) number of times.

  4. 4.

    Let \(\tilde{F}_{1}(z) = P(\tilde{\rho }^{*}\le z)\) is the CDF of \(\tilde{\rho }^{*}\), then obtain \(\tilde{\rho }_{B_{p}}(z) = \tilde{F}_{1}^{-1}(z)\). An approximate \(100(1-p)\%\) confidence interval for \(\eta\) is now given by

    $$\begin{aligned} \left( \tilde{\rho }^{*}_{B_{p}}\left( \frac{p}{2}\right) ,\tilde{\rho }^{*}_{B_{p}}\left( 1-\frac{p}{2}\right) \right) . \end{aligned}$$

(B) Boot-t

  1. 1.

    Obtain \(\tilde{\rho }^{*}\) using the boot-p method as discussed in the above algorithm.

  2. 2.

    Compute the statistic \(T = \frac{\tilde{\rho }^{*}-\tilde{\rho }}{\sqrt{\tilde{V}(\tilde{\rho }^{*})}}.\)

  3. 3.

    Repeat step 2, \(N_{B}\) number of times.

  4. 4.

    Let \(\tilde{F}_{2}(z) = P(\tilde{T}\le z)\) is the CDF of \(\tilde{T}\), then obtain \(\tilde{\rho }_{B_{t}}(z) = \tilde{\rho } + \sqrt{\tilde{V}(\tilde{\rho })} \tilde{F}_{2}^{-1}(z)\). Further, the approximate \(100(1-p)\%\) confidence interval for \(\eta\) is evaluated as

    $$\begin{aligned} \left( \tilde{\rho }^{*}_{B_{p}}\left( \frac{p}{2}\right) ,\tilde{\rho }^{*}_{B_{p}}\left( 1-\frac{p}{2}\right) \right) . \end{aligned}$$

Simulation study and algorithms

In this subsection, we examine the performance of MLE and Bayes under the three loss functions estimators considering a finite number of samples. In the context of the RL-Exp distribution, sampling is performed by selecting random samples such as \(n = 20, 40, 70, 100\). The samples are obtained through the application of the quantile function, given by

$$\begin{aligned} z_u=-\dfrac{1}{\alpha }\log \left\{ \dfrac{W(-(\beta -1)(1-u) \ e^{-(\beta -1)})}{-(\beta -1)(1-u)}\right\} ^{-\beta }, \; \; 0<u< 1. \end{aligned}$$
(28)

The simulation study is conducted for two different sets of \(\rho =(\alpha ,\beta )\). The values pertaining to these combinations are indicated as follows: \(\rho =(0.5,2)\), and Set2: \(\rho =(1.75, 2.3).\) multiple repetitions were performed and yielded consistent results. \(N=1000\) random samples from the RL-Exp distribution are generated and different criteria including average estimates (Mean) and mean square error (MSE) are applied to asses the simulation results. Multiple repetitions were performedsuch as (\(\rho =(2.5, 3.75)\) and \(\rho =(4, 4.5)\)) and yielded consistent results. The proposed criteria can be defined as below

$$\begin{aligned} \hbox {Mean}=\frac{1}{N}\displaystyle \sum _{i=1}^{N}\hat{\rho }_i, \end{aligned}$$
$$\begin{aligned} \hbox {MSE}=\frac{1}{N}\displaystyle \sum _{i=1}^{N}(\hat{\rho }_i-\rho )^2. \end{aligned}$$

The simulation process is conducted by using the R language software with the optim function. It is well noted that for the Bayes estimator using the MCMC procedure, the obtained estimators are computed with conditions \(q=0.5\), \(D=10000\), the burn-in period \(B=2000\), and the values of prior distribution \(a_1=b_1=a_2=b_2=1\). Figs. 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 and 19 plotted the histogram, trace, the auto-correlation function (ACF), cumuplots function, and Gelman-Rubin diagnostic curves of the unknown parameters \(\alpha\) and \(\beta\) to assess MCMC convergence.

On the other hand, the CIs are constructed using the three proposed techniques with 95% confidence level. We computed the average interval length (ALs) and coverage probabilities (CPs) for each method. The results are presented in numerical form in Tables 4 and 5.

After that, to compare the performance of the two estimation approaches, we have assigned the values of the number of iterations (NI) and MSEs on the basis of their value in decreasing order for the two estimation methods corresponding to the same sample size. From these results obtained in Tables 2 and 3, we can observe that for all recommended estimators, the average parameter estimations approach the actual parameter values. Further, the MSEs decrease in size as m increases, which ensures that the obtained estimated parameter values are consistent and asymptotically unbiased. Moreover, we observed that Bayes estimation under the SE loss function appears as the best estimation method for the RL-Exp model with minimum MSE as compared to other methods mentioned. Figures 4, 5, 6 and 7 confirm this result. In addition, as the value of m continues to rise, the ALs of \(\rho\) ultimately decrease for all suggested techniques. Also, the Boot-t method has greater CPs among credible and Boot-p techniques.

Table 2 Numerical values of the RL-Exp model simulation for Set 1.
Table 3 Numerical values of the RL-Exp model simulation for Set 2.
Table 4 The ALs and the CPs for the RL-Exp model for Set 1.
Table 5 The ALs and the CPs for the RL-Exp model for Set 2.
Figure 4
Figure 4
Full size image

Box plots for the MSEs based on proposed estimators using Set 1.

Figure 5
Figure 5
Full size image

Box plots for the MSEs based on proposed estimators using Set 2.

Figure 6
Figure 6
Full size image

Box plots for the MSEs based on proposed estimators at \(\rho =(2.5, 3.75)\).

Figure 7
Figure 7
Full size image

Box plots for the MSEs based on proposed estimators at \(\rho =(4, 4.5)\).

Figure 8
Figure 8
Full size image

MCMC convergence plots of RL-Exp model for Table 2 at \(m=100\).

Figure 9
Figure 9
Full size image

MCMC convergence plots of RL-Exp model for Table 3 at \(m=100\).

Figure 10
Figure 10
Full size image

MCMC convergence plots of RL-Exp model for \(\rho =(2.5, 3.75)\).

Figure 11
Figure 11
Full size image

MCMC convergence plots of RL-Exp model for \(\rho =(4, 4.5)\).

Figure 12
Figure 12
Full size image

Gelman-Rubin diagnostic plots of \(\alpha\) and \(\beta\) for Table 2 at \(m=100\).

Figure 13
Figure 13
Full size image

Cumuplot function of \(\alpha\) and \(\beta\) for Table 2 at \(m=100\).

Figure 14
Figure 14
Full size image

Gelman-Rubin diagnostic plots of \(\alpha\) and \(\beta\) for Table 3 at \(m=100\).

Figure 15
Figure 15
Full size image

Cumuplot function of \(\alpha\) and \(\beta\) for Table 3 at \(m=100\).

Figure 16
Figure 16
Full size image

Gelman-Rubin diagnostic plots of \(\alpha\) and \(\beta\) for \(\rho =(2.5, 3.75)\).

Figure 17
Figure 17
Full size image

Cumuplot function of \(\alpha\) and \(\beta\) for \(\rho =(2.5, 3.75)\).

Figure 18
Figure 18
Full size image

Gelman-Rubin diagnostic plots of \(\alpha\) and \(\beta\) for \(\rho =(4, 4.5)\).

Figure 19
Figure 19
Full size image

Cumuplot function of \(\alpha\) and \(\beta\) for \(\rho =(4, 4.5)\).

Application of the RL-Exp model to life events

This section demonstrates the RL-Exp distribution as a practical tool, demonstrating its efficiency through the application of real data sets, with a specific focus on case studies in engineering and environmental fields.

First data

The first data set defined the failure times of product reliability during the design phase for 50 elements. It is studied by42 and the values are presented in Table 6.

Table 6 The values of data set I.

Second data

The second data set provided 56 yearly records of the highest amounts of rainfall in Karachi, Pakistan, from 1953 to 2009, which offers a comprehensive view of the city’s precipitation patterns. This data set was previously considered43. The values of the data set are summarized in Table 7.

Table 7 The values of data set II.

Third data

The third data consisted of the length of the intervals between the periods in which cars pass a spot on a road. The proposed data was sourced from Jorgensen44, and its values were reported in Table 8.

Table 8 Values of the data set III.

Table 9 summarizes the statistical description of all the proposed data sets, and visual comparisons, including scaled total time on the test (TTT), probability-probability (P-P), and box plots are sketched in Fig. 20 drawn from the three recommended data sets. These representations help in studying the historical performance of the engineering and environmental sectors.

Table 9 Descriptive statistics for the recommended data sets.
Figure 20
Figure 20
Full size image

Several descriptive plots of the proposed data set.

Note that the RL-Exp model is applied for the three data sets, with the following initial parameters:

  1. 1.

    Data I: \(\alpha =0.2, \ \ \beta =3.8\).

  2. 2.

    Data II: \(\alpha =0.05, \ \ \beta =2.1\).

  3. 3.

    Data III: \(\alpha =0.95, \ \ \beta =18\).

Now, the proposed model is compared with some renowned distributions, generalized Rayleigh (GR), zero-truncated Poisson exponential ( ZTP-Exp), alpha power transformed exponential (APT-Exp), Weibull (Wei), extended exponential (Ex-Exp), moment exponential (M-Exp), Gamma, XLindley, and two parameters Mira (TPM) distributions for fitting the three data sets.

Hence, several goodness-of-fit metrics like log likelihood function (\(\mathcal {LLF}\)), Akaike Information Criterion (\(\mathcal {AIC}\)), Correction Akaike Information Criterion (\(\mathcal {CAIC}\)), Hannan-Quin Information Criterion(\(\mathcal {HQIC}\)), Bayesian Information Criterion (\(\mathcal {BIC}\)), Kolmogorov-Smirnov (\(\mathcal{K}\mathcal{S}\)) statistics with its associated P-values, Anderson Darling (\(\mathcal {A}\)), and Cramer-Von Mise (\(\mathcal {W}\)) to select the best appropriate model for the two data sets. For a particular probability model, the distribution with the least value of these measures and a greater P-value offers the best fit for modeling a given real-life data set. The parameter estimates of all distributions with goodness-of-fit metrics for the proposed data are given in Tables 10 and 11. To verify whether the obtained estimate actually converges to the MLEs, we have tried the grid search algorithm with some other initial estimates also, and it converges to the same set of estimates. Moreover, we have conducted an extensive grid search method with a grid size of 0.001, and it is noticed that they coincide, which guarantees that the final estimates converge to the MLEs. The statistical software utilizes the R package (AdquacyModel) and the “BFGS” algorithm to perform all of these calculations. Obviously, the numerical values of Tables 10 and 11 demonstrate that the RL-Exp model fits the three data sets better than other fitted distributions, as indicated by the p value. Figures 21, 22, 23, 24, 25 and 26 present the fitted functions for the RL-Exp with the fitting models, including the PDF, CDF, SF, and P-P plots for the applied data sets. These plots further support the findings from Tables 10 and 11, offering a very close fit to each data set as compared to the other contender distributions.

Table 10 Estimation and goodness-of-fit comparison measures for the fitting models by applying the three recommended data sets.
Table 11 Various statistical metrics for the fitting using the suggested data sets.
Figure 21
Figure 21
Full size image

Fitted PDF, CDF, and SF plots of RL-Exp model for the first data set.

Figure 22
Figure 22
Full size image

Fitted PDF, CDF, and SF plots of RL-Exp model for the second data set.

Figure 23
Figure 23
Full size image

Fitted PDF, CDF, and SF plots of RL-Exp model for the third data set.

Figure 24
Figure 24
Full size image

P–P plots of the fitting distributions using data I.

Figure 25
Figure 25
Full size image

P–P plots of the fitting distributions using data II.

Figure 26
Figure 26
Full size image

P–P plots of the fitting distributions using data III.

Finally, the three recommended data sets are evaluated by applying the Bayes estimators under SE, LI, and GE loss functions. The obtained results are summarized in Table 12. In addition, the histogram, trace, and ACF plots of the RL-Exp distribution for the MCMC technique are illustrated in Figs. 27, 28 and 29. All the codes used exist in Sect. 8.

Table 12 Result of Bayesian estimate of the RL-Exp model using SE, LI and GE loss functions for the three data sets.
Figure 27
Figure 27
Full size image

MCMC Convergence results using data I.

Figure 28
Figure 28
Full size image

MCMC Convergence results using data II.

Figure 29
Figure 29
Full size image

MCMC Convergence results using data III.

Conclusion

The article introduces a new probabilistic model, the Ramos Louzada exponential model, specifically designed for engineering and environmental analysis applications. The proposed model can fit real-world data with ambiguity; our exploration of the RL-Exp model has demonstrated its ability to handle ambiguity and data ambiguity. The proposed model provides insight into reliability properties, moment functions, and order statistics and demonstrates its versatility as a powerful tool for decoding complex, skewed, and symmetric data sets. Using a wide range of methods, we can handle statistical analysis of the RL-Exp model and its unknown parameters. Therefore, we concluded that the Bayesian approach under the SE loss function is superior to the conventional estimating method since it consistently produces lower values for the MSE and number of iterations. We applied the proposed distribution to real-world data sets to verify its practical importance, comparing it directly with well-known classical distributions. The analysis results showed unequivocally that the RL-Exp model outperformed its classical counterpart in capturing the complex nuances of the underlying data, indicating its potential as a superior modeling tool in various fields. Henceforth, to best-fitting capability, the RL-Exp mode has certain disadvantages. For example, it has a non-closed form for the quantile function, and the cost of Bayesian estimates is generally high to compute. For future studies, we can apply the RL-Exp model in other cases, such as bivariate and multivariate versions, regression frameworks, and other statistical inferences, like censored data. Also, the proposed model may be utilized in different areas that require flexible and tractable probabilistic modeling.

Codes used in the paper

figure a