Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Exponentiated Odd Lomax Exponential distribution with application to COVID-19 death cases of Nepal

Abstract

This study suggested a new four-parameter Exponentiated Odd Lomax Exponential (EOLE) distribution by compounding an exponentiated odd function with Lomax distribution as a generator. The proposed model is unimodal and positively skewed whereas the hazard rate function is monotonically increasing and inverted bathtubs. Some important properties of the new distribution are derived such as quintile function and median; asymptotic properties and mode; moments; mean residual life, mean path time; mean deviation; order statistics; and Bonferroni & Lorenz curve. The value of the parameters is obtained from the maximum likelihood estimation, least-square estimation, and Cramér-Von-Mises methods. Here, a simulation study and two real data sets, “the number of deaths per day due to COVID-19 of the first wave in Nepal" and ‘‘failure stresses (In Gpa) of single carbon fibers of lengths 50 mm", have been applied to validate the different theoretical findings. The finding of an order of COVID-19 deaths in 153 days in Nepal obey the proposed distribution, it has a significantly positive relationship between the predictive test positive rate and the predictive number of deaths per day. Therefore, the intended model is an alternative model for survival data and lifetime data analysis.

Introduction

Probability distributions have been used extensively not only in statistics and mathematics, but also in applied sciences, engineering, and life sciences. Thus, the advancement of probability distributions always continues to grow at a fast pace to simulate real-life conditions and analyze real-life data more efficiently. While doing so, this past decade, many generalized distributions being proposed based on different modification methods with more parameters and flexibility than the existing one. However, there are numerous problems to solve and analyze in real data because any classical or standard probability distributions do not address the different data characteristics [1]. Thus, a new family of distributions or distributions has been proposed to generalize several distributions by compounding well-known distributions which provide greater flexibility in modeling as a practical viewpoint [2].

In the literature, a new parametric distribution has been derived by adding a parameter in exponential and Weibull distribution, yielding a new two-parameter exponential and three parameters Weibull distribution [3]. Marshall–Olkin extended Lomax distribution has been derived by extending the Marshall and Olkin family of distributions based on the Lomax distribution [4]. A five-parameter McDonald Lomax distribution has been derived from the Lomax distribution [5]. Likewise, the new sub-models have been formed by using a Lomax distribution as a generator with two additional positive parameters. In this paper, some special models, such as Lomax-normal, Lomax-Weibull, Lomax-logistic, and Lomax-Pareto distributions have been derived [6, 7]. A new distribution has been generalized, then it became the Kumaraswamy-G Poisson distribution, which has three extra positive parameters [8]. The three-parameter power Lomax distribution, which is more flexible than previous Lomax distributions, and it has been derived with decreasing and inverted bathtub hazard rate functions [9]. Moreover, a new two-parameter half Logistic Poisson distribution has been derived, and it expanded into generalized half-logistic Poisson distribution with three parameters. The proposed distribution is increasing, decreasing, upside-down, and bathtub-shaped hazard rate function [10, 11]. Similarly, a three-parameter Kumaraswamy half logistic distribution has been derived from the Kumaraswamy-G family by compounding with half logistic distribution as a baseline distribution [2].

Furthermore, exponentiated Weibull Lomax distribution has been derived from the exponentiated Weibull-G family [12]. The alpha power inverted exponential distribution has been derived from the inverted exponential distribution with alpha as a power. The proposed distribution is more versatile in numerous real data analyses [13]. An odd generalized exponential family has been compounding with inverted Lomax distribution in modeling, formed four-parameter model is an odd generalized exponentiated inverse Lomax distribution [14]. Likewise, the odd Lomax-exponential (type III) distribution has been derived from the Lomax random variable as a generator [15]. Lomax exponential distribution has been formed after the new modification of the Lomax distribution which is very flexible in life data modeling with decreasing and increasing hazard shapes (non-monotonic) [16]. Similarly, inverse Lomax as a generator has been used in continuous distributions and formed the inverse Lomax-exponentiated-G family [17]. Moreover, a new Poisson inverted exponential distribution is derived from the Poisson-G family [18]. A three-parameter half logistic Nadarajah-Haghighi extension of exponential (NHE) distribution has been derived by compounding a continuous distribution NHE with half logistic-G family [19], and compounding Rayleigh distribution with exponentiated-G Poisson family by power transformation technique formed exponentiated Rayleigh Poisson distribution [20].

In literature, different distributions have been derived and estimated the parameters by different techniques like as; maximum likelihood estimators, least squares estimators, weighted least squares estimators, percentile estimators, the maximum product spacing estimators, the minimum spacing absolute distance estimators, the minimum spacing absolute log-distance estimators, Cramér von Mises estimators, Anderson Darling estimators, right-tailed Anderson Darling estimators, method of moments estimators and Bayes estimators [2125].

Corona Virus Disease 2019 (COVID-19) pandemic has devastated the world and is accompanied by economic, social, and behavioral challenges and responses. More than 1.5 million people have died worldwide and more than 1,800 people have died in Nepal by the end of December 2020 [26, 27]. Already, several mathematical and statistical models have been proposed to explain the path of the pandemic. However, it is important to note that the characteristics of the data fluctuate which may lead to classical probability distributions that may not be able to be captured in all cases. For example, the data are highly skewed, either to the right or to the left, with the possibility of some outlying observations, and therefore a classical distribution such as the normal distribution cannot be used to fit them. Therefore, flexible distribution is required to capture such data. As a result, we have proposed an Exponentiated Odd Lomax Exponential (EOLE) distribution to analyze the deaths cases of COVID-19 first wave in Nepal. It is more flexible, with four parameters, better equipped to handle complex data, and thus achieves our goal.

In this study, the cumulative distribution function, probability density function, reliability/survival, hazard rate functions, reverse hazard rate function, and cumulative hazard rate function are explicitly presented in section material and methods. Likewise, we derive some important statistical properties such as quintile function and median, asymptotic properties and mode, moments, mean residual life, mean path time, mean deviation, order statistics, and Bonferroni & Lorenz curve. In an estimation technique, we have to employ three well-known estimation methods to estimate the model parameters namely, the Maximum Likelihood Estimation (MLE), Least-Square Estimation (LSE), and Cramér-Von-Mises (CVM). We conducted a simulation study in the result and discussion section, and two real data sets were used to verify the theoretical findings in various aspects. Finally, derive our conclusion of this study with further discussion.

Materials and methods

Exponentiated odd Lomax exponential distribution

Exponential distribution plays a significant role in statistics and probability theory. In this distribution, events occur continuously and independently at a constant average rate. It is a special case of gamma, Weibull, Rayleigh, and Erlang distribution. It is a continuous analog of the geometric distribution, which has the main property of being memoryless. As a result, the exponential distribution is used as a baseline probability distribution having a cumulative distribution function (1)

We have, and .

The distribution is extended by an auxiliary parameter, it forms an exponentiated function [28, 29]. Let, θ > 0 is an auxiliary parameter on odd function, called the exponentiated odd function, which is . Similarly, the T-X family of distribution is an extended form of beta generated distribution by taking any non-negative continuous random variable T as a generator instead of beta random variable [30], which is (2)

The r(t) as a generator that has used the probability density function of the Lomax distribution. The Lomax distribution (also known as Pareto type II distribution) is a widespread distribution with applications in the field of actuarial science, reliability modeling, life testing, economics, network analysis, and operations research [31]. Therefore, The PDF of Lomax distribution as a generator is

We compound the PDF of the Lomax distribution as a generator and exponentiated odd [W(x)] function because the exponential distribution has a single scale parameter and the Lomax distribution has one of each scale and shape parameter. When both functions are compounded, it becomes two of each scale and shape parameter, making it is more robust and flexible distribution. As a result, it captures different types of data such as; skewed, truncated, non-truncated, and others. Therefore, the CDF of an exponentiated odd Lomax exponential distribution is (3)

The corresponding PDF of the proposed distribution is (4)

Here, α > 0, δ > 0 are scale parameters and λ > 0, θ > 0 are shape parameters. The shape of PDF (4) is platykurtic and positively skewed at α = 1.0, λ = 1.0 and α = 1.5, λ = 1.5, symmetrical at α = 2.0, λ = 2.0 and it is leptokurtic after increase α and λ whereas θ = 2.5 and δ = 2.0 are fixed [Fig 1, (left panel)].

thumbnail
Fig 1. Probability density function (left panel), hazard rate function (right panel) with different parameter’s value.

https://doi.org/10.1371/journal.pone.0269450.g001

Likewise, the survival function is complementary to the CDF which gives the chance to live just before during ‘x’. Mathematically, R(x) = 1 − F(x). Hence, the survival function of the proposed distribution is (5)

The hazard rate function is the conditional density given that the event has not yet occurred before time x. Mathematically, let x be a survival time of a component or item and we want to calculate the probability that it will not survive for an additional time Δx, then hazard rate function is, Therefore, the hazard rate function of the proposed model is (6)

Likewise, the shape of hazard function (6) a is monotonic increase at (α = 1.0, λ = 1.0), (α = 1.5, λ = 1.5) and (α = 2.0, λ = 2.0). After increasing the value of α and λ then it change monotonic increase and inverted bathtub shaped at (α = 2.5, λ = 2.5) and (α = 3.0, λ = 3.0) whereas θ = 2.5 and δ = 2.0 are fixed [Fig 1, (right panel)].

Similarly, the reversed hazard rate function is the ratio of density to the distribution function which is useful in reliability analysis. It is (7)

Likewise, the cumulative hazard rate function is not the probability function, however, it measures the risk. Therefore, it is defined as (8)

Statistical properties

In this section, some properties of the EOLE distribution have been derived.

Useful expansions

Distribution is derived from the generalized binomial series. For, |Z| < 1, n > 0; we have, (9)

Quantile and median

The quantile functions are used in theoretical aspects of a probability distribution. It is an alternative to PDF and CDF, which is used to obtain statistical measures like median, skewness, and kurtosis. It has been also used to generate random numbers. The quantile function is given by Q(u) = F−1(u). Therefore, the corresponding quantile function of the proposed distribution is (10)

Where, u ~ U(0,1). In particular, the median is derived by setting in Eq (10), we get;

Asymptotic behavior and mode

To examine the asymptotic behavior, we have to check, . If both limits are converging into zero, then the proposed model satisfied the properties of asymptotic behavior and it existed the mode value.

Therefore,

Further, we have to calculate the mode by taking the logarithmic in Eq (4), we get; (11)

Now, differentiate concerning in Eq (11) and apply the condition f(x) ≠ 0 and f′(x) = 0, the mode of proposed distribution is (12)

Eq (12) is a nonlinear equation that cannot be solved analytically. It can be solved numerically by using the Newton-Raphson method.

Moments

The moments of probability distribution suggest the characteristics of the distribution like mean, standard deviation, skewness, and kurtosis. Let, X be a random variable following the EOLE distribution, then the moment of the proposed distribution is (13)

Alternatively, we define the moments of proposed distribution from the quantile function [32, 33]. The rth raw moment of the proposed distribution is (14)

Where, QG(u) is the quantile function (10), then Eq (14) is (15)

By simplification, we get rth raw moments of proposed distribution is (16)

Where, αp(r) is the coefficient of in the expansion of [33, 34].

In particular, the first four moments of X obtained by substituting the value of r = 1, 2, 3 and 4 in Eq (16).

Conditional moments

The conditional moment is also of interesting for increasing the failure rate model. Conditional moment is (17)

Alternatively, we can define the conditional moments from the quantile function, which is (18)

Where, u = F(x) is CDF and, R(x) is survival function of the proposed model, then conditional moments is

In particular, and

Mean residual life

The Mean Residual Life (MRL) is the average outstanding life, Xx given that the item has survived to time x. Thus, the expected additional lifetime given that a component has survived until the time x is called the MRL. It is defined as, (19)

Alternatively, we can define the MRL of proposed distribution from the quantile function is (20)

Where, F(x) is CDF and, R(x) is survival function of the proposed distribution.

Mean past lifetime

The mean Past Lifetime (MPL) is the conditional random variable xX/Xx. This showed that the time elapsed from the failure of the component given that its lifetime is less or equal to x. It can be calculated as, (21)

It can be alternatively defined from the quantile function, which is (22)

Mean deviation

The Mean Deviation (MD) from mean and median measures the scatter from the center value either mean or median. The MD is defined as, (23)

We obtained MD(μ) and MD(md) using the following relationships: (24)

Likewise, (25)

We have to calculate in terms of quantile function such as .

Likewise,

Finally, the Eqs (24) and (25) becomes, and

Order statistics

Order statistics have been extensively applied in many fields of statistics such as reliability and life testing. Let, X1, X2, …, Xn random sample from (4) and X1:nX2:n ≤ … ≤ Xn:n corresponding order statistics. The probability density function of rth order statistics say Xr:n; 1 ≤ rn [33] is given by; (26)

We apply the preposition of (1) and (2) in Eq (26) then the equation becomes, (27)

When, r = n then from Eq (27), the pdf of the largest order statistics Xn:n is given by

Similarly, r = 1, then from Eq (27), the pdf of smallest order statistics x1:n is given by

Bonferroni and Lorenz curve

Bonferroni and Lorenz curve has been proposed by Bonferroni [33]. To measure poverty and income, Bonferroni and Lorenz curves are widely used. Also, such types of curves are widely used in other fields like demography, medicine, reliability, insurance and many others.

Methods of estimation

We have to estimate the value of unknown parameters of the proposed model by maximum likelihood estimation, method of least square, weighted least square, and Cramér von miss technique.

Maximum Likelihood Estimation (MLE)

Let, x1, x2, …, xn are random sample from EOLE distribution with parameters (α, θ, λ and δ), then likelihood function of proposed distribution is the product of nth time of sample PDF which is . Where, is the parameter space which belongs to (α, θ, λ and δ). Therefore, the log-likelihood function of the proposed distribution is (28)

The parameters are obtained from maximum likelihood estimation by partial differentiate (28) with respect to corresponding parameters. Let, and we have; (29) (30) (31) (32)

Finally, solve non-linear equations , , and for α, θ, λ and δ. We get the maximum likelihood estimate value (, , and ) of the parameters (α, θ, λ and δ). Likewise, for the interval estimation of parameters (α, θ, λ and δ), we have to calculate the observed information matrix. The observed information matrix is (33)

The elements of the observed information matrix are in Appendix B of S1 Appendix. Let denote the parameter space and the corresponding MLE of as , then follows the asymptotic multivariate normal distribution, where is the Fisher’s information matrix. For practical proposed, we directly calculate the observed information matrix from Eq (28) and convert it into Hussain matrix. Finally, we calculate the variance-covariance matrix from the inverse of the Hussain matrix is (34)

Furthermore, the asymptotic normality of MLEs, approximate 100(1 − γ)% confidence intervals of α, θ, λ and δ can be constructed as; where zγ/2 is the upper percentile of standard normal variate.

Method of Least-Square Estimation (LSE)

Initially, the least square estimation and weighted least square estimate were introduced to estimate the parameters of beta distribution [35, 36]. This technique has been used to estimate unknown parameters of proposed distribution by minimizing the concerning parameters α, θ, λ and δ, which is (35)

The parameter’s values are obtained from the least square method by partial differentiation in Eq (35) concerning corresponding parameters.

Let, and , and , then Eq (35) becomes; (36) (37) (38) (39)

We solve non-linear equations , , and to estimate the unknown parameters of the proposed distribution by minimizing the function concerning parameters α, θ, λ and δ.

Weighted least-square estimation

The weighted least-squares estimation is a technique to determine the unknown parameters by minimizing concerning parameters α, θ, λ and δ is (40)

Where, is the weight for the proposed model. Hence, the weighted least-square estimators of α, θ, λ and δ respectively can be obtained by partial differentiate with respect to corresponding parameters in Eq (40) and set the result equal to zero (41) (42) (43) (44)

We solve non-linear equations , , and to estimate unknown parameters of proposed distribution by minimizing function concerning parameters α, θ, λ and δ.

Method of Cramér-Von-Mises (CVM)

Cramér-von-Mises is minimum distance estimators [36]. It provides empirical evidence that the bias of the estimator is smaller than the other minimum distance estimators. The CVM estimators are achieved and the function has minimized C(α, θ, λ, δ) (45)

Cramér-Von-Mises estimators of α, θ, λ and δ respectively can be obtained by partial differentiate with respect to corresponding parameters in Eq (45) and set the result equal to zero (46) (47) (48) (49)

We solve non-linear equations , , and to estimate unknown parameters of proposed distribution by minimizing the function concerning parameters α, θ, λ and δ.

Results and discussion

Data analysis has been done in two-phase. Firstly, we have done a simulation study and secondly, we have done real data analysis. In real data analysis, two data sets have been used to validate the proposed model: (i) The first data set is the number of deaths per day due to the COVID-19 first wave in Nepal. (ii) The second data set is failure stresses (in GPa) of single carbon fibers of lengths 50 mm.

Simulation study

In a simulation study, we estimate the parameters of the proposed distribution by maximum likelihood estimation. The performance of ML estimators is assessed through their average bias and Mean Square Error (MSEs) for different sample sizes. For the estimation purpose, 10000 random samples of sizes 50, 200, 500, 750 are generated with different combinations of (α, θ, λ and δ). The iterative technique is used to estimate the ML parameters of each sample size. We observed that average bias and MSEs for individual parameters fall to zero when sample size increases as our expectation, which provides the consistency of the estimators. (Table 1).

thumbnail
Table 1. MLE, average bias and MSEs of EOLE distribution.

https://doi.org/10.1371/journal.pone.0269450.t001

Real data analysis

I. Number of deaths per day due to COVID-19 in Nepal.

The COVID-19 is a worldwide pandemic of coronavirus disease in 2019 including Nepal. The first COVID case was confirmed on 23 January 2020 and the first death was on 14 May in Nepal. Due to the COVID-19 pandemic, the government has emphasized a nationwide lockdown from March 24, 2020, to July 21, 2020. Following that, the government concentrated its efforts on the PCR test and other health-related initiatives. Every day, the ministry of health and population have been provided the data regarding COVID-19 issues, such as test positive rate, the number of deaths, the number of infected, and many others. During the research period, researchers collected the data daily from 23 January 2019 to 24 December 2019 all over the country. Every day, the ministry of health and the population of Nepal (MOHP) has been reported the data [26]. Among these data, we select the number of deaths to validate the proposed model. A total of 1,808 deaths were recorded in Nepal at the end of 24 December 2020 due to COVID-19 first wave. Every day, on average, 5.4 ≈ 6 people were died due to COVID-19 (from 23 January to 24 December). The summary finding of daily deaths has been presented in the following table (Table 2).

thumbnail
Table 2. Descriptive statistic of the number of death due to COVID-19.

https://doi.org/10.1371/journal.pone.0269450.t002

To validate the proposed model, at least two deaths occurred every day as consideration for sample data. In the last 153 days, every day, at least two people have died, as reported below [26].

2, 2, 2, 2, 2, 2, 3, 2, 3, 3, 4, 2, 5, 5, 3, 2, 4, 4, 8, 4, 4, 3, 2, 3, 7, 6, 6, 11, 9, 3, 8, 7, 11, 8, 12, 12, 14, 7, 11, 12, 6, 14, 9, 9, 11, 6, 6, 5, 5, 14, 9, 15, 11, 8, 4, 7, 11, 10, 16, 2, 7, 17, 6, 8, 10, 4, 10, 7, 11, 11, 8, 7, 19, 9, 15, 12, 10, 14, 22, 9, 18, 12, 19, 21, 12, 12, 18, 8, 26, 21, 17, 13, 5, 15, 14, 11, 17, 16, 17, 23, 24, 20, 30, 18, 18, 17, 21, 18, 22, 26, 15, 13, 13, 6, 9, 17, 12, 17, 22, 7, 16, 16, 24, 28, 23, 23,19, 25, 29, 21, 9, 13, 16, 10, 17, 20, 23, 14, 12, 11, 15, 9, 18, 14, 13, 6, 16, 12, 11, 7, 3, 5, 5.

To fit the data, we have to check our data set by graphical representation like TTT plot and box plot.

Total time test plot

TTT plot is an important graphical method for checking whether or not our data set can be applied in a particular model. Plots can be easily obtained by using the TTT function of adequacy model package on R software. It is used to validate the hazard rate function [37, 39]. The empirical version of the TTT plot is where, yr:n (r = 1, 2, …, n) and yi:n (i = 1, 2, …, n) are the order statistics of the sample. The shape of the TTT plot is either convex for decreasing failure rate, concave for increasing failure rate or bathtub shaped. Here, the TTT plot of the illustrative data set is concave for increasing failure rate. It indicates that the data set is valid for further analysis [Fig 2 (left panel)] [37].

Box plot

The summary finding of the data set is present by using the box plot. It provides a clear picture of the descriptive characteristics of the illustrative data set [Fig 2 (right panel)].

Parameter estimation

We computed the value of the parameter by maximizing the log-likelihood function in Eq (28), minimizing the least square method in Eq (35), weighted least square Eq (40), and the Cramér Von Mises method in Eq (45) directly by using maxLik () function on R software [38, 39]. Finally, we have to present the estimated value of ; which were computed by different methods (Table 3).

thumbnail
Table 3. Estimated parameters’ value from four different methods.

https://doi.org/10.1371/journal.pone.0269450.t003

Distribution characteristics

After estimating the value of the parameter, we determined the characteristics of the proposed distribution from the illustrative data set. The finding of descriptive statistics showed that the mean is greater than the median, which is also higher than the mode, and value of skewness is positive, which shows that the proposed model is positively skewed. In the case of kurtosis, the distribution is approximately symmetrical, but towards platykurtic (Table 4).

thumbnail
Table 4. Descriptive characteristic of the proposed model.

https://doi.org/10.1371/journal.pone.0269450.t004

Validation of estimation methods

Various methods have been used in the literature to estimate unknown parameters. Among them, we used four methods named: MLE, LSE, WLSE, and CVM. Again, we have to check the validation of the different methods by using different goodness of fit criteria. The well-known criteria are Kolmogorov-Simnorov (KS) test, Anderson’s darling (A2) test, and Cramér Von Mises (W) test. The p-values of the KS test, A2 test, and W test are insignificant with finding of MLE, but significant with the finding of LSE, WLSE, and CVM. Therefore, MLE has satisfied the good behavior of goodness of fit (Table 5).

thumbnail
Table 5. Comparison with a p-value of KS, A2 and W statistics in different methods.

https://doi.org/10.1371/journal.pone.0269450.t005

Furthermore, we compared the empirical distribution and theoretical cumulative distribution of the proposed model, indicating that the curve of empirical distribution is closer with the finding of MLE but does not closer with other findings (LSE, WLSE, and CVM) in the illustrative data set [Fig 3 (left panel)]. Also, we plot the theoretical PDF of the intended model by using different estimated values [Fig 3 (right panel)]. In both graphical demonstrations, the estimated value of MLE is more appropriately fitted than others.

thumbnail
Fig 3. Plot the empirical distribution function with estimated CDF (left panel), histogram with estimated PDF (right panel).

https://doi.org/10.1371/journal.pone.0269450.g003

Relationship between the predictive probability of number of deaths and test Positive rate

Again, we have to estimate the parameter value of EOLE distribution by using the test positive rate per day from the MLE technique. The estimated parameter’s value of EOLE distribution with Standard Error (SE) are ; ; and, .

Furthermore, we have to predict the probability of test positive rate and probability of number of deaths per day. Finally, we have to determine the relationship among these variables. The finding revealed that, there is a positive relationship among these variables, which is statistically significant (r = 0.2762, p-value = 0.00054) with a 95% confidence interval (0.12291–0.41662).

The finding concludes that the test positive rate will increase; the death rate should be increased [Fig 4].

thumbnail
Fig 4. Relationship between predictive prob-ability of number of death and test positive rate.

https://doi.org/10.1371/journal.pone.0269450.g004

Model comparisons/selections

Model selection is an important and integral part of data analysis. It is important to increase computing power to fit more realistic, flexible, and complex models. We compared our proposed model with eleven competitive models namely; exponentiated half logistic exponential (EHLE) [40], Marshall-Olkin logistic exponential (MOLE) [41], Lomax exponential Weibull (LEW) [42], exponentiated generalized inverted exponential (EGIE) [43], generalized inverted generalized exponential (GIGE) [44], generalized odd inverted exponential exponential (GOIEE) [45], Marshall–Olkin power generalized Weibull (MOPGW) [46], odd Lomax exponential (OLE) [47], type I half-logistic Fréchet (TIHLF) [48], Lindley inverse Weibull (LIW) [36] and half logistic Nadarajah Haghighi extension of exponential (HLNHE) [19]. To compare the proposed models with other competitive models, firstly we determine the value of parameters by maxlik function () from R software by solving the nonlinear equation [38, 39]. The estimated parameter value of each distribution along with standard error are present in the following table (Table 6). The PDF of each competitive model is in Appendix C of S1 Appendix.

thumbnail
Table 6. Estimated value of parameters: Proposed as well as competitive models.

https://doi.org/10.1371/journal.pone.0269450.t006

We have compared different goodness of fit criteria like as; (i) values of log-likelihood, (ii) Akaike’s information criterion, (iii) Bayesian information criterion, (iv) corrected Akaike’s information criterion, and (v) Hannan-Quinn information criterion. Each criteria can be calculated as following relation; ; ; ; and . Where, p is the number of parameters in the model and n is the total sample under consideration.

According to -2LL, AIC, BIC, CIAC and HQIC, the least value among the competitive models is superior to others. The finding reveals that the value of the intended model has smaller as compared to all other eleven competitive models. Therefore, the proposed model is superior than others followed by MOPGW. The model GIGE is the least fitted model in the given illustrative data set (Table 7).

thumbnail
Table 7. Calculated value of -2LL, AIC, BIC, CIAC, and HQIC of different models.

https://doi.org/10.1371/journal.pone.0269450.t007

Furthermore, we have compared the empirical distribution and theoretical cumulative distribution of the proposed model, indicating that both curves are closer in the illustrative data set. Likewise, the theoretical CDF of nine competitive models namely, EHLE, MOLE, LEW, EGIE, GIGE, MOPGW, OLE, LIW, and HLNHE compared to the theoretical CDF the proposed model [Fig 5 (left panel)]. Also, the theoretical PDF of the intended model is compared with all other competitive models [Fig 5 (right panel)]. The finding suggests that the proposed model is adequately fit in illustrative data set than all other competitive models.

thumbnail
Fig 5. Estimated fitted CDF (left panel), Estimated fitted densities (right panel).

https://doi.org/10.1371/journal.pone.0269450.g005

II. Failure stresses (In Gpa) of single carbon fibers of lengths 50 mm data set.

The second data set “on failure stresses (in GPa) of single carbon fibers of lengths 50 mm” [49] has been used to validate the proposed distribution. The illustrative data set were used by different authors to validate other distributions like, a new extension of the generalized half logistic distribution [50] and weighted Lindley distribution [51].

1.339, 1.434, 1.549, 1.574, 1.589, 1.613, 1.746, 1.753, 1.764, 1.807, 1.812, 1.84, 1.852, 1.852, 1.862, 1.864,1.931, 1.952, 1.974, 2.019, 2.051, 2.055, 2.058, 2.088, 2.125, 2.162, 2.171, 2.172, 2.18, 2.194, 2.211, 2.27, 2.272,2.28, 2.299, 2.308, 2.335, 2.349, 2.356, 2.386, 2.39, 2.41, 2.43, 2.431, 2.458, 2.471, 2.497, 2.514, 2.558, 2.577, 2.593, 2.601, 2.604, 2.62, 2.633, 2.67, 2.682, 2.699, 2.705, 2.735, 2.785, 3.02, 3.042, 3.116, 3.174.

Now, we have used an illustrative data set to estimate the parameters value of the proposed model. The estimated value of the parameters are (0.120098, 6.208652, 3.391613 and 0.003138) respectively. Furthermore, we used the KS test, Anderson’s darling test (A2), and Cramér Von Mises test (W) to assess the goodness of fit. The test values for each statistic are 0.038742 (p-value = 0.8227), 0.21115 (p-value = 0.9871), and 0.025715 (p-value = 0.9886), respectively. The p-values of each statistic support the null hypothesis, indicating that the proposed model has a better fit in the recommended data set. Similarly, we compared the proposed model to other competitive models using -2LL, AIC, BIC, CIAC, and HQIC. Firstly, we estimate the values of the model’s parameters and present them in the table (Table 8).

thumbnail
Table 8. Estimated parameter values of all competitive models.

https://doi.org/10.1371/journal.pone.0269450.t008

The lowest value of -2LL, AIC, BIC, CIAC, and HQIC in the proposed model, among all competitive models, indicates that the proposed model is superior to others (Table 9).

thumbnail
Table 9. Comparison of -2LL, AIC, BIC, CIAC, and HQIC value among models.

https://doi.org/10.1371/journal.pone.0269450.t009

Similarly, the built model is appropriately fit in terms of graphical appearance than other competitive models [Fig 6].

thumbnail
Fig 6. Estimated fitted CDF (left panel), Estimated fitted densities (right panel) of carbon fiber data set.

https://doi.org/10.1371/journal.pone.0269450.g006

Conclusion

This study suggested a new four-parameter Exponentiated Odd Lomax Exponential (EOLE) distribution by compounding an exponentiated odd function with Lomax distribution as a generator. Some important properties of the new distribution are investigated such as quintile function and median; asymptotic properties and mode; moments; mean residual life, mean path time; mean deviation; order statistics; and Bonferroni & Lorenz curve. Further, we have employed three well-known estimation methods to estimate the model parameters namely, the maximum likelihood estimation, least-square estimation, and Cramér-Von-Mises methods. To verified the different theoretical finding we have applied a simulation study and two real data sets, ‘‘Number of deaths per day due to COVID-19 first wave in Nepal” and ‘‘failure stresses (in GPa) of single carbon fibers of lengths 50 mm”. It has a significantly positive relationship between predicted test positive rate and the predicted number of deaths per day. Finally, we analyzed the illustrative data set and found that the proposed model provides a reasonably better fit as compared to some other well-known models. Therefore, the EOLE distribution can be used as an alternative model in the future to analyze survival and lifetime data.

Acknowledgments

We would like to be grateful to the referees for their valuable comments and suggestions which help improve the original manuscript.

References

  1. 1. Merovci F. Transmuted generalized Rayleigh distribution. Journal of Statistics Applications & Probability. 2014 Mar 1; 3(1):9.
  2. 2. Usman RM, Haq M, Talib J. Kumaraswamy half-logistic distribution: properties and applications. J Stat Appl Probab. 2017; 6:597–609.
  3. 3. Marshall AW, Olkin I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika. 1997 Sep 1; 84(3):641–52.
  4. 4. Ghitany ME, Al-Awadhi FA, Alkhalfan L. Marshall–Olkin extended Lomax distribution and its application to censored data. Communications in Statistics—Theory and Methods. 2007 Aug 7; 36(10):1855–66.
  5. 5. Lemonte AJ, Cordeiro GM. An extended Lomax distribution. Statistics. 2013 Aug 1; 47(4):800–16.
  6. 6. Gómez YM, Bolfarine H, Gómez HW. A new extension of the exponential distribution. Revista Colombiana de Estadística. 2014 Jun; 37(1):25–34.
  7. 7. Cordeiro GM, Ortega EM, Popović BV, Pescim RR. The Lomax generator of distributions: Properties, minification process and regression model. Applied Mathematics and Computation. 2014 Nov 15; 247:465–86.
  8. 8. Ramos MW, Marinho PR, Cordeiro GM, da Silva RV, Hamedani G. The Kumaraswamy-G Poisson family of distributions. Journal of Statistical Theory and Applications. 2015.
  9. 9. Rady EH, Hassanein WA, Elhaddad TA. The power Lomax distribution with an application to bladder cancer data. SpringerPlus. 2016 Dec; 5(1):1–22. pmid:27818876
  10. 10. Muhammad M, Yahaya MA. The half logistic-Poisson distribution. Asian Journal of Mathematics and Applications. 2017 Jun 26; 2017.
  11. 11. Muhammad M. Generalized half-logistic Poisson distributions. Communications for Statistical Applications and Methods. 2017; 24(4):353–65.
  12. 12. Hassan AS, Abd-Allah M. Exponentiated Weibull-Lomax distribution: properties and estimation. Journal of Data Science. 2018 Apr 1; 16(2):277–98.
  13. 13. Ceren ÜN, Cakmakyapan S, Gamze ÖZ. Alpha power inverted exponential distribution: Properties and application. Gazi University Journal of Science. 2018; 31(3):954–65.
  14. 14. Maxwell O, Chukwudike NC, Bright OC. Modeling lifetime data with the odd generalized exponentiated inverse Lomax distribution. Biom Biostat Int J. 2019; 8(2):39–42.
  15. 15. Ogunsanya AS, Sanni OO, Yahya WB. Exploring some properties of odd Lomax-exponential distribution. Annals of Statistical Theory and Applications (ASTA). 2019 May; 1:21–30.
  16. 16. Ijaz M, Asim SM. Lomax exponential distribution with an application to real-life data. PloS one. 2019 Dec 11; 14(12):e0225827. pmid:31826022
  17. 17. Falgore JY, Doguwa SI. Inverse Lomax-Exponentiated G (IL-EG) Family of Distributions: Properties and Applications. Asian Journal of Probability and Statistics. 2020 Nov 30:48–64.
  18. 18. Dhungana GP. A New Poisson Inverted Exponential Distribution: Model, Properties and Application. Prithvi Academic Journal. 2020 Sep 16:136–46.
  19. 19. Joshi R.K., & Kumar V. Half Logistic NHE: Properties and Application. International Journal for Research in Applied Science & Engineering Technology.2020 Dec 14; 8(IX), 742–753.
  20. 20. Joshi RK, Dhungana GP. Exponentiated Rayleigh Poisson Distribution: Model, Properties and Applications. American Journal of Theoretical and Applied Statistics. 2020 Nov 4; 9(6):272–82.
  21. 21. Sajid AL, Sanku DE, Tahir MH, Mansoor M. A comparison of different methods of estimation for the flexible Weibull distribution. Communications Faculty of Sciences University of Ankara Series A1 Mathematics and Statistics. 2020;69(1):794–814.
  22. 22. Dey S, Ali S, Park C. Weighted exponential distribution: properties and different methods of estimation. Journal of Statistical Computation and Simulation. 2015 Dec 12;85(18):3641–61.
  23. 23. Dey S, Dey T, Ali S, Mulekar MS. Two-parameter Maxwell distribution: Properties and different methods of estimation. Journal of Statistical Theory and Practice. 2016 Jun;10(2):291–310.
  24. 24. Shafqat M, Ali S, Shah I, Dey S. Univariate discrete Nadarajah and Haghighi distribution: Properties and different methods of estimation. Statistica. 2020;80(3):301–30.
  25. 25. Ali S, Dey S, Tahir MH, Mansoor M. The Poisson Nadarajah-Haghighi Distribution: Different Methods of Estimation. Journal of Reliability and Statistical Studies. 2021 Aug 30:415–50.
  26. 26. Government of Nepal Ministry of Health and Population. Health sector response to COVID-19. 2020 Dec 21; SitRep#319.
  27. 27. World Health Organization. Weekly Operational Update on COVID-19. 2020 Dec 21; World Health Organization.
  28. 28. Tahir MH, Nadarajah S. Parameter induction in continuous univariate distributions: Well-established G families. Anais da Academia Brasileira de Ciências. 2015 Apr; 87:539–68. pmid:26131628
  29. 29. de Brito CR, Rêgo LC, de Oliveira WR, Gomes-Silva F. Method for generating distributions and classes of probability distributions: The univariate case. Hacettepe Journal of Mathematics and Statistics. 2019 Jan 1; 48(3):897–930.
  30. 30. Alzaatreh A, Lee C, Famoye F. A new method for generating families of continuous distributions. Metron. 2013 Jun 1; 71(1):63–79.
  31. 31. Chakrabortya, T. A note on the Lomax distribution. 2019. ArXiv.1911.12612v1 [Math. Stat].
  32. 32. Anzagra L, Sarpong S, Nasiru S. Odd chen-g family of distributions. Annals of Data Science. 2020 Mar 16:1–23.
  33. 33. Dey S, Kumar D, Ramos PL, Louzada F. Exponentiated Chen distribution: Properties and estimation. Communications in Statistics-Simulation and Computation. 2017 Nov 26; 46(10):8118–39.
  34. 34. Balakrishnan N. Order statistics from the half logistic distribution. Journal of Statistical Computation and Simulation. 1985 Jan 1; 20(4):287–309.
  35. 35. Swain JJ, Venkatraman S, Wilson JR. Least-squares estimation of distribution functions in Johnson’s translation system. Journal of Statistical Computation and Simulation. 1988 Jun 1;29(4):271–97.
  36. 36. Joshi RK, Kumar VI. Lindley inverse Weibull distribution: Theory and Applications. Bull. Math. & Stat. Res. 2020; 8(3):32–46.
  37. 37. Aarset MV. How to identify a bathtub hazard rate. IEEE Transactions on Reliability. 1987 Apr; 36(1):106–8.
  38. 38. Henningsen A, Toomet O. maxLik: A package for maximum likelihood estimation in R. Computational Statistics. 2011 Sep; 26(3):443–58.
  39. 39. R Core Team, R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2020; URL https://www.R-project.org/.
  40. 40. Almarashi AM, Elgarhy M, Elsehetry MM, Kibria BM, Algarni A. A new extension of exponential distribution with statistical properties and applications. Journal of Nonlinear Sciences & Applications (JNSA). 2019 Mar 1; 12(3).
  41. 41. Mansoor M, Tahir MH, Cordeiro GM, Provost SB, Alzaatreh A. The Marshall-Olkin logistic-exponential distribution. Communications in Statistics-Theory and Methods. 2019 Jan 17; 48(2):220–34.
  42. 42. Ansari SI, Nofal ZM. The lomax exponentiated weibull model. Japanese Journal of Statistics and Data Science. 2021 Jul; 4(1):21–39.
  43. 43. Oguntunde PE, Adejumo A, Balogun OS. Statistical properties of the exponentiated generalized inverted exponential distribution. Applied Mathematics. 2014; 4(2):47–55.
  44. 44. Oguntunde PE, Adejumo AO. The generalized inverted generalized exponential distribution with an application to a censored data. Journal of Statistics Applications & Probability. 2015; 4(2):223–30.
  45. 45. Chesneau C, Djibrila S. The generalized odd inverted exponential-G family of distributions: properties and applications. Eurasian Bullet. Math. 2019.2(3);86–110.
  46. 46. Afify AZ, Kumar D, Elbatal I. Marshall–Olkin power generalized Weibull distribution with applications in engineering and medicine. Journal of Statistical Theory and Applications. 2020 May; 19(2):223–37.
  47. 47. Cordeiro GM, Afify AZ, Ortega EM, Suzuki AK, Mead ME. The odd Lomax generator of distributions: Properties, estimation and applications. Journal of Computational and Applied Mathematics. 2019 Feb 1; 347:222–37.
  48. 48. Cordeiro GM, Alizadeh M, Diniz Marinho PR. The type I half-logistic family of distributions. Journal of Statistical Computation and Simulation. 2016 Mar 3; 86(4):707–28.
  49. 49. Bader MG, Priest AM. Statistical aspects of fibre and bundle strength in hybrid composites. Progress in science and engineering of composites. 1982:1129–36.
  50. 50. Muhammad M, Liu L. A new extension of the generalized half logistic distribution with applications to real data. Entropy. 2019 Apr; 21(4):339. pmid:33267053
  51. 51. Al-Mutairi DK, Ghitany ME, Kundu D. Inferences on stress-strength reliability from weighted Lindley distributions. Communications in Statistics-Theory and Methods. 2015 Oct 2;44(19):4096–113.