Abstract

A new five-parameter transmuted generalization of the Lomax distribution (TGL) is introduced in this study which is more flexible than current distributions and has become the latest distribution theory trend. Transmuted generalization of Lomax distribution is the name given to the new model. This model includes some previously unknown distributions. The proposed distribution's structural features, closed forms for an rth moment and incomplete moments, quantile, and Rényi entropy, among other things, are deduced. Maximum likelihood estimate based on complete and Type-II censored data is used to derive the new distribution's parameter estimators. The percentile bootstrap and bootstrap-t confidence intervals for unknown parameters are introduced. Monte Carlo simulation research is discussed in order to estimate the characteristics of the proposed distribution using point and interval estimation. Other competitive models are compared to a novel TGL. The utility of the new model is demonstrated using two COVID-19 real-world data sets from France and the United Kingdom.

1. Introduction

Many generators have been studied in recent years by expanding some effective classical distributions. Many applied fields, including dependability, demographics, engineering, economics, actuarial sciences, biological research, hydrology, insurance, medicine, and finance, have employed such created families of distributions for modeling and evaluating lifetime data. However, there are still a lot of real-world data occurrences that do not fit into any of the statistical distributions. Shaw and Buckley [1] introduced a new class of distributions known as transmuted distributions with cumulative distribution function (CDF) as

By differentiating equation (1), we get the probability density function (pdf) as follows:where and are the base distribution's pdf and CDF.

There are various transmuted distributions suggested. Aryal and Tsokos [2] proposed the transmuted Weibull distribution as a new generalization of the Weibull distribution. Merovci [3] devised and explored the varied structural properties of the transmuted Rayleigh distribution. Khan and King [4] obtained the transmuted modified Weibull distribution. The transmuted Lomax distribution was presented by Ashour and Eltehiwy [5]. Transmuted Pareto distribution is introduced by Merovci and Puka [6]. The transmuted generalized linear exponential distribution was introduced by Elbatal et al. [7]; among others. Poboková and Michalková [8] proposed a transmuted Weibull distribution. Ali and Athar [9] have created a new generalized transmuted family of distributions (TD). They utilized Weibull distribution to generalized transmuted families of distributions (TWDn).

The Lomax distribution is a heavy-tail pdf popular in business, economics, and actuarial modeling. In some cases, it is also known as the Pareto Type-II distribution. In the event of a business failure, Lomax used it to fit data. It is essentially a Pareto distribution with a 0-support level. The pdf is as follows:

The CDF for (3) iswhere and are the shape and scale parameters, respectively.

The CDF and pdf of Gompertz-generalized G-family of distribution are given by Alizadeh et al. [10] aswhere and are extra form parameters that change the tail weights. and are the parent (or baseline) distribution's pdf and CDF, respectively.

Now, if the density from (3) and (4) is replaced into (5) and (6), Oguntunde et al. [11] introduced a novel generalization of the Lomax distribution known as the Gompertz Lomax distribution (GoLom) with vector parameters where ; the CDF and pdf are

The corresponding pdf is created by inserting the densities from (3) and (4) into (6) in the following order:

The transmuted generalized Lomax (TGL) distribution is a new five-parameter transmuted generalization of the Lomax distribution presented in this article. This generalization stems from a fundamental motivation as follows:(i)Providing a very flexible life distribution that includes several new existing distributions as submodels(ii)Making a significant difference in data modeling

One of the advantages of this distribution is that it works on modeling COVID-19 data. In COVID-19, a new coronavirus disease has expanded worldwide since December 2019, producing over 218 million cases and over 4.5 million deaths, reported by the World Health Organization (WHO). There have been about 6.5 million cases of COVID-19 in France as of September 2021, with over 112850 deaths. There have been about 6.8 million cases of COVID-19 in the United Kingdom as of September 2021, with over 132740 deaths. Therefore, we decided to find the best mathematical-statistical model for modeling the data of the countries of France and the United Kingdom. There were also many researchers who worked on finding a model for these data, such as Almetwally [12], Almetwally [13], Almetwally [14], and others.

The following is a representation of how this article is structured. In Section 2, we define the new distribution. The new distribution's structural features are discussed in Section 3. The maximum likelihood estimators (MLEs) of parameters under complete and Type-II censored samples are investigated in Section 4. Section 5 describes the various bootstrap confidence intervals. Section 6 describes a Monte-Carlo simulation analysis using entire sample sizes and Type-II censored samples to estimate point and interval estimation of TGL distribution parameters. In Section 7, two real-world data sets are introduced, and at the end of the article, there is a conclusion.

2. Transmuted Generalized Lomax Model

The TGL distribution and its submodels are shown here. The CDF of the TGL distribution with vector parameters where can be derived by substituting (7) and (8) in (1) and (2) asand its pdf is as follows:

As a result, the pdf (10) is defined as X ∼ TGL . Table 1 lists the TGL distribution's special submodels.

The survival (reliability) function and the hazard rate function have the following definitions:

For specific parameters selections, the pdf of TGL model is shown in Figure 1.

We can deduct from Figure 1 plots of the TGL distribution's pdf can be unimodal, normal, or right-skewed.

We may derive from Figure 2 that the TGL distribution's hazard function can take the form of a decreasing, increasing, or upside-down shape.

Lemma 1. The TGL density function's limit is provided as

Proof. The density function's conclusion is easy to illustrate (10).
Furthermore, the TGL hazard function's limit as x⟶0 is 0 and x⟶ ∞ is ∞ as shown as follows:This statement is simple to demonstrate.

3. Statistical Properties

The statistical aspects of the TGL distribution are examined in the following subsections, including moments, mode, quantile function, Rényi entropy, and order statistics.

3.1. Moments

The instant near 0 is calculated. We can write as follows from (10).

First, to obtain I1, as a result, we use binomial expansion.

So, I1 is given by

In a similar way, I2 is as follows:

Then, can be expressed as

3.2. Mode

The mode of the TGL distribution was obtained in this subsection. The is as follows:

. The mode is the solution of the following equation:

3.3. Quantile and Median

By inverting cdf (9) as follows, the TGL distribution can be easily simulated: on (0, 1), if U follows a uniform distribution, then

Special Case :

3.4. Rényi Entropy

The variance of the uncertainty is measured by the Rényi entropy of a pdf with random variable of TGL distribution. The Rényi entropy is defined as for any real parameter :

We can extract the integrated component using the density function (10) as follows:

The binomial expansion is then used as follows:

As a result, the TGL distribution's Rényi entropy is

3.5. Order Statistics

In this part, we will look at single-order statistics for the TGL distribution. Let us say ; there are TGL random variables that are both independent and identically distributed. Let stand for the order statistics derived from these variables. The pdf of the order statistic, say , is then calculated aswhere The binomial expansion is used in this case; then, order statistic of TGL distribution is

For the TGL distribution, the moments of order statistics are

By using the binomial expansion,

For the TGL distribution, the moments of order statistics are

4. Parameter Estimation

The MLEs of the parameters under complete and Type-II censored samples are investigated in this section. Approximate confidence intervals (ACIs) for unknown values are also calculated using the Fisher information matrix.

4.1. MLEs under Complete Sample

In the statistical literature, various approaches for parameter estimation have been given, with the MLEs method being the most extensively employed. We explore applying MLEs to estimate the parameters of the TGL distribution with a complete sample. If is a random sample of this distribution of size with a set of parameter vectors , then the log-likelihood function, say , may be stated aswhere . The partial differential equations are calculated as follows:

The nonlinear equations are numerically solved to determine ML estimators as , , , and using an iterative technique.

4.2. MLEs under Type-II Censored Sample

The MLEs of parameters for TGL distribution based on Type-II censored samples are investigated in this subsection. The Fisher information matrix for Type-II censored model is also used to calculate the approximate confidence intervals for the unknown parameters . Let be a random sample of size n, and we only look at the first k-th order statistics based on the Type-II censored sample. Likelihood function in this scenario is of the kindwhere is a constant and is the data that has been censored. The log-likelihood function is possibly written as follows without constant term from (15).where denotes the time of the k-th failure and denotes the time of the k-th failure. The MLEs are the solutions to the following five equations:

It is to be noted that equation (16) cannot be solved explicitly. To obtain the MLEs , a numerical approach is required, and a numerical technique is needed. We obtain the observed Fisher information matrix since its expectation requires numerical integration. The observed information or Hessian matrix is

The Fisher information matrix is given by the negative expected of second partial derivatives of (15) for the unknown parameters locally at given in (16). Under some regularity conditions, is approximately normal with mean and covariance matrix . Practically, we estimate by; then

The elements of the observed Hessian matrix are computed using an iterative numerical solution method. Now, the ACIs can be obtained as follows:where is the percentile of a standard normal distribution.

5. Bootstrap Confidence Interval

We create two parametric bootstrap confidence intervals (CI) in this section as follows.

5.1. Percentile Bootstrap (Boot-P)

(1)Compute the MLE of based on complete and censored samples(2)Generate a bootstrap sample using to obtain the bootstrap estimate of say using the bootstrap sample(3)Repeat step (2) times to get (4)Arrange in order of ascending as (5)A two-side Boot-P CI for the unknown parameters is set by

5.2. Bootstrap-t (Boot-t)
(1)The same steps as (1-2) in Boot-p(2)Compute the t-statistic of where asymptotic variances of and it can be obtained using the Fisher information matrix(3)Repeat steps 2-3 times and obtain (4)Arrange in ascending order as (5)A two-side Boot-t CI for the unknown parameters is given by

6. Simulation Study

In this section, we discuss the Monte Carlo simulation study to estimate point and interval estimation of parameters of TGL distribution based on complete sample sizes and Type-II censored samples. The simulation results are in Tables 25, and concluding remarks of simulation results are obtained in this section. A Monte Carlo simulation is an initial task for studying different properties of parameters of the TGL model based on different sample schemes; we can use these steps:(1)Generate random sample from a uniform distribution with intervals 0 and 1 and sample size n as 30, 50, and 100(2)Determine different actuals values of parameters of TGL distribution asCase 1: Case 2:  Case 3:  Case 4: (3)We use the inverse CDF method to transform the CDF in terms of u and get the sample of TGL distribution(4)Sort sample and select the first m failures as 20 and 25 where n = 30, 30, and 40 where n = 50 and 70 and 90 where n = 100(5)By using different programs as Mathcad, R-software, Mable, and Mathematica, we can obtain the results of the simulation(6)We use 10000 iterations in the summation generator(7)In point estimation, we obtain bias and the mean squared error (MSE) of parameters of TGL distribution(8)In intervals estimation, we obtain a length of CI for ACI denoted as L.ACI, length of percentile bootstrap CI can be denoted as L.BPCI, length of bootstrap-t CI can be denoted as L.BTCI, and coverage probability (CP) of ACI

7. Concluding Remarks of Simulation Results

Tables 25 show the simulation results of point and interval estimates of TGL distribution parameters using Type-II censored samples and entire sample sizes. Based on these Tables, the following concluding remakes are noticed.

(i)As sample size increases and fixed other values of the model, the various measures for the parameter of TGL distribution estimates decrease(ii)As the number of units to be failed increases and fixed other values of the model, the various measures for the parameter of TGL distribution Type-II censored samples estimates decrease(iii)The bootstrap is the shortest length of CI for interval estimation of parameter TGL distribution

8. Real Data Analysis

The relevance and potentiality of the TGL distribution are demonstrated in this section through the application of two real data sets.

8.1. Data Set (1): COVID-19 of France

The COVID-19 data in question is from France, and it covers a period of 108 days, from March 1 to June 16, 2021. This data was formed by using daily new deaths (ND), daily cumulative cases (CC), and daily cumulative deaths (CD) as follows:

The data are as follows: 0.0024 0.0025 0.0027 0.0045 0.0045 0.0062 0.0109 0.0113 0.0123 0.0123 0.0126 0.0129 0.0130 0.0139 0.0152 0.0160 0.0160 0.0161 0.0164 0.0169 0.0173 0.0174 0.0188 0.0219 0.0225 0.0226 0.0248 0.0260 0.0284 0.0303 0.0315 0.0318 0.0320 0.0323 0.0327 0.0329 0.0332 0.0343 0.0345 0.0346 0.0346 0.0347 0.0347 0.0352 0.0359 0.0365 0.0366 0.0370 0.0371 0.0376 0.0384 0.0392 0.0396 0.0419 0.0421 0.0443 0.0445 0.0462 0.0492 0.0506 0.0525 0.0536 0.0539 0.0540 0.0568 0.0569 0.0592 0.0600 0.0603 0.0606 0.0617 0.0619 0.0621 0.0629 0.0632 0.0657 0.0665 0.0675 0.0678 0.0683 0.0684 0.0691 0.0693 0.0707 0.0708 0.0719 0.0722 0.0727 0.0771 0.0773 0.0773 0.0774 0.0791 0.0832 0.0837 0.0845 0.0865 0.0897 0.0907 0.0910 0.0946 0.0961 0.0972 0.1010 0.1056 0.1123 0.1149 0.2153.

In Table 6, the TGL distribution is fitted to COVID-19 of France country. The TGL model is compared with other competitive models as Mead and Afify [16] proposed the Burr-XII model (KEBXII) with Kumaraswamy exponentiated, Weibull-Lomax (WL) distribution, Odds Exponential-Pareto IV (OEPIV) distribution proposed by Baharith et al. [17], Marshall–Olkin Alpha power Weibull (MOAPW) by Almetwally et al. [18], Marshall–Olkin Alpha power extended Weibull (MOAPEW) by Almetwally [19], Marshall–Olkin alpha power inverse Weibull (MOAPIW) by Basheer et al. [20], Marshall–Olkin alpha power Lomax (MOAPL) by Almongy et al. [21], and Gompertz Lomax (GOLOM) distribution by Oguntunde et al. [11]. According to this result, we note that the estimate of TGL has the best measure where it has the smallest value of Cramer-von Mises (), Anderson-Darling (), and Kolmogorov- Smirnov (KS) statistic along with its value. The fitted TGL, pdf, CDF, and PP-plot of the data set are displayed in Figure 3.

The COVID-19 data in question is from the United Kingdom and spans 82 days, from May 1 to July 16, 2021. This data is formed by using daily ND, daily CC, and daily CD as follows:

0.0023, 0.0023, 0.0023, 0.0046, 0.0065, 0.0067, 0.0069, 0.0069, 0.0091, 0.0093, 0.0093, 0.0093, 0.0111, 0.0115, 0.0116, 0.0116, 0.0119, 0.0133, 0.0136, 0.0138, 0.0138, 0.0159, 0.0161, 0.0162, 0.0162, 0.0162, 0.0163, 0.0180, 0.0187, 0.0202, 0.0207, 0.0208, 0.0225, 0.0230, 0.0230, 0.0239, 0.0245, 0.0251, 0.0255, 0.0255, 0.0271, 0.0275, 0.0295, 0.0297, 0.0300, 0.0302, 0.0312, 0.0314, 0.0326, 0.0346, 0.0349, 0.0350, 0.0355, 0.0379, 0.0384, 0.0394, 0.0394, 0.0412, 0.0419, 0.0425, 0.0461, 0.0464, 0.0468, 0.0471, 0.0495, 0.0501, 0.0521, 0.0571, 0.0588, 0.0597, 0.0628, 0.0679, 0.0685, 0.0715, 0.0766, 0.0780, 0.0942, 0.0960, 0.0988, 0.1223, 0.1343, and 0.1781.

In Table 7, the TGL distribution is fitted to COVID-19 of The United Kingdom country. The TGL model is compared with other competitive models as, KEBXII, WL, OEPIV, MOAPW, MOAPEW, and GOLOM distributions. According to this result, we note that the estimate of TGL has the best measure where it has the smallest value of , , and KS statistic along with its value. The fitted TGL, pdf, CDF, and PP-plot of the data set are displayed in Figure 4.

9. Conclusion

We investigate the so-called five-parameter transmuted generalized Lomax distribution in this study. Lomax and Gompertz Lomax (GoLom) distributions are included in the TGL model. The TGL distribution's structural properties are deduced. The maximum likelihood approach is used to estimate the population parameters based on complete and Type-II censored samples. We discussed the Monte Carlo simulation study to estimate point and interval estimation of parameters of TGL distribution based on complete sample sizes and Type-II censored samples. The proposed distribution was applied to two COVID-19 real-world data sets from France and United Kingdom. We compared a new transmuted generalization of the Lomax distribution (TGL) with KEBXII, WL, OEPIV, MOAPW, MOAPEW, and GOLOM distributions. It was shown to provide a better fit than several other models. We hope that the presented model will be used in a variety of fields, including engineering, survival and lifetime data, meteorology, biology, hydrology, economics (income disparity), and others.

Data Availability

All data used to support the findings of the study are available within the article.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was funded by the Deanship of Scientific Research at Princess Nourah bint Abdulrahman University through the Fast-track Research Funding Program.