Next Article in Journal
Stone Duality for Kolmogorov Locally Small Spaces
Next Article in Special Issue
Editorial of the Special Issue “Skewed (Asymmetrical) Probability Distributions and Applications across Disciplines”
Previous Article in Journal
Research and Application of Improved Clustering Algorithm in Retail Customer Classification
Previous Article in Special Issue
Experimental Design for the Lifetime Performance Index of Weibull Products Based on the Progressive Type I Interval Censored Sample
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Probability Mass Function for Various Shapes of the Failure Rates, Asymmetric and Dispersed Data with Applications to Coronavirus and Kidney Dysmorphogenesis

1
Department of Mathematics, College of Science and Humanities in Al-Kharj, Prince Sattam bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia
2
Department of Mathematics, Faculty of Science, Mansoura University, Mansoura 35516, Egypt
3
Department of Statistics, Persian Gulf University, Bushehr 75169, Iran
*
Author to whom correspondence should be addressed.
Symmetry 2021, 13(10), 1790; https://doi.org/10.3390/sym13101790
Submission received: 28 July 2021 / Revised: 9 September 2021 / Accepted: 14 September 2021 / Published: 26 September 2021

Abstract

:
In this article, a discrete analogue of an extension to a two-parameter half-logistic model is proposed for modeling count data. The probability mass function of the new model can be expressed as a mixture representation of a geometric model. Some of its statistical properties, including hazard rate function, moments, moment generating function, conditional moments, stress-strength analysis, residual entropy, cumulative residual entropy and order statistics with its moments, are derived. It is found that the new distribution can be utilized to model positive skewed data, and it can be used for analyzing equi- and over-dispersed data. Furthermore, the hazard rate function can be either decreasing, increasing or bathtub. The parameter estimation through the classical point of view has been performed using the method of maximum likelihood. A detailed simulation study is carried out to examine the outcomes of the estimators. Finally, two distinctive real data sets are analyzed to prove the flexibility of the proposed discrete distribution.

1. Introduction

In probability theory, the half-logistic (HLo) distribution is a continuous probability (CPr) model for nonnegative-valued random variables (RVs). The HLo model is a random distribution reported by folding at zero the logistic (Lo) distribution centered around the origin. Due to the flexibility of the HLo model, several authors aimed to propose and study many extensions and generalizations for this model with its applications in various fields, for instance, Balakrishnan [1], Balakrishnan and Wong [2], Torabi and Bagheri [3], Olapade [4,5,6], Kantam et al. [7], Jose and Manoharan [8], Krishnarani [9], Oliveira et al. [10], Usman et al. [11], Cordeiro et al. [12], Muhammad and Liu [13], Samuel and Kehinde [14], Xavier and Jose [15], Eliwa et al. [16] and El-Morshedy et al. [17], among others. Among all these generalizations, we take in our consideration the HLo model with two parameters (see Hashempour, [18]). An RV X is said to have the continuous two-parameter half-logistic (HLo-II) distribution if its cumulative distribution function (CDF) is given as
G Y ( y ; δ , λ ) = 1 e δ y 1 + e λ y ; y 0 ,
where δ > 0 and λ > 0 .
In several cases, lifetimes need to be recorded on a discrete scale rather than on a continuous analogue. Thus, discretizing CPr distributions has received noticeable attention in recent years. See, for example, Pillai and Jayakumar [19], Kemp [20], Roy [21], Johnson et al. [22], Jazi et al. [23], Nekoukhou et al. [24], Bakouch et al. [25], Almalki and Nadarajah [26], Chakraborty and Chakravarty [27], Shanker and Fesshaye [28], Inusah and Kozubowski [29], Barbiero and Hitaj [30], Eliwa et al. [31,32], Nezampour and Hamedani [33], El-Morshedy et al. [34,35,36] and Eliwa and El-Morshedy [37], among others. Although there are a number of discrete models in the statistical literature, there is still a lot of space left to build a new discretized model that is proper under various conditions. In this article, we propose and study the discrete analogue of the HLo-II (DHLo-II) model. Some characteristics of the proposed distribution can be summarized as follows: its CDF and probability mass function (PMF) can be expressed as closed-forms; its hazard rate function (HRF) can be bathtub-, increasing- and decreasing-shaped; it can be utilized for modelling equi- and over-dispersion phenomena; and it provides the best fit for various types of data in several fields in spite of having only two parameters, especially for over-dispersion data.
The article is organized as follows. In Section 2, we introduce the DHLo-II model based on the survival discretization approach; for more detail around this technique see Roy and Ghosh [38] and Chakraborti et al. [39]. Different statistical properties are discussed in Section 3. In Section 4, the model parameters are estimated by using the maximum likelihood method. A simulation study is presented in Section 5. Two distinctive data sets are analyzed to show the flexibility of the DHLo-II distribution in Section 6. Finally, Section 7 provides some conclusions.

2. Synthesis of the DHLo-II Model

In this Section, the new discrete model can be generated by utilizing the survival discretization technique. Thus, the CDF of the DHLo-II distribution can be expressed as
F X ( x ; α , β ) = 1 α x + 1 1 + β x + 1 ; x = 0 , 1 , 2 , ,
where F ( 1 ; α , β ) = 0 , 0 < α < 1 and 0 < β < 1 . The corresponding PMF to Equation (2) can be listed as
Pr ( X = x ; α , β ) = ( 1 α ) α x + ( 1 β ) β x + ( β α ) ( α β ) x ( 1 + β x ) ( 1 + β x + 1 ) ; x = 0 , 1 , 2 , .
Using generalized binomial expansion, Equation (3) can be proposed as a mixture representation of geometric (Geo) model as follows
Pr ( X = x ; α , β ) = i , j = 0 ( 1 ) i + j β j ( β i + j ) x ( ( 1 α ) α x + ( 1 β ) β x + ( β α ) ( α β ) x ) = i , j = 0 ( 1 ) i + j β j ( 1 α ) ( α β i + j ) x + ( 1 β ) ( β i + j + 1 ) + ( β α ) ( α β i + j + 1 ) x = i , j = 0 w i , j ( 1 ) g ( x ; 1 α β i + j ) + i , j = 0 w i , j ( 2 ) g ( x ; 1 β i + j + 1 ) + i , j = 0 w i , j ( 3 ) g ( x ; 1 α β i + j + 1 ) ,
where
w i , j ( 1 ) = ( 1 ) i + j β j ( 1 α ) 1 α β i + j , w i , j ( 2 ) = ( 1 ) i + j β j ( 1 β ) 1 β i + j + 1 , w i , j ( 3 ) = ( 1 ) i + j β j ( 1 α ) 1 α β i + j + 1 ,
and g ( x ; ν ) = ν ( 1 ν ) x denote the PMF of Geo distribution with parameter ν . The HRF can be expressed as
H X ( x ; α , β ) = ( 1 α ) α x + ( 1 β ) β x + ( β α ) ( α β ) x 1 α x 1 + β x ( 1 + β x ) ( 1 + β x + 1 ) ( 1 α x ) ; x = 0 , 1 , 2 , ,
where H X ( x ; α , β ) = Pr ( X = x ; α , β ) 1 F X ( x 1 ; α , β ) . Figure 1 shows the PMF and HRF plots for various values of the DHLo-II parameters.
It is noted that the shape of the PMF is always unimodal. Further, the DHLo-II distribution can be used to model asymmetric data. Regarding the HRF, it is found that the proposed model has several shapes including bathtub, increasing and decreasing, which means this model can be utilized to analyze various types of data in different fields, especially in medicine, insurance and engineering.

3. Statistical Properties

3.1. Moments and Generating Functions

Assume X to be a DHLo-II RV, then the probability generating the function (PrGF) can be listed as
E ( s X ) = x = 0 s x Pr ( X = x ; α , β ) = i , j = 0 w i , j ( 1 ) 1 α β i + j 1 s α β i + j + i , j = 0 w i , j ( 2 ) 1 β i + j + 1 1 s β i + j + 1 + i , j = 0 w i , j ( 3 ) 1 α β i + j + 1 1 s α β i + j + 1 .
Onreplacing s by e s in Equation (6), the moment generating function (MGF) can be derived. Thus, the first moment of the DHLo-II distribution is
E ( X ) = x = 0 α x + 2 + β x + 2 1 + β x + 2 = x , i = 0 ( 1 ) i ( β x + 2 ) i ( α x + 2 + β x + 2 ) = i = 0 ( 1 ) i β 2 i + 1 1 β i + 1 + i = 0 α 2 ( 1 ) i β 2 i 1 ( α β ) i .
Similarly, the other moments can be derived. Based on the MGF, the mean, variance, index of dispersion (IOD), skewness and kurtosis can be listed in Table 1, Table 2, Table 3, Table 4 and Table 5 as numerical computations (NuCo).
From Table 1, Table 2, Table 3, Table 4 and Table 5 it is clear that: the mean, variance and IOD increase for constant values of β with α 1 ; the proposed model is appropriate only for modelling equi- and over-dispersed data, because the IOD always is greater than or equal one; and this distribution is capable of modeling positively skewed and leptokurtic data sets.

3.2. Conditional Moments

This section lists results of the conditional moments (CoMos) for the DHLo-II distribution. The CoMos can be utilized to derive the mean deviations, Bonferroni and Lorenz curves. The n th CoMo of the DHLo-II model under X n | X x and X n | X > x can be expressed as
E ( X n | X x ) = 1 F ( x ; α , β ) t = 0 x t n Pr ( X = t ; α , β ) = 1 F ( x ; α , β ) i , j = 0 [ w i , j ( 1 ) t = 0 x t n g ( t ; 1 α β i + j ) + w i , j ( 2 ) t = 0 x t n g ( t ; 1 β i + j + 1 ) + w i , j ( 3 ) t = 0 x t n g ( t ; 1 α β i + j + 1 ) ]
and
E ( X n | X > x ) = 1 1 F ( x + 1 ; α , β ) t = x + 1 t n Pr ( X = t ; α , β ) = 1 1 F ( x + 1 ; α , β ) i , j = 0 [ w i , j ( 1 ) t = x + 1 t n g ( t ; 1 α β i + j ) + w i , j ( 2 ) t = x + 1 t n g ( t ; 1 β i + j + 1 ) + w i , j ( 3 ) t = x + 1 t n g ( t ; 1 α β i + j + 1 ) ] ,
respectively. The mean residual life function is given by E ( X x | X > x ) = E ( X | X > x ) x = V F ( x ; α , β ) x , where V F ( x ; α , β ) is referred to as the vitality function of the distribution function F.

3.3. Stress-Strength Analysis

Stress-strength analysis has been utilized in mechanical component design. Let X 1 DHLo-II ( α 1 , β 1 ) and X 2 DHLo-II ( α 2 , β 2 ) be two independent RVs, then
Pr ( X 1 X 2 ) = x = 0 F X 1 ( x ; α 1 , β 1 ) Pr ( X 2 = x ; α 2 , β 2 ) = i , j , k = 0 ( 1 ) i + j + k β 2 k [ 1 α 2 1 α 2 β 1 i β 2 j + k + 1 β 2 1 β 1 i β 2 j + k + 1 + β 2 α 2 1 α 2 β 1 i β 2 j + k + 1 + α 2 1 1 α 1 α 2 β 1 i β 2 j + k + β 2 1 1 α 1 β 1 i β 2 j + k + 1 + α 2 β 2 1 α 1 α 2 β 1 i β 2 j + k + 1 ] .
It is noted that the value of stress-strength depends on the model parameters only.

3.4. Residual Entropy and Cumulative Residual Entropy

Residual entropy (RE) and cumulative residual entropy (CRE) are two important measures of information theory. The RE of the RV X is given by
E ( X ) = x = 0 F ( x ; α , β ) log ( F ( x ; α , β ) ) = k , j = 0 i = 0 k + 1 ( 1 ) k k 2 j k + 1 i k + 1 α i β 2 k + 1 i 1 α i β 2 k + 1 i α i + 1 β 2 k + 1 i 1 α i + 1 β 2 k + 1 i ,
whereas the CRE can be listed as
CE ( X ) = x = 0 F ¯ ( x ; α , β ) log ( F ¯ ( x ; α , β ) ) = k , j = 0 i = 0 k + 1 ( 1 ) k + i k 2 j k + 1 i k + 1 α i + 1 β j 1 α i + 1 β j + α i β j + 1 1 α i β j + 1 ,
where F ¯ ( x ; α , β ) represents the survival function of the distribution. The previous two equations can be derived by using geometric expansion and generalized binomial expansion (simple algebra).

3.5. Order Statistics

Order statistics (OrSt) play an important role in different fields of statistical theory. Suppose X 1 , X 2 , , X n to be a random sample (RS) from the DHLo-II, and let X 1 : n , X 2 : n ,…, X n : n be their corresponding OrSt. Then, the CDF of the ith OrSt X i : n for an integer value of x is proposed as
F i : n ( x ; α , β ) = j = 0 n k = 0 n j ( 1 ) k n j n j k F ( x ) j + k = j = 0 n k = 0 n j ( 1 ) k n j n j k ( 1 α x + 1 1 + β x + 1 ) j + k = j = 0 n k = 0 n j r = 0 l = 0 j + k Ω ( j , r , l ) ( n , k ) α l β r ( α l β r ) x ,
where
Ω ( j , r , l ) ( n , k ) = ( 1 ) k + l n j n j k j k r j + k l .
The PMF of the ith OrSt can be formulated as
Pr ( X i : n = x ; α , β ) = j = 0 n k = 0 n j r = 0 l = 0 j + k Ω ( j , r , l ) ( n , k ) α l β r ( 1 α l β r ) ( α l β r ) x 1 ,
where Pr ( X i : n = x ; α , β ) = F i : n ( x ; α , β ) F i : n ( x 1 ; α , β ) . The v th moments of X i : n can be proposed as
E ( X i : n v ) = x = 0 b = i n j = 0 n b ( 1 ) j n b n b j x v Pr i ( X = x ; α , β , b + j ) .
Based on Equation (14), L-moments can be listed as
Δ w = 1 w j = 0 w 1 ( 1 ) j w 1 j E X w j : w ,
which can be utilized to discuss some descriptive statistics.

4. Maximum Likelihood Estimation (MLE)

In this section, we determine the MLE of the DHLo-II parameters according to a complete sample. Assume X 1 , X 2 , , X n to be an RS of size n from the DHLo-II distribution. The log-likelihood function (L) can be listed as follows
L ( x ; α , β ) = i = 1 n log 1 α α x i + 1 β β x i + β α α β x i i = 1 n log 1 + β x i i = 1 n log 1 + β x i + 1 .
To estimate the model parameters α and β , the first partial derivatives L ( x ; α , β ) α and L ( x ; α , β ) β should be obtained, and then equating the resulted equations to zero “normal equations”. These two equations cannot be solved analytically. Thus, an iterative procedure such as Newton–Raphson is required to solve it numerically.

5. Simulation

In this section, we assess the performance of the maximum likelihood estimators (MLEs) with respect to sample size n using R software. The assessment is based on a simulation study: generate 10,000 samples of size n = 10 , 12 , 14 , , 60 from DHLo-II α = 0.5 β = 0.5 and DHLo-II α = 0.8 β = 0.3 , respectively; compute the MLEs for the 10,000 samples, say ϱ ^ l for l = 1 , 2 , , 10,000 ; and compute the biases and mean-squared errors (MSEs), where bias ( ϱ ) = 1 10 , 000 l = 1 10 , 000 ϱ j ^ ϱ and MSE ( ϱ ) = 1 10 , 000 l = 1 10 , 000 ϱ j ^ ϱ 2 . The empirical results are given in Figure 2 and Figure 3, respectively.
From Figure 2 and Figure 3, it is noted that the magnitude of bias and MSE always decrease to zero as n grows. This shows the consistency of the MLEs. We can say that the maximum likelihood approach works quite well in estimating the model parameters, and consequently, it can be used effectively for analyzing the count data.

6. Applications

In this section, we illustrate the importance and the flexibility of the DHLo-II distribution by utilizing data from different fields. We shall compare the fits of the DHLo-II distribution with some competitive models such as discrete inverse Weibull (DIW), discrete gamma Lindley (DGL), discrete Burr II (DB-II), discrete log-logistic (DLL), discrete inverse Rayleigh (DIR), discrete Burr-Hatke (DBH), discrete Lindley and discrete Pareto (DP). The fitted models are compared using some criteria, namely, the maximized log-likelihood (L), Akaike information criterion (Aic) and its corrected (Caic), Hannan–Quinn information criterion (Hqic), Bayesian information criterion (Bic), and Chi-square (Chi 2 ) test with its corresponding P-value (Pv).

6.1. Data set I: COVID-19 in Armenia

The data are listed in (https://www.worldometers.info/coronavirus/country/armenia/, accessed on 20 July 2021) and represent the daily new deaths in Armenia for COVID-19 from 15 February to 4 October 2020. The initial mass shape for these data are explored utilizing the nonparametric kernel mass estimation (Kme) technique in Figure 4, and it is observed that the mass is asymmetric function. The normality condition (Nc) is checked by the quantile-quantile (Qu-Qu) plot in Figure 4. The extreme observations (ExOb) are spotted from the box plot in Figure 4, and it is observed that some ExOb were listed.
The MLEs with their corresponding standard errors (Se), confidence intervals (CI) for the parameter(s) and goodness of fit tests for data set I are listed in Table 6 and Table 7.
The abbreviations “Of” and “Df” represent the observed frequency and degree of freedom, respectively. From Table 7, it is noted that the DGL distribution works quite well in addition to the DHLo-II distribution. However, the DHLo-II model is the best among all tested distributions. Figure 5 shows that the MLEs are unique because the L profiles have only unimodal shapes.
Figure 6 supports our empirical results where the DHLo-II is more fit to analyze these data, whereas Figure 7 shows the probability–probability (Pr–Pr) plot for data set I, which proves that the data set plausibly came from the DHLo-II distribution.
According to the MLEs, the empirical descriptive statistics (EDS) for mean, variance, IOD, skewness and kurtosis are 4.16215 , 23.66536 , 5.68584 , 2.10584 and 9.45986 , respectively. The data exhibit over-dispersion. Moreover, they are moderately skewed to the right and leptokurtic.

6.2. Data Set II: Kidney Dysmorphogenesis

This data set is taken from the study of Chan et al. [40]. Initial mass shape for the kidney data is explored using the nonparametric Kme approach in Figure 8, and it is noted that the mass is asymmetric and multimodal functions. The Nc is checked via the Qu-Qu plot in Figure 8. The ExOb are spotted from the box plot in Figure 8, and it is noted that some ExOb were reported.
Here, we examine the fitting capability of the DHLo-II distribution with some other competitive distributions. The MLEs, Se and CI for the parameter(s) as well as goodness of fit test for this data are reported in Table 8 and Table 9.
It is noted that the DIW, DB-II, DLL, DBH and DP distributions work quite well in addition to the DHLo-II distribution. However, the DHLo-II distribution is the best model among all tested models. Figure 9 shows that the MLEs are unique.
Figure 10 supports our empirical results where the DHLo-II is more fit to analyze data set II, whereas Figure 11 shows the Pr–Pr plot for the same data.
According to the MLEs, the EDS for mean, variance, IOD, skewness and kurtosis are 1.45414 , 5.87716 , 4.04167 , 2.33510 and 10.05033 , respectively. The data are over-dispersed, skewed to the right and leptokurtic.

7. Conclusions

In this paper, we proposed a flexible discrete probability model with two parameters, in the so-called discrete half-logistic (DHLo-II) distribution. Various statistical properties of the proposed model have been derived. It was found that the DHLo-II model is convenient for modelling skewed data sets, especially those which have very extreme observations. Furthermore, it can be used as a flexible model to analyze equi- and over-dispersed phenomena, especially in medicine, insurance and engineering fields. More advantages of the proposed model are that it provides a wide variation in the shape of the HRF, including decreasing, increasing and bathtub, and consequently this distribution can be used in modelling various kinds of data. The DHLo-II parameters have been estimated via the MLE approach. A simulation has been performed based on different sample sizes, and it was found that the MLE method works quite effectively in estimating the DHLo-II parameters due to the consistency property. Finally, two distinctive data sets “COVID-19 and kidney dysmorphogenesis” have been analyzed to illustrate the flexibility of the DHLo-II model. In our future work, the bivariate and multivariate extensions will be derived for the DHLo-II distribution with its applications in medicine and engineering fields.

Author Contributions

M.E.-M.: Data curation; Methodology; Resources; Software; Writing—review and editing; M.A.: Methodology; Resources; Validation; A.A.-B.: Writing—review and editing; Funding acquisition; Methodology; M.S.E.: Conceptualization; Formal analysis; Methodology; Software; Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Balakrishnan, N. Order statistics from the half logistic distribution. J. Stat. Comput. Simul. 1985, 20, 287–309. [Google Scholar] [CrossRef]
  2. Balakrishnan, N.; Wong, K.H.T. Approximate MLEs for the location and scale parameters of the half-logistic distribution with Type-II right-censoring. IEEE Trans. Reliab. 1991, 40, 140–145. [Google Scholar] [CrossRef]
  3. Torabi, H.; Bagheri, F.L. Estimation of Parameters for an Extended Generalized Half Logistic Distribution Based on Complete and Censored Data. JIRSS 2010, 9, 171–195. [Google Scholar]
  4. Olapade, A.K. On Characterizations of the Half Logistic Distribution; InterStat: Johannesburg, South Africa, 2003; Issue 2. [Google Scholar]
  5. Olapade, A.K. On a four-parameter type I generalized half logistic distribution. Proc. Jangjeon Math. 2011, 2, 189–198. [Google Scholar]
  6. Olapade, A.K. The type I generalized half logistic distribution. J. Iran. Stat. Soc. 2014, 13, 69–82. [Google Scholar]
  7. Kantam, R.R.L.; Ramakrishna, V.; Ravikumar, M.S. Estimation and testing in type I generalized half logistic distribution. J. Mod. Appl. Stat. Methods 2013, 12, 22. [Google Scholar] [CrossRef]
  8. Jose, J.K.; Manoharan, M. Beta half-logistic distribution: A new probability model for lifetime data. J. Stat. Manag. Syst. 2016, 19, 587–604. [Google Scholar] [CrossRef]
  9. Krishnarani, S.D. On a power transformation of half-logistic distribution. J. Probab. Stat. 2016, 2016, 2084236. [Google Scholar] [CrossRef] [Green Version]
  10. Oliveira, J.; Santos, J.; Xavier, C.; Trindade, D.; Cordeiro, G.M. The McDonald half-logistic distribution: Theory and practice. Commun. Stat.-Theory Methods 2016, 45, 2005–2022. [Google Scholar] [CrossRef]
  11. Usman, R.M.; Haq, M.; Talib, J. Kumaraswamy half-logistic distribution: Properties and applications. J. Statist. Appl. Probab. 2017, 6, 597–609. [Google Scholar] [CrossRef]
  12. Cordeiro, G.M.; de Andrade, T.A.; Bourguignon, M.; Gomes-Silva, F. The exponentiated generalized standardized half-logistic distribution. Int. J. Stat. Probab. 2017, 6, 24–42. [Google Scholar] [CrossRef] [Green Version]
  13. Muhammad, M.; Liu, L. A new extension of the generalized half logistic distribution with applications to real data. Entropy 2019, 21, 339. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Samuel, A.F.; Kehinde, O.A. A study on transmuted half logistic distribution: Properties and application. Int. Stat. Distrib. Appl. 2019, 5, 54. [Google Scholar]
  15. Xavier, T.; Jose, J.K. A study of stress-strength reliability using a generalization of power transformed half-logistic distribution. Commun. Stat.-Theory Methods 2020, 50, 4335–4351. [Google Scholar] [CrossRef]
  16. Eliwa, M.S.; Alshammari, F.S.S.; Abualnaja, K.M.; El-Morshedy, M. A flexible extension to an extreme distribution. Symmetry 2021, 13, 745. [Google Scholar] [CrossRef]
  17. El-Morshedy, E.M.; Alshammari, F.S.; Tyagi, A.; Elbatal, I.; Hamed, Y.S.; Eliwa, M.S. Bayesian and frequentist inferences on a type I half-logistic odd Weibull generator with applications in engineering. Entropy 2021, 23, 446. [Google Scholar] [CrossRef]
  18. Hashempour, M. A new two-parameter lifetime distribution with flexible hazard rate function: Properties, applications and different method of estimations. Math. Slovaca 2021, 71, 983–1004. [Google Scholar] [CrossRef]
  19. Pillai, R.N.; Jayakumar, K. Discrete Mittag–Leffler distributions. Stat. Probab. Lett. 1995, 23, 271–274. [Google Scholar] [CrossRef]
  20. Kemp, A.W. Characterizations of a discrete normal distribution. J. Stat. Plan. Inference 1997, 63, 223–229. [Google Scholar] [CrossRef]
  21. Roy, D. Discrete Rayleigh distribution. IEEE Trans. Reliab. 2004, 53, 255–260. [Google Scholar] [CrossRef]
  22. Johnson, N.L.; Kemp, A.W.; Kotz, S. Univariate Discrete Distributions; John Wiley & Sons: Hoboken, NJ, USA, 2005; Volume 444. [Google Scholar]
  23. Jazi, M.A.; Lai, C.D.; Alamatsaz, M.H. A discrete inverse Weibull distribution and estimation of its parameters. Stat. Methodol. 2010, 7, 121–132. [Google Scholar] [CrossRef]
  24. Nekoukhou, V.; Alamatsaz, M.H.; Bidram, H. Discrete generalized exponential distribution of a second type. Statistics 2013, 47, 876–887. [Google Scholar] [CrossRef]
  25. Bakouch, H.S.; Jazi, M.A.; Nadarajah, S. A new discrete distribution. Statistics 2014, 48, 200–240. [Google Scholar] [CrossRef]
  26. Almalki, S.J.; Nadarajah, S. A new discrete modified Weibull distribution. IEEE Trans. Reliab. 2014, 63, 68–80. [Google Scholar] [CrossRef]
  27. Chakraborty, S.; Chakravarty, D. A new discrete probability distribution with integer support on (-,). Commun. Stat.-Theory Methods 2016, 45, 492–505. [Google Scholar] [CrossRef]
  28. Shanker, R.; Fesshaye, H. On discrete Poisson-Shanker distribution and its applications. Biom. Biostat. J. 2017, 5, 00121. [Google Scholar] [CrossRef] [Green Version]
  29. Inusah, S.; Kozubowski, T.J. A discrete analogue of the Laplace distribution. J. Stat. Plan. Inference 2006, 136, 1090–1102. [Google Scholar] [CrossRef]
  30. Barbiero, A.; Hitaj, A. A discrete analogue of the half-logistic distribution. In Proceedings of the 2020 International Conference on Decision Aid Sciences and Application (DASA), Sakheer, Bahrain, 8–9 November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 64–67. [Google Scholar]
  31. Eliwa, M.S.; Alhussain, Z.A.; El-Morshedy, M. Discrete Gompertz-G family of distributions for over-and under-dispersed data with properties, estimation, and applications. Mathematics 2020, 8, 358. [Google Scholar] [CrossRef] [Green Version]
  32. Eliwa, M.S.; Tyagi, A.; Alshammari, F.S.; El-Morshedy, M. Modelling coronavirus and larvae Pyrausta data: A discrete binomial exponential II distribution with properties, classical and Bayesian estimation. J. Appl. Stat. 2021. [Google Scholar]
  33. Nezampour, S.; Hamedani, G.G. Characterizations of three 2020 introduced discrete distributions. Pak. J. Stat. Oper. Res. 2020, 16, 609–616. [Google Scholar] [CrossRef]
  34. El-Morshedy, M.; Altun, E.; Eliwa, M.S. A new statistical approach to model the counts of novel coronavirus cases. Math. Sci. 2021, 1–14. [Google Scholar]
  35. El-Morshedy, M.; Eliwa, M.S.; Tyagi, A. A discrete analogue of odd Weibull-G family of distributions: Properties, classical and Bayesian estimation with applications to count data. J. Appl. Stat. 2021, 1–25. [Google Scholar] [CrossRef]
  36. El-Morshedy, M.; Eliwa, M.S.; Nagy, H. A new two-parameter exponentiated discrete Lindley distribution: Properties, estimation and applications. J. Appl. Stat. 2020, 47, 354–375. [Google Scholar] [CrossRef]
  37. Eliwa, M.S.; El-Morshedy, M. A one-parameter discrete distribution for over-dispersed data: Statistical and reliability properties with applications. J. Appl. Stat. 2021. [Google Scholar] [CrossRef]
  38. Roy, D.; Ghosh, T. A new discretization approach with application in reliability estimation. IEEE Trans. Reliab. 2009, 58, 456–461. [Google Scholar] [CrossRef]
  39. Chakraborti, S.; Jardim, F.; Epprecht, E. Higher-order moments using the survival function: The alternative expectation formula. Am. Stat. 2018, 73. [Google Scholar] [CrossRef]
  40. Chan, S.K.; Riley, P.R.; Price, K.L.; McElduff, F.; Winyard, P.J.; Welham, S.J.; Woolf, A.S.; Long, D.A. Corticosteroid-induced kidney dysmorphogenesis is associated with deregulated expression of known cystogenic molecules, as well as Indian hedgehog. Am. J. Physiol.-Ren. Physiol. 2010, 298, F346–F356. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. The PMF and HRF plots.
Figure 1. The PMF and HRF plots.
Symmetry 13 01790 g001
Figure 2. The bias and MSE for the DHLo-II α = 0.5 β = 0.5 .
Figure 2. The bias and MSE for the DHLo-II α = 0.5 β = 0.5 .
Symmetry 13 01790 g002
Figure 3. The bias and MSE for the DHLo-II α = 0.8 β = 0.3 .
Figure 3. The bias and MSE for the DHLo-II α = 0.8 β = 0.3 .
Symmetry 13 01790 g003
Figure 4. The Kme, Qu-Qu, and box plots for data set I.
Figure 4. The Kme, Qu-Qu, and box plots for data set I.
Symmetry 13 01790 g004
Figure 5. The L profiles for the DHLo-II parameters based on data set I.
Figure 5. The L profiles for the DHLo-II parameters based on data set I.
Symmetry 13 01790 g005
Figure 6. The fitted PMFs for data set I.
Figure 6. The fitted PMFs for data set I.
Symmetry 13 01790 g006
Figure 7. The Pr–Pr plots for data set I.
Figure 7. The Pr–Pr plots for data set I.
Symmetry 13 01790 g007
Figure 8. The Kme, Qu-Qu and box plots for data set II.
Figure 8. The Kme, Qu-Qu and box plots for data set II.
Symmetry 13 01790 g008
Figure 9. The L profiles for the DHLo-II parameters based on dataset II.
Figure 9. The L profiles for the DHLo-II parameters based on dataset II.
Symmetry 13 01790 g009
Figure 10. The estimated PMFs for data set II.
Figure 10. The estimated PMFs for data set II.
Symmetry 13 01790 g010
Figure 11. The Pr-Pr plots for data set II.
Figure 11. The Pr-Pr plots for data set II.
Symmetry 13 01790 g011
Table 1. Some NuCo of the DHLo-II distribution for β = 0.001.
Table 1. Some NuCo of the DHLo-II distribution for β = 0.001.
α
Measure 0.001 0.01 0.1 0.3 0.5 0.7 0.9
Mean 0.00200 0.01109 0.11201 0.42927 1.00050 2.33363 9.00010
Variance 0.00200 0.01117 0.12415 0.61234 1.99950 7.77667 89.9982
IOD 1.00000 1.00748 1.10844 1.42647 1.99850 3.33243 9.99969
Skewness 22.3608 9.60244 3.45601 2.37058 2.12123 2.03213 2.00282
Kurtosis 503.015 96.5834 16.9092 10.6215 9.50161 9.13014 9.01133
Table 2. Some NuCo of the DHLo-II distribution for β = 0.01.
Table 2. Some NuCo of the DHLo-II distribution for β = 0.01.
α
Measure 0.001 0.01 0.1 0.3 0.5 0.7 0.9
Mean 0.01099 0.02000 0.12012 0.43559 1.00502 2.33635 9.00100
Variance 0.01107 0.02001 0.13058 0.61338 1.99510 7.76679 89.9828
IOD 1.00775 1.00039 1.08711 1.40815 1.98512 3.32432 9.99697
Skewness 9.64876 7.07593 3.26866 2.34450 2.12024 2.03437 2.00328
Kurtosis 97.5325 53.1472 15.3539 10.5121 9.51522 9.14427 9.01338
Table 3. Some NuCo of the DHLo-II distribution for β = 0.1.
Table 3. Some NuCo of the DHLo-II distribution for β = 0.1.
α
Measure 0.001 0.01 0.1 0.3 0.5 0.7 0.9
Mean 0.10283 0.11111 0.20384 0.50229 1.05385 2.36639 9.01128
Variance 0.11674 0.12345 0.21126 0.63993 1.96223 7.66870 89.8131
IOD 1.13534 1.11111 1.03643 1.27401 1.86195 3.24066 9.96674
Skewness 3.69573 3.47850 2.36398 2.07340 2.08971 2.05342 2.00819
Kurtosis 19.0106 17.1000 9.21101 9.18706 9.53455 9.27863 9.19153
Table 4. Some NuCo of the DHLo-II distribution for β = 0.3.
Table 4. Some NuCo of the DHLo-II distribution for β = 0.3.
α
Measure 0.001 0.01 0.1 0.3 0.5 0.7 0.9
Mean 0.35189 0.35891 0.43830 0.70225 1.21122 2.47100 9.05015
Variance 0.57726 0.57948 0.61832 0.90749 2.03517 7.47027 89.2348
IOD 1.64044 1.61455 1.41070 1.29225 1.68026 3.02317 9.86003
Skewness 2.76593 2.72830 2.32845 1.70329 1.84145 2.05212 2.02320
Kurtosis 12.7583 12.5653 10.3780 6.97051 8.37013 9.43783 9.11141
Table 5. Some NuCo of the DHLo-II distribution for β = 0.5.
Table 5. Some NuCo of the DHLo-II distribution for β = 0.5.
α
Measure 0.001 0.01 0.1 0.3 0.5 0.7 0.9
Mean 0.76516 0.77124 0.84016 1.07150 1.52899 2.70637 9.15007
Variance 1.99106 1.98797 1.96588 2.05052 2.81392 7.57726 88.0408
IOD 2.60213 2.57760 2.33988 1.91367 1.84036 2.79978 9.62186
Skewness 2.48246 2.47637 2.39012 1.99815 1.58944 1.87366 2.04677
Kurtosis 10.9062 10.8970 10.6659 8.86802 6.69554 8.67895 8.23654
Table 6. The MLEs, Se and CI for data set I.
Table 6. The MLEs, Se and CI for data set I.
Parameter → α β
ModelMLESeCIMLESeCI
DHLo-II 0.620 0.048 [ 0.527, 0.714] 0.815 0.014 [ 0.788, 0.842]
DIW 0.201 0.026 [ 0.149, 0.252] 0.958 0.060 [ 0.839, 1.076]
DGL 0.784 0.037 [ 0.712, 0.855] 0.228 0.089 [ 0.053, 0.404]
DB-II 0.643 0.034 [ 0.576, 0.711] 1.811 0.210 [ 1.399, 2.223]
DLL 2.871 0.2426 [ 2.395, 3.346] 1.388 0.086 [ 1.219, 1.557]
DIR 0.112 0.019 [ 0.075, 0.149]
DBH 0.976 0.01136 [ 0.953, 0.998]
DL 0.692 0.012 [ 0.668, 0.716]
DP 0.493 0.0229 [ 0.448, 0.538]
Table 7. The goodness-of-fit test for data set I.
Table 7. The goodness-of-fit test for data set I.
Expected Frequency
XOfDHLo-IIDIWDGLDB-IIDLLDIRDBHDLDP
0 56 48.53 46.60 43.85 61.12 43.59 25.96 118.83 28.24 89.88
1 31 37.23 54.93 35.74 51.49 43.91 108.22 39.57 32.85 35.42
2 22 28.83 30.93 29.08 28.24 32.05 47.70 19.75 31.94 19.64
3 25 22.55 19.14 23.63 17.09 22.69 20.44 11.83 28.47 12.70
4 11 17.83 12.94 19.18 11.39 16.34 10.22 7.865 24.12 8.99
5 14 14.23 9.31 15.55 8.15 12.08 5.76 5.59 19.74 6.75
6 14 11.46 7.02 12.59 6.12 9.15 3.55 4.18 15.77 5.28
7 10 9.29 5.49 10.18 4.78 7.12 2.34 3.24 12.38 4.26
8 11 7.57 4.41 8.23 3.84 5.65 1.62 2.58 9.58 3.52
9 3 6.19 3.62 6.64 3.16 4.55 1.17 2.10 7.33 2.97
10 10 5.08 3.02 5.36 2.65 3.74 0.86 1.74 5.56 2.54
11 7 4.17 2.57 4.32 2.26 3.13 0.66 1.47 4.18 2.20
12 4 3.43 2.20 3.48 1.94 2.62 0.51 1.25 3.13 1.93
13 5 2.82 1.91 2.80 1.69 2.23 0.41 1.07 2.32 1.71
14 2 2.32 1.68 2.25 1.49 1.92 0.33 0.93 1.72 1.52
15 2 1.90 1.49 1.81 1.33 1.68 0.27 0.82 1.27 1.37
16 6 8.57 24.74 7.31 25.26 19.55 1.98 9.19 3.40 31.32
Total 232 232 232 232 232 232 232 232 232 232
L 590.63 625.48 590.86 629.89 609.58 719.92 657.92 604.57 644.98
Aic 1185.26 1254.96 1185.72 1263.77 1223.16 1441.84 1317.85 1211.13 1291.96
Caic 1185.31 1255.01 1185.77 1263.83 1223.22 1441.86 1317.87 1211.15 1291.98
Bic 1192.15 1261.85 1192.61 1270.67 1230.06 1445.29 1321.29 1214.58 1295.41
Hqic 1188.04 1257.74 1188.49 1266.55 1225.94 1443.23 1319.24 1212.52 1293.35
Chi 2 18.22 75.53 19.03 82.63 39.97 397.61 185.04 53.71 113.54
Df 1191181068119
Pv 0.08 0.001 0.06 0.001 0.001 0.001 0.001 0.001 0.001
Table 8. The MLEs, Se and CI for data set II.
Table 8. The MLEs, Se and CI for data set II.
Parameter ⟶ α β
ModelMLESeCIMLESeCI
DHLo-II 0.052 0.056 [ 0 , 0.162] 0.659 0.035 [ 0.589, 0.728]
DIW 0.581 0.048 [ 0.488, 0.675] 1.049 0.146 [ 0.763, 1.335]
DGL 0.582 0.045 [ 0.493 , 0.671] 0.351 0.0654 [ 0.223, 0.479]
DB-II 0.278 0.045 [ 0.189, 0.366] 1.053 0.167 [ 0.725, 1.381]
DLL 0.780 0.136 [ 0.514, 1.046] 1.208 0.159 [ 0.895, 1.520]
DIR 0.554 0.049 [ 0.458, 0.649]
DBH 0.874 0.041 [ 0.794, 0.954]
DL 0.436 0.026 [ 0.385, 0.488]
DP 0.268 0.034 [ 0.201 , 0.336]
Table 9. The goodness-of-fit test for data set II.
Table 9. The goodness-of-fit test for data set II.
Expected Frequency
X OfDHLo-IIDIWDGLDB-IIDLLDIRDBHDLDP
0 64 62.88 63.91 46.01 64.74 63.19 60.89 61.94 40.29 65.84
1 14 13.64 20.69 26.76 19.18 20.10 33.99 20.06 29.83 18.27
2 10 9.03 8.05 15.57 8.48 8.64 8.12 9.65 18.36 8.16
3 6 7.03 4.23 9.06 4.63 4.66 3.00 5.52 10.34 4.51
4 4 5.29 2.59 5.27 2.86 2.86 1.42 3.49 5.52 2.82
5 2 3.83 1.75 3.07 1.92 1.92 0.78 2.35 2.85 1.91
6 2 2.69 1.26 1.78 1.37 1.37 0.47 1.65 1.44 1.37
7 2 1.85 0.95 1.04 1.01 1.02 0.31 1.19 0.71 1.02
8 1 1.25 0.74 0.60 0.78 0.79 0.21 0.89 0.35 0.79
9 1 0.84 0.59 0.35 0.61 0.62 0.15 0.67 0.17 0.63
10 1 0.56 0.49 0.20 0.49 0.50 0.11 0.52 0.08 0.51
11 2 1.11 4.75 0.29 3.93 4.33 0.55 2.07 0.06 4.17
Total 110 110 110 110 110 110 110 110 110 110
L 167.52 172.94 178.77 171.14 171.72 186.55 169.89 189.11 171.19
Aic 339.03 349.87 361.53 346.28 347.43 375.09 341.78 380.22 344.38
Caic 339.15 349.98 361.65 346.39 347.55 375.13 341.82 380.26 344.42
Bic 344.44 355.28 366.94 351.68 352.84 377.80 344.48 382.92 347.08
Hqic 341.22 352.06 363.72 348.47 349.62 376.19 342.88 381.32 345.48
Chi 2 1.97 6.45 19.09 2.59 4.03 40.46 2.61 34.64 3.43
Df 433232444
Pv 0.74 0.09 0.001 0.27 0.26 0.001 0.63 0.001 0.49
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

El-Morshedy, M.; Alizadeh, M.; Al-Bossly, A.; Eliwa, M.S. A Probability Mass Function for Various Shapes of the Failure Rates, Asymmetric and Dispersed Data with Applications to Coronavirus and Kidney Dysmorphogenesis. Symmetry 2021, 13, 1790. https://doi.org/10.3390/sym13101790

AMA Style

El-Morshedy M, Alizadeh M, Al-Bossly A, Eliwa MS. A Probability Mass Function for Various Shapes of the Failure Rates, Asymmetric and Dispersed Data with Applications to Coronavirus and Kidney Dysmorphogenesis. Symmetry. 2021; 13(10):1790. https://doi.org/10.3390/sym13101790

Chicago/Turabian Style

El-Morshedy, Mahmoud, Morad Alizadeh, Afrah Al-Bossly, and Mohamed S. Eliwa. 2021. "A Probability Mass Function for Various Shapes of the Failure Rates, Asymmetric and Dispersed Data with Applications to Coronavirus and Kidney Dysmorphogenesis" Symmetry 13, no. 10: 1790. https://doi.org/10.3390/sym13101790

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop