Meta-analysis with zero-event studies: a comparative study with application to COVID-19 data

Wei, Jia-Jin; Lin, En-Xuan; Shi, Jian-Dong; Yang, Ke; Hu, Zong-Liang; Zeng, Xian-Tao; Tong, Tie-Jun

doi:10.1186/s40779-021-00331-6

Methodology
Open access
Published: 03 July 2021

Meta-analysis with zero-event studies: a comparative study with application to COVID-19 data

Jia-Jin Wei¹,
En-Xuan Lin²,
Jian-Dong Shi¹,
Ke Yang¹,
Zong-Liang Hu³,
Xian-Tao Zeng⁴ &
…
Tie-Jun Tong ORCID: orcid.org/0000-0003-0947-3990¹

Military Medical Research volume 8, Article number: 41 (2021) Cite this article

7423 Accesses
10 Citations
2 Altmetric
Metrics details

Abstract

Background

Meta-analysis is a statistical method to synthesize evidence from a number of independent studies, including those from clinical studies with binary outcomes. In practice, when there are zero events in one or both groups, it may cause statistical problems in the subsequent analysis.

Methods

In this paper, by considering the relative risk as the effect size, we conduct a comparative study that consists of four continuity correction methods and another state-of-the-art method without the continuity correction, namely the generalized linear mixed models (GLMMs). To further advance the literature, we also introduce a new method of the continuity correction for estimating the relative risk.

Results

From the simulation studies, the new method performs well in terms of mean squared error when there are few studies. In contrast, the generalized linear mixed model performs the best when the number of studies is large. In addition, by reanalyzing recent coronavirus disease 2019 (COVID-19) data, it is evident that the double-zero-event studies impact the estimate of the mean effect size.

Conclusions

We recommend the new method to handle the zero-event studies when there are few studies in a meta-analysis, or instead use the GLMM when the number of studies is large. The double-zero-event studies may be informative, and so we suggest not excluding them.

Background

Meta-analysis is a statistical method to synthesize evidence from a number of independent studies that addressed the same scientific questions [1, 2]. In clinical studies, experimental data are commonly composed of binary outcomes, and consequently, meta-analyses of binary data have attracted increasing attention in evidence-based medicine [3, 4]. For each study, an effect size is reported to quantify the treatment effect by comparing the event probabilities between the treatment group and the control group, including the odds ratio (OR), the relative risk (RR), and the risk difference (RD). In meta-analysis, when the study-specific effect size is estimated based on a two-by-two contingency table, the zero-event problem in one or both groups frequently occurs, which may cause an unexpected calculation complication in the statistical inference of the effect size. If the study involves a zero event in one group, we refer to it as a single-zero-event study; and if the study involves zero events in both groups, we refer to it as a double-zero-event study [5]. Vandermeer et al. [6] and Kuss [7] applied random sampling techniques and found that 30% of meta-analyses from the 500 sampled Cochrane reviews included one or more single-zero-event studies, while 34% of the reviews involved at least one meta-analysis with a double-zero-event study.

As a recent example, Chu et al. [8] conducted several meta-analyses to evaluate the effectiveness of physical distancing, face masks, and eye protection on the spread of three coronaviruses, which caused severe acute respiratory syndrome (SARS), Middle East respiratory syndrome (MERS) or coronavirus disease 2019, also known as COVID-19 [9, 10]. Specifically, they considered RR as the effect size and applied the random-effects model to pool the observed effect sizes with an inverse-variance weight assigned to each study [11, 12]. As a result, for their meta-analysis on physical distancing, they concluded that the risk of infection will be significantly decreased with a further physical distance. We note, however, that there are 8 single-zero-event studies and 7 double-zero-event studies among a total of 32 studies. In particular for the 7 studies on COVID-19 data, 4 of them are single-zero-event studies and 2 of them are double-zero-event studies. To escape the zero-event problem, Chu et al. [8] excluded the double-zero-event studies from their meta-analyses, which, however, may introduce an estimation bias to the overall effect size [7]. More recently, Xu et al. [13] revisited 442 meta-analyses with or without the double-zero-event studies, and then by a comparative study, they concluded that the double-zero-event studies do contain valuable information and should not be excluded from the meta-analysis.

Inspired by the aforementioned examples, we provide a selective review on the existing methods for meta-analysis that can handle the zero-event studies. For ease of presentation, we will mainly focus on the random-effects model with RR as the effect size, whereas the same comparison also applies to OR and RD. For more details on meta-analysis of OR and RD with the zero-event studies, one may refer to [7] and the references therein, in which the author discussed the methods applicable to all the three effect sizes as well as some methods only applicable to one of them. For a given study, we let n₁ be the number of samples in the treatment group with X₁ being the number of events, and n₂ be the number of samples in the control group with X₂ being the number of events. Let also X₁ follow a binomial distribution with parameters n₁ and p₁>0, and X₂ follow a binomial distribution with parameters n₂ and p₂>0. We further assume that X₁ and X₂ are independent of each other. Then to estimate RR=p₁/p₂, the maximum likelihood estimator is known as

$$\begin{array}{@{}rcl@{}} \widehat {\text{RR}} = {X_{1}/n_{1} \over X_{2}/n_{2}} = \frac{X_{1}n_{2}}{X_{2}n_{1}} \end{array} $$

(1)

Note that $\widehat {\text {RR}}$ is often right-skewed. To derive the statistical inference on RR, researchers frequently apply the log scale so that the resulting estimator can be more normally distributed. Specifically by Agresti [14], the approximate variance of $\text {ln}\left (\widehat {\text {RR}}\right)$ is

$$\begin{array}{@{}rcl@{}} \text{var}\left[\text{ln}\left(\widehat{\text{RR}}\right)\right]\approx \frac{1}{X_{1}} - \frac{1}{n_{1}} + \frac{1}{X_{2}} - \frac{1}{n_{2}} \end{array} $$

(2)

By (1) and (2), when there are zero events in one or both groups, the classic method for estimating RR suffers from the zero-event problem and will no longer be applicable.

To have a valid estimate of RR, originated from Haldane [15], one often recommends to add 0.5 to the counts of events and non-events if some count is zero [16, 17]. This method is referred to a correction method and has been extensively used in meta-analysis to deal with the zero-event studies. For further developments on the continuity correction, one may refer to Sweeting et al. [18], Carter et al. [19], and the references therein. On the other side, there are also statistical models without the continuity correction to handle meta-analysis with the zero-event studies, such as the generalized linear mixed models [4, 20, 21].

The remainder of this paper is organized as follows. In “Methods with the continuity correction” section, we first review the random-effects model and the existing methods with the continuity correction, and then propose a new method of the continuity correction for estimating RR. In “The generalized linear mixed models” section, we review the generalized linear mixed models for meta-analysis. In “Simulation studies” section, we conduct simulation studies to evaluate the performance of the reviewed methods and our new method. In “Application to COVID-19 data” section, we apply all the well performed methods to a recent meta-analysis on COVID-19 data for further evaluation of their performance. We then conclude the paper in “Discussion” and “Conclusions” sections with some interesting findings, and provide the supplementary materials in the Appendix.

Methods

Methods with the continuity correction

Suppose that there are k studies in the meta-analysis, and y_i for $i=1, \dots,k$ are the observed effect sizes for each study. By DerSimonian and Laird [22], the random-effects model can be expressed as

$$\begin{array}{@{}rcl@{}} y_{i} = \theta +\zeta_{i} + \epsilon_{i} \end{array} $$

(3)

where θ is the mean effect size, ζ_i are the deviations of each study from θ, and ε_i are the sampling errors. We further assume that ζ_i are independent and identically distributed random variables from N(0,τ²),ε_i are independent random errors from $N(0,\sigma _{i}^{2})$, and that they are independent of each other. In addition, τ² is referred to as the between-study variance, and $\sigma _{i}^{2}$ are referred to as the within-study variances.

For the random-effects model in (3), by the inverse-variance method the mean effect size θ can be estimated by

$$\begin{array}{@{}rcl@{}} \hat{\theta} = \frac{\sum_{i} w^{*}_{i} y_{i}}{\sum_{i} w^{*}_{i}} \end{array} $$

(4)

where $w^{*}_{i} = 1/\left (\sigma _{i}^{2}+ \tau ^{2}\right)$ are the weights assigned to each individual study [23]. In meta-analysis, the within-study variances $\sigma _{i}^{2}$ are routinely estimated by the variances of the observed effect sizes, denoted by var(y_i). While for the between-study variance, DerSimonian and Laird [22] proposed the method of moments estimator as

$$\begin{array}{@{}rcl@{}} T^{2} = \frac{Q-k+1}{C} \end{array} $$

(5)

where $Q = \sum _{i} w_{i} \left (y_{i} - \sum _{i} w_{i} y_{i} / \sum _{i} w_{i}\right)^{2}$ is known as the Q statistic, and $C = \sum _{i} w_{i} - \sum _{i} w_{i}^{2} / \sum _{i} w_{i}$ with $w_{i} = 1/\sigma _{i}^{2}$ for $i=1,\dots,k$.

We note, however, that the random-effects model may suffer from the zero-event problem. Taking RR as an example, if we apply the random-effects model for meta-analysis, then the effect sizes y_i will be the observed ln(RR) values. Now for estimating ln(RR), if we plug in $\widehat {\text {RR}}$ from formula (1) directly, then ln$\left (\widehat {\text {RR}}\right)$ will not be well defined when the studies involve the zero events, and so is for the variance estimate of $\text {ln}\left (\widehat {\text {RR}}\right)$ in formula (2). Consequently, without a valid estimate of the effect size and of its within-study variance, the random-effects model cannot be applied to estimate the mean effect size by the inverse-variance method. This shows that a correction on $\widehat {\text {RR}}$ is often desired in meta-analysis with some studies involving zero events.

Existing methods with the continuity correction

Let c₁>0 and c₂>0 be two values for the continuity correction. To overcome the zero-event problem, one common approach is to estimate p₁ by (X₁+c₁)/(n₁+2c₁) and estimate p₂ by (X₂+c₂)/(n₂+2c₂). Plugging them into (1) and (2), we have

$$\begin{array}{@{}rcl@{}} \widetilde{\text{RR}}\left(c_{1},c_{2}\right) = {X_{1}+c_{1} \over n_{1}+2c_{1}}\cdot {n_{2}+2c_{2} \over X_{2}+c_{2}} \end{array} $$

(6)

Accordingly, the 95% confidence interval (CI) of RR is

$$ \begin{aligned} \text{exp} \left\{{\text{ln}\left(\widetilde{\text{RR}}\left(c_{1},c_{2}\right)\right)} \!\pm\! 1.96\sqrt{ \frac{1}{X_{1}+c_{1}} - \frac{1}{n_{1}+2c_{1}} + \frac{1}{X_{2}+c_{2}} - \frac{1}{n_{2}+2c_{2}}} \right\} \end{aligned} $$

(7)

For the values of c₁ and c₂ in (6), there are mainly three suggestions in the literature that are widely used for the random-effects meta-analysis.

(i)
When c₁=c₂=0.5, it yields the Haldane estimator [15] as
$$ \begin{aligned} \widetilde{\text{RR}}_{\text{Haldane}} = \left\{ \begin{array}{ll} \frac{X_{1}+0.5}{n_{1}+1}\cdot \frac{n_{2}+1}{X_{2}+0.5} & ~~~~~~~~ X_{1}= 0~\text{or}~n_{1}, X_{2}=0~\text{or}~n_{2}, \\ \frac{X_{1}n_{2}}{n_{1}X_{2}} & ~~~~~~~~ \text{otherwise} \end{array} \right. \end{aligned} $$
(8)
(ii)
When c₁=n₁/(n₁+n₂) and c₂=n₂/(n₁+n₂), it yields the TACC estimator [18] as
$$ \begin{aligned} \widetilde{\text{RR}}_{\text{TACC}} = \left\{ \begin{array}{ll} \frac{X_{1}+c_{1}}{n_{1}+2c_{1}}\cdot \frac{n_{2}+2c_{2}}{X_{2}+c_{2}} & ~~~~~~~~ X_{1}= 0~\text{or}~n_{1}, X_{2}=0~\text{or}~n_{2}, \\ \frac{X_{1}n_{2}}{X_{2}n_{1}} & ~~~~~~~~ \text{otherwise} \end{array} \right. \end{aligned} $$
(9)

For the balanced case when n₁=n₂, the TACC estimator is equivalent to the Haldane estimator. Also to implement this estimator, one may apply metabin in the R package “meta” with the setting incr=“TACC” [24].
(iii)
When c₁=c₂=1, it yields the Carter estimator [19] as
$$\begin{array}{@{}rcl@{}} \widetilde{\text{RR}}_{\text{Carter}} = \frac{X_{1}+1}{n_{1}+2}\cdot \frac{n_{2}+2}{X_{2}+1} \end{array} $$
(10)

Besides the continuity correction methods in family (6), another alternative is to estimate p₁ by (X₁+c₁)/(n₁+c₁) and estimate p₂ by (X₂+c₂)/(n₂+c₂). Then with c₁=c₂=0.5, it yields the Pettigrew estimator [25] as

and the 95% CI of RR as

Moreover, to avoid a zero standard error, Hartung and Knapp [26] suggested not to correct X₁ and X₂ when X₁=n₁ and X₂=n₂.

A hybrid method with the continuity correction

Note that the existing methods are all constructed to first estimate p₁ and p₂, and then take their ratio as an estimate of RR=p₁/p₂. Nevertheless, noting that p₂ is in the denominator rather than in the numerator, inverting an optimal estimate for p₂ may not necessarily yield an optimal estimate for 1/p₂. In this section, we propose a hybrid method that is to estimate p₁ and 1/p₂ directly, and then take their product to estimate RR.

For the estimation of p₁, we show in Appendix 1 that the mean squared error (MSE) of (X₁+c₁)/(n₁+2c₁) is smaller than the MSE of (X₁+c₁)/(n₁+c₁) in most settings. We thus consider to apply (X₁+c₁)/(n₁+2c₁) to estimate p₁ in RR. While to estimate the reciprocal of p₂, one may consider (n₂+2c₂)/(X₂+c₂) as in (6). Or instead, another option can be to consider (n₂+c₂)/(X₂+c₂) as originated in (??), see also [27] and [28] for more discussion. And if we take the latter one, then a hybrid estimator of RR can be constructed as

$$\begin{array}{@{}rcl@{}} \widehat{\text{RR}}\left(c_{1},c_{2}\right) = {X_{1}+c_{1} \over n_{1}+2c_{1}}\cdot {n_{2}+c_{2} \over X_{2}+c_{2}} \end{array} $$

(11)

For the optimal values of c₁ and c₂ in (11), our simulation studies in Appendices 2 and 3 show that c₁=0.5 and c₂=0.5 are among the best options. In view of this, our new hybrid estimator is taken as follows:

$$\begin{array}{@{}rcl@{}} \widehat{\text{RR}}(0.5,0.5) = {X_{1}+0.5 \over n_{1}+1}\cdot {n_{2}+0.5 \over X_{2}+0.5} \end{array} $$

(12)

whereas the 95% CI of RR is given as

$$ \begin{aligned} \text{exp} \left\{\text{ln}\left(\widehat{\text{RR}}(0.5,0.5)\right) \!\pm\! 1.96\sqrt{ \frac{1}{X_{1}+0.5} - \frac{1}{n_{1}+1} + \frac{1}{X_{2}+0.5} - \frac{1}{n_{2}+0.5}} \right\} \end{aligned} $$

(13)

Comparison of the continuity correction methods

In this section, we conduct a numerical study to compare the finite sample performance of the existing and new methods. For ease of presentation, we refer to the confidence intervals associated with (8), (9), (10), (??) and (12) as the Haldane interval, the TACC interval, the Carter interval, the Pettigrew interval, and the hybrid interval, respectively.

To generate the data, we let p₂=0.05, 0.15, 0.85 or 0.95, and p₁=p₂×RR with RR ranging from 0.2 to min{5,1/p₂}. We also consider different combinations of the sample sizes. For the sake of brevity, only the results for balanced samples with n₁=n₂=10 or 50 are presented, whereas the results for the unbalanced samples are postponed to Appendix 4. Recall that the Haldane and TACC intervals are the same when n₁=n₂, and we thus present the results for the Haldane interval only. With N=100,000 repetitions for each setting, we generate random numbers from the binomial distributions with parameters (p₁,n₁) and (p₂,n₂) to yield the estimates of RR and their CIs. We then compute the frequencies of the true RR falling in the CIs as the coverage probability estimates. Moreover, the expected lengths of the CIs on the log scale are computed by $N^{-1}\sum _{s=1}^{N}\left (\text {ln(UL}_{\text {s}}) - \text {ln(LL}_{\text {s}})\right)$, where UL_s and LL_s are the upper and lower limits of the sth CI.

For p₂=0.05 or 0.15, the top four panels of Figs. 1 and 2 show that the Haldane interval is the most conservative interval in most settings, and it provides the longest expected lengths compared to the other three intervals. The Carter interval may have downward spikes in the left or right tail, although it leads to the shortest expected lengths. We also note that the simulation results of the Pettgrew interval and the hybrid interval are nearly the same. Their coverage probabilities and expected lengths are intermediate between those of the other two intervals in most settings.

From the bottom four panels of Figs. 1 and 2 with p₂=0.85 or 0.95, it is evident that the Haldane interval has a satisfactory performance in most settings with the coverage probabilities around the nominal level. In contrast, the Carter interval fails to provide enough large coverage probabilities in most settings, so does the Pettgrew interval when n₁ and n₂ are small. Note also that the coverage probabilities of the hybrid interval are comparable to the Haldane interval as long as p₂ is not extremely large. Moreover, the hybrid interval yields shorter expected lengths than the Haldane interval.

To sum up, when p₂ is small, the Pettgrew interval and the hybrid interval are less conservative than the Haldane interval in most settings. While for large p₂, the Haldane interval and the hybrid interval perform better than the Pettgrew interval in terms of coverage probability. In addition, the expected lengths of the hybrid interval are always shorter than the Haldane interval. This shows that the hybrid interval can serve as a good alternative for the interval estimation of RR.

The generalized linear mixed models

The generalized linear mixed models (GLMMs) are extensions of the generalized linear model, which include both the fixed and random effects as linear predictors [14]. Different types of the GLMMs have been proposed in the literature including a few reviews and comparison studies [4, 29]. Among the existing models, the bivariate GLMM has been well recognized and being recommended for estimating RR in meta-analysis [20].

Let p_i1 and p_i2 be the event probabilities in the treatment and control groups of the ith study, respectively. The bivariate GLMM is represented as

$$\begin{array}{@{}rcl@{}} &&g(p_{i1}) = \Omega_{1} + \zeta_{i1} \\ &&g(p_{i2}) =\Omega_{2} + \zeta_{i2} \end{array} $$

(14)

where g(·) is the link function, Ω₁ and Ω₂ are the fixed effects, and the random effects are given by

$$\begin{array}{@{}rcl@{}} { \left(\begin{array}{c} \zeta_{i1} \\ \zeta_{i2} \end{array} \right)} \overset{\text{ind}}{\sim} { N\left[ \left(\begin{array}{c} 0 \\ 0 \end{array} \right), \left(\begin{array}{cc} \tau_{1}^{2} & \rho \tau_{1} \tau_{2} \\ \rho \tau_{1} \tau_{2} & \tau_{2}^{2} \end{array} \right) \right ]} \end{array} $$

The mean effect size based on model (14) was defined as

$$\begin{array}{@{}rcl@{}} {}{\text{RR}}_{\text{GLMM}} =\frac{E\left(p_{1}\right)}{E\left(p_{2}\right)} = \frac{\int_{-\infty}^{\infty}g^{-1}\left(\Omega_{1}+t\right)\tau_{1}^{-1} \phi\left(t/\tau_{1}\right) \mathrm{d}t}{\int_{-\infty}^{\infty}g^{-1}\left(\Omega_{2}+t\right)\tau_{2}^{-1} \phi\left(t/\tau_{2}\right) \mathrm{d}t} \end{array} $$

(15)

where E(p₁) and E(p₂) are the mean event probabilities in the control and treatment groups, g⁻¹(·) is the inverse function of the link, and ϕ(·) is the probability density function of the standard normal distribution [30]. For the logit link, Zeger et al. [31] proposed an approximate formula $E\left (p_{j}\right)\approx \text {expit}\left (\Omega _{j} /\sqrt {1+C^{2}\tau _{j}^{2}}\right)$ with $C = 16\sqrt {3}/(15\pi)$. For the probit link, $E\left (p_{j}\right)=\Phi \left (\Omega _{j} /\sqrt {1+\tau _{j}^{2}}\right)$, where j=1 or 2, and Φ(·) is the cumulative distribution function of the standard normal distribution. While for the other links, there does not exist a closed form of formula (15) and so a numerical approximation is often needed [32].

For the parameter estimation in model (14), Jackson et al. [4] provided a detailed introduction for the implementation based on the R package “lme4” in their model 6. Alternatively, one may also apply the function meta.biv in the R package “altmeta” maintained by Lin and Chu [33], in which the 95% CI of RR can be derived by the bootstrap resampling method.

Results

Simulation studies

In this section, we compare the performance of the reviewed methods on handling meta-analysis with the zero-event studies, including the continuity correction methods and the generalized linear mixed models. Among the existing continuity correction methods, we note that the Haldane and TACC estimators are comparable and among the best when estimating the mean effect size, in contrast to the other two methods including the Carter and Pettigrew estimators. Hence, for the sake of brevity, we only present the results of the Haldane and TACC estimators in the main text but provide the simulation results for all four methods in Appendix 5. Besides the Haldane and TACC estimators, we also consider the newly introduced hybrid estimator and the GLMM with the logit link for further comparison.

To conduct the meta-analysis, we consider k=3, 6 and 12 as three different numbers of studies. Also by (3), we let θ=ln(RR) be the mean effect size that ranges from ln(0.2) to ln(5), and then generate the random effects ζ_i from N(0,τ²) with τ²= 0.25 or 1. Next, we randomly generate n_i2 from the log-normal distribution based on the assumption that ${\ln }(n_{i2}) \overset {\text {ind}}{\sim } N(3.35, 1.00)$ [34]. It is also assumed by [34] that the ratios between n_i1 and n_i2 follow the uniform distribution with values from 0.84 to 2.04. In addition, we generate the event probabilities of the control group p_i2 from the uniform distribution with values from 0.01 to min{0.99,1/exp(θ)}. Then accordingly, the event probabilities of the treatment group are given by p_i1=exp(θ+ζ_i)p_i2, where exp(θ+ζ_i)p_i2≥1 will be discarded. Finally, we generate X_i1 and X_i2 from the binomial distributions with parameters (n_i1,p_i1) and (n_i2,p_i2), respectively. Note that the data will be re-generated if the number of events or non-events in one group are both zero. Finally, with N=10,000 repetitions for each setting, we compute the mean squared errors (MSEs) between the estimated RR and the true RR to evaluate the accuracy of the methods.

From the top two panels of Fig. 3, it is evident that the three continuity correction methods perform much better than the GLMM in nearly all settings when k is small. Moreover, the hybrid estimator is consistently better than the Haldane and TACC estimators. The middle two panels show that, when k is moderate, the three continuity correction methods still perform better than the GLMM in most settings. Finally, the bottom two panels indicate that the GLMM performs the best in most settings when k is large. To conclude, the accuracy of the different methods depends on the number of studies. In particular, for meta-analysis with few studies, the random-effects model with the hybrid estimator is more reliable for handling the zero-event studies than the other methods; and for meta-analysis with large studies, we recommend the GLMM to handle the random-effects meta-analysis.

Application to COVID-19 data

As mentioned earlier, Chu et al. [8] conducted a systematic review that revealed the connections of physical distancing, face masks, and eye protection with the transmission of SARS, MERS, and COVID-19. It is noteworthy that their analytical results have attracted more and more attention. As an evidence, their paper has received a total of 1236 citations in Google Scholar as of 16 March 2021. In this section, we propose to reanalyze COVID-19 data and compare the performance of the different methods with or without the double-zero-event studies, including the Haldane estimator, the TACC estimator, the hybrid estimator, and the GLMMs.

Note that the treatment group represents a further physical distance and the control group represents a shorter physical distance. As shown in the top panel of Fig. 4, [8] applied the random-effects model with the Haldane estimator and removed the double-zero-event studies from their meta-analysis. The overall effect size of 0.15 with the 95% CI being [0.03,0.73] indicates that the infection risk will be significantly reduced with a further physical distance. The middle panel of Fig. 4 reports that the random-effects model with the TACC estimator yields the overall effect size of 0.12 with the 95% CI being [0.03,0.50]. Moreover, the bottom panel of Fig. 4 shows that the random-effects model with the hybrid estimator yields the overall effect size of 0.13 with the 95% CI being [0.03,0.72]. Note also that the study-specific CIs here are always narrower than the CIs in the top panel, which coincides with the simulation results that the expected lengths of the CI associated with the hybrid estimator are shorter than the Haldane estimator. In addition, the GLMM in (14) does not provide the estimates of the study-specific effect sizes, so the results are listed as follows. By the bootstrap resampling with 1000 replicates, the GLMM with the logit link yields the overall effect size of 0.20 with the 95% bootstrap CI being [0.05,0.55]. Also, the GLMM with the probit link yields the overall effect size of 0.18 with the 95% CI being [0.04,0.55].

To reanalyze COVID-19 data, we now include the double-zero-event studies. The top panel of Fig. 5 shows that the random-effects model with the Haldane estimator yields the overall effect size of 0.22 with 95% CI being [0.06,0.82]. The middle panel of Fig. 5 presents that the random-effects model with the TACC estimator provides the overall effect size of 0.18 with the 95% CI being [0.06,0.57]. While for the hybrid estimator, it is shown by the bottom panel that the overall effect size is 0.21 with 95% CI being [0.05, 0.81]. At last, the GLMM with the logit link provides the overall effect size of 0.29 with the 95% CI being [0.10,0.64], and the GLMM with the probit link provides the overall effect size of 0.28 with the 95% CI being [0.10,0.56].

Discussion

To handle the zero-event studies in meta-analysis of binary data, researchers often apply the random-effects model with the continuity correction, or instead, the GLMMs. From the simulation results, we note that the performance of the different methods depends on the number of studies. For meta-analysis with few studies, the random-effects model with the continuity correction is able to perform better than the GLMM, especially the hybrid continuity correction. We also note that the hybrid continuity correction can yield a reliable confidence interval for a single RR. Although the continuity correction does show some advantages, it should be used with caution since an arbitrary correction may lead to a bias or even reverse the result of a meta-analysis, especially when the numbers of samples in the two groups are fairly unbalanced [7, 13]. When the number of studies is large, the GLMM is preferable to the random-effects model with the continuity correction. In other words, the performance of the GLMM relies on a sufficient number of studies [35]. Also as shown in Ju et al. [34], the GLMM also requires enough total events in the two groups, e.g., larger than 10.

Besides the random-effects model we have compared, it is noteworthy that there are also other models for meta-analysis that can handle the zero-event studies including, for example, the beta-binomial model [36–38]. Most meta-analyses with rare events have a small degree of heterogeneity, and so the common-effect model may be more suitable than the random-effects model [39]. In addition, Li and Rice [40] showed that the fixed-effects model can also provide an accurate CI for meta-analysis of OR with the zero-event studies. Apart from that, it is also noteworthy that the fixed-effects model can serve as a convincing model for meta-analysis with few studies [12, 41–43]. As a future work, it can be interesting to investigate the best model for meta-analysis with few studies which include the zero-event studies as well.

For the double-zero-event studies in meta-analysis, we have shown by reanalyzing COVID-19 data that they do impact the estimate of the mean effect size, and so they may not be uninformative. As noted by Friedrich et al. [44], including the double-zero-event studies moves the mean effect size estimate toward the direction of the null hypothesis. If one arbitrarily excluded the informative double-zero-event studies, there would be a risk of overstating the treatment effect such that the conclusion would be less reliable. As recommended by the literature [7, 13] and the references therein, we suggest including the double-zero-event studies in meta-analysis.

Apart from model comparison, the selection of effect sizes has attracted more and more attention in the literature. In particular, there is a recent debate on the choice of RR or OR in clinical epidemiology, in which a number of important properties of RR or OR together with their pros and cons were discussed including, for example, portability and collapsibility [45–47]. In view of this, we have also analyzed COVID-19 data with OR being the effect size and present the results in Appendix 6 with R code in Appendix 7. To handle the zero-event studies, we apply four methods that have been reviewed in this paper, including Haldane’s continuity correction, TACC, the GLMM, and the empirical continuity correction proposed by Sweeting et al. [18]. For more techniques on meta-analysis of OR with the zero-event studies, one may refer to [4, 7, 18, 29, 34] and the references therein.

Conclusions

In this paper, we revisited the existing methods that are widely used to handle the zero-event problem in meta-analysis of binary data, in particular with RR as the effect size which is also known as the risk ratio. For the methods with the continuity correction, we reviewed four existing estimators of RR and also introduced a new hybrid estimator with their applications to the random-effects model. Apart from those, the GLMM was also included which is a state-of-the-art method without the continuity correction. By a comparative study and also a real data analysis on COVID-19 data, we found that the random-effects model with the hybrid estimator can serve as a more reliable method for handling the zero-event studies when there are few studies in a meta-analysis, and recommend using the GLMM when the number of studies is large. This paper also provides a useful addition to Chu et al. [8], and meanwhile, it also calls for further observational studies in this field.

Availability of data and materials

Not applicable.

Abbreviations

OR:: Odds ratio
RR:: Relative risk
RD:: Risk difference
SARS:: Severe acute respiratory syndrome
MERS:: Middle East respiratory syndrome
COVID-19:: Coronavirus disease 2019
CI:: Confidence interval
MSE:: Mean squared error
GLMMs:: Generalized linear mixed models
TACC:: Treatment arm continuity correction

References

Borenstein M, Hedges LV, Higgins JPT, Rothstein HR. Introduction to Meta-Analysis. Chichester, UK: John Wiley & Son; 2011.
Google Scholar
Ma LL, Wang YY, Yang ZH, Huang D, Weng H, Zeng XT. Methodological quality (risk of bias) assessment tools for primary and secondary medical studies: what are they and which is better?Mil Med Res. 2020; 7:7.
PubMed PubMed Central Google Scholar
Davey J, Turner RM, Clarke MJ, Higgins JPT. Characteristics of meta-analyses and their component studies in the cochrane database of systematic reviews: a cross-sectional, descriptive analysis. BMC Med Res Methodol. 2011; 11:160.
Article PubMed PubMed Central Google Scholar
Jackson D, Law M, Stijnen T, Viechtbauer W, White IR. A comparison of seven random–effects models for meta-analyses that estimate the summary odds ratio. Stat Med. 2018; 37(7):1059–85.
Article PubMed PubMed Central Google Scholar
Ren Y, Lin L, Lian Q, Zou H, Chu H. Real-world performance of meta-analysis methods for double-zero-event studies with dichotomous outcomes using the cochrane database of systematic reviews. J Gen Intern Med. 2019; 34(6):960–8.
Article PubMed PubMed Central Google Scholar
Vandermeer B, Bialy L, Hooton N, Hartling L, Klassen TP, Johnston BC, Wiebe N. Meta-analyses of safety data: a comparison of exact versus asymptotic methods. Stat Methods Med Res. 2009; 18(4):421–32.
Article PubMed Google Scholar
Kuss O. Statistical methods for meta-analyses including information from studies without any events–add nothing to nothing and succeed nevertheless. Stat Med. 2015; 34(7):1097–116.
Article CAS PubMed Google Scholar
Chu DK, Akl EA, Duda S, Solo K, Yaacoub S, Schünemann HJ, study authors C-SURGES. Physical distancing, face masks, and eye protection to prevent person-to-person transmission of SARS-CoV-2 and COVID-19: a systematic review and meta-analysis. Lancet. 2020; 395(10242):1973–87.
Article CAS PubMed PubMed Central Google Scholar
Jin YH, Cai L, Cheng ZS, Cheng H, Deng T, Fan YP, et al. A rapid advice guideline for the diagnosis and treatment of 2019 novel coronavirus (2019-ncov) infected pneumonia (standard version). Mil Med Res. 2020; 7:4.
CAS PubMed PubMed Central Google Scholar
Jin YH, Zhan QY, Peng ZY, Ren XQ, Yin XT, Cai L, et al. Chemoprophylaxis, diagnosis, treatments, and discharge management of COVID-19: An evidence-based clinical practice guideline (updated version). Mil Med Res. 2020; 7:41.
CAS PubMed PubMed Central Google Scholar
Borenstein M, Hedges LV, Higgins JP, Rothstein HR. A basic introduction to fixed-effect and random-effects models for meta–analysis. Res Synth Methods. 2010; 1(2):97–111.
Article PubMed Google Scholar
Lin E, Tong T, Chen Y, Wang Y. Fixed-effects model: the most convincing model for meta-analysis with few studies. Preprint at https://arxiv.org/abs/2002.04211. 2020.
Xu C, Li L, Lin L, Chu H, Thabane L, Zou K, Sun X. Exclusion of studies with no events in both arms in meta-analysis impacted the conclusions. J Clin Epidemiol. 2020; 123:91–9.
Article PubMed Google Scholar
Agresti A. Categorical Data Analysis, 2nd Edition. Hoboken: John Wiley & Son; 2003.
Google Scholar
Haldane JB. The estimation and significance of the logarithm of a ratio of frequencies. Ann Hum Genet. 1956; 20(4):309–11.
Article CAS PubMed Google Scholar
Schwarzer G. meta: An r package for meta-analysis. R News. 2007; 7:40–5.
Google Scholar
Weber F, Knapp G, Ickstadt K, Kundt G, Glass Ä. Zero–cell corrections in random–effects meta–analyses. Res Synth Methods. 2020; 11(6):913–9.
Article PubMed Google Scholar
Sweeting MJ, Sutton AJ, Lambert PC. What to add to nothing? use and avoidance of continuity corrections in meta–analysis of sparse data. Stat Med. 2004; 23(9):1351–75.
Article PubMed Google Scholar
Carter RE, Lin Y, Lipsitz SR, Newcombe RG, Hermayer KL. Relative risk estimated from the ratio of two median unbiased estimates. J Royal Stat Soc: Ser C Appl Stat. 2010; 59(4):657–71.
Google Scholar
Chu H, Nie L, Chen Y, Huang Y, Sun W. Bivariate random effects models for meta-analysis of comparative studies with binary outcomes: methods for the absolute risk difference and relative risk. Stat Methods Med Res. 2012; 21(6):621–33.
Article PubMed Google Scholar
Chen Y, Hong C, Ning Y, Su X. Meta–analysis of studies with bivariate binary outcomes: a marginal beta–binomial model approach. Stat Med. 2016; 35(1):21–40.
Article PubMed Google Scholar
DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials. 1986; 7(3):177–88.
Article CAS PubMed Google Scholar
Laird NM, Mosteller F. Some statistical methods for combining experimental results. Int J Technol Assess Heal Care. 1990; 6(1):5–30.
Article CAS Google Scholar
Balduzzi S, Rücker G, Schwarzer G. How to perform a meta-analysis with R: a practical tutorial. Evid Based Ment Health. 2019; 22(4):153–60.
Article PubMed Google Scholar
Pettigrew HM, Gart JJ, Thomas DG. The bias and higher cumulants of the logarithm of a binomial variate. Biometrika. 1986; 73(2):425–35.
Article Google Scholar
Hartung J, Knapp G. A refined method for the meta–analysis of controlled clinical trials with binary outcome. Stat Med. 2001; 20(24):3875–89.
Article CAS PubMed Google Scholar
Fattorini L. Applying the Horvitz-Thompson criterion in complex designs: a computer-intensive perspective for estimating inclusion probabilities. Biometrika. 2006; 93(2):269–78.
Article Google Scholar
Seber GAF. Statistical Models for Proportions and Probabilities. Heidelberg: Springer; 2013.
Book Google Scholar
Bakbergenuly I, Kulinskaya E. Meta-analysis of binary outcomes via generalized linear mixed models: a simulation study. BMC Med Res Methodol. 2018; 18(1):70.
Article PubMed PubMed Central Google Scholar
McCullagh P. Sampling bias and logistic models. J R Stat Soc Ser B Stat Methodol. 2008; 70(4):643–77.
Article Google Scholar
Zeger SL, Liang KY, Albert PS. Models for longitudinal data: a generalized estimating equation approach. Biometrics. 1988; 44(4):1049–60.
Article CAS PubMed Google Scholar
Lin L, Chu H. Meta-analysis of proportions using generalized linear mixed models. Epidemiology. 2020; 31(5):713–7.
Article PubMed PubMed Central Google Scholar
Lin L, Chu H. altmeta: Alternative Meta-Analysis Methods. 2020. https://CRAN.R-project.org/package=altmeta.
Ju J, Lin L, Chu H, Cheng LL, Xu C. Laplace approximation, penalized quasi-likelihood, and adaptive gauss-hermite quadrature for generalized linear mixed models: towards meta-analysis of binary outcome with sparse data. BMC Med Res Methodol. 2020; 20(1):152.
Article PubMed PubMed Central Google Scholar
Gronsbell J, Hong C, Nie L, Lu Y, Tian L. Exact inference for the random–effect model for meta–analyses with rare events. Stat Med. 2020; 39(3):252–64.
Article PubMed Google Scholar
Sarmanov O. Generalized normal correlation and two-dimensional fréchet classes. Sov Math Dokl. 1966; 7:596–9.
Google Scholar
Chen Y, Luo S, Chu H, Su X, Nie L. An empirical Bayes method for multivariate meta-analysis with an application in clinical trials. Commun Stat Theory Methods. 2014; 43(16):3536–51.
Article PubMed PubMed Central Google Scholar
Luo S, Chen Y, Su X, Chu H. mmeta: an R package for multivariate meta-analysis. J Stat Softw. 2014; 56(11):11.
Article PubMed PubMed Central Google Scholar
Jia P, Lin L, Kwong JSW, Xu C. Many meta-analyses of rare events in the cochrane database of systematic reviews were underpowered. J Clin Epidemiol. 2021; 131:113–22.
Article PubMed Google Scholar
Li QK, Rice K. Improved inference for fixed–effects meta–analysis of 2×2 tables. Res Synth Methods. 2020; 11(3):387–96.
Article Google Scholar
Bender R, Friede T, Koch A, Kuss O, Schlattmann P, Schwarzer G, Skipka G. Methods for evidence synthesis in the case of very few studies. Res Synth Methods. 2018; 9(3):382–92.
Article PubMed PubMed Central Google Scholar
Rice K, Higgins JP, Lumley T. A re–evaluation of fixed effect(s) meta–analysis. J R Stat Soc Ser A Stat Methodol. 2018; 181(1):205–27.
Article Google Scholar
Yang K, Kwan HY, Yu Z, Tong T. Model selection between the fixed-effects model and the random-effects model in meta-analysis. Stat Interface. 2020; 13(4):501–10.
Article Google Scholar
Friedrich JO, Adhikari NK, Beyene J. Inclusion of zero total event trials in meta-analyses maintains analytic consistency and incorporates all available data. BMC Med Res Methodol. 2007; 7:5.
Article PubMed PubMed Central Google Scholar
Doi SA, Furuya-Kanamori L, Xu C, Lin L, Chivese T, Thalib L. Questionable utility of the relative risk in clinical research: a call for change to practice. J Clin Epidemiol. 2020. https://doi.org/10.1016/j.jclinepi.2020.08.019.
Xiao M, Chen Y, Cole SR, MacLehose R, Richardson D, Chu H. Is OR “portable” in meta-analysis? Time to consider bivariate generalized linear mixed model. 2020. Preprint at https://www.medrxiv.org/content/10.1101/2020.11.05.20226811v1.
Doi SA, Furuya-Kanamori L, Xu C, Chivese T, Lin L, Musa OA, Hindy G, Thalib L, Harrell Jr FE. The OR is “portable” but not the RR: time to do away with the log link in binomial regression. J Clin Epidemiol. 2021. https://doi.org/10.13140/RG.2.2.31631.10407.

Download references

Acknowledgements

The authors sincerely thank the Editor, Associate Editor, and two anonymous reviewers for their insightful comments and suggestions.

Funding

This study was supported by grants awarded to Tie-Jun Tong from the General Research Fund (HKBU12303918), the National Natural Science Foundation of China (1207010822), and the Initiation Grants for Faculty Niche Research Areas (RC-IG-FNRA/17-18/13, RC-FNRA-IG/20-21/SCI/03) of Hong Kong Baptist University.

Author information

Authors and Affiliations

Department of Mathematics, Hong Kong Baptist University, Hong Kong, China
Jia-Jin Wei, Jian-Dong Shi, Ke Yang & Tie-Jun Tong
Shenzhen Research Institute of Big Data, Shenzhen, China
En-Xuan Lin
College of Mathematics and Statistics, Shenzhen University, Shenzhen, China
Zong-Liang Hu
Center for Evidence-Based and Translational Medicine, Zhongnan Hospital of Wuhan University, Wuhan, China
Xian-Tao Zeng

Authors

Jia-Jin Wei
View author publications
You can also search for this author in PubMed Google Scholar
En-Xuan Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jian-Dong Shi
View author publications
You can also search for this author in PubMed Google Scholar
Ke Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zong-Liang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Xian-Tao Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Tie-Jun Tong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

TJT, JJW, EXL, and XTZ reviewed the literature and designed the methods. TJT, JJW, and JDS conducted the simulation studies. TJT, JJW, KY, and ZLH conducted the experiments and analyzed the real data. All authors contributed to the manuscript preparation. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Tie-Jun Tong.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

Supplementary information: Appendix

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Wei, JJ., Lin, EX., Shi, JD. et al. Meta-analysis with zero-event studies: a comparative study with application to COVID-19 data. Military Med Res 8, 41 (2021). https://doi.org/10.1186/s40779-021-00331-6

Download citation

Received: 22 March 2021
Accepted: 07 June 2021
Published: 03 July 2021
DOI: https://doi.org/10.1186/s40779-021-00331-6

Meta-analysis with zero-event studies: a comparative study with application to COVID-19 data

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Methods with the continuity correction

Existing methods with the continuity correction

A hybrid method with the continuity correction

Comparison of the continuity correction methods

The generalized linear mixed models

Results

Simulation studies

Application to COVID-19 data

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Military Medical Research

Contact us