Introduction

The high level of contagiousness of SARS-CoV-2 has overwhelmed many health systems in few months, especially intensive care units (ICU). COVID-19 mainly affects the respiratory system and can cause serious lung damage leading to hypoxemic acute respiratory failure and even acute respiratory distress syndrome [1, 2]. Current treatment is based on corticosteroids and respiratory support, ranging from basic oxygen therapy to extracorporeal oxygenation. The abrupt clinical worsening that occurs about a week from the onset of symptoms can have an outcome ranging from complete recovery to death. Risk factors for poor prognosis, especially the need for hospital admission and ventilatory support, have been analyzed in multiple studies [3, 4]. But it remains unclear what factors determine these differences in the course of the disease and, above all, what markers we can use to anticipate irreversible clinical deterioration. Research related to radiological prognostic markers has mainly focused on the first chest radiograph (CXR) findings [5,6,7,8]. However, there are few studies have used interval chest radiographs for prognostication of the disease [9].

Our objective is to analyze if rapid worsening of COVID-19 pneumonia in the initial days has predictive value for the need of ventilatory support (VS), regardless of it consisted in non-invasive ventilation or endotracheal intubation, given the overlap between these two forms of therapy based on physicians’ experience. Our hypothesis is this evolution may provide a more robust prognostic information than that provided by the initial radiograph alone.

Methods

Observational ambispective single-center study approved by our institutional ethics committee (internal code EST: 38/20), following the STROBE Statement. Patient informed consent was waived.

Setting and sample selection

We recruited a sample of COVID-19 inpatients from March 14th to April 17th, 2020 (sample 1, S1). If our hypothesis was confirmed in S1, it would be validated in a second sample of COVID-19 outpatients, recruited from October 27th, 2020, to January 25th, 2021 (sample 2, S2).

The eligibility criteria for both samples were as follows: patients aged 18–80 years; SARS-CoV-2 infection confirmed by RT-PCR, antibody, or antigen testing; unfavorable respiratory course; and CXR lung opacities by probable or indeterminate COVID-19 pneumonia [10, 11]. For S1, when eligible, patients were admitted and included. Additionally, patients were also admitted when clinical suspicion of pneumonia was high and they had risk criteria for pneumonia progression (i.e., age ≥ 60 or comorbidities; Suppl-Fig. 1a), even when a first CXR did not show signs of pneumonia. For S2, patients were consecutive included when they did not have admission criteria at the time of their first CXR (i.e., no dyspnea, respiratory rate < 22, oxygen saturation ≥ 95%, BS < 14, and no risk of progression). However, they needed further assessment and a second CXR due to an unfavorable clinical course while isolated at home the month after the first CXR (hereinafter, “reconsultation”) (Suppl-Fig. 1b).

SARS-CoV-2 infection was confirmed by the primary care physician before requesting the CXR1 or by the emergency physician shortly after the CXR1. ICU admission was determined by the severity of the respiratory failure and the need for VS (Suppl-Table 1). Patients older than 80 years were not eligible because they could not be considered candidates for VS in extreme resource-constrained situations.

Patients (a) with non-diagnostic radiological images, due to poor technical quality, and (b) referrals from other hospitals admitted to the ICU, due to the lack of previous data were excluded (Suppl-Fig. 1).

Radiological variables

To determine the radiological evolution, data from CXR1 and CXR2 were collected. CXR1 was considered the 1st CXR performed during hospital admission for COVID-19 for sample S1, or the 1st CXR showing lung opacities probable or indeterminate for COVID-19 pneumonia [10] for sample S2. CXR2 was the first follow-up CXR after CXR1 in both samples.

CXRs were always reported by a radiologist or radiology resident.

For CXR1 in both samples and for CXR2 in S2, standard posteroanterior thorax view was assessed, and for non-mobile patients, bedside portable radiography.

For CXR2 in S1, portable radiography was always used to reduce the in-hospital contagion risk.

The severity of pulmonary involvement was quantified using the Brixia score (BS) [11] on the basis of the posteroanterior or anteroposterior CXR projection. The BS was chosen because it is specific to COVID-19, and, in our opinion, more straightforward to calculate than other scores (e.g., RALE) as it does not measure the extent per lung zone BS ranges between 0 (absence of involvement) and 18 points (maximum total involvement, 3 points maximum per lung zone, 6 zones, Fig. 1). BS scores on CXR1 and CXR2 were denominated Brixia score 1 (BS1) and 2 (BS2), respectively. Severity and evolution of lung involvement in each lung zone were assessed by the BS mean value and its percentage of change ((BS2mean-BS1mean) *100/3).

Fig. 1
figure 1

Example Brixia score (BS) measurement in supine chest X-ray (CXR), in patient with dyspnea, SARS-CoV-2 infection confirmed by RT-PCR and pneumonia. a Original radiograph. b Original radiograph with reference marks for BS calculation: the lung fields are divided by drawing a horizontal line below the aortic arch (hollow arrowhead) and another dividing the visible lung parenchyma into two parts of equal height down to the lowest diaphragm below the first line (black arrowhead). Ground glass opacities (GGO) have been drawn with a dotted margin area while black arrows point to small consolidations. In BS, opacities are quantified as 0: normal; 1: GGO; 2: GGO and consolidations with GGO predominance; 3: consolidations with or without GGO, with consolidation predominance. In the example a, 2 points were assigned to all zones, excepting the left superior zone with 1 point, with a BS of 11 points

Two radiologists (observer 1, 8-year experience in thoracic and emergency radiology and observer 2, 8-year experience in emergency radiology) and a third-year resident (observer 3) retrospectively calculated the BS1 in S1, blinded to observer 1 and 2, the CXR2 or any clinical data. Previous radiological studies could be consulted to detect preexisting abnormalities. To analyze the concordance of BS in supine radiography studies, observers 1 and 3 calculated BS2 in a sample obtained by simple randomization of 50 patients from among all S1 patients undergoing a second portable radiograph as CXR2. Once tested the BS reproducibility, data obtained for observer 1 was used for S1 analysis; after workshops had been conducted for all the radiologists involved, the BS was prospectively calculated and used for decision-making and for analysis in S2 (Suppl-Table 1). The degree of experience in reading CXR among radiologists and residents in-training for S2 ranged between 2 and 22 years. Although experience is variable among radiologists, all CXRs are reported in our hospital during on-call duty, so the radiologist is used to reading it.

For standardizing the BS worsening, we defined the worsening radiological day rate (WRDR) as (BS2—BS1)/difference of days between CXR1 and CXR2, resulting in the daily increase in BS. A positive WRDR value indicated worsening, and a negative one, radiological improvement.

Non-radiological variables

The main endpoint was the VS need (request criteria in Suppl-Table 1). We also collected sex, age, comorbidities, chronic treatment with immunosuppressants and antihypertensive drugs, symptoms, need and length for hospitalization and ICU, days from onset of the symptoms (d-OS), corticosteroid treatment, SpO2/FiO2 and inflammatory markers levels related to the episode of COVID-19.

To determine whether early radiological follow-up would be feasible in patients most likely to require VS, we defined “vulnerable population,” in the absence of specific published criteria, as those patients with at least one comorbidity and older than 55 years.

Statistical methods

S1 size corresponded to all eligible first wave COVID-19 inpatients. Based on S1 results, assuming a proportion of VS requirement of 17%, a precision of 6.5%, and a need for replacement rate of 10%, the S2 size considering 95% confidence was estimated to be 109 patients.

Quantitative variables were expressed as means ± standard deviation or median (interquartile range, IQR) according to the normal or non-normal distribution, and qualitative variables using absolute and relative frequencies. Chi2, Mann–Whitney U, Kruskal–Wallis, Student T, and Spearman correlation tests were used for bivariate comparison. The reading agreement was evaluated by intraclass correlation coefficient (ICC) and Bland–Altman tests.

Radiological variables performance was determined using the maximum value of the Youden index in the receiver operating characteristic curve, with values of area under the curve (AUC), sensitivity, specificity, and positive (PLR) and negative (NLR) likelihood ratios with the 95% confidence interval (95% CI).

The multivariate analysis by binary logistic regression was performed to determine the association between VS need and the main effects of non-radiological variables.

Differences were considered statistically significant at p < 0.05. IBM-SPSS-Statistics version 20, Med-Calc version 12.7.0, JASP 0.14.0.0, and Excel-Microsoft-Office-365 software were used. Analysis was performed by JPMP (radiologist) and MBR (statistician).

Results

Sample description

S1 consisted of 120 hospitalized patients (55 ± 14 years, 68 men), 26 (21.7%) of them requiring VS (Fig. 2a, Table 1). The most frequent symptoms were fever (temperature > 37.5ºC, 74.2%), cough (70%), and dyspnea (40%).

Fig. 2
figure 2

Flow chart of patients included and excluded in samples 1 (a) and 2 (b). Abbreviations: CXR, chest X-ray; ED, emergency department; ICU, intensive care unit; WRDR, worsening radiological day rate

Table 1 Characteristics of sample 1

The median time between the onset of the symptoms and CXR1 was 7 days (5–11).

S2 consisted of 112 patients who reconsulted (56 ± 13 years, 61 men), 65 (58%) of whom required hospitalization and 14 (12.5%) VS (Fig. 2b, Table 2).

Table 2 Characteristics of sample 2

Radiological variables

The average ICC between observers 1–2, 1–3, and 2–3 for the BS1 was 0.906 (95% CI 0.844–0.940), 0.812 (95% CI 0.745–0.878), and 0.878 (95% CI 0.823–0.916), respectively. The mean of the differences in the Bland–Altman test was 0.7 (95% CI − 3.0; 4.5), 0.8 (95% CI − 4.4; 6), and 0.1 (95% CI − 4.5; 4.7), respectively. The average ICC between observers 1–3 for the BS2, assessed from portable radiography, was 0.905 (95% CI 0.833–0.946), and the mean of the differences in the Bland–Altman test was 0.3 (95% CI − 4.7; 5.3). The medians of BS1, BS2, and WRDR were 5 (3–8), 5 (2–9), and 0.18 (− 1; 1) in S1, and 4 (3–6), 7 (4–0), and 0.5 (0; 1) in S2. The days between CXRs varied less in S1 (2 (2–2) vs 5 (3–6)), as it was established by protocol only for S1 (Suppl-Table 1; Tables 1 and 2). The first follow-up CXR and the first VS device were required 3 and 4 d-OS, and at 3 and 6 d-OS for S1 and S2, respectively.

In S1, lung opacities were present in the right middle (RMZ 82%), right lower zone (RLZ 72%), left lower (65%) and left middle (63%) zones in CXR1. Severity followed the same distribution (mean BS1 = 1.23, 1.17, 1.04, and 1.01). Radiological severity peaked in the lower zones and the RMZ within 5 d-OS, and in the left middle zone, between 5 and 10 d-OS. Upper zone involvement was less frequent and milder, especially on the left (54% and 45%, mean BS1 = 0.72 and 0.63, respectively) and occurred later (at 10–15 d-OS, Suppl-Table 2a).

In patients with VS requirement, BS1 was significantly greater only in RMZ (mean BS = 1.55 vs 1.12, p = 0.043); however, BS2 was greater in all zones (p ≤ 0.042), mainly in RLZ (33.33%) and RMZ (26.67%), excepting left upper zone (p = 0.05). When VS was not required, BS2 decrease respecting to BS1 in all zones between 0.39 and 7.36%, although it was only significant for lower zones and right upper zone (p ≤ 0.048, Suppl-Table 2b).

In S1, radiological severity was higher in men in CXR1 (BS1 of 6.5 ± 3.9 vs 4.9 ± 3.4; p = 0.02), and it tended to be so in CXR2 (BS2 of 6.5 ± 4.7 vs 4.8 ± 4.0; p = 0.07). Patients with rapid radiological worsening were admitted earlier (rho between d-OS to admission with BS2 and WRDR =  − 0.226 and − 0.243; p < 0.005). However, the initial radiological severity was not decisive (rho with BS1 =  − 0.088).

BS2 correlated better with SpO2/FiO2 and systemic inflammatory markers levels (rho range = 0.174 and − 0.576) than BS1 (rho range = 0.125 and − 0.493), except for ferritin and D-dimer levels on admission, with better correlation for BS1 (Suppl-Table 3).

According to the multivariate analysis (Table 3), the predictive ability for VS need was always worse for BS1 than for BS2 and WRDR. A two-unit increase of BS1 within 10 d-OS raised the risk of VS [95% CI] in 0.9 times ([0.7–1.1], p = 0.20, AUC = 0.635 [0.502–0.754]) in S1, and 1.7 times ([1.2–2.6]; p = 0.005, AUC = 0.557 [0.506–0.608]) in S2. However, a two-unit increase of BS2 incremented the risk 2.4 times ([1.7–3.4], p < 0.001, AUC = 0.931 [0.835–0.980]) in S1, and 2.4 times ([1.7–3.3], p < 0.001, AUC = 0.812 [0.688–0.903], Table 4) in S2. Patients who required VS always showed higher BS2, and BS1 was higher only for the bivariant analysis in S1 (Fig. 3). Patients with the highest BS1 (rho =  − 0.567, p = 0.003) were admitted earlier to ICU, and this correlation was weaker for BS2 (rho =  − 0.446, p = 0.06).

Table 3 Binary logistic regression models for Brixia score 1 (model 1), Brixia score 2 (model 2), and worsening radiological day rate (WRDR) (model 3) for patient treated at hospital (sample 1) and at home (sample 2)
Table 4 Diagnosis performance of Brixia score 1 and 2 for ventilatory support
Fig. 3
figure 3

Differences in Brixia score 1 and Brixia score 2. Worsening radiological day rate in patients according to the healthcare requirements in patients treated in hospital (sample 1, upper row), and at home (sample 2, lower row). For each variable, the median (interquartile range) is shown; n is shown in Tables 1 and 2. The p-value has been calculated by Mann–Whitney U or Kruskal–Wallis test

The increase in radiological severity in conjunction with the early request for a second CXR (Tables 1 and 2) in patients with VS requirement was highlighted by the WRDR. A WRDR > 1.3 within 10 d-OS multiplied the probability of needing VS [95% CI] by 2.0 ([1.3–3.2], p < 0.001) in S1 and by 5.0 ([2.2–11.4], p < 0.001, Table 3) in S2, with sensitivity, specificity, PLR, NLR, and AUC values of 94% [70–100], 93% [82–99], 14.06 [4.7–42.3], 0.067 [0.01–0.4], and 0.903 [0.800–0.964] in S1, and of 83% [52–98], 87% [74–95], 6.39 [2.9–14.0], 0.19 [0.05–0.7], and 0.867 [0.752–0.942] in S2 (Table 4). In 24/26 S1 patients (92%) with VS requirement, WRDR was > 1 (median 2.5, IQR 1.68–4.88) in the 0–10 d-OS. In 92/94 patients (98%) without VS requirement, WRDR was < 1 in S1 during the entire follow-up period. These patterns were replicated in S2. The WRDR differences were evident as early as 0–5 d-OS, with WRDR median (IQR) of 2.5 (1.75; 5.5) vs − 0.50 (− 1.5; 0.0; p < 0.001) for VS requirement in S1, and of 2.33 (0.67; 3.00), 0.7 (0.17; 1.22), and 0.33 (− 0.11; 0.62) for VS requirement, hospitalization vs discharge in S2 (p < 0.001, Fig. 4).

Fig. 4
figure 4

Differences in worsening radiological day rate in patients with and without respiratory support requirement compared to the days from the onset of the symptoms to the 1st CXR in samples 1 (a) and 2 (b). The values for each variable are shown in the text and in Tables 1 and 2. The p-value has been calculated by Mann–Whitney U test or Kruskal–Wallis test. Abbreviations: CXR, chest X-ray; d, days; VS, ventilatory support

Non-radiological variables

In S2, patients older (64 ± 9, 57 ± 12 vs 53 ± 14 years; p = 0.02) and with at least one comorbidity (7 (50%), 12 (23.5%), vs 8 (16.7%); p = 0.04, Table 2) required more support (VS, hospitalization vs discharge). In S1, patients treated with antihypertensive drugs (13 (50%) vs 21 (22%) p = 0.006), with diabetes mellitus (7 (26.6%) vs 11 (11.7%); p = 0.05), previous cardiovascular or cerebrovascular disease (6 (23.1%) vs 10 (10.6%); p = 0.09), and males (19 (73.1%) vs 49 (52.1%); p = 0.06, Table 1) tending to need more VS.

If early radiological follow-up had been performed in vulnerable patients or with pneumonia on the first CXR with a BS > 4 (since this is the threshold determined by the Youden point on the ROC curve that best defines the need for VS on S1), 212/456 (46.5%) patients older than 55, 76/456 (16.7%) with some comorbidity and 273/456 (59.9%) with a BS1 > 4 would have undergone follow-up CXR among the total number of patients presenting with pneumonia between October 27th, 2020, and January 25th, 2021 (Fig. 2). Considering the day with the most reconsultations (13/01/21), 19 follow-up CXRs would have been requested: 14 (over 55 years) plus 1 (some comorbidity) plus 4 (BS1 > 4).

Discussion

The early changes of pulmonary opacities assessed in the CXR are a determining factor in the clinical evolution of patients with COVID-19 pneumonia. A worsening of Brixia score > 1.3 points per day within 10 days after onset symptoms increases the VS need twofold in inpatients, and five times in outpatients.

The prognostic role of imaging techniques in patients with SARS-CoV-2 pneumonia has been focused on the first CXR [6,7,8, 12]. However, the predictive ability of the lung involvement assessed by BS on the second CXR was better in our study, probably because this follow-up better depicts the enormous capacity of COVID-19 for a rapid clinical worsening. BS evaluated in the second CXR was better correlated with systemic inflammatory markers and SpO2/FIO2 than in the first CXR. The worsening radiological per day rate, calculated by dividing the increase of the BS between two consecutive CXR by the number of days between both exams, is our best parameter for characterizing the SARS-CoV-2 pneumonia progression. Therefore, not only radiological involvement, but also the speed at which changes occur, standardizing per day, prevents the influence that the different time elapsed between the first and second CXR may have on the radiological changes. For example, an increase in BS greater than 4 points between two radiographs 3 days apart would mean a daily increase of Brixia score > 1.3. We could also extrapolate this worsening rate by dividing the BS in whatever CXR performed within the first 10 days of symptoms by the days from symptom onset to CXR performing date. For example, a BS of 10 points in a CXR on the fourth day since symptom onset, assuming pneumonia onset on the first day of symptoms, would mean a daily increase in BS of 2.5 points (> 1.3 per day), or even higher if pneumonia onset were later. Both scenarios would imply a high probability of requiring VS. However, given the high reproducibility for the CXR findings [13], the radiological changes between radiographs are probably a more objective measure of evolution than the patient’s perception of the days since the onset of symptoms and the assumption of the day that the pneumonia begins. A follow-up with a second CXR can provide an inexpensive and more simple marker of rapid disease progression risk than other complex prognostic models or algorithms [7, 14], warning of the need of ICU admission, preventing delays establishing VS treatment to stabilize respiratory failure, and possibly avoiding endotracheal intubation and death. Awaiting evidence-based guidelines, our recommendation for patients with COVID-19 pneumonia and BS > 4 is to perform a follow-up CXR [9, 15] after 48–72 h, given that VS may be required as quickly as 4 days from the onset of symptoms, at least in patients older than 55 or with some comorbidity. This following-up workload had been sustainable in our environment (19 CXRs in the busiest day). Males, elderly, and patients with some comorbidity (requiring antihypertensive therapy, diabetics, and with cardio- or cerebrovascular disease) were our most vulnerable patients, like previous studies also suggest [16].

COVID-19 pneumonia evolves in extensiveness and severity following a distribution, starting in the right middle and lower zones, with progression towards the left middle zone, and finally the upper zones, at 5, 5–10, and 10–15 day of onset symptoms, respectively, as has been previously reported [17].

Having analyzed two samples could be considered a limitation; however, it has allowed us to validate our hypothesis, making the findings more robust and generalizable. The exclusion of patients older than 80 years constitutes another limitation; however, having included patients in this age range could have possibly entailed a more relevant bias, as they would not have received VS in a resource-scarce scenario even when clinically indicated. Future studies will be required to replicate the results in this population. The variable obesity was not collected in the part of the sample collected retrospectively; however, this limitation has no impact on the most relevant findings of the work. Finally, although a beneficial effect of corticosteroid treatment on VS requirements has not been highlighted, specifically focused studies on this point are required. Nonetheless, we believe that the radiological prognostic ability is not affected by this factor.

Summarizing, a daily increase of the Brixia score > 1.3 points at onset of SARS-CoV-2 pneumonia raises the possibility of requiring VS. We recommend radiological follow-up 48–72 h after the first CXR with findings of pneumonia, at least in elderly patients with some comorbidity.