FormalPara Take-home message

LUS provides risk stratification and prediction of outcomes in COVID-19, and may guide management strategies, triage and resource allocation during a pandemic.

Introduction

The main manifestation of Coronavirus disease 2019 (COVID-19) is viral pneumonia, that may evolve to severe acute respiratory distress syndrome (ARDS) [1, 2]. Severe cases require intensive care treatment and prolonged mechanical ventilation, and often manifest multi-organ involvement such as hemodynamic instability, myocardial injury, renal dysfunction and coagulopathy [3]. Parameters reported to correlate with poor outcome are older age, comorbidities, high sequential organ failure assessment (SOFA) score, lymphopenia, elevated troponin and d-dimer greater than 1 mg/L [4].

Bilateral lung infiltrates on computed tomography (CT) is the hallmark of severe disease, but can also appear in asymptomatic patients or precede respiratory symptoms by days [5]. The use of lung ultrasound (LUS) as a diagnostic tool in critically ill patients, for assessment of response to treatment as well as for follow-up, has become common practice [6,7,8,9,10,11,12].Moreover, its use has been recommended as standard of care [13]. Findings on LUS correlate with clinical course similar to findings on high resolution CT [14, 15] in various patient populations. Combining this powerful tool with bedside echocardiography allows rapid thorough assessment of cardiovascular and respiratory status of the patient and thus guidance of further treatment [16,17,18]. The cardiac manifestations of COVID-19 using bedside echocardiography were recently published [19]. Yet, although the outbreak of COVID-19 started months ago, systematic LUS evaluation of patients for risk stratification and management guidance has not been introduced into routine practice, perhaps because of the risk of infection spreading. To this end, we performed comprehensive LUS exams in consecutive COVID-19 hospitalized patients.

Methods

We studied 120 consecutive adult patients with COVID-19 admitted to the medical ward or intensive care unit (ICU) at the Tel Aviv Medical Center, between 21/03/2020 and 04/05/2020. All patients had a diagnosis of COVID-19 confirmed by a positive reverse-transcriptase polymerase chain reaction assay for SARS-CoV-2 in a respiratory tract sample. Demographic data, comorbid conditions, medications, physical examination, and laboratory findings were systematically recorded. Patients were risk stratified according to their COVID-19 modified early warning score (COVID-19 MEWS, Supplemental Table I) and SOFA score [20, 21]. At the beginning of the COVID-19 pandemic, we initiated a prospective program of performance of LUS on admission and on deterioration for all patients presenting with respiratory illness due to COVID-19 infection, using a pre-defined step-by-step protocol, as part of a routine patient care protocol. All patients underwent comprehensive LUS combined with bedside echocardiography within 24 h of admission. Patients who then experienced clinical deterioration underwent a repeated exam. Clinical deterioration was defined as either respiratory (acute new onset hypoxemia requiring mechanical ventilation, veno-venous extracorporeal membrane oxygenation, or both), or hemodynamic (persistent hypotension requiring vasopressors to maintain mean arterial pressure ≥ 65 mmHg or having serum lactate level > 2 mmol/L despite adequate volume resuscitation). This is a retrospective study of the prospectively and systematically collected data on the lung ultrasound exams performed. The ethics committee of the Tel Aviv Medical Center approved the study, IRB number 0196-20-TLV.

Follow-up and outcomes

Clinical follow-up was obtained by daily review of all medical records. Outcome analysis started at time of baseline LUS exam. Endpoints studied were: all-cause mortality and composite endpoint comprised of death or new need for invasive mechanical ventilation. The data that support the findings of this study will be available from the corresponding author upon reasonable request.

Lung ultrasound

We performed LUS on all patients with COVID-19 using a six-zone method for each lung that included a scan of the anterior, antero-lateral, and postero-lateral aspects of the thorax. Examinations were performed by three cardiologists with expertise in LUS recording and interpretation using the same equipment (CX 50, Philips Medical Systems, Bothell, WA), with the same phased-array probe used for echocardiography. Each LUS lasts between 2–3 min, with the patient supine or semi-supine, omitting the need for position change during the examination. A point scoring system was employed for each region and ultrasound pattern: A-lines (normal reverberation artifacts of the pleural line that when accompanied by lung sliding correspond to normal aeration of the lung) were equal to 0 point; B-lines (hyperechoic lines vertical to the pleura line, arising from it and reaching the edge of the screen erasing A-lines, which represent reverberation artifact through edematous interlobular septa or alveoli) were divided to B1 (separated B-lines that correspond to moderate lung aeration loss) that was equal to 1 point, and B2 (coalescent B-lines that correspond to severe lung aeration loss) that was equal to 2 points; Lung consolidation that was equal to 3 points. Thus, an LUS score of 0 was normal, and 36 was worst [7]. Examples of the different patterns are shown in Fig. 1. We also documented the presence of pleural thickening and defined a homogenous vs. patchy pattern of each examination. Pleural thickening was qualitatively determined, indicating irregular pleural line either in cases of sub-pleural consolidations or in cases of B-lines accompanied by irregular pleural line. In accordance to present guidelines [22], the following measures were undertaken to minimize the risk of inadvertent infection: all studies were performed bedside at the designated COVID-19 wards using dedicated scanners that were tagged and set aside in each ward. Full personal protection equipment was used and LUS measurements were performed offline to reduce exposure time. Inter-observer variability for LUS score was determined by a second independent blinded and experienced observer, who measured the LUS score in 20 randomly selected patients. Inter-observer variability was assessed using the Bland–Altman method and the within-subject coefficient of variation. The within-subject coefficient of variation (calculated as the ratio of the standard deviation of the measurement difference to the mean value of all measurements) provides a scale-free, unitless estimate of variation expressed as a percentage.

Fig. 1
figure 1

Examples of different patterns of lung ultrasound findings. a A-lines, normal reverberation artifacts of the pleural line that correspond to normal aeration of the lung. b A single B-line that represents reverberation artifact through mildly edematous interlobular septa or alveoli that correspond to moderate aeration lost. c Multiple coalescent B-lines that correspond to severe lung aeration loss. d Lung consolidation that correspond to complete aeration loss

Statistical analysis

Continuous normally distributed variables were presented as means ± SD and compared using the Student’s t test. Normality was assessed using the Shapiro–Wilk test and visual inspection of quantile- quantile plots. Non-normally distributed data were presented by median, 1st and 3rd quartiles and compared using the Wilcoxon rank sum test. Categorical data were compared between groups using the χ2 test, or Fisher's exact test. LUS parameters in consecutive exams were compared using the signed Wilcoxon signed-rank test. Correlation between change in positive end-expiratory pressure (PEEP) and change in LUS score was examined using Spearman's rank correlation coefficient. Receiver-operating characteristic (ROC) curve analysis was used to determine optimal cutoff values of LUS score for 30-day events. The best cutoff value was defined by Youden's index calculation. Cox proportional hazards models for mortality or clinical deterioration as endpoints allowed for calculation of hazard ratios (HR) of baseline LUS parameters. p values of less than 0.05 were considered to indicate statistical significance. All data were analyzed with the JMP System software version 12.0 (SAS Institute, Inc, Cary, NC). All authors participated in designing the study, collecting and analyzing data, and drafting and revising the manuscript.

Results

During the study period, clinical data were collected for 135 consecutive patients hospitalized with COVID-19. Fifteen patients were excluded because they did not undergo LUS due to hospital discharge ≤ 24 h (8 patients), patient refusal (1 patient) and a “do not resuscitate/intubate” status in 6 patients. Thus, the study group included 120 COVID-19 patients who underwent LUS evaluation. Table 1 shows baseline characteristics and LUS assessments of all patients, stratified by LUS score tertiles. Eighty patients (67%) had a baseline LUS score of 0–18, and 40 (33%) had an LUS score of 19–36. Mean age was 64.7 ± 18 years, 62% males. Comorbidities were present in 81% of patients, with hypertension being the most common, followed by diabetes and obesity. The most common symptoms on admission were respiratory, followed by only fever, chest pain and fatigue. C-reactive protein (CRP), Troponin-I, brain natriuretic peptide (BNP) and D-dimer were elevated at baseline in 88%, 28%, 37% and 69% of patients, respectively. Patients in the upper tertile of LUS score, compared to those in the lower tertiles, were older, had lower levels of hemoglobin, lymphocytes and albumin with higher levels of CRP, troponin, D-dimer and fibrinogen (p < 0.05 for all). They had lower ambient O2 saturation and higher SOFA score and MEWS (p < 0.001 for all). Baseline mean left ventricular ejection fraction was 57.7 ± 5%, mean E/e′ was 10.3 ± 6.3 and none of the echocardiographic parameters was significantly different between the groups (p > 0.2 for all, Supplemental Table II). Bilateral infiltrates were the most common chest X-ray manifestation, found in 39% of patients. Pleural effusion and lobar infiltrates were rare (< 15% each).

Table 1 Baseline characteristics

None of the patients had normal LUS (A-lines accompanied by lung sliding in all zones), or homogenous B-lines in all zones. Most patients had patchy pleural thickening (n = 100; 83%), or patchy subpleural consolidations (n = 93; 78%) in at least one zone. Pleural effusion was rare (n = 9, 8%). The median total lung score was 15, IQR [7–20]. Comparison of inter-observer variability for LUS score showed good agreement between measurements: mean difference 0.1 ± 0.05 points, r = 0.92, p = 0.36. The Bland–Altman plot showed a random scatter of points around 0, indicating no systematic bias or measurement error proportional to the measurement value. Measurement variability (within-subject coefficient of variation) for measurements of inter-observer differences was 3.1%.

LUS and clinical severity grade

On admission (baseline LUS evaluation), 75 patients were stratified as having clinically mild disease (oxygen saturation ≥ 94% at room air), 31 as moderate disease (need for non-invasive oxygen) and 14 as severe disease (need for invasive mechanical ventilation). When compared to patients with mild disease, patients with severe or moderate disease were more hypoxemic (O2 saturation of 86 ± 7, 88.7 ± 6% and 96.2 ± 3% in severe, moderate and mild disease, respectively, p < 0.0001 for trend), more tachycardic, more pyretic, required more vasopressor support and had higher levels of CRP, D-dimer and cardiac biomarkers (troponin-I, BNP). Results of LUS evaluation stratified by severity of disease are shown in Table 2. The prevalence of pleural thickening, subpleural consolidations and the total LUS score were higher with worsening disease.

Table 2 Patients stratified by clinical presentation at baseline lung ultrasound

LUS and clinical deterioration

In 20 patients, sequential LUS exams were performed due to clinical deterioration (hemodynamic instability n = 4, respiratory deterioration n = 16). In this group of patients, total LUS score worsened mostly due to deterioration in anterior segments grade (16/20, 80%) with amplification of B-lines and consolidations (Supplemental Table III). In seven patients, who were invasively ventilated during baseline LUS and underwent a repeated LUS because of further deterioration, a significant positive correlation was found between the change in LUS score and the change in PEEP requirements (ρ = 0.87; p = 0.03).

Example of LUS of a patient at baseline and after clinical deterioration is shown in Supplemental Fig. 1.

LUS and survival

There were 23 deaths during follow-up [mean follow-up period 31 days, IQR (20–40) days]. Presence of pleural effusion, pleural thickening and high total LUS score at baseline examination were each significantly associated with increased mortality (Supplemental Table IV).

The optimal cutoff point for LUS score was 18—using the highest Youden's index in the ROC analysis for 30-day mortality (AUC 0.76; sensitivity = 62%, specificity = 74%). Survival was reduced with total LUS score > 18 vs. LUS score ≤ 18 (66 ± 20% vs. 88 ± 11% for 30-day survival; p = 0.01). Kaplan–Meier survival curve (Fig. 2a) shows lower survival with total LUS score > 18 compared to lower LUS score. Unadjusted hazard ratio of death for total LUS score was 1.08 [1.02–1.16] per point, p = 0.008. The unadjusted hazard ratio of death for high risk LUS score (> 18) was 2.65 [1.14–6.3], p = 0.02, suggesting a 2.6-fold increase in mortality with high risk, compared to low risk, LUS score (Supplemental Table IV). The only chest X-ray finding associated with mortality was the presence of bilateral infiltrates, and its addition to the model showed that total LUS score is independently associated with mortality when accounting for chest X-ray findings. The only physical finding associated with mortality was ambient O2 saturation. Although total LUS score remained significantly associated with mortality when adjusted for bilateral infiltrates in chest X-ray or age, its association with mortality was lost when adjusted for ambient O2 saturation and MEWS (Table 3).

Fig. 2
figure 2

a Kaplan–Meier curve for mortality according to lung ultrasound severity. b Kaplan–Meier curve for the combination of need for invasive mechanical ventilation or mortality according to lung ultrasound severity

Table 3 Multivariable analyses of baseline predictors of clinical deterioration and death

LUS and composite events

Following baseline LUS, 30 composite events occurred. Presence of pleural thickening and total LUS score were significantly associated with the composite event (Supplemental Table IV).

The rate of the composite events was increased with total LUS score > 18 vs. LUS score ≤ 18 (43 ± 9% vs. 10.6 ± 3% for thirty days; p = 0.0004). Kaplan–Meier curve (Fig. 2b) shows higher rate of the composite event with total LUS score > 18 compared to lower score. Unadjusted hazard ratio of the composite event for total LUS score was 1.12 per point [1.05–1.2], p = 0.0008. Unadjusted hazard ratio of the composite event for high risk LUS score (> 18) was 4.24 [2.06–9.1], p < 0.0001 suggesting a 4.2-fold increase in the composite event with high risk versus low risk LUS score (Supplemental Table IV).

Addition of presence of bilateral infiltrates in chest X-ray to the model showed that total LUS score is independently associated with the composite event when accounting for chest X-ray findings. The only physical finding associated with the composite event was ambient O2 saturation. Addition of ambient O2 saturation to the model showed that total LUS score is independently associated with the composite event when accounting for ambient O2 saturation. Although total lung LUS score remained significantly associated with the composite event when adjusted for bilateral infiltrates in chest X-ray, age or ambient O2 saturation, its association with the composite event was lost when adjusted for MEWS (Table 3).

Discussion

COVID-19 primarily affects the lungs, and pneumonia appears to be the most frequent serious manifestation of infection [1]. During the COVID-19 pandemic, LUS was sporadically used in several centers to identify disease severity, and to assist in treatment decisions [23, 24]. The results of the present study, which used a protocoled guided systematic LUS in 120 consecutive COVID-19 patients admitted to the Tel Aviv Medical Center, show that: 1. All admitted patients, even with mild disease, have abnormal LUS at presentation; 2. For the majority of patients, the most common finding on LUS was patchy pleural thickening or patchy subpleural consolidations in at least one zone. 3. Increased LUS score is associated with worsening disease; 4. In deteriorating patients, LUS pathology worsens mostly in the anterior lung segments and correlates with PEEP requirements. 5. Baseline LUS predicts death and/or clinical deterioration and may aid risk stratification and clinical decision making.

Ultrasonographic features of COVID-19

None of the patients had normal LUS, suggesting a possible role of LUS to rule out COVID-19 infection in symptomatic hospitalized patients. However, because less than 10% of symptomatic patients with COVID-19 infection are admitted to the hospital in Israel, these results are susceptible to selection bias. We believe that our results should serve as incentive to explore the role of LUS in ruling out COVID-19 infection in larger series, including asymptomatic as well as ambulatory patients. The most common findings were pleural thickening and subpleural consolidations whereas no homogenous diffuse B-lines were seen. Moreover, bedside echocardiography did not reveal findings suggestive of elevated left atrial pressure in the majority of patients. Such features correlate with previous high-resolution CT descriptions of patchy subpleural lung infiltrates in COVID-19 [25] and rules out the etiology of cardiogenic pulmonary edema [15].

LUS findings in relation with disease severity

LUS score in patients with severe disease were significantly higher compared with patients with mild or moderate disease. With worsening disease, more pleural thickening and subpleural consolidations were recorded. The relation between clinical severity and LUS findings is in line with previously published data using LUS and chest CT scores [26, 27], as well as with previously described patterns in swine (H1N1) and avian (H7N9) Influenza [28, 29]. Interestingly, the main contributor to the worsening LUS score was new, or greater, involvement of anterior segments, a finding that may be used clinically to warn from imminent deterioration. Furthermore, in patients who were mechanically ventilated during baseline LUS and later underwent a second examination due to clinical deterioration, LUS score and PEEP requirements were significantly correlated. Recent publications have shown that with respiratory distress from COVID-19, patients initially may retain relatively good lung compliance despite very poor oxygenation [30,31,32]. In these patients, CT exam will show limited ground-glass infiltrates, signifying interstitial rather than alveolar edema [33, 34]. These patients have low response to PEEP, and tolerate larger tidal volumes (7–8 mL/kg ideal body weight). In some patients, the disease progressively develops into the "classic" type ARDS, with CT showing extensive consolidations associated with low lung compliance, and the need for higher PEEP, low tidal volume and early consideration of prone positioning [35, 36]. When challenged by such a dynamic disease, a quick bedside imaging exam as LUS may become extremely helpful for distinguishing between these phenotypes, follow patients' clinical status and direct therapy accordingly, thus allowing adequate changes in respiratory support to higher PEEP, low tidal volume and early consideration of prone positioning in patients with increasing LUS scores and decreasing number of normal segments, suggesting rapidly decreasing compliance. Furthermore, our data show that in the final stages of clinical deterioration, even the anterior lung segments can become consolidated. This finding can predict a poor response to prone positioning [37].

LUS as a predictive tool of clinical course and outcome

Our data show that a higher LUS score, appearance of pleural thickening and pleural effusion predict the need for mechanical ventilation, mortality and the combination of both. Survival drops significantly with an LUS score above 18. This prediction is independent of chest X-ray findings, making it a stand-alone superior alternative. For the composite outcome of need for invasive mechanical ventilation or death, the predictive ability of LUS score is even superior to that of chest X-ray and O2 saturation. This is in concordance with previously described evidence in patients with decompensated heart failure, in which semi-quantitative B-line assessment was shown to be a prognostic indicator of adverse outcomes and mortality [9]. Moreover, our results are in line with a publication regarding chest CT in COVID-19 patients, in which the total burden of lung involvement and anterior segment involvement at admission were associated with higher rates of adverse clinical composite endpoints of ICU admission, respiratory failure and shock [38]. The peripheral distribution of lung infiltrates in COVID-19 makes LUS a reliable imaging study, and can reduce the number of CT scans performed [17, 39], with their associated risks of infection spread, radiation exposure and the need to disinfect the CT room [22]. Moreover, transporting critical patients to CT is challenging and complex, while LUS can be easily performed at the bedside.

Our study identified patients without any pleural thickening or subpleural consolidations, who did not experience clinical deterioration, showing the ability of a straightforward baseline LUS to also predict a good clinical outcome and serve as a mean of triage, especially in case of widespread infection and emergency room over-crowding. It could also serve as an adjunct in hospitalized patients discharge decisions.

Limitations

First, our study is a single center study, which included only patients with COVID-19 who were hospitalized for at least 24 h. The fact that only ≈ 7% of patients diagnosed with COVID-19 in Israel are admitted to the hospital, probably led to over-estimation of the severity of LUS in COVID-19. Fifteen patients (11.1%) were excluded. Six of these patients were excluded due to "Do Not Resuscitate/Intubate" orders. These patients received only palliative care and died shortly after their admission. This fact may have created an opposite bias resulting in underestimation of LUS severity in patients with COVID-19 infection. Using phased-array transducers is acceptable when performing LUS, but its low frequency and high penetrance can compromise pleural evaluation. Nevertheless, placing the focus at the pleura level enabled reasonable assessment of the pleural line and subpleural consolidations. The fact that LUS measurements were calculated by the cardiologist caring for the patient may lead to over-estimation of the severity of LUS. Outcome analyses in our study should be interpreted with caution due to the small number of patients.

Conclusions and clinical implications

In patients with COVID-19, LUS rapidly identifies pulmonary involvement and provides risk stratification, including prediction of need for mechanical ventilation and mortality, above routine radiographic assessment. Its use may guide patients’ management strategies, as well as resource allocation in case of surge capacity.