FormalPara Take-home message

In patients suspected for COVID-19, lung ultrasound patterns of probability integrated with patients’ characteristics allow to rule in or rule out COVID-19 pneumonia at bedside with high accuracy, according to the different clinical phenotypes. This approach could support and expedite patients’ management during a pandemic surge.

Introduction

Since the beginning of the SARS-CoV-2 pandemic, frontline physicians with experience in lung ultrasound (LUS) have brought this technique into the spotlight as an efficient tool to diagnose interstitial pneumonia associated with coronavirus disease 2019 (COVID-19) [1,2,3]. The main advantages of a systematic point-of-care LUS approach during the surge are both a quick and reliable diagnosis of lung involvement at bedside, and a reduced exposure of other healthcare workers outside designated COVID-19 hospital areas. LUS has a great potential in the early identification of pneumonia [4]; however, LUS signs of COVID-19 pneumonia are non-specific and can be present in other respiratory conditions that commonly present to the Emergency Department (ED) [5, 6]. Recently, a simple approach based on the presence and distribution of a few basic LUS signs, determining patterns with different degrees of probability for COVID-19 pneumonia, has been proposed [7,8,9]. This approach integrates clinical symptoms and the patient’s medical history at presentation (clinical phenotypes) to provide context to the LUS exam.

The primary aim of our study was to describe the diagnostic accuracy of the combination of LUS probability patterns and clinical phenotypes in the prediction of the results of the reverse transcription-polymerase chain reaction (RT-PCR) test in a large international multicenter population suspected of COVID-19 during the pandemic surge. A second aim was to test an online training to implement a standardized LUS approach for COVID-19 in centers with different levels of LUS expertise. The study hypothesis is that a standardized LUS exam is feasible and reproducible, and can be useful for the early prediction of RT-PCR results in patients suspected of COVID-19.

Methods

Study design and population

This is a multicenter prospective observational study. From March 1st, 2020 until May 30th, 2020, patients from 20 different hospitals were evaluated (Online Resource 1). In each hospital, the main local LUS expert was nominated as the site principal investigator and was responsible for patient enrollment and LUS image acquisition.

Inclusion criteria were: (1) clinical suspicion of SARS-CoV-2 infection leading to presentation to the ED, or admission to COVID Unit or Intensive Care Unit (ICU) of the enrolling Hospitals. Clinical suspicion was based on the following symptoms lasting for at least 3 days in the absence of an alternative more probable diagnosis: major criteria (at least one): fever > 37.5 °C, cough, dyspnea, anosmia and/or ageusia; minor criteria (at least two): sore throat, bilateral conjunctivitis, generalized weakness, rhinorrhea, headache, diffuse musculoskeletal pain, gastrointestinal symptoms (diarrhea, nausea, vomiting); (2) age ≥ 18 years. Exclusion criteria were: (1) a previous diagnosis of COVID-19 pneumonia; (2) a previous pneumonectomy, pleurodesis, or history of fibrothorax.

All the enrolled patients were examined by LUS and received a nasopharyngeal swab for SARS-CoV-2 RT-PCR. Oxygen saturation and/or arterial blood gas analysis were measured in all patients.

Ethical consideration

The local Ethical Committee Boards approved the study, and the study was conducted following the ethical standards of local guidelines for good clinical practice. The list of ethical approvals is added as Online Resource 2. Informed consent was obtained in line with security protocols in place in the hospitals during the emergency, and depending on local rules and institutional regulations.

Clinical characteristics

Each patient was characterized according to the presence of dyspnea, either objective or self-reported to the attending physician at admission. Desaturation and respiratory failure were defined as either a PaO2/FiO2 < 300 mmHg or Sat.O2/FiO2 < 357 [3]. Cardiopulmonary comorbidities were defined as a history of chronic respiratory and/or cardiac disease, including severe chronic obstructive pulmonary disease, interstitial lung disease, primitive or metastatic lung cancer, heart failure, and cor pulmonale. These conditions are those that may create confounding LUS signs [4].

Three main different clinical phenotypes were identified and assigned to each patient at presentation, as previously described [8, 9]. These phenotypes were defined as (1) mild phenotype in patients without dyspnea and/or desaturation (mild non-respiratory symptoms and no signs of respiratory failure); (2) severe phenotype in patients with dyspnea and/or desaturation (dyspnea and/or signs of respiratory failure); or (3) mixed phenotype in patients with cardiopulmonary comorbidities, irrespective of the clinical condition. From the severe group, we selected a subgroup with objective desaturation, as described above.

Lung ultrasound

A complete LUS examination was performed at presentation on the antero-lateral and posterior chest, as previously described [8, 9]. Commercially available ultrasound equipments with convex transducers (3.5–6.0 MHz) were used. The focus was placed on the pleural line. Depth was set at about 8–10 cm, according to patient’s size. Gain was regulated to optimize the image. The sonographers at each hospital were either ICU or ED clinicians, with documented previous experience in LUS in emergency and critical care. The number of operators was variable in each hospital; they were under the responsibility of the local principal investigator. Each LUS operator was blind to the result of RT-PCR test and any other imaging test or blood exam, but not to the patients’ clinical condition. LUS was performed immediately at presentation whenever possible, always before the result of the RT-PCR test.

Each LUS examination was classified according to the following standardized, mutually exclusive, patterns [8, 9]:

  • Low probability (LowLUS): a normal or near-normal LUS pattern characterized by bilateral A-lines with lung sliding and without significant B-lines.

  • High probability (HighLUS): typical LUS pattern of COVID-19 pneumonia with bilateral and multifocal clusters of separated or coalescent B-lines, large hyperechoic bands (light beams), multifocal peripheral consolidations, regular and irregular pleural line, with or without large consolidations. These clusters should appear in patchy distribution, abruptly alternating with normal A-lines patterns (“spared areas”).

  • Intermediate probability (IntLUS): less typical pattern including unilateral isolated clusters of B-lines and light beams or focal multiple B-lines, with or without small peripheral consolidations.

  • Alternative probability (AltLUS): a pattern more consistent with an alternative diagnosis such as an isolated large consolidation with dynamic air bronchograms (suggesting bacterial pneumonia) or without bronchograms (suggesting obstructive atelectasis), a large pleural effusion (either hydrostatic or inflammatory), diffuse homogeneously distributed B-lines (suggesting cardiogenic edema or diffuse fibrosis).

Standardization of the LUS approach and inter-operator variability

Before starting the enrollment, dedicated web meetings were organized to illustrate the protocol. Real cases were discussed to standardize LUS approach and pattern recognition. At the end of this preparatory phase, a video tutorial was sent to each site’s principal investigator (Online Resource 3).

To assess LUS inter-operator variability in the interpretation of LUS patterns and signs, an online training exercise was set up, with a total of 53 clips including the whole range of significant LUS signs (separated B-lines, coalescent B-lines, light beams, large consolidations, small peripheral consolidations, pleural line alterations, and no alterations). Additionally, a group of eight clinical cases showing all 4 LUS patterns were presented. All the principal investigators were asked to recognize the main abnormalities in each video and establish the LUS pattern probability, blinded to any clinical information. If intraclass correlation coefficient (ICC) was > 0.8, the sonographer was considered independent in LUS evaluation.

Statistical analysis

Continuous variables are expressed as mean ± standard deviation. Two-sample comparisons were performed using the Mann–Whitney U test, and the Chi-squared test for categorical data. Sensitivity, specificity, positive and negative predictive values, and accuracy were calculated using RT-PCR as the gold standard, and either the HighLUS as positive LUS exam and LowLUS/IntLUS/AltLUS as negative exam (Model 1), or HighLUS and IntLUS as positive exam and LowLUS/AltLUS as negative exam (Model 2).

The association of selected variables with the positivity of RT-PCR was then assessed by logistic regression model using multivariable procedures (forward selection). Variables were selected for the univariate analysis according to the clinical probability of being associated with the positivity of RT-PCR; all variables with an associated p value < 0.10 at univariate analysis were included in the final multivariate analysis. Variance inflation factor (VIF) > 5 was used to exclude variables due to multicollinearity. The interobserver variability in LUS assessment was examined by ICC and calculated in the whole series of paired measurements made by the participating Center principal investigators and the main expert sonographer. A p value of < 0.05 was considered statistically significant. All statistical analyses were performed using the Statistical Package for the Social Sciences (SPSS Inc., Chicago, Illinois, USA, version 20) and R software (R Foundation for Statistical Computing software).

Results

An initial number of 2035 patients were screened, and a final total number of 1462 patients were enrolled: 1022 patients had a positive RT-PCR and 440 a negative RT-PCR (prevalence of COVID-19 at presentation of 69.8%). The flowchart of patients is shown in Online Resource 4. Patients’ characteristics are listed in Table 1. Seventy-four percent of the population was enrolled in EDs, 16% in COVID units, and 10% in ICUs.

Table 1 Patients’ characteristics according to RT-PCR positivity

The prevalence of the four different LUS patterns are shown according to RT-PCR positivity in the overall population and in the different clinical phenotypes (Online Resource 5). Table 2 reports LUS diagnostic accuracy in all subgroups, considering as positive LUS exam HighLUS alone or HighLUS and IntLUS. An example of the 4 LUS probability patterns is shown in Fig. 1. Data on the prevalence of the light beam (as a binary variable) in the different clinical subgroups are shown in Online Resource 6.

Table 2 Diagnostic accuracy of LUS in patients suspected of COVID-19 considering HighLUS (positive exam) vs LowLUS/IntLUS/AltLUS (negative exam), or HighLUS/IntLUS (positive exam) vs LowLUS/AltLUS (negative exam) with RT-PCR as gold standard
Fig. 1
figure 1

Representative images of the four probability patterns in symptomatic patients, showing the distribution of the LUS signs of COVID-19 pneumonia. First row: bilateral distribution of typical LUS interstitial signs (high probability) in a case RT-PCR positive. Second row: monolateral distribution of interstitial LUS signs (intermediate probability) in a case RT-PCR positive. Third row: presence of atypical signs (alternative probability) in a case showing an isolated large consolidation with air bronchograms, due to bacterial pneumonia, and RT-PCR negative. Fourth row: absence of interstitial LUS signs (low probability) in a case RT-PCR negative. LUS lung ultrasound, COVID-19 Corona Virus Disease 2019, RT-PCR reverse transcriptase-polymerase chain reaction

A multivariate logistic regression analysis was performed to assess the independent role of selected variables to predict a positive RT-PCR. Both the HighLUS and the light beam were predictive of a positive RT-PCR, independently of desaturation (Online Resource 7). No severe multicollinearity was found between dyspnea and respiratory failure (VIF = 1.32) and between light beam and HighLUS (V = 2.95).

Respiratory failure was more prevalent in patients with the HighLUS compared to other LUS patterns (65.5% vs 50.5% in AltLUS, 47.6% in IntLUS and 5.8% in LowLUS, Chi-squared = 252, p < 0.0001).

A total of 75 sonographers were involved in the study. The average overall agreement on the training video-clips was 90.6%. The ICC of the whole series between each participating center principal investigator and the expert sonographer is listed in Online Resource 8. The average ICC was 0.980 (range 0.939–1.0); all sonographers showed an ICC > 0.90. In a fixed-effects model including the center as a predictor, no relevant changes were observed in the coefficients and their significance, neither in the univariate analysis, nor in the multivariable-adjusted model.

Discussion

Our findings show that some combinations between LUS patterns of probability for COVID-19 pneumonia and the clinical phenotype at presentation are accurate in predicting the result of the RT-PCR in patients with suspected SARS-CoV-2 infection. This standardized LUS protocol has shown low inter-operator variability among the US and European centers involved. Figure 2 reports the proposed operative algorithm based on LUS signs, and Fig. 3 details the main practical conclusions of our study.

Fig. 2
figure 2

Proposed operative algorithm on the interpretation of LUS signs in the first approach to patients suspected of COVID-19 pneumonia. The algorithm must be considered as a schematic guidance to be always clinically integrated with the overall picture, and never in isolation. Notes: *highly suggestive of bacterial pneumonia with isolated consolidation of large size and with dynamic air bronchogram; **suggestive of cardiogenic edema when visualized bilaterally with homogeneous and gravity-related distribution; ***when multiple clusters with light beam, coalescent B-lines and pleural irregularities are observed monolaterally (multifocal), it may be still classified HighLUS. HighLUS high-probability lung ultrasound pattern, LowLUS low-probability lung ultrasound pattern, IntLUS intermediate-probability lung ultrasound pattern, AltLUS alternative probability lung ultrasound pattern, COVID-19 Corona Virus Disease 2019.

Fig. 3
figure 3

Proposed operative flowchart on early management of patients suspected of COVID-19 during a pandemic surge, according to the clinical evaluation at presentation and the assignment of the LUS probability pattern. Final decision should consider that the protocol allows to rule-in or rule-out COVID-19 as the main cause of the presenting symptoms. HighLUS high-probability lung ultrasound pattern, LowLUS low-probability lung ultrasound pattern, IntLUS intermediate-probability lung ultrasound pattern, AltLUS alternative probability lung ultrasound pattern, COVID-19 Corona Virus Disease 2019

To the best of our knowledge, this is the first large multicenter study describing the application of a standardized LUS protocol that combines the sonographic probability of pneumonia with the clinical phenotypes for a rapid decision-making and resource allocation in patients suspected of COVID-19. The role of LUS in patients with suspected COVID-19 is to identify an associated interstitial pneumonia, the most common complication, and not to diagnose the infection, because pneumonia does not manifest in all patients with SARS-CoV-2 infection. Moreover, all the LUS signs of COVID-19 pneumonia are non-specific and may be present in other pulmonary conditions. These are the reasons why combining LUS patterns of probability for interstitial pneumonia with clinical phenotypes at presentation provides a structured guidance to the sonographer for a more effective and clinically-oriented use of LUS.

HighLUS: in times of high prevalence of the disease, the HighLUS is highly predictive of COVID-19 pneumonia. In our population, the percentage of discordant cases showing HighLUS and negative RT-PCR was low and might be explained by the non-negligible rate of false-negative swab tests. Indeed, the first RT-PCR is limited by a rather low sensitivity and only repeated swabs, or bronchoalveolar lavage in some cases, may correctly allocate the diagnosis in those patients with a first false-negative swab [10,11,12]. However, the possibility that an RT-PCR shifts to positivity after hospitalization and a reasonable incubation period always raises the suspicion of cross-infection; moreover, it is common experience that hospitalized patients with a typical clinical picture and suggestive computed tomography (CT) pattern but repeatedly negative RT-PCR have been considered and treated as COVID-19 pneumonia, particularly in the peak of the COVID-19 surge [10]. Notwithstanding these limitations when the prevalence of the disease is high, detection of HighLUS in patients suspected for COVID-19 predicts SARS-CoV-2 infection in all clinical subgroups, with a higher specificity in the mild phenotype.

The light beam, a relevant sign in the definition of the HighLUS, is also a strong independent predictor of RT-PCR positivity. We speculate that this sign can be the ultrasound representation of the very early interstitial involvement corresponding to the ground-glass opacities that are typically visible on CT studies in the lung periphery during the initial phase of the disease (Fig. 4) [13,14,15]. The light beam is not specific for COVID-19, as it can be seen in other conditions not necessarily COVID-related; however, it is a typical feature of the LUS pattern in COVID-19 pneumonia, and its presence during a pandemic surge should prompt high suspicion for COVID-19 pulmonary involvement.

Fig. 4
figure 4

The case of a 65-year-old patient complaining of fever, cough, and fatigue for 4 days, without signs of respiratory failure and showing normal saturation in room air. The CT scan shows bilateral early smooth GGO affecting main part of the lung periphery. The correspondent LUS exam shows the typical interstitial signs with patchy distribution well characterized by the “light beam” in abrupt alternance with “spared areas”. CT computed tomography, LUS lung ultrasound, GGO ground-glass opacity

IntLUS: as expected, the percentage of patients with IntLUS and negative RT-PCR was higher than in patients with HighLUS. However, the two patterns together achieved the highest sensitivity in all subgroups. Therefore, as a practical perspective, the absence of both HighLUS and IntLUS should be considered to rule out COVID-19 pneumonia with high sensitivity, which increases as the clinical situation worsens (from 90.2% in the overall population to 99.3% in respiratory failure).

LowLUS: overall, 35% of patients with the LowLUS had positive RT-PCR. This apparent discrepancy is explained by the presence of the virus in patients without pneumonia or with a very mild form of pulmonary involvement, invisible to LUS [16, 17]. However, in the absence of sonographically detectable pulmonary involvement (A-lines pattern throughout), it is unlikely that the respiratory failure is related to COVID-19. Interestingly enough, some patients with severe phenotype and LowLUS showed a positive RT-PCR. A possible explanation for this discrepancy may be the subjective nature of the reported dyspnea, which could be linked to factors other than pneumonia, including the stressful situation for patients with suspected COVID-19. Another possibility is the profound muscle weakness and general sense of fatigue, frequently accompanying symptoms of this viral disease. Indeed, when the analysis was restricted to the subgroup of patients with objective desaturation (respiratory failure subgroup), no patient with LowLUS resulted as RT-PCR positive. Thus, our study demonstrated that LowLUS in patients with objective signs of respiratory failure allows to rule out COVID-19 as the main cause of respiratory symptoms, suggesting searching for an alternative diagnosis.

AltLUS: bacterial pneumonia, cardiogenic pulmonary edema, pulmonary fibrosis, and neoplastic or infective pleural effusion were the main causes for AltLUS [18]. In both the severe and mixed phenotype, the vast majority of patients showing the AltLUS was confirmed RT-PCR negative (84.8% and 84.4%, respectively). Thus, even in a moment of high prevalence of COVID-19, the AltLUS correctly predicted the negativity of RT-PCR in most patients. These findings are important for clinicians trying to differentiate between the exacerbation of a chronic disease and COVID-19 pneumonia.

Mixed phenotype: in the group of patients assigned to the “mixed” phenotype, the role of LUS is particularly challenging. Pattern recognition may become difficult for the possibility of exacerbations of pre-existing pulmonary alterations. Indeed, many pulmonary conditions, both acute and chronic, share similar LUS signs with COVID-19 pneumonia. Despite these difficulties, the detection of the HighLUS in the mixed phenotype predicted a positive RT-PCR with good specificity. However, detection of IntLUS in patients with chronic pre-existing pulmonary diseases indicates the need for further testing.

LUS patterns and CT findings: our findings show the practical advantage of classifying LUS in four probability patterns within clinical phenotypes, which has not been used in previous LUS studies [17, 19,20,21,22,23]. The language used in this study to define the LUS patterns is similar to the language suggested for CT findings related to COVID-19 [8, 9, 13]. The “typical appearance” on CT can be assimilated to the HighLUS, the “indeterminate appearance” to IntLUS, the “atypical appearance” to AltLUS, and the “negative” to LowLUS. CT scan is the gold standard of chest imaging with a sensitivity superior to LUS. It is reasonable to describe LUS in terms of “probability”, whereas CT findings are described as “appearance”, and while the absence of pulmonary signs on CT indicates a “negative” study, the absence of LUS signs indicates “low probability” of COVID-19 pneumonia. We encourage a standard application of this language to LUS findings in the first approach to suspected cases. On the other hand, it is also evident the need to differentiate the main clinical phenotypes at presentation, considering that different combinations between clinical phenotypes and LUS patterns give different accuracy in predicting RT-PCR results. Indeed, the main principle of point-of-care LUS still remains the need for a close correlation with the clinical condition of the patient [22].

Limitations

Our study is limited by the lack of a definitive gold standard. We did not perform a systematic comparison with CT scan to confirm LUS findings of pneumonia and did not perform a follow-up of patients for definitive confirmation of COVID-19 infection in doubtful cases. COVID-19 poses several diagnostic dilemmas: studies on LUS for COVID-19 should combine the need to confirm the infection, which cannot be pursued by chest imaging, and the need to detect pneumonia, which is in the power of LUS. The RT-PCR test is limited by a low sensitivity and delay in the results, but it is the standard to indicate patients’ first diagnosis and guide hospital allocation in the clinical practice; therefore, it is the reference we used to assess LUS accuracy.

The LUS operators were blind to the RT-PCR results but aware of the patients’ clinical condition, which may have biased LUS interpretation. However, COVID-19 may occur with a wide variety of clinical pictures and the LUS diagnosis is not linked to the clinical severity and symptoms, minimizing this potential bias. Patients were enrolled when a sonographer with documented LUS experience was present; although this can also have introduced some selection bias, the selection was anyway completely random.

We assessed the inter-operator variability in the recognition of signs and patterns on clips. This is not the same as assessing LUS variability in the same patient, which was not feasible during the pandemic. However, our data show how a standardized LUS protocol may be implemented with high inter-center reproducibility and applied worldwide by operators with variable skills and experience.

While the number of severe pneumonias with respiratory failure is well estimated during the pandemic, the percentage of asymptomatic infections is undoubtedly underestimated. Even more, the number of interstitial pneumonias in mildly symptomatic or even asymptomatic patients is far from being fully investigated [24]. Future studies should address the real prevalence of pneumonia associated with COVID-19, and LUS can be the ideal sensitive tool to accomplish this task.

Finally, the interpretation of our results is linked to the prevalence of the disease, which was particularly high during the first pandemic wave. With a lower prevalence, the significance of these findings needs to be revisited.

Conclusions

A standardized LUS protocol combining ultrasound probability patterns with clinical phenotypes in patients suspected of COVID-19 pneumonia at presentation is feasible and reproducible. The HighLUS and IntLUS together are very sensitive in ruling out interstitial pneumonia, with increasing values with worsening clinical phenotypes; whereas in patients with mild symptoms, the HighLUS is very specific in ruling in interstitial pneumonia. The detection of the AltLUS suggests an alternative diagnosis. Our approach may be useful to rapidly guide management for wiser use of hospital resources during a pandemic surge, and may even represent the only available diagnostic tool in scarce resource areas.