Introduction

Since the end of 2019, a global pandemic spread in most countries all over the world. As in many others realities, since the end of February 2020, Italy experienced a first pandemic phase characterized by an exponential increase in the number of affected patients accessing emergency units and, due to government restrictions, a next phase lasting from May to September 2020, characterized by a slow progressive reduction of the affected between the ones presenting to hospitals with a clinical suspicion of COVID-19 infection [1].

Imaging played a major role in the management of the infected population, and chest computed tomography (CT) and chest X-ray (CXR) have been extensively used, revealing undisputed utility during the admission workout and the follow-up of hospitalized patients [2,3,4]. Even if CT imaging is considered the most effective method for the detection of lung abnormalities particularly in the early stage of the disease, portable CXR, in the emergency setting, has played an important role in the identification and follow-up of lung involvement [4,5,6,7,8]. In fact, CXR allowed chest imaging without moving patients from their rooms and so reducing the possibility of a cross-infection among other patients and medical staff and reducing the burden on radiological units, where the recommended times for decontamination of CT rooms have frequently led to a heavy reduction in CT activity [9, 10]. In our institute, as in many others Italian hospitals, each patient with clinical suspicion of COVID-19 infection underwent a supine CXR in the admission room, as a first-line triage tool in association with reverse transcription polymerase chain reaction (RT-PCR) testing. We evaluated each admission CXR for the presence of a large set of findings, some of which related to COVID-19 pneumonia, as reported in recent literature, and others related to cardio-respiratory non-COVID-19 affections and labeled as suggestive for COVID-19 infection or for an alternative diagnosis [5, 7, 11, 12]. For each COVID-19 patient, each admission CXR has also been scored for the extension of pulmonary findings with a score, previously validated for COVID-19 lung alterations, and all the acquired data were correlated with the patient’s final diagnosis and outcome [7].

The aims of our study were to prospectively valuate admission CXR accuracy at the reduction of positive cases between clinically suspects, to identify specific CXR findings strictly associated with COVID-19 final diagnosis and to correlate lung involvement of admission CXR with patients’ outcome. Previous studies have focused mainly on description of CXR findings at presentation and on CXR modifications related to the disease time course [5,6,7, 12]. Other studies were concentrated on the prognostic role of such methodic and have scored radiographic alterations, either with visual scores or using artificial intelligence (AI) technology, generally finding good correlations with patients’ outcome [6, 7, 13, 14]. Recently, a study by Hare and colleagues has evaluated CXR diagnostic accuracy and detected a very high specificity (100%) [8]. To our knowledge, however, all previous CXR evaluations were based on the assessment of lung alterations during pandemic peak and were retrospective ones.

Materials and method

Study design and patients characteristics

This was an observational monocentric prospective study approved by the Ethical Committee of our referring center (rif. CEAVC 17,104). Informed consent was obtained from study patients. From the 1st to the 30rd of April 2020, we prospectively evaluated all the admission CXR of patients presenting to our institute with a clinical suspicion of COVID-19 infection. According to recent literature, the following characteristics were evaluated: age and gender, fever, cough, dyspnea, reduction of SpO2, chest pain, myalgias, rhinorrhea, diarrhea, hemoptysis and smell or taste disturbances. All patients underwent a nasopharyngeal swab and a supine CXR while waiting in the isolation ward [11, 15,16,17]. Results of RT-PCR in respiratory specimens requested in ED were available within 6 h from the request. We decided not to consider dead patients in relation to the short-term follow-up of our study (one month since the admission date).

CXR examinations

CXRs were performed in the antero-posterior projection, with patients in the supine position, using portable X-ray units (FDR Go PLUS-Fujifilm, Italy) in the isolation ward. All images were stored in a picture archiving and communication system (PACS, Syngo-Siemens), and a report was provided before moving patients from the admission room. The CXRs were evaluated by two radiologists (a cardiothoracic one and a general one), blinded from any clinical information and, when discordant, the final decision was reached collegially. Case adjudication was dichotomic: COVID-19 infection ‘present’ or ‘absent.’ In COVID-19 absent cases, an alternative pre-specified clinical diagnosis was indicated. In each CXR, according to the Fleischer Society glossary of terms and according to previous literature it was searched the presence of ground glass opacity (GGO), consolidation (CO) and reticular-nodular opacities (RNO) [12, 18, 19]. For each CXR was then specified the site of lung alterations as: upper or lower, peripheral or central, focal or diffuse, monolateral or bilateral [12]. Nodules, pleural effusions, lung cavitation, pneumothorax, hilar congestion, peri-bronchial cuffing and cardiac enlargement were also searched and reported when present [18]. Patients with a positive RT-PCR in respiratory specimens, comprehensive of bronchoalveolar lavage (BAL) within 5 days from ED presentation, were considered COVID-19 present. In COVID-19 patients only, the extension of the CXR detected findings was scored using a severity score validated by Wong and colleagues for COVID-19 lung involvement. It is derived from a previously more complex one developed in order to assess lung edematous involvement in acute respiratory distress syndrome-ARDS (Radiographic Assessment of Lung Edema-RALE) [7, 20]. In this modified RALE score, each lung was classified for the extension of lung involvement from 0 to 4 (0 = no involvement; 1 =  < 25%; 2 = 25–50%; 3 = 50–75%; 4 =  ≥ 75% of involvement), and the scores of both lungs were summed with a maximum value of 8 [7].

Patient diagnosis and outcome

COVID-19 infection was confirmed by RT-PCR on nasopharyngeal and throat swabs. In our institute, sometimes were needed serial RT-PCR tests before reaching a diagnosis. In very few cases, COVID-19 diagnosis was confirmed with RT-PCR test on BAL. The final diagnosis of patients was that derived from the clinical report for hospitalized patients and from telephonic follow-up for non-hospitalized ones. The outcome of COVID-19 patients was graded considering the following steps: patients discharged (Group 0), patients requiring ordinary hospitalization (Group 1), noninvasive ventilation (Group 2) or intubation (Group 3).

Statistical analysis

Sensitivity and specificity of CXR examination in the diagnosis of COVID-19 were estimated. The diagnostic accuracy of CXR and the association of each radiological finding with the presence or absence of COVID-19 infection were assessed through Cohen's K test. A forward logistic regression analysis with maximum likelihood method was performed to evaluate the correlations of radiological findings and COVID-19 diagnosis, with sex and age as covariates. Odds ratios and corresponding 95% confidence intervals (95% CI) were thus estimated by logistic regression model. The nonparametric test of Kruskal–Wallis was carried out to evaluate the difference in the RALE score between patients with different outcome. Statistical analysis was performed with SPSS (IBM SPSS Statistics for Windows, version 25.0); statistical significance threshold was set with p = 0.05.

Results

During the study period, 327 CXRs of patients presenting to our institute with a clinical suspect of COVID-19 infection were evaluated in total. A percentage of 30.5% (100 patients) resulted COVID-19 positive at nasopharyngeal swab or at BAL RT-PCR tests. The clinical characteristics of the 327 patients and of the 100 infected on presentation are summarized in Table 1. Note the poor difference in age and sex distribution between the two populations, with a mean age of 71 years for all patients accessing the admission ward and of 69 years for COVID-19 ones. There is also an equal presence of males and females, with only a slight prevalence of the last ones in both cohorts. Likewise, the clinical presentation is very similar with a predominance of fever, dyspnea and cough as presenting symptoms. In Fig. 1 is shown the age stratification of COVID-19 affected patients (Fig. 1). Sensitivity and specificity of CXR examination in the diagnosis of COVID-19 were 60% and 83%, respectively (Table 2). CXR accuracy in the diagnosis of COVID-19 was fair (Cohen’s K = 0.42 with a p value < 0.001) (Table 3). It can be observed that findings as GGO, CO, RNO and a peripheral, basal, bilateral and diffuse distribution of lung alterations showed statistically significant Cohen’s K, indicating an association with COVID-19 diagnosis, although with low K values (< 0.3) (Fig. 2, 3). After logistic regression, three radiological findings showed p values < 0.05 with odds ratio > 1: presence of GGO and a basal and diffuse distribution, indicating the three main characteristic that should guide the diagnosis toward COVID-19 infection positivity (Table 4). Nonparametric Kruskal–Wallis test showed statistically significant differences of RALE score between the following groups with different outcomes: 0–1 (p = 0.014), 0–3 (p = 0.001), 0–2 (p = 0.03), 1–3 (p = 0.04). These results were obtained despite the large differences in numerosity among groups: Group 0 (n = 11), Group 1 (n = 85), Group 2 (n = 5), Group 3 (n = 5) (Fig. 4).

Table 1 Descriptive statistic of total suspects and of COVID-19 population
Fig. 1
figure 1

Age distribution of patients with COVID-19 pneumonia and positive nasopharyngeal swab

Table 2 CXR examination versus COVID-19 diagnosis contingency table. Reverse-transcription polymerase chain reaction (RT-PCR); chest X-ray (CXR)
Table 3 Cohen’s K for each radiological finding with corresponding p value
Fig. 2
figure 2

Ground glass opacities in COVID-19 pneumonia. Figure a shows an initial interstitial thickening in both lower pulmonary lobes (arrows). In figure b, a young male patient with a diffuse basal and bilateral GGO involvement

Fig. 3
figure 3

Diffuse lung involvement in COVID-19 pneumonia. Figure a shows a reticular-nodular pattern in both basal and subpleural parenchyma. Figures in b and c demonstrate diffuse lung involvement with bilateral consolidations and thickening of peri-bronchovascular interstitium

Table 4 Logistic regression results. Odds ratio with 95% confidence interval and p values of radiological findings showing significative association with COVID-19 diagnosis and odds ratio ≥ 1 after logistic regression. Ground glass opacities (GGO), distribution (D), confidence interval (CI)
Fig. 4
figure 4

Box and Whisker plot of RALE score estimated in each group by outcome: patients discharged (Group 0), patients requiring ordinary hospitalization (1), noninvasive ventilation (2) or intubation (3). RALE score showed a statistical correlation with the patients’ outcome confirming the prognostic value of CXR scores of COVID-19 involvement at the diagnosis

Discussion

While the high diagnostic accuracy of chest CT in detecting COVID-19 alterations has already been evaluated, with a specificity ranging, according to different studies, from 95 to 99%, and a sensitivity varying from 61 to 98%, to our knowledge, only Hare and colleagues has evaluated CXR specificity [7, 8, 21,22,23,24]. They have showed, in their recent work, a CXR specificity of 100% and a sensitivity of 44% in diagnosing COVID 19 infection using the British Society of Thoracic Imaging issued guidelines for the categorization of chest radiographs [25]. Our results show a moderate reduction in CXR specificity (83%) compared to them. This difference may be related in primis to the different CXR reporting system. In Hare work, patients were categorized as classic/probable COVID-19, indeterminate for COVID-19, normal, and non-COVID-19, according to the BSTI guidelines, while we asked the radiologist to assess only the presence or the absence of COVID-19 lung involvement using a set of previously identified radiological findings suggestive for COVID-19 infection, without an “indeterminate” or “normal” pattern [7, 12, 18, 25]. Most likely, our radiologists considered the “indeterminate” and “normal” CXRs as non-COVID-19 ones. In addition, our work is a prospective one, without a pre-selection of patients and may reflect more objectively the Italian reality and, maybe, the diagnostic accuracy of the methodic. Finally, our study took place in a period of non-pandemic peak, with a global reduction of the affected among the ones presenting with a clinical suspicion (in our study, COVID-19 patients were the 30% of the suspected ones), thereby affecting CXR predictive values in diagnosing COVID-19 patients. All these factors may concur in influencing the evaluation of the methodic diagnostic accuracy. In our opinion, a specificity of 83%, with a negative predictive value of 81%, although not high as those reported by Hare and not high as those reported for CT scans, reveal quite a good ability of CXR to rule out others cardiopulmonary affections mimicking COVID-19 at clinical presentation (Fig. 5) [8, 21, 22]. Our correlation of CXR findings with the final diagnosis of COVID-19 infection identified GGO, a peripheral and a diffuse distribution of lung alterations, as the ones highly related to COVID-19 final diagnosis (P values < 0.001). This result is concordant with previous ones and confirms the necessity for the radiologist to take special attention in these findings as suspicious of COVID-19 lung involvement [8, 12]. Furthermore, a logistic regression carried out between a huge number of variables including all radiographic findings (correlated or not correlated with COVID-19 diagnosis) plus patients’ sex and age identified GGO and a diffuse bibasal prevalence of lung alterations as the three independent variables as predictors of COVID-19 diagnosis. These results are significant as they enable radiologists to identify, with enough confidence, a subset of radiologically suspected patients among those presenting with similar symptoms in a non-pandemic peak and to ask the clinicians for a different management for the identified ones (such as continue isolation or repeat the negative RT-PCR tests), even in front of a low clinical suspect and of a low prevalence of the infected among the population.

Fig. 5
figure 5

Alternative diagnosis. In figure a, an upper-right lobar pneumonia (arrow) with diffuse inflammatory lung involvement. In figure b, an 82-year-old woman with cardiac failure and pulmonary edema, hilar congestion (arrow) and cardiomegaly

On the other hand, we found a sensitivity of admission CXR of about 60%. This value is similar to that found by Wong et al. (69%) and by Cozzi et al. (61.1%) at the pandemic peak and is presumably related to the low capability of CXR to identify subtle lung alterations generally occurring in mild disease or at the beginning of the illness [7, 26]. This occurring both at the pandemic peak and during the progressive reduction of the affected among the population. In our study, the RALE score showed a good correlation with the patients’ outcome confirming the prognostic value of CXR scores of COVID-19 involvement at the diagnosis [6, 7, 13, 26] (Fig. 2). In our experience, 85% of COVID-19 patients underwent an ordinary hospitalization while only a minimum percentage either was dismissed or required respiratory assistance. This evidence reflects the percentage of cases requiring respiratory assistance (approximately 10%) between the hospitalized ones, similar to that detected during the pandemic peak, and on the other hand reflects Italian policy regarding COVID-19 pandemic [27,28,29]. In fact, by now in Italy, people are managed at home while experiencing a mild symptomatology and are asked to apply for hospital admission only with the worsening of the clinical situation. These circumstances explain the few cases of non-hospitalized COVID-19 patients (11%) between all the ones admitting to our institute. This behavior may also explain our slightly higher sensitivity (60%) compared to Hare’s one (44%) as we have a lower percentage of mild cases between admitted patients, thus possibly lacking a sensible percentage of false negative CXR. In our study, sex and age were not predictors of COVID-19 infection. In fact, sex and age distribution in affected and non-affected populations are similar with an average of 70 years approximately (Table 1). In the affected population, we found a peak of incidence in patients aging between 80 and 85 years. So, although age did not result as a predictor factor for COVID-19 infection, the age stratification of our COVID-19 population reflects those previously described [30].

Our study has several limitations. First, we lack interobserver agreement in the diagnostic evaluation and in the prognostic scoring of the CXR (all decisions were reached collegially) while this evaluation has been previously carried out both for diagnostic and prognostic assessment with acceptable results [6,7,8, 13, 25]. Also, we evaluated only admission CXR and we lack follow-up imaging. This is the main limit of our study as it has been previously proved that CXR findings differ according to different time interval from the clinical onset and that radiological findings strongly modify along with the time course of the disease [7,8,9,10,11,12,13,14,15,16,17,18,19]. Furthermore, we have a limited period of follow-up with necessarily an incomplete picture of patients’ outcome. This may be a bias affecting the results of the correlation between the modified RALE score and patients’ outcome.

However, to our knowledge this is the first prospective study aimed to define CXR diagnostic accuracy, and it is the first study performed during the descending phase of COVID-19 pandemic. This work demonstrates that, although not sensible and specific as chest CT, CXR examination, a convenient fast and bed-side methodic, correlates with patients’ outcome and shows a good specificity, thus enabling the radiologist to rule out most of others cardiopulmonary affections. We have also identified three findings (presence of GGO and diffuse and basal lung alterations) which allows radiologists to point out to the clinician highly suspicious patients even in front of a low clinical suspect.