FormalPara Key Summary Points

Why carry out this study?

The long-term effects of COVID on the lungs remain unclear, but, given the extent of the pandemic, it has the potential to become a significant chronic global health problem.

New health services need to be rolled out to follow-up COVID survivors, and understanding what these might entail is helpful for future planning.

We provide the first description of findings from a COVID follow-up service designed in accordance with British Thoracic Society Guidelines

What was learned from this study?

A total of 221 patients were followed-up: all patients who had been admitted to ICU (n = 44) and 38 of 177 (21%) non-ICU patients who had persistent symptoms and/or persistent radiographic changes on chest X-ray at their initial 8-week follow-up visit required further pulmonary function testing and thoracic imaging.

An overall prevalence of PFT abnormalities were seen in around 24% of the entire cohort and radiological evidence of fibrosis was seen in around 7% of the entire cohort at 4 months after discharge.

Introduction

On 31 December 2019, the Wuhan Municipal Health Commission, China, reported a cluster of cases of pneumonia of unknown origin to the World Health Organisation (WHO) [1]. The causative agent was a novel coronavirus SARS-CoV-2, now widely known as COVID-19. A year later, on 30 December 2020, the WHO reported a total number of 80,453,105 confirmed cases globally, and 1,775,776 confirmed deaths globally [2].

There is very little knowledge of the incidence of long-term sequelae following COVID-19 [3]. Although COVID-19 is a systemic disease, evidence suggests that the lungs are the most affected organ [4]. Follow-up data from previous coronavirus outbreaks, SARS-CoV and MERS-CoV, indicate that between 25% and 35% of survivors experience persistent abnormalities of pulmonary function and radiographic changes consistent with pulmonary fibrosis [5, 6]. Early case reports indicate that this phenomenon will also be associated with post-COVID infection [7,8,9,10]. With a global population of more than 78 million COVID-19 survivors at the close of 2020, and case rates continuing to rise, a better understanding of what may become a significant chronic global health issue is essential.

A handful of studies have described pulmonary sequelae of COVID in a total of 1509 post-infection COVID patients (individual study sample sizes 18–246 patients). Follow-up time ranged from 2 weeks to 3 months. Most studies report persistent radiographic changes and abnormalities of pulmonary function tests (PFTs) in a proportion of their patients, with prevalence varying from 18 to 90% for computerised tomography (CT) changes and 25% to 88% for PFT abnormalities [11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27].

In this study, we report on the experience of a newly established COVID follow-up service. We aimed to add to the growing evidence of the long-term effects of COVID-19, and to make early estimates of the persistence of pulmonary morbidity in hospitalised COVID survivors, and therefore the potential future burden to health services. Our objectives were to ascertain: (1) the proportion of hospitalised COVID-19 survivors requiring follow-up investigations (PFTs and CT imaging) in accordance with British Thoracic Society (BTS) guidelines [28]; (2) factors associated with this need for follow-up; and (3) the prevalence of persistent PFT abnormalities and CT changes in those requiring follow-up investigations. Our findings will help inform the design and roll-out of services in the post-COVID era.

Methods

Study Subjects and Design

This was a retrospective service evaluation analysis of the post-COVID follow-up service established at Manchester University NHS Foundation Trust, Wythenshawe Hospital, a tertiary respiratory department in the United Kingdom, in the wake of the first wave of the COVID-19 pandemic in spring 2020.

All consecutive patients admitted with a diagnosis of COVID-19 under Respiratory Medicine at Wythenshawe Hospital between 1 March and 30 June 2020 were included (n = 283). Diagnosis was made either by a positive SARS-CoV-2 reverse-transcriptase polymerase chain reaction or a combination of high clinical suspicion and supportive imaging, at the discretion of a senior clinician. Fifty-three of these individuals died (19%) during admission, while nine declined follow up. The remaining 221 patients were referred for follow-up. Follow-up PFTs and CT thoracic imaging were planned for patients as per BTS guidelines [28].

Ethics Statement

The study was conducted according to the guidelines of the Declaration of Helsinki of 1964, and its later amendments. Ethical review and approval were waived for this study, due to the work conducted being part of local service evaluation, utilising only data routinely collected as part of the local COVID follow-up service. As such, specific ethical approval and informed patient consent were not required, as confirmed by the NHS Health Research Authority decision tool. This service evaluation is registered locally.

Methods

Patient demographic data, including age, ethnicity, gender, pre-admission body mass index (BMI) and pre-existing co-morbidities, were obtained from hospital electronic patient records (EPR) and electronic general practice records, where available. Need for supplemental oxygen, admission to the intensive care unit (ICU), length of hospital stay, maximum fraction of inspired oxygen (FiO2) required, peak positive end-expiratory pressure (PEEP) and peak inspiratory positive airway pressure (IPAP) for ventilated patients were obtained via EPR and ICU notes.

All patients admitted under respiratory medicine or ICU were assessed by telephone at approximately 8 weeks post-discharge. A standard follow-up proforma (supplementary material 1) was used to identify persistent symptoms, including cough, breathlessness and fatigue. Breathlessness both pre- and post-admission was recorded and graded as per the Medical Research Council (MRC) breathlessness score. Follow-up chest radiographs were offered to all patients, and were reported by the local radiology department as per the British Society of Thoracic Imaging guidelines [29]. Patients were either discharged from the follow-up service, or further investigated with pulmonary function testing and high-resolution computed tomography thorax. Reasons for further investigation were: persistent breathlessness from baseline (MRC breathlessness score increase of ≥ 1),, significantly reduced exercise tolerance (self-reported), or persistent or worsening chest radiograph changes at follow-up. All CT scans were reported by a specialist thoracic radiologist with expertise in interstitial lung disease. Based on the descriptive report, the changes were categorized as: (1) normal; (2) persistent ground glass opacification; or (3) features of fibrosis.

Analysis

Main outcome measures:

  1. 1.

    Factors associated with persistent symptoms and/or changes on chest radiograph at 8 weeks post-discharge.

  2. 2.

    Pulmonary function tests at 4 months post-discharge in discharged ICU patients and ward-based patients with persistent symptoms at 8 weeks.

  3. 3.

    Radiographic changes on CT thorax at 4 months post-discharge in ICU patients and ward patients with persistent symptoms.

Statistical analysis was performed using Microsoft Excel and GraphPad Prism 9 software. Categorical variables were presented as counts, whilst continuous variables were presented as means ± standard deviation (SD). Statistical significance was tested using the Student's t test, Fisher’s exact test, chi-squared test, Mann–Whitney test, or one-way ANOVA test, as appropriate. Correlation was tested using Spearman’s correlation coefficient. A p value of < 0.05 is denoted as statistically significant.

Results

Baseline Demographics (Table 1)

Table 1 Baseline demographics and admission details of all patients and comparison of different cohorts

A total of 221 patients were included for analysis, of which 135 (61%) were male. Average age at admission was 58 years and average pre-admission BMI was 30.0 kg/m2. The majority of patients were white (n = 172, 78%), 31 were Asian (14%), 9 were black (4%) and 9 were of another ethnicity (4%). The majority of patients were life-long non-smokers (n = 139, 63%). The most common pre-existing co-morbidity was hypertension (n = 74, 33%), followed by asthma (n = 43, 19%), diabetes mellitus (DM; n = 36, 16%), chronic obstructive pulmonary disease (COPD; n = 19, 9%) and cardiovascular disease (n = 19, 9%). Nearly a quarter of patients (n = 53, 24%) had two or more pre-existing co-morbidities (Table 1).

Forty-four patients were treated in the ICU. The average age of ICU patients was younger than non-ICU patients (54 vs. 59 years, p = 0.033). The prevalence of COPD was increased in the non-ICU cohort compared with the ICU cohort (11% vs. 0%, p = 0.016). This is likely due to pre-existing COPD being felt to represent a poor prognostic factor with respect to the likelihood of benefit and survival in ICU, therefore reducing the likelihood of escalation of affected patients beyond ward-level care. There were no other statistically significant differences between the two cohorts (Table 1).

We compared baseline demographics of non-ICU patients with persistent symptoms and/or radiographic changes at 8 weeks (and therefore required further follow-up) and those that were symptom-free at 8 weeks. A greater proportion of patients with persistent symptoms were ex- or current smokers compared to those who were symptom-free with normal chest X-rays (CXRs) at 8 weeks (53% vs. 34%, p = 0.040) (see Table 1). Pre-existing DM was also statistically significantly increased in those requiring further follow-up compared with those discharged at 8 weeks (29% vs. 10%, p = 0.007). There was a trend towards increased prevalence of COPD in the follow-up group (18% vs. 9%) but this did not reach statistical significance (p = 0.134) (Table 1). There was no association between pre-existing asthma and the need for further follow-up (Table 1).

Inhaled Corticosteroids and Oral Corticosteroids in Concurrent Airways Disease

Pre-admission inhaled corticosteroids (ICS) were prescribed in all 19 patients with a pre-existing diagnosis of COPD or asthma-COPD overlap syndrome, and in 33 of 40 (83%) patients with a pre-existing diagnosis of asthma. There was no difference between the rates of prescribed ICS between the ICU and non-ICU cohort (25% vs. 23%; p = 0.695) (Table 1). There was also no significant difference between rates of prescribed ICS between the follow-up (FU) and non-FU groups in the non-ICU cohort (34% vs. 19%; p = 0.078) (Table 1). Twelve of the patients in the present study received oral prednisolone to treat their concurrent airways disease during admission for COVID, but, given the small numbers involved, we were not able to draw any conclusions regarding the impact of this treatment (data not shown).

Hospital Length of Stay and Oxygen Therapy

The ICU cohort had a longer length of stay (LOS), a greater need for supplemental oxygen, and a higher maximum FiO2 than the non-ICU cohort (Table 1). LOS was also significantly different between the non-ICU group requiring follow-up compared to those discharged from follow-up at 8 weeks (8 days vs. 5 days; p = 0.0176) (Table 1). There was no difference between the number of patients requiring supplemental oxygen or the maximum FiO2 in the non-ICU group requiring follow-up compared to those discharged from follow-up at 8 weeks (Table 1).

Follow-Up Investigations

Follow-up PFTs and CT thoracic imaging were planned for all ICU patients (n = 44) and 38 of 177 (21%) non-ICU patients who had persistent symptoms and/or persistent radiographic changes on CXR at their initial 8-week follow-up visit, as per BTS guidelines. Missing data due to patients failing to attend or inability to complete tests is depicted in tables and figures where appropriate.

Abnormalities in Pulmonary Function Tests Correlate with Length of Stay and Maximum FiO2

Sixty-seven (30%) patients had follow-up PFTs: 37 of 44 (84%) ICU patients and 30 of 38 non-ICU patients intended for follow-up. Pre-admission PFTs were only available for 3 of these individuals (data not shown). The average time between discharge date and follow-up PFTs was 109 ± 37 days (median = 101). Fifty-three of the 67 (79%) had an abnormality (< 80% predicted value) in at least one PFT parameter. This equated to 24% of the entire cohort (n = 221). There was no difference in the proportion of patients with at least one abnormal PFT between the ICU and non-ICU groups (76% vs. 83%; p = 0.552).

The most common pulmonary function abnormality was a reduction in diffusion capacity of carbon monoxide (TLCO) of less than 80% predicted, seen in 31 of 59 patients (53%). This was more frequent in ICU patients compared with non-ICU patients, but did not reach statistical significance (64% vs. 38%; p = 0.0695) (Fig. 1). A restrictive deficit [forced vital capacity (FVC) < 80% predicted and forced expiratory volume in one second/forced vital capacity (FEV1/FVC) ratio > 70%] was seen in 22 of 67 patients (33%), and was similarly distributed in ICU and non-ICU patients. There was no statistical difference in absolute and percentage predicted FVC and TLCO values between the ICU and non-ICU groups (Table 2).

Fig. 1
figure 1

The proportion of patients with abnormal pulmonary function tests comparing ICU and non-ICU cohorts. Abnormal pulmonary function tests (PFT) are defined as < 80% predicted value for FEV1, FVC, and TLCO. Comparison is made between intensive care unit (ICU) patients and those non-ICU patients who required follow-up investigations. Not all PFTs were completed for all patients. The number of patients completing each test is shown under the appropriate bars. Statistical significance is measured using Fisher’s exact test and a p value of < 0.05 is denoted as statistically significant. NS not significant, FEV1 forced expiratory volume in 1 s, FVC forced vital capacity, TLCO diffusion capacity for carbon monoxide

Table 2 Pulmonary function tests (PFTs) values comparing ICU and non-ICU cohorts

The non-ICU cohort had a significantly lower absolute FEV1 (2.45 vs. 2.88; p = 0.027) and absolute and percentage predicted MFEF 75/25 (2.17 vs. 2.99; p = 0.027, and 80 vs. 102; p = 0.004, respectively) than the ICU cohort (Table 2). We speculate that the increased prevalence of obstructive airway disease and small airway disease in the non-ICU follow-up group compared to the ICU group is likely explained by the increased prevalence of COPD (18% vs. 0%) and ex- or current smokers (53% vs. 32%) in this group (Fig. 1).

For the cohort as a whole, longer LOS correlated with declining TLCO (r = − 0.518; p = < 0.0001). This remained true for the ICU cohort (r = − 0.656; p = 0.0002) but not the non-ICU cohort (r = − 0.286; p = 0.153) when analysed separately (Fig. 2a, b). For the cohort as a whole, higher maximum FiO2 correlated with declining TLCO (r = − 0.396; p = 0.0021) (Fig. 2c). When analysed separately, this correlation was not maintained in either the ICU cohort (r = − 0.0986; p = 0.624) or the non-ICU cohort (r = − 0.324; p = 0.1136). In the ICU cohort, higher maximum IPAP correlated with declining TLCO (r = − 0.391; p = 0.0299) (Fig. 2d) but not PEEP (r = 0.083; p = 0.693). In all patients, there was no correlation between TLCO and BMI (r = 0.0404; p = 0.765).

Fig. 2
figure 2

Percentage of predicted TLCO correlates negatively with increasing length of stay, increasing maximum FiO2 and peak inspiratory positive airway pressure (IPAP). a Percentage of predicted TLCO compared with length of stay for all patients. b Percentage of predicted TLCO compared with length of stay for intensive care unit (ICU) patients. c Percentage of predicted TLCO compared with maximum inspired FiO2 for all patients. d Peak IPAP correlates with percentage predicted TLCO in ventilated patients on ICU. Correlation is tested using Spearman’s correlation coefficient, and a p value of < 0.05 is denoted as statistically significant. TLCO diffusion capacity for carbon monoxide

Thoracic Radiology

A total of 72 (33%) follow-up CT scans were performed: 37 of 44 (84%) ICU patients and 35 of 38 (92%) non-ICU patients intended for follow-up. The mean time from discharge to follow-up CT imaging was 131 ± 52 days (median = 132). For those who had acute imaging from their admission with COVID, all were reported to show some temporal improvement in relation to the admission CT. Figure 3a shows the proportion of CT scans that were defined as normal (35%), ground-glass opacities (GGO; 44%), or had radiological signs of fibrosis (defined as evidence of traction bronchiectasis, architectural distortion, or honey combing; 21%). This equated to GGO in 14% and fibrosis in 7% of the entire cohort (n = 221). Radiological changes had a statistically significant increase in the ICU compared with the ward-based cohorts (chi-squared test, p = 0.0259): GGO in 43% ICU compared with 46% non-ICU, and fibrosis in 32% ICU compared with 9% non-ICU. TLCO abnormalities were more frequent in those with GGO or fibrosis compared to those with normal appearances (chi-squared test, p = 0.0293) (Fig. 3b).

Fig. 3
figure 3

Radiological findings on CT Thorax at an average of 5 months after admission. a Prevalence of ground glass changes (GGO) and fibrotic changes on CT thorax at follow up. Comparison is made between intensive care unit (ICU) patients and those non-ICU patients who required follow-up investigations. These changes were significantly increased in intensive care unit (ICU) patients as compared to non-ICU patients (chi-squared test). b Prevalence of abnormalities in TLCO (< 80% predicted value) in each of the three radiological groups. These were significantly increased in those with ground glass changes and fibrotic change as compared to those with normal appearance on CT. Statistical significance is measured using the chi-squared test and a p value of < 0.05 is denoted as statistically significant

Maximum inspired FiO2 was highest in the fibrosis group (mean = 74.3), followed by those with GGO (mean = 56.6) and lowest in the normal CT imaging group (mean = 47.4), and this trend reached statistical significance (one-way ANOVA test, p = 0.0038) (Table 3). There were no differences in peak PEEP or IPAP between the three groups. There was a statistically significant difference in TLCO between the three groups (one-way ANOVA test, p = 0.0072), with lowest average TLCO in those showing fibrotic changes (Table 3).

Table 3 Maximum inspired FiO2, ventilator settings and percentage of predicted values for pulmonary function tests (PFTs) in the different radiological categories

Discussion

We have presented findings from a retrospective evaluation of a ‘real-world’ experience of following-up hospitalised COVID survivors. We found that, in patients who had received ward-level care only, 21% had persistent symptoms or CXR changes at 8 weeks, warranting further investigation in accordance with BTS guidelines. In this cohort, statistically significant associations with persistent symptoms were being an ex- or current smoker, having pre-existing DM, and having a longer length of stay. To the best of our knowledge, we are the first to report these associations. The association with DM is of interest, as there is already considerable evidence that pre-existing DM contributes to the severity and mortality of COVID patients [30], and our data suggest that it may also contribute to the likelihood of survivors developing long-term pulmonary sequelae. Conversely, although premorbid obesity is well documented to increase severity and mortality in COVID patients [31, 32], in our cohort we found no association between BMI and persistent symptoms, nor did we find any relationship between BMI and percentage of predicted TLCO at follow-up. We also did not find any association between pre-admission inhaled corticosteroid use and persistent symptoms. This is unexpected in light of the recent STOIC study, which showed that COVID patients treated with inhaled corticosteroids in the community had decreased rates of hospital admission, and also a quicker resolution of self-reported and questionnaire-reported symptoms [33]. We speculate that the reasons we did not see similar results were partly due to the low numbers prescribed inhaled corticosteroids in our study, and also because of the different settings (real-world evidence compared with randomised control trial). Although we were able to obtain record of prescriptions, we had no means of measuring compliance. We speculate that compliance was likely to be lower in our patients, who were being treated for chronic conditions, compared to acutely unwell participants recruited to a COVID therapy trial.

In our ICU cohort, we observed a higher prevalence of TLCO abnormalities than the ward-based cohort, but this did not quite reach statistical significance. In the ICU cohort, the percentage of predicted TLCO correlated negatively with increased LOS and with higher maximum IPAP, but did not correlate with maximum inspired FiO2 (likely due to the high levels of FiO2 used in all these patients). In the entire patient group, the percentage of predicted TLCO correlated negatively with both increased LOS and higher maximum FiO2. Radiological signs of GGO and fibrosis showed a statistically significant increase in the ICU cohort compared to the ward-based cohort.

Our findings are in keeping with the few existing studies in the literature. Radiographic abnormalities have been reported in 18–94% of COVID patients at follow-up, and abnormalities of pulmonary function tests in 25–88% of these patients. One study of 18 patients reported radiological signs of fibrosis in 84% of their subjects [26], but others have reported such changes in between 7 and 26% of subjects, reflective of our findings above [13, 14, 16, 17, 24]. The most frequently reported PFT abnormality is an abnormal TLCO, reported in 16–72% of patients [13, 14, 16, 19, 21, 22, 24, 25, 27]. Very few studies have reported associations with persistent pathological CT findings and lung function abnormalities. Lerum et al. reported an association between ICU admission and pathological CT findings [14] in a cohort of 15 ICU patients, and Smet et al. reported an association between abnormalities in TLCO and increasing LOS, as well as with admission to ICU [22], all in keeping with our findings. Shah et al. reported an association between the number of days requiring supplemental oxygen and TLCO and CT abnormalities. Collectively, this allows us to start to build a picture of the patients who are at greater risk of long-term pulmonary sequelae after COVID infection, with LOS, oxygen requirements and ICU admission as useful indicators of severity of disease and increased risk of later problems.

A limitation of our study is the bias introduced by only performing PFTs and CT thorax on those with self-reported persistent symptoms at 8 weeks, or with persistent changes on chest radiograph. Furthermore, we did not have both PFTs and CTs for all patients intended for follow-up, due to delays in booking appointments, patients failing to attend, or inability to complete tests, again reflective of collecting data from a ‘real world’ follow-up setting. This very likely means that some cases of post-COVID morbidity are not captured. With this caveat, the prevalence of detected persistent pulmonary function test abnormalities within our entire cohort (n = 221) was 53 (24%). The prevalence of detected persistent radiological interstitial changes post-COVID within our entire cohort (n = 221) was 50 (23%). The prevalence of persistent fibrosis was 15 (7%). However, in the absence of CT scans for every individual, these may not be truly representative figures. Nonetheless, in real-world settings, it would not be practical or ethical to offer CTs scans to all follow-up patients, and therefore this gives some estimate of the likely prevalence of symptomatic post-COVID fibrosis based on a more pragmatic approach that should be applicable to most settings.

Of further note, we do not yet know the long-term natural history of the radiological appearances, and it remains possible that there may be significant improvement further down the line. Indeed, one very recent study has reported that CT abnormalities continue to improve at 6-months post-infection, although PFT abnormalities, in particular restrictive defects, may persist [34].

An additional limitation of our study is the absence of pre-morbid PFTs and imaging in the majority of our patients, meaning that we cannot attribute with certainty the abnormalities seen to the damaging effects of COVID infection. This is reflected in the increased prevalence of obstructive and small airway defects in our non-ICU follow-up cohort as compared to our ICU cohort, which we speculate is likely explained by the increased prevalence of COPD and ex- and current smokers in this group. Finally, our data report on follow-up from patients treated very early in the pandemic, when no medical interventions beyond best supportive care were in routine use for the treatment of COVID. Since then, advances have been made in the treatment of COVID-19, most notably with the introduction of dexamethasone for all hospitalised patients requiring supplemental oxygen, as well as the antiviral remdesivir and the anti-interleukin 6 monoclonal antibody tocilizumab. The advent of these treatments will very likely impact on the long-term consequences of COVID infection. The RECOVERY trial was introduced during the period of recruitment for this study, and a few of the patients in this study were recruited and randomised to this clinical trial (data not shown). It will be extremely interesting to discover whether any of these treatments also reduce long-term pulmonary sequelae in COVID survivors. However, we did not feel that our study was an appropriate platform to address this question, and felt this was better examined within the context of the clinical trial analysis.

Nonetheless, our study contributes to the emerging literature on post-COVID lung sequelae. Notably, we have one of the longest post-infection follow-up time of studies to date, with an average of 15 weeks from discharge to PFTs and 18 weeks from discharge to CT thorax. Of the existing published studies, there are only five reporting at 12 weeks or longer after infection [14, 16, 21, 24, 27], and only one reporting beyond 3 months after discharge [33]. BTS do not recommend evaluation of PFTs prior to 3 months after discharge [28], as premature evaluation may lead to errors in functional diagnosis, and any abnormalities may still reflect the resolving acute inflammatory process rather than longer-term sequelae. In addition, to the best of our knowledge, we have presented the first description of experience of implementing the new BTS guidelines for COVID follow-up, and, as such, the data presented here can help inform and guide the development of equivalent services as we enter the post-COVID era.

Conclusion

Our data add to the growing evidence that there will be pulmonary sequelae in a proportion of COVID survivors, providing some insight into what may become a significant chronic global health problem. We can offer the following estimate from our findings which others may find useful: services using BTS guidelines to guide follow-up of post-COVID hospitalised patients may see an overall prevalence of PFT abnormalities in around 24% and radiological fibrosis in around 7% at 4 months after discharge.