Significance

What is already known on this subject? The considerable volume of COVID-19 cases occurring during the pandemic has made it challenging for state and local health departments to conduct thorough case investigations and collect all requested data elements. As a result, the national COVID-19 surveillance database is incomplete for many requested data elements, including pregnancy status.

What this study adds? Sensitivity of the pregnancy field on the COVID-19 case report form (CRF) varied by state and demographic factors. In the absence of complete pregnancy status in COVID-19 case surveillance data, reporting jurisdictions might consider linkage to birth and fetal death certificates to ascertain this information. Use of data linkages in combination with pregnancy information available on the CRF will provide a more comprehensive picture of the burden of SARS-CoV-2 infection in pregnancy.

Purpose

State, local, and territorial public health departments voluntarily report infections from SARS-CoV-2, the virus that causes coronavirus disease 2019 (COVID-19), to the Centers for Disease Control and Prevention (CDC) as part of surveillance of nationally notifiable conditions. The COVID-19 case report form (CRF) was developed by CDC to standardize the reporting of information on cases, including demographics, exposures, contact history, and clinical course and management (Centers for Disease Control & Prevention, 2020c). The current version of the COVID-19 CRF is two pages in length and contains over 100 data elements. The considerable volume of COVID-19 cases occurring during the pandemic has made it challenging for state and local health departments to conduct thorough case investigations and collect all requested data elements. Jurisdictions must often prioritize collecting those data elements needed to trace close contacts and prevent transmission of infection (CDC, 2020a, d). As a result, the national COVID-19 surveillance database is incomplete for many requested data elements, including pregnancy status (Stokes et al., 2020).

Previous studies have shown that pregnant people are at increased risk of severe illness from COVID-19 when compared to nonpregnant people (Allotey et al., 2020; Delahoy et al., 2020; Ellington et al., 2020; Panagiotakopoulos et al., 2020; Zambrano et al., 2020). Two CDC analyses using national COVID-19 surveillance data found that pregnant women with SARS-CoV-2 infection were more likely to have adverse outcomes (e.g., intensive care unit [ICU] admissions, mechanical ventilation, extracorporeal membrane oxygenation, and death) than nonpregnant women (Ellington et al., 2020; Zambrano et al., 2020). However, a limitation of those analyses was that pregnancy status was missing for more than two-thirds of women of reproductive age (WRA), defined as women aged 15–44 years. We sought to examine the sensitivity of the COVID-19 CRF pregnancy field and to identify systematic differences in the percentage of confirmed pregnant women with SARS-CoV-2 infection with pregnancy indicated on the CRF.

Description

In both Illinois and Tennessee, local or regional health departments are responsible for case investigations and monitoring of reports of COVID-19, including collecting information needed to complete the CRF. The majority of COVID-19 reports are received through laboratory reporting of SARS-CoV-2 infections, which typically do not include information on whether the infected person was pregnant at the time of testing. CRF data collected by local health departments are entered into the National Electronic Disease Surveillance System (NEDSS) Base System (NBS), which houses all reportable conditions except HIV and sexually transmitted infections. CRFs for persons with laboratory-confirmed SARS-CoV-2 infection were linked to birth and fetal death certificates with pregnancy completion dates during January 1, 2020–December 31, 2020 in Illinois and Tennessee. Linkage methodology differed by state. Illinois used Linkplus software to conduct probabilistic linkages to match CRFs for females aged 10–55 years with birth and fetal death certificates, relying on phonetic first name, current last name, maiden last name, date of birth, and address as linkage variables. Tennessee used deterministic linkages in SAS version 9.4 to match CRFs for non-males born on or after January 1, 1960 to birth and fetal death certificates using phonetic first name, current last name, maiden last name, date of birth, and social security number (if available). For the remainder of this paper, CRFs that linked to a birth or fetal death certificate and had a positive SARS-CoV-2 specimen collected during pregnancy or on the delivery date will be referred to as “confirmed pregnant women with SARS-CoV-2 infection.” CRFs that linked with a vital record but had specimen collection date after the delivery date were excluded from further analysis.

Among confirmed pregnant women with SARS-CoV-2 infection, both states limited the analytic sample for this study to those aged 15–44 years. The sensitivity of the CRF pregnancy field was calculated as the percentage of confirmed pregnant women with SARS-CoV-2 infection for which pregnancy was indicated on the CRF; CRFs with pregnancy status ‘No,’ ‘Unknown’ or missing were considered as not indicating pregnancy. Chi-square tests were used to test for significant differences in sensitivity by: race/ethnicity, age, education, insurance (delivery payer), and 1st trimester initiation of prenatal care (derived from birth and fetal death certificates), and month and pregnancy trimester of SARS-CoV-2 specimen collection (derived from CRFs). This activity was reviewed by CDC and was conducted consistent with applicable federal law and CDC policy.Footnote 1

Assessment

The percentage of WRA with laboratory-confirmed SARS-CoV-2 infection that were confirmed to be pregnant at the time of infection was 1.8% (4276/235,209) in Illinois and 1.5% (2070/135,659) in Tennessee. Among all CRFs that indicated the infected woman was pregnant, 50.3% (1936/3849) in Illinois and 43.6% (872/1999) in Tennessee linked with a birth or fetal death certificate with delivery on or before December 31, 2020. The sensitivity of the CRF pregnancy field was 45.3% in Illinois (1936/4276) and 42.1% in Tennessee (872/2070) (Table 1).

Table 1 Women of reproductive age with laboratory-confirmed COVID-19 that linked to birth or fetal death certificates for completed deliveries through December 31, 2020, by pregnancy status

Sensitivity of the pregnancy field varied by demographic characteristics. Sensitivity varied by maternal race/ethnicity in both states (p < 0.05), with lower sensitivity observed among White, non-Hispanic women in Illinois and among Black, non-Hispanic women in Tennessee. Sensitivity did not vary by maternal age in either state. Sensitivity varied by maternal education in Illinois, with lower sensitivity observed among women with a Bachelor’s degree or higher, but did not vary significantly in Tennessee. Sensitivity varied significantly by maternal insurance status in both states, with lower sensitivity observed among women with no insurance or self-pay. In both states, sensitivity was higher among women who initiated prenatal care in the first trimester compared with those who had no prenatal care or initiated care in the second trimester or later. In both states, sensitivity differed significantly by month of specimen collection; however, the patterns differed by state. In Tennessee sensitivity tended to decrease with each month as the pandemic progressed, whereas Illinois observed more variability in sensitivity by month. Sensitivity also varied by trimester of positive specimen collection in both states, with higher sensitivity in the first and second semesters of pregnancy compared with the third trimester.

Conclusions

These findings highlight the importance of improving the completeness and accuracy of the COVID-19 CRF pregnancy field. Using birth and fetal death certificates as the gold standard for ascertaining pregnancy status, under-ascertainment of pregnant women with SARS-CoV-2 infection was observed in both Illinois and Tennessee. This level of incompleteness and inaccuracy has implications for the interpretation and generalizability of studies relying solely on CRF data, and reporting jurisdictions should identify strategies to improve ascertainment of pregnancy status.

Timely pregnancy information is particularly important for efforts to gather real-time information on the impact of emerging infections occurring throughout pregnancy. Linkages to additional data sources that can serve as an alternative method to identify pregnancy status, such as vital records as conducted in this study, might not be possible until after the pregnancy is completed, which can be several months after the infection occurred. Although valuable, more timely information on infections during pregnancy allows for the provision of information about the impact of COVID-19 in pregnancy, the importance of seeking appropriate medical care, the risk for severe illness or adverse pregnancy outcomes, and recommendations for preventing the spread of infection to close contacts including the neonate.

The sensitivity of the CRF pregnancy field varied by state and certain demographic factors. In Tennessee, sensitivity was higher among non-Hispanic White women compared with women of color and in Illinois sensitivity was higher among non-Hispanic Black women. In both states, sensitivity varied significantly by maternal race/ethnicity, insurance, trimester of prenatal care entry, month of specimen collection, and trimester of specimen collection. Sensitivity also varied by maternal education in Illinois but not in Tennessee. These results might reflect differences among populations in their response to case investigator attempts to contact them; willingness to disclose information, including pregnancy status, once contact has been made (assuming information on pregnancy status is systematically collected from all women reported to have SARS-CoV-2 infection); fear of discrimination; or distrust of government entities (Davis et at 2010; Kirst et al., 2013; Petkovic et al., 2019). These findings have implications for the interpretation of surveillance data on COVID-19 in pregnancy by race/ethnicity. National data indicate that persons of color are infected with SARS-CoV-2 at higher rates than white people (The COVID Tracking Project, 2020). If the sensitivity of the CRF pregnancy field is lower for women of color, as was observed in Tennessee, use of CRF data would underestimate inequities in SARS-CoV-2 infection in pregnancy by race/ethnicity. However, if the sensitivity of the pregnancy field is lower for White women, as was observed in Illinois, use of CRF data would overestimate inequities in SARS-CoV-2 infection in pregnancy. Furthermore, studies that rely on the CRF alone to examine outcomes for women with COVID-19 during pregnancy might fail to accurately estimate potential differences in severity of infection and adverse outcomes among racial and ethnic groups.

Both states found substantial numbers of CRFs that indicated a woman was pregnant but did not link with a birth or fetal death certificate. There are several reasons why a linkage might not have occurred. Many of these unlinked CRFs are likely to represent women who are still pregnant and therefore had not had a completed delivery resulting in a birth or fetal death certificate that was available at the time of data linkage. Both states include a data field on the CRF for estimated date of delivery for women indicated to be pregnant. While this field is not always completed, when it is complete, the information can be used to determine whether it is likely the individual is still pregnant as of the time of data linkage. Tennessee also collects the name of the obstetric provider when pregnancy is indicated on the CRF, which enables follow-up with the obstetrician for unlinked records to request information on the pregnancy outcome.

Some unlinked records might represent deliveries that occurred outside the maternal state of residence. While birth and fetal death certificates for out-of-state occurrences are reported to the jurisdiction where the woman resides, there is a substantial time lag associated with this process and the vital records might not have been available at the time the linkages were conducted. There is also a time lag for capturing vital events in provisional birth and fetal deaths files, and therefore some deliveries that have occurred might not have linked for the current analysis but will link in future matches. Both states will continue to perform cumulative linkages between CRF and birth/fetal death certificate data for pregnant women that do not link until it is reasonable to assume the pregnancy should have been completed and the result of the pregnancy remains unknown.

The unlinked CRFs might also represent situations in which the CRF will not link with vital records, including early pregnancy losses, pregnancy terminations, unreported fetal deaths or stillbirths, maternal deaths that occurred prior to delivery, or relocation of the woman to another jurisdiction. In both states fetal deaths are required to be reported when the fetus is at least 20 weeks gestation and/or 350 g; therefore, fetal deaths occurring before 20 weeks gestation might not have a corresponding fetal death certificate and will remain unlinked. Data quality issues including misspellings, data entry errors, and incorrectly entered pregnancy status can also result in unlinked records.

In summary, these findings from two states indicate that the sensitivity of the COVID-19 CRF pregnancy field varied by state and certain demographic factors. Analyses that rely on CRF data alone should be interpreted in the context of these limitations. Further investigation is needed to better understand and mitigate the factors that contribute to incomplete or inaccurate pregnancy information on the CRF. In addition to the factors discussed above that affect the likelihood of case investigators successfully contacting an infected person and that person’s willingness to disclose information, inadequate training for case investigators/contact tracers on the importance of collecting information on pregnancy can be an important contributor to incomplete data. In the absence of complete pregnancy status data on the COVID-19 CRF, reporting jurisdictions might consider linkage to birth and fetal death certificates to ascertain this information. Data linkage can reduce the amount of incomplete and inaccurate data when time and resources are insufficient to complete thorough case investigations for all reported cases. In addition to improving information on pregnancy status, linkages to vital records can be used to improve the quality of other data elements including race/ethnicity and other demographic characteristics. However, relying on data linkages alone to identify infections during pregnancy does introduce some bias because cases that will not have a birth of death certificate (such as early losses and terminations) will be systematically missed in the linkage. Therefore, use of data linkages in combination with pregnancy information available on the CRF will provide a more comprehensive picture of the burden of SARS-CoV-2 infection in pregnancy. Jurisdictions that are using data linkages to improve their identification of pregnant women for COVID-19 surveillance can consider establishing processes to provide information back to infectious disease surveillance systems to improve the completeness and quality of the CRF data. Because states vary in their processes for completing the COVID-19 CRF, timing of access to provisional birth and fetal death records, and experience with and capacity to conduct data linkages, it is uncertain how well these findings from two states can be generalized to other reporting jurisdictions.

These results also speak to the importance of implementing enhanced pregnancy surveillance efforts to provide more detailed information on the impact of SARS-CoV-2 infection on pregnant women and their infants. CDC, in collaboration with state and local health departments, has initiated COVID-19 pregnancy surveillance to report pregnancy-related information and outcomes among pregnant women with laboratory-confirmed SARS-CoV-2 infection (CDC, 2020b; Woodworth et al., 2020). Improving the quality of CRF pregnancy data or using data linkages to improve identification of pregnant women provides a more complete enumeration of women with SARS-CoV-2 infection during pregnancy that can serve as the sampling frame for medical record abstraction to gather more detailed information on maternal and infant outcomes. These data can provide timely and important information to inform clinical guidance and prevention strategies.