Abstract
Comprehensive data on transmission mitigation behaviors and both SARS-CoV-2 infection and serostatus are needed from large, community-based cohorts to identify COVID-19 risk factors and the impact of public health measures. We conducted a longitudinal, population-based study in the East Bay Area of Northern California. From July 2020-March 2021, approximately 5,500 adults were recruited and followed over three data collection rounds to investigate the association between geographic and demographic characteristics and transmission mitigation behavior with SARS-CoV-2 prevalence. We estimated the populated-adjusted prevalence of antibodies from SARS-CoV-2 infection and COVID-19 vaccination, and self-reported COVID-19 test positivity. Population-adjusted SARS-CoV-2 seroprevalence was low, increasing from 1.03% (95% CI: 0.50–1.96) in Round 1 (July-September 2020), to 1.37% (95% CI: 0.75–2.39) in Round 2 (October-December 2020), to 2.18% (95% CI: 1.48–3.17) in Round 3 (February-March 2021). Population-adjusted seroprevalence of COVID-19 vaccination was 21.64% (95% CI: 19.20–24.34) in Round 3, with White individuals having 4.35% (95% CI: 0.35–8.32) higher COVID-19 vaccine seroprevalence than individuals identifying as African American or Black, American Indian or Alaskan Native, Asian, Hispanic, two or more races, or other. No evidence for an association between transmission mitigation behavior and seroprevalence was observed. Despite >99% of participants reporting wearing masks individuals identifying as African American or Black, American Indian or Alaskan Native, Asian, Hispanic, two or more races, or other, as well as those in lower-income households, and lower-educated individuals had the highest SARS-CoV-2 seroprevalence and lowest vaccination seroprevalence. Results demonstrate that more effective policies are needed to address these disparities and inequities.
Figures
Citation: Adams C, Horton M, Solomon O, Wong M, Wu SL, Fuller S, et al. (2022) Health inequities in SARS-CoV-2 infection, seroprevalence, and COVID-19 vaccination: Results from the East Bay COVID-19 study. PLOS Glob Public Health 2(8): e0000647. https://doi.org/10.1371/journal.pgph.0000647
Editor: Victor Adagi Alegana, KEMRI-Wellcome Trust Research Programme Nairobi, KENYA
Received: February 11, 2022; Accepted: June 29, 2022; Published: August 15, 2022
Copyright: © 2022 Adams et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: De-identified individual participant data that underlie the results reported in this article may be shared for up to 36 months following publication after investigators whose proposed use of the data has been approved by an independent review committee. For individual participant data analysis or meta-analysis, proposals should be directed to Lynn Hollyer (lhollyer@berkeley.edu). Requests will be reviewed by an independent review committee and the UC Berkeley Institutional Review Board. Data may be shared upon approval. Data requestors will need to sign a data access agreement.
Funding: This study was supported by Open Philanthropy Projects (EH, LFB), Fast Track grants from Emergent Ventures (EH, LFB), the UC Berkeley Innovative Genomics Institute (EH, LFB), and the UC Berkeley School of Public Health Center for Population Health (EH, LFB). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: JAL reports receipt of grants unrelated to this study from Pfizer and Merck, Sharpe & Dohme, and consulting fees from Pfizer, Merck, Sharpe & Dohme, VaxCyte, and Kaiser Permanente. All other authors report no competing interests. There are no patents, products in development or marketed products associated with this research to declare. This does not alter our adherence to PLOS policies on sharing data and materials.
Introduction
The first confirmed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection in the California Bay Area was reported on February 28, 2020 [1]. Measures to prevent transmission were implemented soon after and included shelter-in-place orders, mask mandates, business and school closures, and social distancing recommendations. Despite these measures, spikes in reported infections occurred from July to September 2020 and December 2020 to February 2021 [2].
Local public health and healthcare systems experienced major challenges in preventing infections, identifying COVID-19 cases, and ensuring adherence to transmission mitigation strategies. Furthermore, disparities and inequities in COVID-19 risk and outcomes by race and ethnicity and socioeconomic factors such as income and education have been observed in the United States including in the San Francisco Bay Area [3–7]. A detailed understanding of the effectiveness of transmission mitigation behaviors and sociodemographic factors that contribute to the disproportionate impact of COVID-19 in vulnerable communities is critical. Public health and policy directives aimed at controlling ongoing transmission, developing future prevention strategies, and targeting health disparities and inequities must be evidence-based. Large population-representative cohorts with individual-level data on social and behavioral factors [8] associated with COVID-19 and SARS-CoV-2 infection and serostatus [9–13] are needed.
To address this need, we investigated individual-level characteristics and mitigation behaviors that contributed to SARS-CoV-2 seroprevalence, self-reported infection, and viral infection, and other outcomes in a large, population-based sample of over 5,500 individuals from 12 East Bay cities in Northern California followed longitudinally over three time periods (July-September 2020, October-December 2020, and February-March 2021). The East Bay region was chosen for study because it is densely populated, and both racially and ethnically diverse. We estimated the population-adjusted prevalence of SARS-CoV-2 outcomes and differences by age, sex, race/ethnicity, ZIP code, and other demographic strata as well as the effect of transmission mitigation behavior on SARS-CoV-2 prevalence with Bayesian multilevel regression and poststratification (MRP) models [14,15].
Methods
Recruitment and participants
Recruitment and selection of study participants was completed using a screening phase followed by a longitudinal study phase with three rounds of data collection. In the screening phase, all residential addresses within the East Bay cities and communities of Albany, Berkeley, El Cerrito, El Sobrante, Emeryville, Hercules, Kensington, Oakland, Piedmont, Pinole, Richmond, and San Pablo (~307,000 residential households) were mailed an invitation to complete a consent form and screening questionnaire. Within a household, the individual aged 18 years or older with the next upcoming birthday from the date of postcard receipt, was eligible to participate. Additional eligibility criteria included living within the study region, willingness to provide biospecimens and questionnaire responses, ability to read and speak English or Spanish, and having internet access and an email address.
The target sample size was 5,500 participants per study round. The distribution of racial and ethnic identification in the screening questionnaire responses was more White and non-Hispanic than the region population. To obtain a sample that resembled the racial and ethnic proportions in the 2018 American Community Survey (ACS) for the study region, we ranked assigned ranks for order of inclusion to all eligible individuals who responded to the screening questionnaire. Black and Hispanic individuals were ranked the highest followed by individuals identifying as Asian, Pacific Islander, American Indian or Alaskan Native, two or more races, or other. Order of inclusion for White individuals was randomly sampled. In Round 1, individuals ranked between 1 and 5,500 were offered study enrollment. Those who declined to enroll were replaced with next highest ranked individuals who had not been offered study entry. In subsequent rounds, individuals who had participated in the previous round(s) were offered participation in the next round. If participation was declined, individuals from the pool of participants who had not participated in a study round were invited. Approximate dates for each round were July-September 2020, October-December 2020, and February-March 2021. Further information in S1 File.
Ethics statement
All participants provided their formal written informed consent for the screening phase. All participants in the study phase provided their formal written informed consent for each study round. The study was approved by the University of California, Berkeley Committee on Protection of Human Subjects (Protocol #2020-03-13121).
Study procedures
At the start of each round, eligible participants were invited to participate. Those who agreed to participate received a kit via FedEx containing materials for self-collection of biospecimens, pre-paid return shipping labels, and instructions to complete an online-administered questionnaire at the same time as biospecimen collection.
Questionnaire
The questionnaire addressed sex, gender, age, race/ethnicity, income, employment, physical and mental health, as well as symptoms potentially related to COVID-19 within the previous 2 weeks, and SARS-CoV-2 testing outside of the study (S2 File). Participants were also asked about transmission mitigation behaviors including physical distancing practices, close contacts with others, and mask wearing. Questionnaires were available in English or Spanish.
SARS-CoV-2 viral and antibody testing
Anterior nasal nare swabs for viral RNA testing and dried blood spots (DBS) for antibody testing were collected from participants at each study round. Quantitative reverse transcription PCR (RT-qPCR) was used to identify SARS-CoV-2 viral infection. Three tests were used to assess anti-SARS-CoV-2 antibodies in DBS: Ortho VITROS Anti-SARS-CoV-2 Total IgG (ORTHO-IgG) and spike IgG ELISA (ELISA-IgG) targeted antibodies against the SARS-CoV-2 spike protein (indicating prior natural infection or vaccination), and Roche-NC (ROCHE-NC) Total IgG targeted antibodies against the nucleocapsid (NC) protein (indicating prior natural infection only) [16]. Before COVID-19 vaccines were available in the study region during rounds 1 and 2, detection of antibodies against the SARS-CoV-2 spike protein was considered evidence of SARS-CoV-2 infection. During Round 3, vaccinations were widely available, therefore detection of antibodies against the NC protein were considered evidence of SARS-CoV-2 infection, while detection of antibodies against the spike protein were considered evidence of SARS-CoV-2 infection or COVID-19 vaccination (S1 Fig) [17–20]. Further information on viral and antibody testing in S3 File.
The sensitivity and specificity of the antibody assays used in this study are reported in Wong et al and in S5 Table [16]. Briefly, in round 1, using a threshold of S/C>1 for evidence of anti-Spike antibodies, the sensitivity (Se) and specificity (Sp) of the ORTHO-IgG assay was 80.6% [95% CI: (64.0–91.8)] and 100% [95% CI: (63.1–100)], respectively. In rounds 2 and 3, we implemented a serial testing strategy, using the ORTHO-IgG assay as the screening test (S/C>0.7) and the ELISA-IgG assay as the confirmatory test. In these rounds, the sensitivity and specificity of the ORTHO-IgG assay 88.9% [95% CI: (73.9–96.9)] and 100% [95% CI: (63.1–100)], respectively. The sensitivity and specificity of the ELISA-IgG assay was 97.2% [95% CI: (88.7–99.9)] and 100% [95% CI: (87.7–1)], respectively. In round 3, the ROCHE-NC assay was used as a confirmatory test for presence of NC antibodies. The sensitivity and specificity were 86.7% [95% CI: (69.3–96.2)] and 97.9% [95% CI: (94.8–99.4)], respectively.
SARS-CoV-2 outcomes
The following outcomes were investigated in each round: (1) cumulative SARS-CoV-2 antibody positivity, (2) participants’ self-reported history of SARS-CoV-2 positivity from testing outside the study, (3) and a surveillance definition of “probable COVID-19 case” derived from self-reported symptoms and close contact with infected individuals [21], and (4) viral SARS-CoV-2 positivity from RT-qPCR testing of nasal swabs (S3 File). We also investigated antibodies induced by COVID-19 vaccination only in Round 3. Cumulative SARS-CoV-2 antibody positivity was defined as having detectable antibodies in the current and/or previous round(s). DBS samples that tested negative for anti-NC antibodies and positive for anti-spike antibodies in Round 3 were considered to have antibodies from COVID-19 vaccination alone. Self-reported COVID-19 viral positivity prevalence was defined as the proportion of participants reporting a positive viral test among all participants who reported being tested within a study round. Probable COVID-19 case prevalence was defined as the proportion of participants identified as a probable COVID-19 case among all participants who provided valid responses within a study round. Viral positivity prevalence was defined as testing positive by RT-qPCR from nasal swab samples.
Statistical analysis
Population-adjusted seroprevalence and other SARS-CoV-2 outcomes
Bayesian MRP was used to estimate population-adjusted cumulative seroprevalence, self-reported SARS-CoV-2 viral positivity prevalence at each study round, and “probable COVID-19” prevalence at each study round (See S4 File for further information on MRP and statistical methods). MRP is a regression-based method for estimating population and sub-population averages from survey data that has been shown to perform better than survey weighting, particularly with sparse data [14,15].
In addition to estimation of regional prevalence of our outcomes, we estimated prevalence within demographic groups and geographic areas in the study region. Variables of interest were gender, age, race, Hispanic ethnicity, income, education, household size, and ZIP code. We used a method described by Leeman and colleagues to generate a synthetic population for poststratification using data from the 2018 American Community Survey (ACS) and the Public Use Microdata Sample [15]. Poststratification was done using binary sex because gender is not reported by the ACS. Race and ethnicity were combined into a single variable to reduce the number of poststratification strata. This procedure yielded 44,640 strata.
At each study round, binary SARS-CoV-2 outcomes were modeled as a function of geographic and demographic characteristics using multilevel logistic regression models. Participant sex was included as a fixed effect. Vectors of random intercepts were defined for each category of race/ethnicity, age, education, income, household size, and ZIP code and two-way interactions between ZIP code, race/ethnicity, educational attainment, income, and age. To improve estimation of geographic effects we allowed for spatial correlations using the modified Besag-York-Mollié model and included the proportion of Spanish speaking households within the ZIP code of residence from the ACS as a fixed effect [22].
SARS-CoV-2 outcome prevalence and measures of association
We report populated-adjusted prevalence of SARS-CoV-2 outcomes across the study region and within geographic and demographic groups of interest. To calculate prevalence estimates, posterior distributions of the relevant poststratification stratum were aggregated. We also estimated prevalence differences (PD) and prevalence ratios (PR) for the association between populated-adjusted SARS-CoV-2 outcomes and race/ethnicity, education, and sex. For each parameter of interest, the mean of the posterior distribution was the point estimate, and the 95% credible interval (CI) was the 2.5% and 97.5% quantiles of a posterior distribution.
SARS-CoV-2 test-kit bias corrected seroprevalence
We estimated cumulative SARS-CoV-2 seroprevalence at each study round adjusted for the sensitivity and specificity of the SARS-CoV-2 antibody testing assays used in each study round (S5 Table).
Transmission mitigation behavior analyses
In each round, participants were asked about physical distancing practices, recent close contacts with others, mask wearing, and other behaviors and activities that might affect the risk of SARS-CoV-2 infection (S5 File). We classified participants into two behavior categories, “high-risk” and “low-risk”, with those responses using latent class analysis [23]. Crude associations between behavior categories and characteristics such as sex, age, race/ethnicity, education, and income were assessed with χ2 tests. Associations between high-risk vs. low-risk behavior and within round SARS-CoV-2 seroprevalence and self-reported test positivity were estimated using the MRP model described above with random intercepts for behavior categories and interactions between the behavior categories and ZIP code, age, race/ethnicity, education, and income.
Statistical analyses were completed in R 4.0.2. NIMBLE was used to implement MRP models [24]. Descriptions of MRP methods and code are provided in at github.com/adams-cam/ebcovid_prev.
Results
Enrollment and characteristics of study participants
Of the 16,115 residents who consented and completed the screening procedures between May-July 2020, 1,777 did not satisfy inclusion criteria and were excluded (Fig 1). Characteristics of participants are presented in Table 1 and S1 Table. Participation rates were high (Round 1: 76.8%, Round 2: 89.8%, and Round 3: 87.3%), and participants identified predominantly as female (~63%). Those aged 45–64 years were the largest age group of participants across all study rounds (ranging from 37.3% to 39.4%). Most participants identified as White (52.5% to 63.3%), followed by Asian/Pacific Islander (13.9% to 15.7%), Hispanic (11.0% to 15.6%), two or more races (6.9% to 9.1%), African American or Black (3.0% to 4.9%), and Native American/Alaska Native or other (1.7% to 2.2%). Approximately 4,750 participants (86.3% of Round 1 participants) completed all three study rounds.
Of those who completed the questionnaire, 87.3%, 95.3%, and 96.6% provided DBS and 93.6%, 98.1% and 98.5% provided nasal swabs in rounds 1, 2, and 3, respectively (Table 2). Antibodies against the SARS-CoV-2 spike protein were detected in 29 (0.6%) and 33 (0.6%) of DBS in rounds 1 and 2, respectively. In Round 3, NC antibodies from natural infection alone were detected in 84 participants (1.8% of 4,806) and spike antibodies from natural infection or vaccination were detected in 1,452 participants (31.3% of 4,806). Viral infection was detected in less than three nasal swabs in each round. The proportion of participants reporting both being SARS-CoV-2 tested outside the study and testing positive increased over the study period: 10/1,030 (1.0%) participants reporting a positive COVID-19 result in Round 1, 19/2,059 (0.9%) in Round 2, and 53/1,892 (2.8%) in Round 3. Few participants met the criteria for being a COVID-19 probable case (<0.5%) (S2 Table).
Population adjusted SARS-CoV-2 outcome prevalence in study region
Population adjusted SARS-CoV-2 seroprevalence, self-reported COVID-19 test positivity, and probable COVID-19 cases are reported in Table 3. Overall, populated-adjusted SARS-CoV-2 natural infection seroprevalence was low across the study region: Round 1 (July-September 2020) 1.03% (95% CI 0.50–1.96), Round 2 (October-December 2020 1.37% (0.75–2.39), and Round 3 (February-March 2021) 2.18% (95% CI 1.48–3.17). In Round 3 the populated-adjusted seroprevalence of COVID-19 vaccination was 21.64% (95% CI: 19.2, 24.34). Models incorporating sensitivity and specificity of the antibody assays yielded lower SARS-CoV-2 seroprevalence estimates in Round 1, similar estimates in Rounds 2 and 3, and a higher estimate of COVID-19 vaccine seroprevalence in Round 3. Population adjusted self-reported test positivity to SARS-CoV-2 was similar in Rounds 1 (1.11%, 95 CI: 0.39–2.40) and 2 (1.29%, 95% CI: 0.55, 2.17) to seroprevalence estimates and increased to 4.58% (95% CI: 2.56–7.64) in Round 3. Population-adjusted prevalence of being a COVID-19 probable case was <1% across all study rounds (S2 Fig).
SARS-CoV-2 seroprevalence and infection vary by geographic area
There was evidence for spatial differences in both populated-adjusted seroprevalence and self-reported test positivity (Fig 2). The northern areas (Richmond, San Pablo, Pinole, and Hercules) and southern areas (East Oakland) of the study region had higher seroprevalence than Berkeley, El Cerrito, and North/Downtown Oakland. Self-reported test positivity was also higher in the northern and southern areas. These trends were consistent across study rounds.
(A) Cumulative seroprevalence of SARS-CoV-2 antibodies to natural infection. (B) Prevalence of self-reported COVID-19 test positivity across the study region. Data collected in three rounds: Round 1, July-September 2020; Round 2, October-December 2020), and Round 3, February-March 2021. Base map and data from OpenStreetMap and OpenStreetMap Foundation (https://www.openstreetmap.org/copyright).
Population adjusted seroprevalence and self-reported test positivity prevalence within subgroups
Estimated populated-adjusted seroprevalence within demographic subgroups is reported in Fig 3. Cumulative seroprevalence increased across all demographic groups over the study period. Non-white individuals consistently showed evidence for higher seroprevalence than White individuals, specifically those identifying as African American or Black and Hispanic. We also found evidence for higher prevalence among those with less than a college degree compared to those with a college degree and those with a household income less than $100,000 compared those with household income greater than $100,000. In addition, there appeared to be a small threshold effect in household size, where those in households with five or more persons had a higher seroprevalence than those in households with four or less persons. There were no consistent trends in prevalence by age or sex. Similar relationships were seen between demographic groups and self-reported test prevalence (S3 Fig).
A) Sex, B) Age, C) Race/ethnicity, D) Income, E) Educational attainment, and F) Household size. Abbreviations: AA, African American; AMI, American Indian or Alaskan Native; PI, Pacific Islander.
COVID-19 vaccination seroprevalence differs by race/ethnicity and age
In Round 3, populated-adjusted COVID-19 vaccination seroprevalence was lower among individuals identifying as African American or Black, American Indian or Alaskan Native, Asian, Hispanic, two or more races, or other compared to White individuals (White vs. Non-White, PD: -4.35%, 95% CI: (-9.32, -0.35), Table 4, S4 Fig). The difference between racial/ethnic groups was largest among older participants; non-White individuals had a lower COVID-19 vaccination seroprevalence compared to White individuals among those aged between 65–74 (PD: -11.67%, 95% CI: (-20.98, -2.49)) and those aged 75 or older (PD: -10.15%, 95% CI: (-22.25, 0.43)).
Mask-wearing and association between high-risk vs. low-risk behavior and seroprevalence
More than 99% of participants reported ever wearing a mask 99% reported wearing a mask during leisure and exercise activities, >91% reported wearing a mask at work, and >88% reported wearing a mask while shopping (S3 Table). After clustering participants into “low-risk” and “high-risk” groups according to self-reported mitigation behaviors, most participants were considered low-risk across the study rounds (70%, 82%, and 77%, respectively; S4 Fig). Behaviors with the largest differences in high- versus low-risk behavior were reporting “yes” to: left home for work, medical/healthcare, care of relative, or other; worked with potential COVID-19 contact; attended gathering; and traveled to county outside of residence within last two weeks (S4 Fig). Results from χ2 tests indicated that age, race/ethnicity, and education were all significantly associated (P<0.001) with mitigation behavior (S4 Table), but we did not observe evidence for an association between mitigation behavior and either seroprevalence or self-reported test positivity (Table 5).
Discussion
In the current study, we investigated individual-level characteristics and behaviors that contributed to SARS-CoV-2 related outcomes, including seroprevalence and self-reported infection, in a large, population-based sample of over 5,500 participants from 12 East Bay (Northern California) cities. During three time periods from July 2020 to March 2021, we estimated the population-adjusted prevalence of SARS-CoV-2 outcomes across the study region and within strata of age, sex, race/ethnicity, ZIP code, and household size. We then characterized behaviors to mitigate transmission of SARS-CoV-2 and their association with related outcomes. Overall, prevalence of SARS-CoV-2 outcomes for natural infection were low which may be attributable to the high percentage of mask-wearing and other risk-mitigating behaviors among our participants. COVID-19 vaccination seroprevalence estimates in Round 3 were greater than 20%, with individuals identifying as African American or Black, American Indian or Alaskan Native, Asian, Hispanic, two or more races, or other having lower COVID-19 vaccination seroprevalence estimates than White individuals.
Despite the low overall SARS-CoV-2 seroprevalence and infection observed in our study, we observed evidence for differences in seroprevalence by ZIP code of residence, racial/ethnic identification, educational attainment, and household income. Further, ZIP codes with higher proportions of Spanish speakers had higher populated-adjusted seroprevalence estimates (S5 Fig). These differences persisted despite the high rates of mask wearing reported by our study sample, further adding to strong evidence that the risk of COVID-19 is distributed unequally and that structural inequities play an important role in COVID-19 risk [7,25–28]. Moreover, during Round 3 (February-March 2021), COVID-19 vaccines were widely available across the study region. We found that White individuals had higher prevalence of antibodies from COVID-19 vaccination in Round 3, than individuals identifying as African American or Black, American Indian or Alaskan Native, Asian, Hispanic, two or more races, or other, as reported elsewhere [3,29]. Notably, this difference was largest among those aged 65 or older. We believe this discrepancy is not explained by differences in vaccine hesitancy within racial and ethnic groups in our sample (S6 Table). In Round 3, among the n = 2,572 participants who self-reported not receiving at least one dose of a COVID-19 vaccine, 95.6% (n = 2,462) reported having plans to get vaccinated. This rate was greater than 91% across all racial and ethnic groups. Of the 110 individuals who did not report getting vaccinated, only 18 (16.4%) reported no plans to get vaccinated. These findings demonstrate that in the first few months of vaccine availability in the Bay Area, large disparities in vaccination rates by race/ethnicity were observed among older persons. Furthermore, White individuals, the group with the lowest prevalence of SARS-CoV-2 infection, were more likely to be vaccinated, underscoring the inequities that exist surrounding the coronavirus pandemic.
Another key finding was that almost all participants reported wearing masks. This contrasts with models of mask usage reported by the Institute for Health Metrics and Evaluation which reported mask usage ranging between 75–82% from December 2020 through March 2021 in California [30,31]. Mask wearing is one the most effective behaviors for controlling community spread of SARS-CoV-2 infection [32]. The high rate of mask usage by study participants may partially explain why we did not detect a differences between high-risk and low-risk mitigation behavior and SARS-CoV-2 prevalence, and partially explain why our estimates of SARS-CoV-2 seroprevalence and self-report test positivity were lower than public case reports.
A major strength of this study was the longitudinal design and collection of individual-level data, including biospecimens for antibody and virus testing, which is challenging but much needed in current studies of the pandemic. Comprehensive data on social distancing, self-quarantine, mask wearing, working from home, and other transmission mitigation efforts are also needed to inform current and future prevention strategies. At-home collection of biospecimens, including DBS for antibody testing, made regular testing without in-person interaction possible. This was a critical feature, particularly early in the pandemic, when recommendations were to travel only for essential purposes and to limit in-person interactions. At-home sample collection was used to obtain more than 30,000 biospecimens and is a feasible approach for large populations and geographic regions.
One limitation of this study was the under-representation of certain demographics in our sample. Supplementary mailings of recruitment postcards in Spanish were sent to residences in ZIP codes with high proportions of Spanish speaking households. We also placed recruitment flyers in local grocery stores and conducted outreach to community organizations, local government officials, and school districts in the study region. Despite these efforts, individuals identifying as African American or Black, American Indian or Alaskan Native, Asian, Hispanic, two or more races, or other, males, individuals from lower income households and with lower educational attainment, and individuals residing in lower socioeconomic ZIP codes were underrepresented in our sample. This was important given evidence that individuals who identify as Hispanic or Black, and other underrepresented groups, are at the highest risk for COVID-19 [28]. The low response rates of individuals from groups at higher COVID-19 infection likely contributed to an overall underestimation of SARS-CoV-2 seroprevalence in our study region. Evidence of the relationship between low-response rates and prevalence underestimation can be seen by comparing our seroprevalence estimates to COVID-19 case prevalence reported by Alameda County and Contra Costa County public health agencies [33]. In ZIP codes with low response rates the postcard invitation (e.g., 94601, 94603, 94621) we observed lower estimated prevalence in our study compared to public health agency case reports. The use of at-home testing kits may have excluded individuals from our sample who were either unable or unwilling to collect their own DBS, although we believe this limitation is offset by the insurmountable logistical challenges posed by in-person collection, especially when strict lockdowns were in place during the early phase of the study. Our testing algorithm for identifying individuals with antibodies from COVID-19 vaccination was unable to distinguish antibodies from vaccination versus natural infection in individuals who had both been previously infected and vaccinated. However, this should not have significantly affected our vaccination seroprevalence, as our natural infection seroprevalence estimates in Round 3 were low and only a subset of individuals with previous natural infection were also vaccinated. Another limitation is the possibility of self-selection bias, whereby individuals who joined our study were also more likely to be fervent adherers to COVID-19 public health measures than the general population. Additionally, a small proportion of individuals do not generate detectable antibodies after infection or vaccination for COVID-19. And lastly, there may be unmeasured confounding from variables not included in the analyses.
Our results underscore the substantial and persistent inequities that exist surrounding the coronavirus pandemic. Individuals identifying as African American or Black, American Indian or Alaskan Native, Asian, Hispanic, two or more races, or other, as well as those in lower-income households, and lower-educated individuals had the highest SARS-CoV-2 seroprevalence. We also observed large differences in COVID-19 vaccination seroprevalence between racial and ethnic groups. These disparities in COVID-19 infection and vaccination seroprevalence were observed despite low-response rates of individuals from groups at higher risk for COVID-19 infection in our sample and the near universal rates of mask wearing reported by participants. More work must be done to address these disparities and inequities, such as allocation of resources for high-risk communities and strategies to mitigate the structural barriers posed by social and structural determinants of health.
Supporting information
S1 Fig. Antibody testing algorithm for each study round.
Antibodies against the spike (S) protein indicate prior or current SARS-CoV-2 infection or COVID-19 vaccination. Antibodies against the nucleocapsid (NC) protein indicate prior or current SARS-CoV-2 infection.
https://doi.org/10.1371/journal.pgph.0000647.s001
(PDF)
S2 Fig. Population-adjusted covid probable prevalence by zip code (July 2020-April 2021).
Base map and data from OpenStreetMap and OpenStreetMap Foundation (https://www.openstreetmap.org/copyright).
https://doi.org/10.1371/journal.pgph.0000647.s002
(PDF)
S3 Fig. Populated-adjusted self-reported COVID-19 test positivity in each study round among demographic subgroups, A) sex, B) age, C) race/ethnicity, D) income, E) education, and F) household size.
https://doi.org/10.1371/journal.pgph.0000647.s003
(PDF)
S4 Fig. Proportion of participants stratified by high- and low-risk mitigation behaviors in each study round, A) overall, and B) according to specific behavior.
https://doi.org/10.1371/journal.pgph.0000647.s004
(PDF)
S5 Fig. Scatterplot of proportion of households who speak Spanish at home reported by the American Com-munity Survey (x-axis) and cumulative population-adjusted SARS-CoV-2 seroprevalence (y-axis) within ZIP codes.
https://doi.org/10.1371/journal.pgph.0000647.s005
(PDF)
S6 Fig.
Comparison of SARS-CoV-2 seroprevalence estimated in study to cumulative case prevalence reported by local public health agencies (July 2020-April 2021): A) Oakland/Piedmont, B) Albany, Berkeley, and Emeryville, and D) El Cerrito, Richmond, and San Pablo. The household response rate (RR) to the study invitation within each ZIP code is included in the upper left of each panel.
https://doi.org/10.1371/journal.pgph.0000647.s006
(PDF)
S1 Table. ZIP code of residence of participants at each round of the study compared to study region population.
https://doi.org/10.1371/journal.pgph.0000647.s007
(PDF)
S2 Table. Distribution of COVID-19 probable case definition variables.
https://doi.org/10.1371/journal.pgph.0000647.s008
(PDF)
S3 Table. Self-reported mask wearing behavior during each study round.
https://doi.org/10.1371/journal.pgph.0000647.s009
(PDF)
S4 Table. Characteristics of study participants stratified by high-risk and low-risk mitigation behavior.
https://doi.org/10.1371/journal.pgph.0000647.s010
(PDF)
S5 Table. Antibody assays used to detect antibodies against SARS-CoV-2 spike and nucleocapsid proteins.
https://doi.org/10.1371/journal.pgph.0000647.s011
(PDF)
S6 Table. Self-reported vaccine plans among participants who did not self-report receiving a vaccination in Round 3 stratified by race and ethnicity.
https://doi.org/10.1371/journal.pgph.0000647.s012
(PDF)
S1 File. Methods used for SARS-CoV-2 viral and antibody detection, and participant recruitment.
https://doi.org/10.1371/journal.pgph.0000647.s013
(PDF)
S2 File. Study questionnaires used for each data collection round.
https://doi.org/10.1371/journal.pgph.0000647.s014
(PDF)
S3 File. Details on data collection and SARS-CoV-2 outcome definitions.
https://doi.org/10.1371/journal.pgph.0000647.s015
(PDF)
S4 File. Statistical methods used for population adjusted prevalence analyses.
https://doi.org/10.1371/journal.pgph.0000647.s016
(PDF)
S5 File. Behaviors related to virus containment and mitigation.
https://doi.org/10.1371/journal.pgph.0000647.s017
(PDF)
S6 File. Population adjusted prevalence estimates of COVID-19 outcomes and 95% credible intervals across demographic and regional strata.
See tables for region and stratum population-adjusted estimates for COVID-19 outcomes and mitigation analyses.
https://doi.org/10.1371/journal.pgph.0000647.s018
(XLSX)
Acknowledgments
We want to acknowledge the contributions of the following individuals to this study: Ella Parsons, Jordan Keen, Janine Solomon, Jose Salinas, Kevin Duong, Joseph Egbunikeokye, Maya Talavera, Riya Shrestha, Colin Warnes, José Victor Zambrana, Nicholas Lo, Parnal Narvekar, Fausto Bustos, Gregorio Dias, Reinaldo Mercado-Hernandez, Julia Huffaker, Raymond Montes, Alexandra Zermeno, Alejandra Zeiger, William Dow, Michael Lu, Lila Krop, Kelly Lam, Yan Zhang, Sarah Folkmanis, Sophie Zhai, Dingjun Chen, Ruben Vargas Ethan Garcia, Oliver Li, Manisha Sahoo, Raina Walencewicz, Sophia Wang, Antonia Gibbs, Amrita Ramanathan, Catherine Livelo, Taylor Worley, Amanda Tanaka, Savinnie Ho, Jane Liu Ryan Allen, Sofia Soltero, Victoria Van Metter, Madeleine Fraix, Allie Coyne, Subeksha Sharma Lydia Yu, Shreeya Garg, Sanjeet Paluru, Malika Saxena, Talia Panadero, Ayra Rahman, Joshua Calangian, Dharaa Upadhyaya, Sophia Kemp, Ruhi Parikh, Amy Rich, Sophie Manoukian, Nola Vu, Crystal Nguyen, Jordyn Pinochi, Alma Kuc, Siri Ylenduri, Manvir Kaur, Angikaar Chana, and Sannidhi Sarvadhavabhatla, Benjamin T. Auch, Dinesha Walek, Evan Forsberg, Jerry Daniel, Veronica Tonnell, Ji Hyun (Jay) Kim, Mary Nieuwenhuis. Creative Testing Solutions: Valerie Green, Sherri Cyrus, Phillip Willamson, Brett Hirsch, Paul Contestable, Mars Stone, Joe Derisi, Emily Crawford, Emily Ahlvin, Armando Diaz, and Favianna Rodriquez.
References
- 1.
Eby K. Coronavirus Timeline: Tracking major moments of COVID-19 pandemic in San Francisco Bay Area. In: ABC7 San Francisco [Internet]. 27 Mar 2020 [cited 2021 Nov 29]. https://abc7news.com/6047519/.
- 2.
United States. Agency for Toxic Substances and Disease Registry. Geospatial Research A and SP, editor. CDC COVID data tracker. 2020 [cited 2021 Dec 29]. https://stacks.cdc.gov/view/cdc/96118.
- 3. Njoku A, Joseph M, Felix R. Changing the Narrative: Structural Barriers and Racial and Ethnic Inequities in COVID-19 Vaccination. International Journal of Environmental Research and Public Health. 2021;18: 9904. pmid:34574827
- 4. Qualls N. Community Mitigation Guidelines to Prevent Pandemic Influenza—United States, 2017. MMWR Recomm Rep. 2017;66. pmid:28426646
- 5.
CDC. Interim pre-pandemic planning guidance: community strategy for pandemic influenza mitigation in the United States: early, targeted, layered use of nonpharmaceutical interventions. [cited 2021 Nov 10]. https://stacks.cdc.gov/view/cdc/11425.
- 6. Bell DM, World Health Organization Working Group on International and Community Transmission of SARS. Public health interventions and SARS spread, 2003. Emerg Infect Dis. 2004;10: 1900–1906. pmid:15550198
- 7. Chamie G, Marquez C, Crawford E, Peng J, Petersen M, Schwab D, et al. Community Transmission of Severe Acute Respiratory Syndrome Coronavirus 2 Disproportionately Affects the Latinx Population During Shelter-in-Place in San Francisco. Clin Infect Dis. 2021;73: S127–S135. pmid:32821935
- 8. Hutchins HJ. COVID-19 Mitigation Behaviors by Age Group—United States, April–June 2020. MMWR Morb Mortal Wkly Rep. 2020;69. pmid:33119562
- 9. Havers FP, Reed C, Lim T, Montgomery JM, Klena JD, Hall AJ, et al. Seroprevalence of Antibodies to SARS-CoV-2 in 10 Sites in the United States, March 23-May 12, 2020. JAMA Internal Medicine. 2020;180: 1576–1586. pmid:32692365
- 10. Bendavid E, Mulaney B, Sood N, Shah S, Ling E, Bromley-Dulfano R, et al. COVID-19 Antibody Seroprevalence in Santa Clara County, California. medRxiv [Preprint]. 2020 [cited 2021 May 21]. https://www.medrxiv.org/content/10.1101/2020.04.14.20062463v2. pmid:33615345
- 11. Sood N, Simon P, Ebner P, Eichner D, Reynolds J, Bendavid E, et al. Seroprevalence of SARS-CoV-2–Specific Antibodies Among Adults in Los Angeles County, California, on April 10–11, 2020. JAMA. 2020;323: 2425–2427. pmid:32421144
- 12. Self WH. Seroprevalence of SARS-CoV-2 Among Frontline Health Care Personnel in a Multistate Hospital Network—13 Academic Medical Centers, April–June 2020. MMWR Morb Mortal Wkly Rep. 2020;69. pmid:32881855
- 13. Rosenberg ES, Tesoriero JM, Rosenthal EM, Chung R, Barranco MA, Styer LM, et al. Cumulative incidence and diagnosis of SARS-CoV-2 infection in New York. Ann Epidemiol. 2020;48: 23–29.e4. pmid:32648546
- 14. Gelman A, Little TC. Poststratification Into Many Categories Using Hierarchical Logistic Regression. Survey Research. 1997;23: 127–135. 10.1.1.44.5270.
- 15. Leemann L, Wasserfallen F. Extending the Use and Prediction Precision of Subnational Public Opinion Estimation: EXTENDING USE AND PRECISION OF MrP. American Journal of Political Science. 2017;61: 1003–1022.
- 16. Wong MP, Meas MA, Adams C, Hernandez S, Green V, Montoya M, et al. Development and Implementation of Dried Blood Spot-based COVID-19 Serological Assays for Epidemiologic Studies. Microbiology Spectrum. 2022. pmid:35612315
- 17. Wratil PR, Stern M, Priller A, Willmann A, Almanzar G, Vogel E, et al. Three exposures to the spike protein of SARS-CoV-2 by either infection or vaccination elicit superior neutralizing immunity to all variants of concern. Nat Med. 2022;28: 496–503. pmid:35090165
- 18. Reynolds CJ, Gibbons JM, Pade C, Lin K-M, Sandoval DM, Pieper F, et al. Heterologous infection and vaccination shapes immunity against SARS-CoV-2 variants. Science. 2022;375: 183–192. pmid:34855510
- 19. El Sahly HM, Baden LR, Essink B, Doblecki-Lewis S, Martin JM, Anderson EJ, et al. Efficacy of the mRNA-1273 SARS-CoV-2 Vaccine at Completion of Blinded Phase. N Engl J Med. 2021; NEJMoa2113017. pmid:34551225
- 20. Sadoff J, Gray G, Vandebosch A, Cárdenas V, Shukarev G, Grinsztejn B, et al. Safety and Efficacy of Single-Dose Ad26.COV2.S Vaccine against Covid-19. N Engl J Med. 2021; NEJMoa2101544. pmid:33882225
- 21.
CSTE Interim Position Statement: Update to COVID-19 Case Definition—Council of State and Territorial Epidemiologists. [cited 2021 Oct 13]. https://www.cste.org/news/520707/CSTE-Interim-Position-Statement-Update-to-COVID-19-Case-Definition.htm.
- 22. Simpson D, Rue H, Riebler A, Martins TG, Sørbye SH. Penalising Model Component Complexity: A Principled, Practical Approach to Constructing Priors. Statist Sci. 2017;32.
- 23. Linzer DA, Lewis JB. poLCA: An R Package for Polytomous Variable Latent Class Analysis. Journal of Statistical Software. 2011;42: 1–29.
- 24. de Valpine P, Turek D, Paciorek CJ, Anderson-Bergman C, Lang DT, Bodik R. Programming with models: writing statistical algorithms for general model structures with NIMBLE. Journal of Computational and Graphical Statistics. 2017;26: 403–413.
- 25. Reitsma MB, Claypool AL, Vargo J, Shete PB, McCorvie R, Wheeler WH, et al. Racial/Ethnic Disparities In COVID-19 Exposure Risk, Testing, And Cases At The Subcounty Level In California. Health Affairs. 2021;40: 870–878. pmid:33979192
- 26. Paremoer L, Nandi S, Serag H, Baum F. Covid-19 pandemic and the social determinants of health. BMJ. 2021;372: n129. pmid:33509801
- 27. Abrams EM, Szefler SJ. COVID-19 and the impact of social determinants of health. Lancet Respir Med. 2020;8: 659–661. pmid:32437646
- 28. Magesh S, John D, Li WT, Li Y, Mattingly-app A, Jain S, et al. Disparities in COVID-19 Outcomes by Race, Ethnicity, and Socioeconomic Status: A Systematic-Review and Meta-analysis. JAMA Network Open. 2021;4: e2134147. pmid:34762110
- 29. Agarwal R, Dugas M, Ramaprasad J, Luo J, Li G, Gao G (Gordon). Socioeconomic privilege and political ideology are associated with racial disparity in COVID-19 vaccination. PNAS. 2021;118. pmid:34326130
- 30. Reiner RC, Barber RM, Collins JK, Zheng P, Adolph C, Albright J, et al. Modeling COVID-19 scenarios for the United States. Nat Med. 2021;27: 94–105. pmid:33097835
- 31.
IHME | COVID-19 Projections. In: Institute for Health Metrics and Evaluation [Internet]. [cited 2021 Oct 1]. https://covid19.healthdata.org/.
- 32. Brooks JT, Butler JC. Effectiveness of Mask Wearing to Control Community Spread of SARS-CoV-2. JAMA. 2021;325: 998–999. pmid:33566056
- 33.
The Los Angeles Times’ open-source archive of California coronavirus data. Los Angeles Times Data and Graphics Department; 2021 [cited 2021 Nov 11]. https://github.com/datadesk/california-coronavirus-data.