Clinical and Demographic Factors Associated With COVID-19, Severe COVID-19, and SARS-CoV-2 Infection in Adults A Secondary Cross-Protocol Analysis of 4 Randomized Clinical Trials

Deborah A. Theodore, MD¹; Angela R. Branche, MD²; Lily Zhang, PhD³; et alDaniel S. Graciaa, MD⁴; Madhu Choudhary, MD⁵; Timothy J. Hatlen, MD⁶; Raadhiya Osman, PhD⁷; Tara M. Babu, MD⁸; Samuel T. Robinson, PhD³; Peter B. Gilbert, PhD^3,9; Dean Follmann, PhD¹⁰; Holly Janes, PhD^3,9; James G. Kublin, MD³; Lindsey R. Baden, MD¹¹; Paul Goepfert, MD¹²; Glenda E. Gray, MBBCh^13,14; Beatriz Grinsztejn, MD, PhD¹⁵; Karen L. Kotloff, MD^16,17; Cynthia L. Gay, MD¹⁸; Brett Leav, MD¹⁹; Jacqueline Miller, MD¹⁹; Ian Hirsch, PhD²⁰; Jerald Sadoff, MD²¹; Lisa M. Dunkle, MD²²; Kathleen M. Neuzil, MD¹⁷; Lawrence Corey, MD^3,23; Ann R. Falsey, MD²; Hana M. El Sahly, MD^24,25; Magdalena E. Sobieszczyk, MD¹; Yunda Huang, PhD^3,26; for the COVID-19 Prevention Network (CoVPN) JAMA Netw Open. 2023;6(7):e2323349. doi:10.1001/jamanetworkopen.2023.23349

Key Points

Question What clinical and demographic factors are associated with rates of COVID-19, severe COVID-19, and SARS-CoV-2 infection?

Findings In this secondary analysis of 57 692 participants randomized to the placebo groups of 4 COVID-19 vaccine phase 3 efficacy trials, exposure risks, demographics (age and race), and evidence of previous infection had the strongest associations with study outcomes.

Meaning These findings could inform public health policy pertaining to prioritization for vaccination and risk mitigation efforts.

Abstract

Importance Current data identifying COVID-19 risk factors lack standardized outcomes and insufficiently control for confounders.

Objective To identify risk factors associated with COVID-19, severe COVID-19, and SARS-CoV-2 infection.

Design, Setting, and Participants This secondary cross-protocol analysis included 4 multicenter, international, randomized, blinded, placebo-controlled, COVID-19 vaccine efficacy trials with harmonized protocols established by the COVID-19 Prevention Network. Individual-level data from participants randomized to receive placebo within each trial were combined and analyzed. Enrollment began July 2020 and the last data cutoff was in July 2021. Participants included adults in stable health, at risk for SARS-CoV-2, and assigned to the placebo group within each vaccine trial. Data were analyzed from April 2022 to February 2023.

Exposures Comorbid conditions, demographic factors, and SARS-CoV-2 exposure risk at the time of enrollment.

Main Outcomes and Measures Coprimary outcomes were COVID-19 and severe COVID-19. Multivariate Cox proportional regression models estimated adjusted hazard ratios (aHRs) and 95% CIs for baseline covariates, accounting for trial, region, and calendar time. Secondary outcomes included severe COVID-19 among people with COVID-19, subclinical SARS-CoV-2 infection, and SARS-CoV-2 infection.

Results A total of 57 692 participants (median [range] age, 51 [18-95] years; 11 720 participants [20.3%] aged ≥65 years; 31 058 participants [53.8%] assigned male at birth) were included. The analysis population included 3270 American Indian or Alaska Native participants (5.7%), 7849 Black or African American participants (13.6%), 17 678 Hispanic or Latino participants (30.6%), and 40 745 White participants (70.6%). Annualized incidence was 13.9% (95% CI, 13.3%-14.4%) for COVID-19 and 2.0% (95% CI, 1.8%-2.2%) for severe COVID-19. Factors associated with increased rates of COVID-19 included workplace exposure (high vs low: aHR, 1.35 [95% CI, 1.16-1.58]; medium vs low: aHR, 1.41 [95% CI, 1.21-1.65]; P < .001) and living condition risk (very high vs low risk: aHR, 1.41 [95% CI, 1.21-1.66]; medium vs low risk: aHR, 1.19 [95% CI, 1.08-1.32]; P < .001). Factors associated with decreased rates of COVID-19 included previous SARS-CoV-2 infection (aHR, 0.13 [95% CI, 0.09-0.19]; P < .001), age 65 years or older (aHR vs age <65 years, 0.57 [95% CI, 0.50-0.64]; P < .001) and Black or African American race (aHR vs White race, 0.78 [95% CI, 0.67-0.91]; P = .002). Factors associated with increased rates of severe COVID-19 included race (American Indian or Alaska Native vs White: aHR, 2.61 [95% CI, 1.85-3.69]; multiracial vs White: aHR, 2.19 [95% CI, 1.50-3.20]; P < .001), diabetes (aHR, 1.54 [95% CI, 1.14-2.08]; P = .005) and at least 2 comorbidities (aHR vs none, 1.39 [95% CI, 1.09-1.76]; P = .008). In analyses restricted to participants who contracted COVID-19, increased severe COVID-19 rates were associated with age 65 years or older (aHR vs <65 years, 1.75 [95% CI, 1.32-2.31]; P < .001), race (American Indian or Alaska Native vs White: aHR, 1.98 [95% CI, 1.38-2.83]; Black or African American vs White: aHR, 1.49 [95% CI, 1.03-2.14]; multiracial: aHR, 1.81 [95% CI, 1.21-2.69]; overall P = .001), body mass index (aHR per 1-unit increase, 1.03 [95% CI, 1.01-1.04]; P = .001), and diabetes (aHR, 1.85 [95% CI, 1.37-2.49]; P < .001). Previous SARS-CoV-2 infection was associated with decreased severe COVID-19 rates (aHR, 0.04 [95% CI, 0.01-0.14]; P < .001).

Conclusions and Relevance In this secondary cross-protocol analysis of 4 randomized clinical trials, exposure and demographic factors had the strongest associations with outcomes; results could inform mitigation strategies for SARS-CoV-2 and viruses with comparable epidemiological characteristics.

Introduction

SARS-CoV-2 infection and COVID-19 remain a significant global health challenge.¹^,2 Despite the development of safe and effective vaccines, globally, billions of people remain unvaccinated. Greater understanding of risk factors for infection and severe disease can guide future vaccine uptake prioritization strategies and therapeutic allocation policies.

Prospective studies have demonstrated that higher risk of SARS-CoV-2 infection is associated with demographic and behavioral cofactors.³^–18 In contrast, the associations of underlying medical conditions with infection acquisition are less well-defined, although retrospective reports have reported diabetes and obesity were associated with increased risk.⁷^,19^–24 Prospective studies have identified both demographic characteristics and comorbidities as risk factors associated with severe COVID-19.²⁵^–31

Much of the data defining risk for either acquisition of SARS-CoV-2 or severe COVID-19 come from studies that relied on surveys, registries, or electronic medical records. These data often lack a harmonized definition of outcomes, and none used active surveillance. Moreover, high-quality data regarding risk factors for broadly symptomatic COVID-19 are sparse, despite clear, long-term impacts of disease.³²^,33

The COVID-19 Prevention Network (CoVPN) was formed by the US National Institutes of Health to conduct phase 3 vaccine clinical trials.³⁴ Four randomized, controlled, efficacy trials were conducted with harmonized protocols beginning in 2020.³⁵^–38 The placebo recipients from these trials afford a large, diverse, multinational cohort. Using these detailed prospective data, and controlling for trial, region, and time in the pandemic, we sought to identify independent risk factors associated with COVID-19, severe COVID-19, any SARS-CoV-2 infection, and subclinical SARS-CoV-2 infection.

Methods

Study Design and Setting

We performed a secondary cross-protocol analysis of participant-level data from the blinded phase of 4 randomized, placebo-controlled, phase 3 COVID-19 vaccine efficacy trials. Local or central institutional review board and ethics committee approvals were obtained by each site participating in the 4 trials.³⁵^–38 All participants provided written informed consent to participate in the trials.

We assessed data from participants in the placebo groups of the COVE/CoVPN3001 (ClinicalTrials.gov identifier: NCT04470427; mRNA1273, Moderna), AZD1222 (ChAdOx1 nCoV-19/CoVPN3002; ClinicalTrials.gov identifier: NCT04516746; AZD1222, AstraZeneca), ENSEMBLE /CoVPN3003 (ClinicalTrials.gov identifier: NCT04505722; Ad26.COV2.S, Janssen), and PREVENT-19/CoVPN3004 (ClinicalTrials.gov identifier: NCT04611802; NVX-CoV2373, Novavax) trials (Figure 1), hereafter, referred to as the Moderna, AstraZeneca, Janssen, and Novavax cohorts. Individual study protocols describing study design have been previously published.³⁵^–38

Major eligibility criteria were age at least 18 years, in stable health, and at risk for SARS-CoV-2 infection.³⁹ This analysis constitutes data accrued through the blinded, precrossover phases of the trials from placebo recipients who received at least the first study injection.

Trial start dates and blinded precrossover phase cutoff dates are presented in Figure 1. Participants were enrolled from 8 countries, with enrollment beginning in July 2020 and the last blinded phase data cutoff in July 2021.

Study End Points

Study end points diagnosed at least 1 day after the first placebo administration were included and censored at the earliest date of drop out, unblinding, crossover, receipt of outside vaccination, or blinded phase data cutoff. The primary end point was time to COVID-19, defined with minor differences across studies as positive SARS-CoV-2 reverse transcriptase–polymerase chain reaction (RT-PCR) test result and systemic and/or respiratory symptoms (eTable 1 in Supplement 1).³⁹ The severe COVID-19 end point was defined per Centers for Disease Control and Prevention criteria, including shortness of breath at rest or respiratory distress, respiratory rate of at least 30 breaths per minute, heart rate of at least 125 beats per minute, oxygen saturation 93% or less on room air, respiratory or multiorgan failure, intensive care unit admission, or death (eTable 1 in Supplement 1).

The subclinical SARS-CoV-2 infection end point was defined as having positive SARS-CoV-2 RT-PCR test result or antinucleocapsid protein (anti-N) test result, but not meeting the definition for the primary (symptomatic) COVID-19 end point among participants whose test results were negative for SARS-CoV-2 at baseline. The testing schedule is shown in eTable 2 in Supplement 1.

The any SARS-CoV-2 infection end point was defined as meeting the primary COVID-19 end point or meeting the subclinical SARS-CoV-2 infection end point among participants whose test results were negative for SARS-CoV-2 at baseline. Each of the 4 trials included anti-N testing at designated visits; the Moderna and Novavax cohorts also included SARS-CoV-2 RT-PCR testing regardless of symptoms at designated visits (eTable 2 in Supplement 1). Additional analyses were performed to identify variables associated with COVID-19 rates among participants meeting the any SARS-CoV-2 infection end point and to identify variables associated with severe COVID-19 rates among participants meeting the COVID-19 end point.

Baseline Characteristics

We evaluated 23 pre-enrollment variables as potential risk factors for study outcomes. These variables included demographic characteristics: age (≥65 vs 18-64 years), sex assigned at birth, race, ethnicity, body mass index (BMI; calculated as weight in kilograms divided by height in meters squared), and region (SARS-CoV-2 epidemiological characteristics in South America, North America, or South Africa, where trials were conducted); comorbid conditions: asthma, cardiovascular disease, hypertension, diabetes, smoking, obesity (defined as BMI ≥30), lung disease, liver disease, kidney disease, HIV, and number of comorbidities; and SARS-CoV-2 exposure risk or history: Occupational Safety and Health Administration risk category (high, medium, and low), living situation risk score (very high, high, medium, and low), evidence of previous SARS-CoV-2 infection (defined as positive SARS-CoV-2 PCR test result or anti-N serostatus) at screening or enrollment, and calendar date of enrollment. Race and ethnicity were collected by self report and categorized as American Indian or Alaska Native, Asian, Black or African American, White, other, and multiple. Indigenous people from South America were classified together with the American Indian or Alaska Native US and Mexico demographic according to the US Food and Drug definition (ie, a person having origins in any of the original peoples of North and South America [including Central America] and who maintains tribal affiliation or community attachment). Participants also had the option to select other race without specifying. Ethnicity was categorized as Hispanic or Latino or not Hispanic or Latino. Race and ethnicity were included in analysis to describe participant demographic characteristics. Definitions and derivations of these variables are provided in the eMethods and eTable 3 in Supplement 1.

Statistical Analysis

The COVID-19 and severe COVID-19 end points were evaluated with the full cohort of participants who received the first placebo administration. The any SARS-CoV-2 infection and subclinical SARS-CoV-2 infection end points analyses were restricted to the per-protocol population of participants with negative SARS-CoV-2 test results at baseline who received all planned placebo injections. Instantaneous hazard estimates over calendar time were obtained by smoothing the increments of the Nelson-Aalen estimator for the cumulative hazard function and plotted with global epidemiologic trends using information from the World Health Organization (WHO)⁴⁰ and GISAID.⁴¹ Missing data were imputed by the median for continuous variables and by the most frequent category for categorical variables. Univariate and multivariate analyses generated adjusted hazard ratios (aHRs) adjusted for study and region as stratification variables (reflecting SARS-CoV-2 epidemiological characteristics in North America, South America, and South Africa) to account for potentially different baseline hazard functions across trials and regions. Survival random forest, as implemented in the randomForestSRC package in R statistical software version 4.1.0 (R Project for Statistical Computing), was used to rank the relative importance of covariates, without imposing any parametric assumptions on the association between the risk factors and the study outcomes, while accounting for potential correlations among the risk factors.⁴² Univariate and multivariate Cox proportional hazard regression models, with and without 2-way interaction terms for age (18-64 vs ≥65 years) or ethnicity and each of the other covariates, were used to estimate the associations of baseline covariates using both calendar and study time scales. Given the similarity of the results, only calendar time models are presented. The Cox proportional hazard assumption was evaluated for each risk factor considered in the models.⁴³ The model assumption results for the primary COVID-19 end point are shown in eFigure 1 in Supplement 1. False-discovery rate–based P value adjustment was applied within each end point for all univariate analyses. The final multivariate Cox models with and without interactions were chosen by the Akaike information criterion in a stepwise algorithm. The final multivariate Cox model with 2-way interactions kept interaction terms with P < .01 for reliability of interpretation. Consequently, not all variables that were significant in the univariate analysis were included in the multivariate model. P < .01 was considered statistically significant. All analyses were performed using R software version 4.1.0. Data were analyzed from April 2022 to February 2023.

Results

Demographics

Across the 4 studies, 57 692 participants were randomized and received placebo injections. Median (range) age was 51 (18-95) years; 31 058 (53.8%) were male, and 11 720 participants (20.3%) were aged 65 years or older (Table). The Novavax study enrolled the youngest population, with 12.6% of placebo participants aged 65 years or older.

Overall, 3270 participants (5.7%) were American Indian or Alaska Native, 7849 participants (13.6%) were Black or African American, 17 678 participants (30.6%) were Hispanic or Latino, and 40 745 participants (70.6%) were White. Most participants were enrolled in the US (43 643 participants [75.6%]), followed by Brazil (3635 participants [6.3%]), South Africa (3289 participants [5.7%]), Colombia (2123 participants [3.7%]), Argentina (1498 participants [2.6%]), Peru (1376 participants [2.4%]), Chile (1299 participants [2.3%]), and Mexico (829 participants [1.4%]). Demographic characteristics were largely consistent across the 4 trials; the Janssen study contributed the most diverse population racially, ethnically, and geographically (Table).

Clinical Characteristics at Enrollment

Comorbid conditions among placebo recipients were distributed similarly across trials and included cardiovascular disease (15 996 participants [27.7%]), diabetes (5629 participants [9.8%]), kidney disease (331 participants [0.6%]), liver disease (570 participants [1.0%]), lung disease (4993 participants [8.7%]), obesity (19 879 participants [34.5%]), HIV (953 participants [1.7%]), history of smoking (5840 participants [10.1%]), and hypertension (15 310 participants [26.5%]). Across trials, 21 164 participants (36.7%) had 2 or more comorbidities, 12 231 participants (21.2%) had 3 or more comorbidities, and 4960 participants (8.6%) had 4 or more comorbidities. Most participants had negative results on baseline anti-N antibody and SARS-CoV-2 RT-PCR testing (54 156 participants [93.9%]).

Exposure Risk

A low-risk living condition was most common (26 036 participants [45.1%]), followed by medium (20 582 participants [35.7%]), high (7448 participants [12.9%]), and very high (3626 participants [6.3%]). Living condition risk varied by study, reflecting regional differences of the individual trial populations. A lower workplace exposure risk was most common (29 887 participants [51.8%]), followed by medium (11 189 participants [19.4%]) and high (9131 participants [15.8%]) workplace exposure risk. Participants in the Janssen trial had the lowest workplace exposure risks.

Incidence Rates by Study Outcome

Median (range) blinded precrossover follow-up time was 3.8 (0-11.1) months. Across the 4 trials, 2559 placebo recipients developed COVID-19, with an incidence rate of 13.9% (95% CI, 13.3%-14.4%); 367 placebo recipients developed severe COVID-19, with an incidence rate of 2.0% (95% CI, 1.8%-2.2%) (eTable 4 in Supplement 1). In the per-protocol cohort, 3774 placebo recipients developed any SARS-CoV-2 infection, with an incidence rate of 24.3% (95% CI, 23.5%-25.1%); 1612 placebo recipients developed subclinical infection, with an incidence rate of 10.3% (95% CI, 9.8%-10.9%). We plotted the smoothed hazard estimates over calendar time for COVID-19 in each trial (Figure 2; eFigure 2 in Supplement 1). To put our data into context, epidemiological trends for the regions of the trials are presented using information from the WHO⁴⁰ and GISAID.⁴¹ Smoothed hazard peaks largely aligned with case trends in the countries contributing data to each study. The trials enrolled primarily during pandemic waves with the ancestral and Alpha variants. However, trials also included participants with infections with Beta, Delta, Epsilon, Gamma, Mu, and Lambda variants (Figure 2; eFigure 2 and eTable 5 in Supplement 1).

Variables Associated with Study Outcomes

Univariate Cox regression model results are included in eFigure 3 and eFigure 4 in Supplement 1. In the multivariate Cox regression models, covariates associated with an increased rate of COVID-19 included workplace exposure risk (high vs low risk: aHR, 1.35 [95% CI, 1.16-1.58]; medium vs low risk: aHR, 1.41 [95% CI, 1.21-1.65]; overall P < .001), living condition risk (very high vs low risk: aHR, 1.41 [95% CI, 1.21-1.66]; medium vs low risk: aHR, 1.19 [95% CI, 1.08-1.32]; overall P < .001), and BMI (aHR per 1-unit increase, 1.02 [95% CI, 1.01-1.03]; P < .001) (Figure 3). Covariates associated with a decreased rate of COVID-19 included age 65 years or older (aHR vs <65 years, 0.57 [95% CI, 0.50-0.64]; P < .001), Black or African American race (aHR vs White race, 0.79 [95% CI, 0.68-0.92]; overall P = .002), evidence of previous SARS-CoV-2 infection at enrollment (aHR, 0.13 [95% CI, 0.09-0.19]; P < .001), hypertension (aHR, 0.78 [95% CI, 0.66-0.93]; P = .004), and history of smoking (aHR, 0.76 [95% CI, 0.63-0.92]; P = .005).

We then considered these covariates for the severe COVID-19 end point. Covariates associated with an increased rate of severe COVID-19 included American Indian or Alaska Native race (aHR vs White race, 2.61 [95% CI, 1.85-3.69]) and multiple races (aHR vs White race, 2.19 [95% CI, 1.50-3.20]; overall P < .001), higher BMI (aHR per 1-unit increase, 1.04 [95% CI, 1.03-1.06]; P < .001), diabetes (aHR, 1.54 [95% CI, 1.14-2.08]; P = .005), and having at least 2 comorbidities (aHR vs none, 1.39 [95% CI, 1.09-1.76]; P = .008). Previous SARS-CoV-2 infection at enrollment was associated with a decreased rate of severe COVID-19 (aHR, 0.04 [95% CI, 0.01-0.14]; P < .001).

In an analysis limited to participants meeting the COVID-19 end point, an increased rate of severe COVID-19 was associated with age at least 65 years (aHR vs <65 years, 1.75 [95% CI, 1.32-2.31]; P < .001), American Indian or Alaska Native race (aHR vs White race, 1.98 [95% CI, 1.38-2.83]), Black or African American race (aHR vs White race, 1.49 [95% CI, 1.03-2.14]), multiple races (aHR vs White race, 1.81 [95% CI, 1.21-2.69]; overall P for race = .001), BMI (aHR per 1-unit increase, 1.03 [95% CI, 1.01-1.04]; P = .001), and diabetes (aHR, 1.85 [95% CI, 1.37-2.49]; P < .001) (eFigure 5 and eFigure 6 in Supplement 1).

We next assessed covariates associated with rates of subclinical SARS-CoV-2 infection among participants in the per-protocol cohort. Covariates associated with an increased rate of subclinical SARS-CoV-2 infection included Black or African American race (aHR vs White race, 1.54 [95% CI, 1.34-1.77]; overall P < .001); higher BMI (HR per 1-unit increase, 1.02 [95% CI, 1.01-1.03]; P < .001), high workplace exposure risk (aHR vs low risk, 1.56 [95% CI, 1.25-1.94]; overall P < .001), and very high–risk living situation (aHR vs low risk, 1.40 [95% CI, 1.16-1.69]; P = .008). Asian race was associated with a lower rate of subclinical SARS-CoV-2 infection compared with White race (aHR, 0.56 [95% CI, 0.38-0.82]; P < .001).

We next analyzed the data across all participants in the per-protocol cohort meeting the any SARS-CoV-2 infection end point (COVID-19 or subclinical SARS-CoV-2 infection). Covariates associated with an increased rate of any SARS-CoV-2 infection included higher BMI (HR per 1-unit increase, 1.02 [95% CI, 1.01-1.02]; P < .001); workplace exposure risk (high vs low risk: aHR, 1.42 [95% CI, 1.23-1.64]; medium vs low risk: aHR, 1.25 [95% CI, 1.08-1.44]; overall P < .001); living situation risk (very high vs low risk: aHR, 1.46 [95% CI, 1.29-1.66]; high vs low risk: aHR, 1.15 [95% CI, 1.04-1.28]; medium vs low risk: aHR, 1.17 [95% CI, 1.08-1.27]; overall P < .001), and Black or African American race (aHR vs White race, 1.11 [95% CI, 1.00-1.23]; P < .001).

However, among participants meeting the any SARS-CoV-2 infection definition, race was significantly associated with a decreased rate of developing COVID-19 (American Indian or Alaska Native vs White race: aHR, 0.63 [95% CI, 0.54-0.75]; Black or African American vs White race: aHR, 0.56 [95% CI, 0.47-0.67]; multiracial vs White race: aHR, 0.71 [95% CI, 0.59-0.88]; overall P < .001) (eFigure 7 and eFigure 8 in Supplement 1).

The relative strength of association of covariates with each end point was assessed using a variable importance ranking analysis. Consistent with the Cox regression models, evidence of previous SARS-CoV-2 infection at enrollment, regional differences in SARS-CoV-2 epidemiological characteristics (eFigure 2 in Supplement 1), race, age, living condition, and BMI were top variables associated with COVID-19 rates; for severe COVID-19, evidence of previous SARS-CoV-2 infection at enrollment and region were top variables (Figure 4).

As age and ethnicity, 2 demographic characteristics of COVID-19 research focus, were associated with COVID-19 rates in the univariate analyses, we examined their interactions with other covariates and found significant interactions between age and workplace exposure risk and between evidence of previous SARS-CoV-2 infection and ethnicity. High workplace exposure was associated with an increased rate of COVID-19 among adults aged 18 to 64 years but a decreased rate among adults 65 years or older (eFigure 9 in Supplement 1). The inverse association of older age with COVID-19 rates lessened in strength as workplace exposure risk decreased (high exposure risk aHR, 0.32 [95% CI, 0.24-0.43]; medium exposure risk aHR, 0.59 [95% CI, 0.47-0.74]; low exposure risk aHR, 0.69 [95% CI, 0.58-0.81]; P for interaction < .001). Evidence of previous SARS-CoV-2 infection at enrollment demonstrated a stronger association with a decreased rate of COVID-19 among Hispanic or Latino (aHR, 0.08 [95% CI, 0.04-0.14]) compared with participants who were not Hispanic or Latino (aHR, 0.23 [95% CI, 0.15-0.36]; P for interaction = .004).

Discussion

In this secondary cross protocol analysis of 4 randomized clinical trials, we present the largest, most diverse, global cohort with prospective active surveillance follow-up to our knowledge, enabling precise outcome ascertainment and identification of variables associated with COVID-19 rates. We noted high incidence of SARS-CoV-2 infection in approximately 4 months of follow-up, highlighting the remarkable infectivity of the virus and providing sufficient events for a robust risk assessment. Risk factors identified in this analysis can inform mitigation strategies for SARS-CoV-2 and viruses with comparable epidemiological characteristics. In addition, the characterization of these risk factors will increase efficiency in future analyses of this unique data set.

The sequential rollout of the 4 clinical trials allowed evaluation over different pandemic waves and diverse SARS-CoV-2 strains, affording generalizable results by capturing infections during periods of circulation of the ancestral strain, B.1.1.7 (Alpha variant), B.1.617.2 (Delta variant), B.1.351 (Beta variant), P.1 (Gamma variant), and C.37 (Lambda variant).⁴⁴

In our analysis, hazard of COVID-19 was associated with workplace exposure risks and living condition risks and inversely associated with previous SARS-CoV-2 infection. Additionally, region had a strong association with hazard of COVID-19, which may reflect differences in study population and local mitigation practices, force of infection during the follow-up period, or the circulating variant.¹⁴^,15 In this study, we confirm previous findings reporting associations of infection rates with exposure risk and baseline serostatus,¹⁴^,15 and we offer additional insights, showing that prior infection and ongoing exposure risks at the individual, household, and regional level were associated not only with infection but also disease.

Some of our findings diverge from previously reported observations. For example, large prospective studies have found male sex was associated with SARS-CoV-2 acquisition and severe disease,¹⁷^,45 which we did not find. Second, Black or African American race was associated with a higher rate of any SARS-CoV-2 infection and subclinical infection but a lower rate of COVID-19. This could reflect selection bias in our study population related to socioeconomic status or health care–seeking behavior of trial participants. Consistent with other reports, in the analysis of severe COVID-19 among participants with COVID-19, Black or African American race was associated with severe disease, as was American Indian or Alaska Native race. This striking finding for American Indian or Alaska Native participants has been previously reported⁴⁶^–48 with SARS-CoV-2 and other respiratory viruses⁴⁹^,50 and highlights the need to elucidate the mechanism behind the association.

Previous studies have demonstrated increased risk for severe COVID-19 among older individuals,³¹ which is consistent with other respiratory viruses, like influenza and respiratory syncytial virus.⁵¹^–53 In our analysis, age had a strong inverse association with any SARS-CoV-2 infection and COVID-19, with the strongest inverse association among those with the highest workplace exposure risk. These findings likely reflect stricter adherence to social distancing and masking and lower social activity by older participants. This association was not seen with severe COVID-19, suggesting that a lower rate of infection due to behavioral adjustment obscured a truly higher rate of severe disease in older adults as was seen in our ad hoc analysis of severe disease rates restricted to participants with COVID-19.

In contrast to other reports that have shaped Centers for Disease Control and Prevention and WHO guidelines for prevention and treatment,⁵^,18^,21 in this study, specific comorbidities did not drive COVID-19 rates. Rather, we found inverse associations for COVID-19 with smoking and hypertension. Previous studies of the association of smoking with COVID-19 outcomes have yielded conflicting results, while hypertension emerged as a perceived risk factor for COVID-19 early in the pandemic, potentially influencing the behavior of study participants. As these associations did not persist when assessing COVID-19 risk only among participants with SARS-CoV-2 infection, their link with COVID-19 outcomes remains unclear. Regarding severe COVID-19, diabetes was the sole comorbidity associated with an increased rate; although we did find that having at least 2 comorbidities was associated with a 40% increased rate of severe disease. Overall, our data suggest preexisting immunity, demographics and exposure risks were the strongest variables associated with disease rates.

Limitations

This study has some limitations. It is unclear how infection with specific variants (eg, Omicron) not captured in our study or vaccination would moderate identified risk factors. There were also differences in the severe COVID-19 definition for each trial, warranting additional study of the severe cases within this cohort. The relatively short follow-up time limits understanding of the durability of associations. Any infection and subclinical infection analyses were limited by lack of frequent molecular or anti-N testing, and emerging evidence of variable N antibody responses.⁵⁴ Differentiation between symptomatic and subclinical infection depended on self-report, which may vary by demographics. In addition, although our analyses considered aggregated study- and region-specific associations of the risk factors, there could still be heterogeneity across the trials (eg, percentage of participants with smoking history) that was not adequately accounted for in the final multivariate models. Furthermore, ascertainment of comorbidities also relied on self-report, excepting obesity, which was measured by BMI. Variable access to health care among participants may have limited self-awareness of comorbid conditions. These considerations suggest a role for individualized risk stratification.

Conclusions

In this secondary cross-protocol analysis, we conducted a detailed assessment of variables associated with rates of COVID-19, severe disease, any SARS-CoV-2 infection, and subclinical SARS-CoV-2 infection obtained systematically from the largest clinical trial cohort enrolled in diverse regions of the world with follow-up spanning multiple waves of the pandemic. Results offer generalizable and precise identification of risk factors and may inform future vaccination policy as SARS-CoV-2 becomes endemic to human populations. Results may also help identify populations at risk in the setting of potential future pandemics.

Leave a Reply Cancel reply