Abstract
Preventive measures, health behaviors, environmental exposures, and sociodemographic characteristics affect individual-level cancer risks. It is unclear how they influence neighborhood-level cancer risks. We developed a large-scale neighborhood health dataset for 72,337 census tracts in the United States by combining data from three publicly available sources. We used Bayesian additive regression trees to identify the most important predictors of tract-level cancer prevalence among adults (age ≥18 years), and examined their impact on cancer prevalence using partial dependence plots. The five most important census tract-level correlates of cancer prevalence were the proportion of population who were aged 65 years and older, had routine checkup and were non-Hispanic White, the proportion of houses built before 1960, and the proportion of population living below the poverty line. The identified predictors of neighborhood-level cancer prevalence may inform public health practitioners and policymakers to prioritize the improvement of environmental and neighborhood factors in reducing the cancer burden.
| Original language | English |
|---|---|
| Article number | 100522 |
| Journal | Spatial and Spatio-temporal Epidemiology |
| Volume | 42 |
| DOIs | |
| State | Published - Aug 2022 |
Keywords
- Cancer prevalence
- Machine learning
- Neighborhood
- Variable selection