Training a model for estimating leukocyte composition using whole-blood DNA methylation and cell counts as reference

Jonathan A. Heiss, Lutz P. Breitling, Benjamin Lehne, Jaspal S. Kooner, John C. Chambers, Hermann Brenner

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Aim: Whole-blood DNA methylation depends on the underlying leukocyte composition and confounding hereby is a major concern in epigenome-wide association studies. Cell counts are often missing or may not be feasible. Computational approaches estimate leukocyte composition from DNA methylation based on reference datasets of purified leukocytes. We explored the possibility to train such a model on whole-blood DNA methylation and cell counts without the need for purification. Materials & methods: Using whole-blood DNA methylation and corresponding five-part cell counts from 2445 participants from the London Life Sciences Prospective Population Study, a model was trained on a subset of 175 subjects and evaluated on the remaining. Results: Correlations between cell counts and estimated cell proportions were high (neutrophils 0.85, eosinophils 0.88, basophils 0.02, lymphocytes 0.84, monocytes 0.55) and estimated proportions explained more variance in whole-blood DNA methylation levels than counts. Conclusion: Our model provided precise estimates for the common cell types.

Original languageEnglish
Pages (from-to)13-20
Number of pages8
JournalEpigenomics
Volume9
Issue number1
DOIs
StatePublished - Jan 2017
Externally publishedYes

Keywords

  • DNA methylation
  • Infinium 450K
  • KAROLA
  • LOLIPOP
  • estimation of cell proportions
  • leukocyte composition
  • white-blood cell distribution

Fingerprint

Dive into the research topics of 'Training a model for estimating leukocyte composition using whole-blood DNA methylation and cell counts as reference'. Together they form a unique fingerprint.

Cite this