variancePartition: Interpreting drivers of variation in complex gene expression studies

Research output: Contribution to journalArticlepeer-review

345 Scopus citations

Abstract

Background: As large-scale studies of gene expression with multiple sources of biological and technical variation become widely adopted, characterizing these drivers of variation becomes essential to understanding disease biology and regulatory genetics. Results: We describe a statistical and visualization framework, variancePartition, to prioritize drivers of variation based on a genome-wide summary, and identify genes that deviate from the genome-wide trend. Using a linear mixed model, variancePartition quantifies variation in each expression trait attributable to differences in disease status, sex, cell or tissue type, ancestry, genetic background, experimental stimulus, or technical variables. Analysis of four large-scale transcriptome profiling datasets illustrates that variancePartition recovers striking patterns of biological and technical variation that are reproducible across multiple datasets. Conclusions: Our open source software, variancePartition, enables rapid interpretation of complex gene expression studies as well as other high-throughput genomics assays. variancePartition is available from Bioconductor: http://bioconductor.org/packages/variancePartition.

Original languageEnglish
Article number483
JournalBMC Bioinformatics
Volume17
Issue number1
DOIs
StatePublished - 25 Nov 2016

Keywords

  • Linear mixed model
  • RNA-seq
  • Transcriptome profiling

Fingerprint

Dive into the research topics of 'variancePartition: Interpreting drivers of variation in complex gene expression studies'. Together they form a unique fingerprint.

Cite this