Dominant effects of the huntington's disease HTT CAG repeat length are captured in gene-expression data sets by a continuous analysis mathematical modeling strategy

Jong Min Lee, Ekaterina I. Galkina, Rachel M. Levantovsky, Elisa Fossale, Mary Anne Anderson, Tammy Gillis, Jayalakshmi Srinidhi Mysore, Kathryn R. Coser, Toshi Shioda, Bin Zhang, Matthew D. Furia, Jonathan Derry, Isaac S. Kohane, Ihn Sik Seong, Vanessa C. Wheeler, James F. Gusella, Marcy E. MacDonald

Research output: Contribution to journalArticlepeer-review

22 Scopus citations

Abstract

In Huntington's disease (HD), the size of the expanded HTTCAGrepeat mutation is the primary driver of the processes that determine age at onset of motor symptoms. However, correlation of cellular biochemical parameters also extends across the normal repeat range, supporting the view that the CAG repeat represents a functional polymorphism with dominant effects determined by the longer allele. A central challenge to defining the functional consequences of this single polymorphism is the difficulty of distinguishing its subtle effects from the multitude of other sources of biological variation.We demonstrate that an analytical approach based upon continuous correlation with CAG size was able to capture the modest (̃21%) contribution of the repeat to the variation in genome-wide gene expression in 107 lymphoblastoid cell lines, with alleles ranging from 15 to 92CAGs. Furthermore, a mathematical model from an iterative strategy yielded predicted CAG repeat lengths that were significantly positively correlated with true CAG allele size and negatively correlated with age at onset of motor symptoms. Genes negatively correlated with repeat size were also enriched in a set of genes whose expression were CAG-correlated in human HD cerebellum. These findings both reveal the relatively small, but detectable impact of variation in the CAG allele in global data in these peripheral cells and provide a strategy for building multi-dimensional data-driven models of the biological network that drives the HD disease process by continuous analysis across allelic panels of neuronal cells vulnerable to the dominant effects of the HTT CAG repeat.

Original languageEnglish
Article numberddt176
Pages (from-to)3227-3238
Number of pages12
JournalHuman Molecular Genetics
Volume22
Issue number16
DOIs
StatePublished - Aug 2013
Externally publishedYes

Fingerprint

Dive into the research topics of 'Dominant effects of the huntington's disease HTT CAG repeat length are captured in gene-expression data sets by a continuous analysis mathematical modeling strategy'. Together they form a unique fingerprint.

Cite this