ChEA2: Gene-set libraries from ChIP-X experiments to decode the transcription regulome

Yan Kou, Edward Y. Chen, Neil R. Clark, Qiaonan Duan, Christopher M. Tan, Avi Ma'ayan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

13 Scopus citations

Abstract

ChIP-seq experiments provide a plethora of data regarding transcription regulation in mammalian cells. Integrating ChIP-seq studies into a computable resource is potentially useful for further knowledge extraction from such data. We continually collect and expand a database where we convert results from ChIP-seq experiments into gene-set libraries. The manual portion of this database currently contains 200 transcription factors from 221 publications for a total of 458,471 transcription-factor/target interactions. In addition, we automatically compiled data from the ENCODE project which includes 920 experiments applied to 44 cell-lines profiling 160 transcription factors for a total of ∼1.4 million transcription-factor/target-gene interactions. Moreover, we processed data from the NIH Epigenomics Roadmap project for 27 different types of histone marks in 64 different human cell-lines. All together the data was processed into three simple gene-set libraries where the set label is either a mammalian transcription factor or a histone modification mark in a particular cell line, organism and experiment. Such gene-set libraries are useful for elucidating the experimentally determined transcriptional networks regulating lists of genes of interest using gene-set enrichment analyses. Furthermore, from these three gene-set libraries, we constructed regulatory networks of transcription factors and histone modifications to identify groups of regulators that work together. For example, we found that the Polycomb Repressive Complex 2 (PRC2) is involved with three distinct clusters each interacting with different sets of transcription factors. Notably, the combined dataset is made into web-based application software where users can perform enrichment analyses or download the data in various formats. The open source ChEA2 web-based software and datasets are available freely online at http://amp.pharm.mssm.edu/ChEA2.

Original languageEnglish
Title of host publicationAvailability, Reliability, and Security in Information Systems and HCI - IFIP WG 8.4, 8.9, TC 5 International Cross-Domain Conference, CD-ARES 2013, Proceedings
Pages416-430
Number of pages15
DOIs
StatePublished - 2013
EventIFIP WG 8.4, 8.9, TC 5 International Cross-Domain Conference on Availability, Reliability, and Security in Information Systems and HCI, CD-ARES 2013 - Regensburg, Germany
Duration: 2 Sep 20136 Sep 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8127 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceIFIP WG 8.4, 8.9, TC 5 International Cross-Domain Conference on Availability, Reliability, and Security in Information Systems and HCI, CD-ARES 2013
Country/TerritoryGermany
CityRegensburg
Period2/09/136/09/13

Keywords

  • ChIP-chip
  • ChIP-seq
  • Data Integration
  • Data Visualization
  • ENCODE
  • Enrichment Analysis
  • JavaScript D3
  • Microarrays
  • Systems Biology
  • Transcriptional Networks

Fingerprint

Dive into the research topics of 'ChEA2: Gene-set libraries from ChIP-X experiments to decode the transcription regulome'. Together they form a unique fingerprint.

Cite this