TY - JOUR
T1 - MMBGX
T2 - A method for estimating expression at the isoform level and detecting differential splicing using whole-transcript Affymetrix arrays
AU - Turro, Ernest
AU - Lewin, Alex
AU - Rose, Anna
AU - Dallman, Margaret J.
AU - Richardson, Sylvia
PY - 2009/10/23
Y1 - 2009/10/23
N2 - Affymetrix has recently developed whole-transcript GeneChips-'Gene' and 'Exon' arrays-which interrogate exons along the length of each gene. Although each probe on these arrays is intended to hybridize perfectly to only one transcriptional target, many probes match multiple transcripts located in different parts of the genome or alternative isoforms of the same gene. Existing statistical methods for estimating expression do not take this into account and are thus prone to producing inflated estimates. We propose a method, Multi-Mapping Bayesian Gene eXpression (MMBGX), which disaggregates the signal at 'multi-match' probes. When applied to Gene arrays, MMBGX removes the upward bias of gene-level expression estimates. When applied to Exon arrays, it can further disaggregate the signal between alternative transcripts of the same gene, providing expression estimates of individual splice variants. We demonstrate the performance of MMBGX on simulated data and a tissue mixture data set. We then show that MMBGX can estimate the expression of alternative isoforms within one experimental condition, confirming our results by RT-PCR. Finally, we show that our method for detecting differential splicing has a lower error rate than standard exonlevel approaches on a previously validated colon cancer data set.
AB - Affymetrix has recently developed whole-transcript GeneChips-'Gene' and 'Exon' arrays-which interrogate exons along the length of each gene. Although each probe on these arrays is intended to hybridize perfectly to only one transcriptional target, many probes match multiple transcripts located in different parts of the genome or alternative isoforms of the same gene. Existing statistical methods for estimating expression do not take this into account and are thus prone to producing inflated estimates. We propose a method, Multi-Mapping Bayesian Gene eXpression (MMBGX), which disaggregates the signal at 'multi-match' probes. When applied to Gene arrays, MMBGX removes the upward bias of gene-level expression estimates. When applied to Exon arrays, it can further disaggregate the signal between alternative transcripts of the same gene, providing expression estimates of individual splice variants. We demonstrate the performance of MMBGX on simulated data and a tissue mixture data set. We then show that MMBGX can estimate the expression of alternative isoforms within one experimental condition, confirming our results by RT-PCR. Finally, we show that our method for detecting differential splicing has a lower error rate than standard exonlevel approaches on a previously validated colon cancer data set.
UR - http://www.scopus.com/inward/record.url?scp=75649121097&partnerID=8YFLogxK
U2 - 10.1093/nar/gkp853
DO - 10.1093/nar/gkp853
M3 - Article
C2 - 19854940
AN - SCOPUS:75649121097
SN - 0305-1048
VL - 38
SP - e4-e4
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 1
M1 - gkp853
ER -