TY - GEN
T1 - Normalization regarding non-random missing values in high-throughput mass spectrometry data
AU - Wang, Pei
AU - Tang, Hua
AU - Zhang, Heidi
AU - Whiteaker, Jeffrey
AU - Paulovich, Amanda G.
AU - Mcintosh, Martin
PY - 2006
Y1 - 2006
N2 - We propose a two-step normalization procedure for high-throughput mass spectrometry (MS) data, which is a necessary step in biomarker clustering or classification. First, a global normalization step is used to remove sources of systematic variation between MS profiles due to, for instance, varying amounts of sample degradation over time. A probability model is then used to investigate the intensity-dependent missing events and provides possible substitutions for the missing values. We illustrate the performance of the method with a LC-MS data set of synthetic protein mixtures.
AB - We propose a two-step normalization procedure for high-throughput mass spectrometry (MS) data, which is a necessary step in biomarker clustering or classification. First, a global normalization step is used to remove sources of systematic variation between MS profiles due to, for instance, varying amounts of sample degradation over time. A probability model is then used to investigate the intensity-dependent missing events and provides possible substitutions for the missing values. We illustrate the performance of the method with a LC-MS data set of synthetic protein mixtures.
UR - http://www.scopus.com/inward/record.url?scp=33747838164&partnerID=8YFLogxK
M3 - Conference contribution
C2 - 17094249
AN - SCOPUS:33747838164
SN - 9812564632
SN - 9789812564634
T3 - Proceedings of the Pacific Symposium on Biocomputing 2006, PSB 2006
SP - 315
EP - 326
BT - Proceedings of the Pacific Symposium on Biocomputing 2006, PSB 2006
T2 - 11th Pacific Symposium on Biocomputing 2006, PSB 2006
Y2 - 3 January 2006 through 7 January 2006
ER -