TY - GEN
T1 - Boosted PRIM with application to searching for oncogenic pathway of lung cancer
AU - Wang, Pei
AU - Kim, Young
AU - Pollack, Jonathan
AU - Tibshirani, Robert
PY - 2004
Y1 - 2004
N2 - Boosted PRIM (Patient Rule Induction Method) is a new algorithm developed for two-class classification problems. PRIM is a variation of those Tree-Based methods (Ch9.3), seeking box-shaped regions in the feature space to separate different classes. Boosted PRIM is to implement PRIM-styled weak learners in Adaboost, one of the most popular boosting algorithms. In addition, we improve the performance of the algorithm by introducing a regularization to the boosting process, which supports the perspective of viewing boosting as a steepest-descent numerical optimization by Jerry Friedman. The motivation for Boosted PRIM is to solve the problem of "searching for oncogenic pathways" based on array-CGH (Comparative Genomic Hybridization) data, though the algorithm itself is suitable for general classification problems. We illustrate the performance of the method through some simulation studies as well as an application on a lung cancer array-CGH data set.
AB - Boosted PRIM (Patient Rule Induction Method) is a new algorithm developed for two-class classification problems. PRIM is a variation of those Tree-Based methods (Ch9.3), seeking box-shaped regions in the feature space to separate different classes. Boosted PRIM is to implement PRIM-styled weak learners in Adaboost, one of the most popular boosting algorithms. In addition, we improve the performance of the algorithm by introducing a regularization to the boosting process, which supports the perspective of viewing boosting as a steepest-descent numerical optimization by Jerry Friedman. The motivation for Boosted PRIM is to solve the problem of "searching for oncogenic pathways" based on array-CGH (Comparative Genomic Hybridization) data, though the algorithm itself is suitable for general classification problems. We illustrate the performance of the method through some simulation studies as well as an application on a lung cancer array-CGH data set.
UR - http://www.scopus.com/inward/record.url?scp=14044260603&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:14044260603
SN - 0769521940
SN - 9780769521947
T3 - Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004
SP - 604
EP - 609
BT - Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004
T2 - Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004
Y2 - 16 August 2004 through 19 August 2004
ER -