Systematic comparison of machine learning methods for identification of miRNA species as disease biomarkers

Chihiro Higuchi, Toshihiro Tanaka, Yukinori Okada

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Micro RNA (miRNA) plays important roles in a variety of biological processes and can act as disease biomarkers. Thus, establishment of discovery methods to detect disease-related miRNAs is warranted. Human omics data including miRNA expression profiles have orders of magnitude with much more number of descriptors (p) than that of samples (n), which is so called “p >> n problem”. Since traditional statistical methods mislead to localized solutions, application of machine learning (ML) methods that handle sparse selection of the variables are expected to solve this problem. Among many ML methods, least absolute shrinkage and selection operator (LASSO) and multivariate adaptive regression splines (MARS) give a few variables from the result of supervised learning with endpoints such as human disease statuses. Here, we performed systematic comparison of LASSO and MARS to discover biomarkers, using six miRNA expression data sets of human disease samples, which were obtained from NCBI Gene Expression Omnibus (GEO). We additionally conducted partial least square method discriminant analysis (PLS-DA), as a control traditional method to evaluate baseline performance of discriminant methods. We observed that LASSO and MARS showed relatively higher performance compared to that of PLS-DA, as the number of the samples increases. Also, some of the identified miRNA species by ML methods have already been reported as candidate disease biomarkers in the previous biological studies. These findings should contribute to the extension of our knowledge on ML method performances in empirical utilization of clinical data.

Original languageEnglish
Title of host publicationBioinformatics and Biomedical Engineering - 3rd International Conference, IWBBIO 2015, Proceedings
EditorsFrancisco Ortuño, Ignacio Rojas
PublisherSpringer Verlag
Pages386-394
Number of pages9
ISBN (Electronic)9783319164793
DOIs
StatePublished - 2015
Externally publishedYes
Event3rd International Work-Conference on Bioinformatics and Biomedical Engineering, IWBBIO 2015 - Granada, Spain
Duration: 15 Apr 201517 Apr 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9044
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd International Work-Conference on Bioinformatics and Biomedical Engineering, IWBBIO 2015
Country/TerritorySpain
CityGranada
Period15/04/1517/04/15

Keywords

  • LASSO
  • Least absolute shrinkage and selection operator
  • MARS
  • Machine learning
  • MiRNA
  • Micro RNA
  • Multivariate adaptive regression splines

Fingerprint

Dive into the research topics of 'Systematic comparison of machine learning methods for identification of miRNA species as disease biomarkers'. Together they form a unique fingerprint.

Cite this