TY - JOUR
T1 - IMDAILM
T2 - Inferring miRNA-Disease Association by Integrating lncRNA and miRNA Data
AU - Yao, Yuhua
AU - Ji, Binbin
AU - Shi, Sihong
AU - Xu, Junlin
AU - Xiao, Xiaofang
AU - Yu, Enchao
AU - Liao, Bo
AU - Yang, Jialiang
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2020
Y1 - 2020
N2 - In recent years, more and more studies have shown that microRNAs (miRNAs) play a key role in many important biological processes. Dysregulation of miRNAs can lead to a variety of diseases like cancers, thus predicting potential miRNA-disease associations is important for understanding drug development and disease pathogenesis, diagnosis and treatment. It is known that experimental methods to validate miRNA-disease associations typically involve miRNA knockout or knockdown, which is time and labor-intensive. As a result, computational models have been developed to predict unknown miRNA-disease associations from available information related to miRNAs, diseases, genes, and so on. However, their performances are yet to be improved. Noticing that appropriately combining multiple data-source is usually helpful for improving prediction accuracy, we have developed IMDAILM: Inferring miRNA-Disease Association by integrating lncRNA and miRNA data, a low-rank matrix completion model integrating miRNA, long noncoding RNA (lncRNA) and disease information to predict miRNA-disease associations. Specifically, the miRNA-disease association network and the lncRNA-disease association network are fused to form a new heterogeneous network consisting of 3 types of nodes representing miRNAs, lncRNAs and diseases. In addition, a negative sample inference method was proposed to infer unrelated miRNA-disease pairs. Based on both heterogeneous network and negative samples, a low-rank matrix completion model is proposed and solved. In practice, IMDAILM achieved an area under the curve (AUC) of 0.8884 for predicting miRNAs associated with diseases under the 5-fold cross-validation (CV), outperforming a few recent methods. IMDAILM also yielded an AUC of 0.8870 for predicting both lncRNAs and miRNAs associated with diseases. In addition, the 5-fold CV results indicate that IMDAILM is also superior to other methods in predicting miRNAs associated with isolated diseases. Finally, we confirmed a few novel predicted miRNAs associated with specific diseases like lung cancers by literature mining. In summary, the integration of lncRNA information into a matrix completion framework contributes to the prediction of miRNA-disease associations.
AB - In recent years, more and more studies have shown that microRNAs (miRNAs) play a key role in many important biological processes. Dysregulation of miRNAs can lead to a variety of diseases like cancers, thus predicting potential miRNA-disease associations is important for understanding drug development and disease pathogenesis, diagnosis and treatment. It is known that experimental methods to validate miRNA-disease associations typically involve miRNA knockout or knockdown, which is time and labor-intensive. As a result, computational models have been developed to predict unknown miRNA-disease associations from available information related to miRNAs, diseases, genes, and so on. However, their performances are yet to be improved. Noticing that appropriately combining multiple data-source is usually helpful for improving prediction accuracy, we have developed IMDAILM: Inferring miRNA-Disease Association by integrating lncRNA and miRNA data, a low-rank matrix completion model integrating miRNA, long noncoding RNA (lncRNA) and disease information to predict miRNA-disease associations. Specifically, the miRNA-disease association network and the lncRNA-disease association network are fused to form a new heterogeneous network consisting of 3 types of nodes representing miRNAs, lncRNAs and diseases. In addition, a negative sample inference method was proposed to infer unrelated miRNA-disease pairs. Based on both heterogeneous network and negative samples, a low-rank matrix completion model is proposed and solved. In practice, IMDAILM achieved an area under the curve (AUC) of 0.8884 for predicting miRNAs associated with diseases under the 5-fold cross-validation (CV), outperforming a few recent methods. IMDAILM also yielded an AUC of 0.8870 for predicting both lncRNAs and miRNAs associated with diseases. In addition, the 5-fold CV results indicate that IMDAILM is also superior to other methods in predicting miRNAs associated with isolated diseases. Finally, we confirmed a few novel predicted miRNAs associated with specific diseases like lung cancers by literature mining. In summary, the integration of lncRNA information into a matrix completion framework contributes to the prediction of miRNA-disease associations.
KW - MiRNA
KW - alternating gradient descent method
KW - lncRNA
KW - lncRNA-miRNA association
KW - low-rank matrix completion
KW - miRNA-disease association
UR - http://www.scopus.com/inward/record.url?scp=85079771840&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2019.2958055
DO - 10.1109/ACCESS.2019.2958055
M3 - Article
AN - SCOPUS:85079771840
SN - 2169-3536
VL - 8
SP - 16517
EP - 16527
JO - IEEE Access
JF - IEEE Access
M1 - 8926470
ER -