Bias of phenotype similarity scores between diseases

  • Jing Wang
  • , Xianxiao Zhou
  • , Jing Zhu
  • , Zheng Guo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Since diseases might be related with each other, systematically assessing their relationships could provide us novel insight into their mechanisms. One of the most important methods to study diseases' relationships is to calculate their phenotype similarity scores based on the text and clinical synopsis parts of their records in the OMIM database. However, as demonstrated in this paper, the similarity score between two diseases is highly dependent on the numbers of medical terms in the records describing the diseases (termed as record size). Because the descriptions of some diseases tend to be more detailed due to research biases, the similarity scores between these diseases tend to be larger. Thus, applications based on this phenotype similarity measure are problematic. In this paper, we also discuss some reasonable approaches to study the relationships between diseases, which may avoid the biased applications of disease similarity scores.

Original languageEnglish
Title of host publication2010 4th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2010
DOIs
StatePublished - 2010
Externally publishedYes
Event4th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2010 - Chengdu, China
Duration: 18 Jun 201020 Jun 2010

Publication series

Name2010 4th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2010

Conference

Conference4th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2010
Country/TerritoryChina
CityChengdu
Period18/06/1020/06/10

Keywords

  • Bias
  • Disease
  • Phenotype similarity

Fingerprint

Dive into the research topics of 'Bias of phenotype similarity scores between diseases'. Together they form a unique fingerprint.

Cite this