TY - GEN
T1 - Bias of phenotype similarity scores between diseases
AU - Wang, Jing
AU - Zhou, Xianxiao
AU - Zhu, Jing
AU - Guo, Zheng
PY - 2010
Y1 - 2010
N2 - Since diseases might be related with each other, systematically assessing their relationships could provide us novel insight into their mechanisms. One of the most important methods to study diseases' relationships is to calculate their phenotype similarity scores based on the text and clinical synopsis parts of their records in the OMIM database. However, as demonstrated in this paper, the similarity score between two diseases is highly dependent on the numbers of medical terms in the records describing the diseases (termed as record size). Because the descriptions of some diseases tend to be more detailed due to research biases, the similarity scores between these diseases tend to be larger. Thus, applications based on this phenotype similarity measure are problematic. In this paper, we also discuss some reasonable approaches to study the relationships between diseases, which may avoid the biased applications of disease similarity scores.
AB - Since diseases might be related with each other, systematically assessing their relationships could provide us novel insight into their mechanisms. One of the most important methods to study diseases' relationships is to calculate their phenotype similarity scores based on the text and clinical synopsis parts of their records in the OMIM database. However, as demonstrated in this paper, the similarity score between two diseases is highly dependent on the numbers of medical terms in the records describing the diseases (termed as record size). Because the descriptions of some diseases tend to be more detailed due to research biases, the similarity scores between these diseases tend to be larger. Thus, applications based on this phenotype similarity measure are problematic. In this paper, we also discuss some reasonable approaches to study the relationships between diseases, which may avoid the biased applications of disease similarity scores.
KW - Bias
KW - Disease
KW - Phenotype similarity
UR - https://www.scopus.com/pages/publications/77956164675
U2 - 10.1109/ICBBE.2010.5515892
DO - 10.1109/ICBBE.2010.5515892
M3 - Conference contribution
AN - SCOPUS:77956164675
SN - 9781424447138
T3 - 2010 4th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2010
BT - 2010 4th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2010
T2 - 4th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2010
Y2 - 18 June 2010 through 20 June 2010
ER -