A new distance measure for comparing sequence profiles based on path lengths along an entropy surface

Gary Benson

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

We describe a new distance measure for comparing DNA sequence profiles. For this measure, columns in a multiple alignment are treated as character frequency vectors (sum of the frequencies equal to one). The distance between two vectors is based on minimum path length along an entropy surface. Path length is estimated using a random graph generated on the entropy surface and Dijkstra's algorithm for all shortest paths to a source. We use the new distance measure to analyze similarities within familes of tandem repeats in the C. elegans genome and show that this new measure gives more accurate refinement of family relationships than a method based on comparing consensus sequences.

Original languageEnglish
Pages (from-to)S44-S53
JournalBioinformatics
Volume18
Issue numberSUPPL. 2
DOIs
StatePublished - 2002

Fingerprint

Dive into the research topics of 'A new distance measure for comparing sequence profiles based on path lengths along an entropy surface'. Together they form a unique fingerprint.

Cite this