Evaluation of paired-end sequencing strategies for detection of genome rearrangements in cancer

Ali Bashir, Stanislav Volik, Colin Collins, Vineet Bafna, Benjamin J. Raphael

Research output: Contribution to journalArticlepeer-review

63 Scopus citations

Abstract

Paired-end sequencing is emerging as a key technique for assessing genome rearrangements and structural variation on a genome-wide scale. This technique is particularly useful for detecting copy-neutral rearrangements, such as inversions and translocations, which are common in cancer and can produce novel fusion genes. We address the question of how much sequencing is required to detect rearrangement breakpoints and to localize them precisely using both theoretical models and simulation. We derive a formula for the probability that a fusion gene exists in a cancer genome given a collection of paired-end sequences from this genome. We use this formula to compute fusion gene probabilities in several breast cancer samples, and we find that we are able to accurately predict fusion genes in these samples with a relatively small number of fragments of large size. We further demonstrate how the ability to detect fusion genes depends on the distribution of gene lengths, and we evaluate how different parameters of a sequencing strategy impact breakpoint detection, breakpoint localization, and fusion gene detection, even in the presence of errors that suggest false rearrangements. These results will be useful in calibrating future cancer sequencing efforts, particularly large-scale studies of many cancer genomes that are enabled by next-generation sequencing technologies.

Original languageEnglish
Article numbere1000051
JournalPLoS Computational Biology
Volume4
Issue number4
DOIs
StatePublished - Apr 2008
Externally publishedYes

Fingerprint

Dive into the research topics of 'Evaluation of paired-end sequencing strategies for detection of genome rearrangements in cancer'. Together they form a unique fingerprint.

Cite this