ClipSV: Improving structural variation detection by read extension, spliced alignment and tree-based decision rules

  • Peng Xu
  • , Yu Chen
  • , Min Gao
  • , Zechen Chong

Research output: Contribution to journalArticlepeer-review

Abstract

Structural variation (SV), which consists of genomic variation from 50 to millions of base pairs, confers considerable impacts on human diseases, complex traits and evolution. Accurately detecting SV is a fundamental step to characterize the features of individual genomes. Currently, several methods have been proposed to detect SVs using the next-generation sequencing (NGS) platform. However, due to the short length of sequencing reads and the complexity of SV content, the SV-detecting tools are still limited by low sensitivity, especially for insertion detection. In this study, we developed a novel tool, ClipSV, to improve SV discovery. ClipSV utilizes a read extension and spliced alignment approach to overcoming the limitation of read length. By reconstructing long sequences from SV-associated short reads, ClipSV discovers deletions and short insertions from the long sequence alignments. To comprehensively characterize insertions, ClipSV implements tree-based decision rules that can efficiently utilize SV-containing reads. Based on the evaluations of both simulated and real sequencing data, ClipSV exhibited an overall better performance compared to currently popular tools, especially for insertion detection. As NGS platform represents the mainstream sequencing capacity for routine genomic applications, we anticipate ClipSV will serve as an important tool for SV characterization in future genomic studies.

Original languageEnglish
JournalNAR Genomics and Bioinformatics
Volume3
Issue number1
DOIs
StatePublished - 1 Mar 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'ClipSV: Improving structural variation detection by read extension, spliced alignment and tree-based decision rules'. Together they form a unique fingerprint.

Cite this