TY - JOUR
T1 - MiST
T2 - A new approach to variant detection in deep sequencing datasets
AU - Subramanian, Sailakshmi
AU - Di Pierro, Valentina
AU - Shah, Hardik
AU - Jayaprakash, Anitha D.
AU - Weisberger, Ian
AU - Shim, Jaehee
AU - George, Ajish
AU - Gelb, Bruce D.
AU - Sachidanandam, Ravi
N1 - Funding Information:
Seed fund from The Icahn School of Medicine at Mount Sinai (to R.S.). Partial funding from National Institutes of Health [U01 HL098123 to R.S. and B.D.G.]. Funding for open access charge: Pilot fund from the Children’s Environmental Health Center at The Icahn School of Medicine.
PY - 2013/9
Y1 - 2013/9
N2 - MiST is a novel approach to variant calling from deep sequencing data, using the inverted mapping approach developed for Geoseq. Reads that can map to a targeted exonic region are identified using exact matches to tiles from the region. The reads are then aligned to the targets to discover variants. MiST carefully handles paralogous reads that map ambiguously to the genome and clonal reads arising from PCR bias, which are the two major sources of errors in variant calling. The reduced computational complexity of mapping selected reads to targeted regions of the genome improves speed, specificity and sensitivity of variant detection. Compared with variant calls from the GATK platform, MiST showed better concordance with SNPs from dbSNP and genotypes determined by an exonic-SNP array. Variant calls made only by MiST confirm at a high rate (>90%) by Sanger sequencing. Thus, MiST is a valuable alternative tool to analyse variants in deep sequencing data.
AB - MiST is a novel approach to variant calling from deep sequencing data, using the inverted mapping approach developed for Geoseq. Reads that can map to a targeted exonic region are identified using exact matches to tiles from the region. The reads are then aligned to the targets to discover variants. MiST carefully handles paralogous reads that map ambiguously to the genome and clonal reads arising from PCR bias, which are the two major sources of errors in variant calling. The reduced computational complexity of mapping selected reads to targeted regions of the genome improves speed, specificity and sensitivity of variant detection. Compared with variant calls from the GATK platform, MiST showed better concordance with SNPs from dbSNP and genotypes determined by an exonic-SNP array. Variant calls made only by MiST confirm at a high rate (>90%) by Sanger sequencing. Thus, MiST is a valuable alternative tool to analyse variants in deep sequencing data.
UR - http://www.scopus.com/inward/record.url?scp=84886868191&partnerID=8YFLogxK
U2 - 10.1093/nar/gkt551
DO - 10.1093/nar/gkt551
M3 - Article
C2 - 23828039
AN - SCOPUS:84886868191
SN - 0305-1048
VL - 41
SP - e154
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 16
ER -