TY - JOUR
T1 - Synthetic heparan sulfate standards and machine learning facilitate the development of solid-state nanopore analysis
AU - Xia, Ke
AU - Hagan, James T.
AU - Fu, Li
AU - Sheetz, Brian S.
AU - Bhattacharya, Somdatta
AU - Zhang, Fuming
AU - Dwyer, Jason R.
AU - Linhardt, Robert J.
N1 - Publisher Copyright:
© 2021 National Academy of Sciences. All rights reserved.
PY - 2021/3/16
Y1 - 2021/3/16
N2 - The application of solid-state (SS) nanopore devices to singlemolecule nucleic acid sequencing has been challenging. Thus, the early successes in applying SS nanopore devices to the more difficult class of biopolymer, glycosaminoglycans (GAGs), have been surprising, motivating us to examine the potential use of an SS nanopore to analyze synthetic heparan sulfate GAG chains of controlled composition and sequence prepared through a promising, recently developed chemoenzymatic route. A minimal representation of the nanopore data, using only signal magnitude and duration, revealed, by eye and image recognition algorithms, clear differences between the signals generated by four synthetic GAGs. By subsequent machine learning, it was possible to determine disaccharide and even monosaccharide composition of these four synthetic GAGs using as few as 500 events, corresponding to a zeptomole of sample. These data suggest that ultrasensitive GAG analysis may be possible using SS nanopore detection and well-characterized molecular training sets.
AB - The application of solid-state (SS) nanopore devices to singlemolecule nucleic acid sequencing has been challenging. Thus, the early successes in applying SS nanopore devices to the more difficult class of biopolymer, glycosaminoglycans (GAGs), have been surprising, motivating us to examine the potential use of an SS nanopore to analyze synthetic heparan sulfate GAG chains of controlled composition and sequence prepared through a promising, recently developed chemoenzymatic route. A minimal representation of the nanopore data, using only signal magnitude and duration, revealed, by eye and image recognition algorithms, clear differences between the signals generated by four synthetic GAGs. By subsequent machine learning, it was possible to determine disaccharide and even monosaccharide composition of these four synthetic GAGs using as few as 500 events, corresponding to a zeptomole of sample. These data suggest that ultrasensitive GAG analysis may be possible using SS nanopore detection and well-characterized molecular training sets.
KW - Glycosaminoglycan
KW - Polysaccharide
KW - Sequencing
KW - Single-molecule analysis
KW - Solid-state nanopore
UR - http://www.scopus.com/inward/record.url?scp=85102339130&partnerID=8YFLogxK
U2 - 10.1073/pnas.2022806118
DO - 10.1073/pnas.2022806118
M3 - Article
C2 - 33688052
AN - SCOPUS:85102339130
SN - 0027-8424
VL - 118
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 11
M1 - e2022806118
ER -