TY - JOUR
T1 - Prediction of antitubercular peptides from sequence information using ensemble classifier and hybrid features
AU - Usmani, Salman Sadullah
AU - Bhalla, Sherry
AU - Raghava, Gajendra P.S.
N1 - Publisher Copyright:
© 2018 Usmani, Bhalla and Raghava.
PY - 2018/8/28
Y1 - 2018/8/28
N2 - Tuberculosis is one of the leading cause of death worldwide, particularly due to evolution of drug resistant strains. Antitubercular peptides may provide an alternate approach to combat antibiotic tolerance. Sequence analysis reveals that certain residues (e.g., Lysine, Arginine, Leucine, Tryptophan) are more prevalent in antitubercular peptides. This study describes the models developed for predicting antitubercular peptides by using sequence features of the peptides. We have developed support vector machine based models using different sequence features like amino acid composition, binary profile of terminus residues, dipeptide composition. Our ensemble classifiers that combines models based on amino acid composition and N5C5 binary pattern, achieves highest Acc of 73.20% with 0.80 AUROC on our main dataset. Similarly, the ensemble classifier achieved maximum Acc 75.62% with 0.83 AUROC on secondary dataset. Beside this, hybrid model achieves Acc of 75.87 and 78.54% with 0.83 and 0.86 AUROC on main and secondary dataset, respectively. In order to facilitate scientific community in designing of antitubercular peptides, we implement above models in a user friendly webserver (http://webs.iiitd.edu.in/raghava/antitbpred/).
AB - Tuberculosis is one of the leading cause of death worldwide, particularly due to evolution of drug resistant strains. Antitubercular peptides may provide an alternate approach to combat antibiotic tolerance. Sequence analysis reveals that certain residues (e.g., Lysine, Arginine, Leucine, Tryptophan) are more prevalent in antitubercular peptides. This study describes the models developed for predicting antitubercular peptides by using sequence features of the peptides. We have developed support vector machine based models using different sequence features like amino acid composition, binary profile of terminus residues, dipeptide composition. Our ensemble classifiers that combines models based on amino acid composition and N5C5 binary pattern, achieves highest Acc of 73.20% with 0.80 AUROC on our main dataset. Similarly, the ensemble classifier achieved maximum Acc 75.62% with 0.83 AUROC on secondary dataset. Beside this, hybrid model achieves Acc of 75.87 and 78.54% with 0.83 and 0.86 AUROC on main and secondary dataset, respectively. In order to facilitate scientific community in designing of antitubercular peptides, we implement above models in a user friendly webserver (http://webs.iiitd.edu.in/raghava/antitbpred/).
KW - Antimycobacterial therapy
KW - Antitubercular peptides
KW - Drug discovery
KW - Ensemble classifier
KW - Machine learning
KW - Mycobacterium
KW - Tuberculosis
UR - http://www.scopus.com/inward/record.url?scp=85052836537&partnerID=8YFLogxK
U2 - 10.3389/fphar.2018.00954
DO - 10.3389/fphar.2018.00954
M3 - Article
AN - SCOPUS:85052836537
SN - 1663-9812
VL - 9
JO - Frontiers in Pharmacology
JF - Frontiers in Pharmacology
IS - AUG
M1 - 954
ER -