loading page

Structure-sequence features based prediction of phosphosites of Serine/Threonine Protein Kinases of Mycobacterium tuberculosis
  • Vipul Nilkanth,
  • Shekhar Mande
Vipul Nilkanth
National Centre for Cell Science

Corresponding Author:vipul.nilkanth@yahoo.com

Author Profile
Shekhar Mande
National Centre for Cell Science
Author Profile

Abstract

Elucidation of signalling events in a pathogen is potentially important to tackle the infection caused by it. Such events mediated by protein phosphorylation play important roles in infection and therefore to predict the phosphosites and substrates of the serine/threonine protein kinases, we have developed a Machine learning based approach and predicted the phosphosites for Mycobacterium tuberculosis serine/threonine protein kinases using kinase-peptide structure-sequence data. This approach utilizes features derived from kinase 3D-structure environment and known phosphosite sequences to generate Support Vector Machine based kinase specific predictions of phosphosites making it suitable for prediction of phosphosites of STPKs with no or scarce data of their phosphosites. Support vector machine outperformed the four machine learning algorithms we tried (random forest, logistic regression, support vector machine and k-nearest neighbours) with aucROC value of 0.88 on the independent testing dataset and a ten-fold cross validation accuracy of ~81.6% for the final model. Our predicted phosphosites of M. tuberculosis STPKs form an useful resource for experimental biologists enabling elucidation of STPK mediated post-translational regulation of important cellular processes. The training features file and model files, together with usage instructions file, are available at: https://github.com/vipulbiocoder/Mtb-KSPP
04 Mar 2021Submitted to PROTEINS: Structure, Function, and Bioinformatics
04 Mar 2021Submission Checks Completed
04 Mar 2021Assigned to Editor
01 Apr 2021Reviewer(s) Assigned
25 Apr 2021Review(s) Completed, Editorial Evaluation Pending
06 May 2021Editorial Decision: Revise Major
04 Jul 20211st Revision Received
12 Jul 2021Submission Checks Completed
12 Jul 2021Assigned to Editor
12 Jul 2021Reviewer(s) Assigned
12 Jul 2021Review(s) Completed, Editorial Evaluation Pending
12 Jul 2021Editorial Decision: Accept
26 Aug 2021Published in Proteins: Structure, Function, and Bioinformatics. 10.1002/prot.26195