Prediction of RNA binding sites in a protein using SVM and PSSM profile

Kumar, Manish ; Gromiha, Michael M. ; Raghava, G. P. S. (2008) Prediction of RNA binding sites in a protein using SVM and PSSM profile Proteins: Structure, Function, and Bioinformatics, 71 (1). pp. 189-194. ISSN 0887-3585

[img]
Preview
PDF - Publisher Version
168kB

Official URL: http://onlinelibrary.wiley.com/doi/10.1002/prot.21...

Related URL: http://dx.doi.org/10.1002/prot.21677

Abstract

RNA-binding proteins (RBPs) play key roles in post-transcriptional control of gene expression, which, along with transcriptional regulation, is a major way to regulate patterns of gene expression during development. Thus, the identification and prediction of RNA binding sites is an important step in comprehensive understanding of how RBPs control organism development. Combining evolutionary information and support vector machine (SVM), we have developed an improved method for predicting RNA binding sites or RNA interacting residues in a protein sequence. The prediction models developed in this study have been trained and tested on 86 RNA binding protein chains and evaluated using fivefold cross validation technique. First, a SVM model was developed that achieved a maximum Matthew's correlation coefficient (MCC) of 0.31. The performance of this SVM model further improved the MCC from 0.31 to 0.45, when multiple sequence alignment in the form of PSSM profiles was used as input to the SVM, which is far better than the maximum MCC achieved by previous methods (0.41) on the same dataset. In addition, SVM models were also developed on an alternative dataset that contained 107 RBP chains. Utilizing PSSM as input information to the SVM, the training/testing on this alternate dataset achieved a maximum MCC of 0.32. Conclusively, the prediction performance of SVM models developed in this study is better than the existing methods on the same datasets. A web server 'Pprint' was also developed for predicting RNA binding residues in a protein sequence which is freely available at http://www.imtech.res.in/raghava/pprint/.

Item Type:Article
Source:Copyright of this article belongs to John Wiley and Sons.
Keywords:Evolutionary Information; Interacting Residue; Protein; RNA; SVM
ID Code:37640
Deposited On:25 Apr 2011 13:07
Last Modified:17 May 2016 20:31

Repository Staff Only: item control page