Classification of nuclear receptors based on amino acid composition and dipeptide composition

Bhasin, Manoj ; Raghava, Gajendra P. S. (2004) Classification of nuclear receptors based on amino acid composition and dipeptide composition Journal of Biological Chemistry, 279 (22). pp. 23262-23266. ISSN 0021-9258

[img]
Preview
PDF - Publisher Version
114kB

Official URL: http://www.jbc.org/content/279/22/23262.short

Related URL: http://dx.doi.org/10.1074/jbc.M401932200

Abstract

Nuclear receptors are key transcription factors that regulate crucial gene networks responsible for cell growth, differentiation, and homeostasis. Nuclear receptors form a superfamily of phylogenetically related proteins and control functions associated with major diseases (e.g. diabetes, osteoporosis, and cancer). In this study, a novel method has been developed for classifying the subfamilies of nuclear receptors. The classification was achieved on the basis of amino acid and dipeptide composition from a sequence of receptors using support vector machines. The training and testing was done on a non-redundant data set of 282 proteins obtained from the NucleaRDB data base (1). The performance of all classifiers was evaluated using a 5-fold cross validation test. In the 5-fold cross-validation, the data set was randomly partitioned into five equal sets and evaluated five times on each distinct set while keeping the remaining four sets for training. It was found that different subfamilies of nuclear receptors were quite closely correlated in terms of amino acid composition as well as dipeptide composition. The overall accuracy of amino acid composition-based and dipeptide compositionbased classifiers were 82.6 and 97.5%, respectively. Therefore, our results prove that different subfamilies of nuclear receptors are predictable with considerable accuracy using amino acid or dipeptide composition. Furthermore, based on above approach, an online web service, NRpred, was developed, which is available at www.imtech.res.in/raghava/nrpred.

Item Type:Article
Source:Copyright of this article belongs to The American Society for Biochemistry and Molecular Biology.
ID Code:43046
Deposited On:09 Jun 2011 10:57
Last Modified:18 May 2016 00:09

Repository Staff Only: item control page