PURE: a webserver for the prediction of domains in unassigned regions in proteins

Reddy, Chilamakuri C. S. ; Shameer, Khader ; Offmann, Bernard O. ; Sowdhamini, Ramanathan (2008) PURE: a webserver for the prediction of domains in unassigned regions in proteins BMC Bioinformatic, 9 . 281_1-281_8. ISSN 1471-2105

[img]
Preview
PDF - Publisher Version
1MB

Official URL: http://www.biomedcentral.com/1471-2105/9/281

Related URL: http://dx.doi.org/10.1186/1471-2105-9-281

Abstract

Background: Protein domains are the structural and functional units of proteins. The ability to parse proteins into different domains is important for effective classification, understanding of protein structure, function, and evolution and is hence biologically relevant. Several computational methods are available to identify domains in the sequence. Domain finding algorithms often employ stringent thresholds to recognize sequence domains. Identification of additional domains can be tedious involving intense computation and manual intervention but can lead to better understanding of overall biological function. In this context, the problem of identifying new domains in the unassigned regions of a protein sequence assumes a crucial importance. Results: We had earlier demonstrated that accumulation of domain information of sequence homologues can substantially aid prediction of new domains. In this paper, we propose a computationally intensive, multi-step bioinformatics protocol as a web server named as PURE (Prediction of Unassigned REgions in proteins) for the detailed examination of stretches of unassigned regions in proteins. Query sequence is processed using different automated filtering steps based on length, presence of coiled-coil regions, transmembrane regions, homologous sequences and percentage of secondary structure content. Later, the filtered sequence segments and their sequence homologues are fed to PSI-BLAST, cd-hit and Hmmpfam. Data from the various programs are integrated and information regarding the probable domains predicted from the sequence is reported. Conclusion: We have implemented PURE protocol as a web server for rapid and comprehensive analysis of unassigned regions in the proteins. This server integrates data from different programs and provides information about the domains encoded in the unassigned regions.

Item Type:Article
Source:Copyright of this article belongs to BioMed Central.
ID Code:61222
Deposited On:15 Sep 2011 04:00
Last Modified:18 May 2016 11:01

Repository Staff Only: item control page