Reddy, Chilamakuri C. S. ; Shameer, Khader ; Offmann, Bernard O. ; Sowdhamini, Ramanathan (2008) PURE: a webserver for the prediction of domains in unassigned regions in proteins BMC Bioinformatic, 9 . 281_1-281_8. ISSN 1471-2105
|
PDF
- Publisher Version
1MB |
Official URL: http://www.biomedcentral.com/1471-2105/9/281
Related URL: http://dx.doi.org/10.1186/1471-2105-9-281
Abstract
Background: Protein domains are the structural and functional units of proteins. The ability to parse proteins into different domains is important for effective classification, understanding of protein structure, function, and evolution and is hence biologically relevant. Several computational methods are available to identify domains in the sequence. Domain finding algorithms often employ stringent thresholds to recognize sequence domains. Identification of additional domains can be tedious involving intense computation and manual intervention but can lead to better understanding of overall biological function. In this context, the problem of identifying new domains in the unassigned regions of a protein sequence assumes a crucial importance. Results: We had earlier demonstrated that accumulation of domain information of sequence homologues can substantially aid prediction of new domains. In this paper, we propose a computationally intensive, multi-step bioinformatics protocol as a web server named as PURE (Prediction of Unassigned REgions in proteins) for the detailed examination of stretches of unassigned regions in proteins. Query sequence is processed using different automated filtering steps based on length, presence of coiled-coil regions, transmembrane regions, homologous sequences and percentage of secondary structure content. Later, the filtered sequence segments and their sequence homologues are fed to PSI-BLAST, cd-hit and Hmmpfam. Data from the various programs are integrated and information regarding the probable domains predicted from the sequence is reported. Conclusion: We have implemented PURE protocol as a web server for rapid and comprehensive analysis of unassigned regions in the proteins. This server integrates data from different programs and provides information about the domains encoded in the unassigned regions.
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to BioMed Central. |
ID Code: | 61222 |
Deposited On: | 15 Sep 2011 04:00 |
Last Modified: | 18 May 2016 11:01 |
Repository Staff Only: item control page