Protein structure mining using a structural alphabet

Tyagi, M. ; Brevern, A. G. de. ; Srinivasan, N. ; Offmann, B. (2008) Protein structure mining using a structural alphabet Proteins: Structure, Function, and Bioinformatics, 71 (2). pp. 920-937. ISSN 0887-3585

Full text not available from this repository.

Official URL: http://onlinelibrary.wiley.com/doi/10.1002/prot.21...

Related URL: http://dx.doi.org/10.1002/prot.21776

Abstract

We present a comprehensive evaluation of a new structure mining method called PB-ALIGN. It is based on the encoding of protein structure as 1D sequence of a combination of 16 short structural motifs or protein blocks (PBs). PBs are short motifs capable of representing most of the local structural features of a protein backbone. Using derived PB substitution matrix and simple dynamic programming algorithm, PB sequences are aligned the same way amino acid sequences to yield structure alignment. PBs are short motifs capable of representing most of the local structural features of a protein backbone. Alignment of these local features as sequence of symbols enables fast detection of structural similarities between two proteins. Ability of the method to characterize and align regions beyond regular secondary structures, for example, N and C caps of helix and loops connecting regular structures, puts it a step ahead of existing methods, which strongly rely on secondary structure elements. PB-ALIGN achieved efficiency of 85% in extracting true fold from a large database of 7259 SCOP domains and was successful in 82% cases to identify true super-family members. On comparison to 13 existing structure comparison/mining methods, PB-ALIGN emerged as the best on general ability test dataset and was at par with methods like YAKUSA and CE on nontrivial test dataset. Furthermore, the proposed method performed well when compared to flexible structure alignment method like FATCAT and outperforms in processing speed (less than 45 s per database scan). This work also establishes a reliable cut-off value for the demarcation of similar folds. It finally shows that global alignment scores of unrelated structures using PBs follow an extreme value distribution.

Item Type:Article
Source:Copyright of this article belongs to John Wiley and Sons.
Keywords:Substitution Matrix; Protein Blocks; Local Protein Structure; Structure Mining; Local Alignment; Global Alignment; Structure Comparison
ID Code:62644
Deposited On:22 Sep 2011 02:41
Last Modified:22 Sep 2011 02:41

Repository Staff Only: item control page