Analysis of string-searching algorithms on biological sequence databases

Sheik, S. S. ; Aggarwal, Sumit K. ; Poddar, A. ; Sathiyabhama, B. ; Balakrishnan, N. ; Sekar, K. (2005) Analysis of string-searching algorithms on biological sequence databases Current Science, 89 (2). pp. 368-374. ISSN 0011-3891

[img]
Preview
PDF - Publisher Version
91kB

Official URL: http://www.ias.ac.in/currsci/jul252005/368.pdf

Abstract

String-searching algorithms are used to find the occurrences of a search string in a given text. The advent of digital computers has stimulated the development of string-searching algorithms for various applications. Here, we report the performance of all string-searching algorithms on widely used biological sequence databases containing the building blocks of nucleotides (in the case of nucleic acid sequence database) and amino acids (in the case of protein sequence database). The biological sequence databases used in the present study are Protein Information Resource (PIR), SWISSPROT, and amino acid and nucleotide sequences of all genomes available in the genome database. The average time taken for different search-string lengths considered for study has been taken as an indicator of performance for comparison between various methods.

Item Type:Article
Source:Copyright of this article belongs to Current Science Association.
Keywords:Pattern Matching; Protein Sequence; Nucleotide Sequence; Databases; String-searching Algorithms
ID Code:64443
Deposited On:10 Oct 2011 07:31
Last Modified:18 May 2016 12:52

Repository Staff Only: item control page