On detection and assessment of statistical significance of genomic islands

Chatterjee, Raghunath ; Chaudhuri, Keya ; Chaudhuri, Probal (2008) On detection and assessment of statistical significance of genomic islands BMC Genomics, 9 (1). p. 150. ISSN 1471-2164

[img]
Preview
PDF - Publisher Version
947kB

Official URL: http://www.biomedcentral.com/1471-2164/9/150/abstr...

Related URL: http://dx.doi.org/10.1186/1471-2164-9-150

Abstract

Many of the available methods for detecting genomic islands (GIs) in prokaryotic genomes use markers such as transposons, proximal trnas, flanking repeats etc., or they use other supervised techniques requiring training datasets. Most of these methods are primarily based on the biases in gc content or codon and amino acid usage of the islands. However, these methods either do not use any formal statistical test of significance or use statistical tests for which the critical values and the p-values are not adequately justified. We propose a method, which is unsupervised in nature and uses Monte-Carlo statistical tests based on randomly selected segments of a chromosome. Such tests are supported by precise statistical distribution theory, and consequently, the resulting p-values are quite reliable for making the decision. Our algorithm (named design-island, an acronym for detection of statistically significant genomic island) runs in two phases. Some putative gls are identified in the first phase, and those are refined into smaller segments containing horizontally acquired genes in the refinement phase. This method is applied to Salmonella typhi CT18 genome leading to the discovery of several new pathogenicity, antibiotic resistance and metabolic islands that were missed by earlier methods. Many of these islands contain mobile genetic elements like phage-mediated genes, transposons, integrase and IS elements confirming their horizontal acquirement. The proposed method is based on statistical tests supported by precise distribution theory and reliable p-values along with a technique for visualizing statistically significant islands. The performance of our method is better than many other well known methods in terms of their sensitivity and accuracy, and in terms of specificity, it is comparable to other methods

Item Type:Article
Source:Copyright of this article belongs to BioMed Central Ltd.
ID Code:8128
Deposited On:26 Oct 2010 04:19
Last Modified:16 May 2016 18:11

Repository Staff Only: item control page