Simplifying the mosaic description of DNA sequences

Azad, Rajeev K. ; Subba Rao, J. ; Li, Wentian ; Ramaswamy, Ramakrishna (2002) Simplifying the mosaic description of DNA sequences Physical Review E - Statistical, Nonlinear and Soft Matter Physics, 66 (3). 031913_1-031913_6. ISSN 1539-3755

[img]
Preview
PDF - Author Version
152kB

Official URL: http://pre.aps.org/abstract/PRE/v66/i3/e031913

Related URL: http://dx.doi.org/10.1103/PhysRevE.66.031913

Abstract

By using the Jensen-Shannon divergence, genomic DNA can be divided into compositionally distinct domains through a standard recursive segmentation procedure. Each domain, while significantly different from its neighbors, may, however, share compositional similarity with one or more distant (non-neighboring) domains. We thus obtain a coarse-grained description of the given DNA string in terms of a smaller set of distinct domain labels. This yields a minimal domain description of a given DNA sequence, significantly reducing its organizational complexity. This procedure gives a new means of evaluating genomic complexity as one examines organisms ranging from bacteria to human. The mosaic organization of DNA sequences could have originated from the insertion of fragments of one genome (the parasite) inside another (the host), and we present numerical experiments that are suggestive of this scenario.

Item Type:Article
Source:Copyright of this article belongs to The American Physical Society.
ID Code:45334
Deposited On:28 Jun 2011 04:50
Last Modified:18 May 2016 01:37

Repository Staff Only: item control page