Azad, Rajeev K. ; Subba Rao, J. ; Li, Wentian ; Ramaswamy, Ramakrishna (2002) Simplifying the mosaic description of DNA sequences Physical Review E - Statistical, Nonlinear and Soft Matter Physics, 66 (3). 031913_1-031913_6. ISSN 1539-3755
|
PDF
- Author Version
152kB |
Official URL: http://pre.aps.org/abstract/PRE/v66/i3/e031913
Related URL: http://dx.doi.org/10.1103/PhysRevE.66.031913
Abstract
By using the Jensen-Shannon divergence, genomic DNA can be divided into compositionally distinct domains through a standard recursive segmentation procedure. Each domain, while significantly different from its neighbors, may, however, share compositional similarity with one or more distant (non-neighboring) domains. We thus obtain a coarse-grained description of the given DNA string in terms of a smaller set of distinct domain labels. This yields a minimal domain description of a given DNA sequence, significantly reducing its organizational complexity. This procedure gives a new means of evaluating genomic complexity as one examines organisms ranging from bacteria to human. The mosaic organization of DNA sequences could have originated from the insertion of fragments of one genome (the parasite) inside another (the host), and we present numerical experiments that are suggestive of this scenario.
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to The American Physical Society. |
ID Code: | 45334 |
Deposited On: | 28 Jun 2011 04:50 |
Last Modified: | 18 May 2016 01:37 |
Repository Staff Only: item control page