Simplifying the mosaic description of DNA sequences

Azad, Rajeev K. ; Rao, Subba J. ; Li, Wentian ; Ramaswamy, Ramakrishna (2002) Simplifying the mosaic description of DNA sequences Physical Review E - Statistical, Nonlinear and Soft Matter Physics, 66 . Article ID 031913. ISSN 1539-3755

Full text not available from this repository.

Official URL: http://journals.aps.org/pre/abstract/10.1103/PhysR...

Related URL: http://dx.doi.org/10.1103/PhysRevE.66.031913

Abstract

By using the Jensen-Shannon divergence, genomic DNA can be divided into compositionally distinct domains through a standard recursive segmentation procedure. Each domain, while significantly different from its neighbors, may, however, share compositional similarity with one or more distant (non-neighboring) domains. We thus obtain a coarse-grained description of the given DNA string in terms of a smaller set of distinct domain labels. This yields a minimal domain description of a given DNA sequence, significantly reducing its organizational complexity. This procedure gives a new means of evaluating genomic complexity as one examines organisms ranging from bacteria to human. The mosaic organization of DNA sequences could have originated from the insertion of fragments of one genome (the parasite) inside another (the host), and we present numerical experiments that are suggestive of this scenario.

Item Type:Article
Source:Copyright of this article belongs to The American Physical Society.
ID Code:98871
Deposited On:26 May 2015 12:01
Last Modified:26 May 2015 12:01

Repository Staff Only: item control page