Self-organizing maps: a tool to ascertain taxonomic relatedness based on features derived from 16S rDNA sequence

Raje, D. V. ; Purohit, H. J. ; Badhe, Y. P. ; Tambe, S. S. ; Kulkarni, B. D. (2010) Self-organizing maps: a tool to ascertain taxonomic relatedness based on features derived from 16S rDNA sequence Journal of Biosciences, 35 (4). pp. 617-627. ISSN 0250-5991

[img]
Preview
PDF - Publisher Version
1MB

Official URL: http://www.ias.ac.in/jbiosci/dec2010/617.pdf

Related URL: http://dx.doi.org/10.1007/s12038-010-0070-y

Abstract

Exploitation of microbial wealth, of which almost 95% or more is still unexplored, is a growing need. The taxonomic placements of a new isolate based on phenotypic characteristics are now being supported by information preserved in the 16S rRNA gene. However, the analysis of 16S rDNA sequences retrieved from metagenome, by the available bioinformatics tools, is subject to limitations. In this study, the occurrences of nucleotide features in 16S rDNA sequences have been used to ascertain the taxonomic placement of organisms. The tetra- and penta-nucleotide features were extracted from the training data set of the 16S rDNA sequence, and was subjected to an artificial neural network (ANN) based tool known as self-organizing map (SOM), which helped in visualization of unsupervised classification. For selection of significant features, principal component analysis (PCA) or curvilinear component analysis (CCA) was applied. The SOM along with these techniques could discriminate the sample sequences with more than 90% accuracy, highlighting the relevance of features. To ascertain the confidence level in the developed classification approach, the test data set was specifically evaluated for Thiobacillus, with Acidiphilium, Paracocus and Starkeya, which are taxonomically reassigned. The evaluation proved the excellent generalization capability of the developed tool. The topology of genera in SOM supported the conventional chemo-biochemical classification reported in the Bergey manual.

Item Type:Article
Source:Copyright of this article belongs to Indian Academy of Sciences.
Keywords:Curvilinear Component Analysis; Self-organizing Maps; Principal Component Analysis
ID Code:85714
Deposited On:05 Mar 2012 14:06
Last Modified:19 May 2016 01:38

Repository Staff Only: item control page