Analysis of the genome and transcriptome of cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation

Janbon, Guilhem ; Ormerod, Kate L. ; Paulet, Damien ; Byrnes, Edmond J. III ; Yadav, Vikas ; Chatterjee, Gautam ; Mullapudi, Nandita ; Hon, Chung-Chau ; Billmyre, R. Blake ; Brunel, François ; Bahn, Yong-Sun ; Chen, Weidong ; Chen, Yuan ; Chow, Eve W. L. ; Coppée, Jean-Yves ; Floyd-Averette, Anna ; Gaillardin, Claude ; Gerik, Kimberly J. ; Goldberg, Jonathan ; Gonzalez-Hilarion, Sara ; Gujja, Sharvari ; Hamlin, Joyce L. ; Hsueh, Yen-Ping ; Ianiri, Giuseppe ; Jones, Steven ; Kodira, Chinnappa D. ; Kozubowski, Lukasz ; Lam, Woei ; Marra, Marco ; Mesner, Larry D. ; Mieczkowski, Piotr A. ; Moyrand, Frédérique ; Nielsen, Kirsten ; Proux, Caroline ; Rossignol, Tristan ; Schein, Jacqueline E. ; Sun, Sheng ; Wollschlaeger, Carolin ; Wood, Ian A. ; Zeng, Qiandong ; Neuvéglise, Cécile ; Newlon, Carol S. ; Perfect, John R. ; Lodge, Jennifer K. ; Idnurm, Alexander ; Stajich, Jason E. ; Kronstad, James W. ; Sanyal, Kaustuv ; Heitman, Joseph ; Fraser, James A. ; Cuomo, Christina A. ; Dietrich, Fred S. (2014) Analysis of the genome and transcriptome of cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation PLoS Genetics, 10 (4). Articl ID e1004261. ISSN 1553-7390

[img]
Preview
PDF - Other
6MB

Official URL: http://journals.plos.org/plosgenetics/article?id=1...

Related URL: http://dx.doi.org/10.1371/journal.pgen.1004261

Abstract

Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence.

Item Type:Article
Source:Copyright of this article belongs to Public of Library Science.
ID Code:109890
Deposited On:25 Oct 2017 13:09
Last Modified:25 Oct 2017 13:09

Repository Staff Only: item control page