Comparative genomic analysis of thermophilic fungi reveals convergent evolutionary adaptations and gene losses

Archive ouverte

Steindorff, Andrei, S | Aguilar-Pontes, Maria, Victoria | Robinson, Aaron, J | Andreopoulos, Bill | Labutti, Kurt | Kuo, Alan | Mondo, Stephen | Riley, Robert | Otillar, Robert | Haridas, Sajeet | Lipzen, Anna | Grimwood, Jane | Schmutz, Jeremy | Clum, Alicia | Reid, Ian, D | Moisan, Marie-Claude | Butler, Gregory | Nguyen, Thi, Truc Minh | Dewar, Ken | Conant, Gavin | Drula, Elodie | Henrissat, Bernard | Hansel, Colleen | Singer, Steven | Hutchinson, Miriam, I | de Vries, Ronald, P | Natvig, Donald, O | Powell, Amy, J | Tsang, Adrian | Grigoriev, Igor, V

Edité par CCSD ; Nature Publishing Group -

International audience. Thermophily is a trait scattered across the fungal tree of life, with its highest prevalence within three fungal families (Chaetomiaceae, Thermoascaceae, and Trichocomaceae), as well as some members of the phylum Mucoromycota. We examined 37 thermophilic and thermotolerant species and 42 mesophilic species for this study and identified thermophily as the ancestral state of all three prominent families of thermophilic fungi. Thermophilic fungal genomes were found to encode various thermostable enzymes, including carbohydrate-active enzymes such as endoxylanases, which are useful for many industrial applications. At the same time, the overall gene counts, especially in gene families responsible for microbial defense such as secondary metabolism, are reduced in thermophiles compared to mesophiles. We also found a reduction in the core genome size of thermophiles in both the Chaetomiaceae family and the Eurotiomycetes class. The Gene Ontology terms lost in thermophilic fungi include primary metabolism, transporters, UV response, and O-methyltransferases. Comparative genomics analysis also revealed higher GC content in the third base of codons (GC3) and a lower effective number of codons in fungal thermophiles than in both thermotolerant and mesophilic fungi. Furthermore, using the Support Vector Machine classifier, we identified several Pfam domains capable of discriminating between genomes of thermophiles and mesophiles with 94% accuracy. Using AlphaFold2 to predict protein structures of endoxylanases (GH10), we built a similarity network based on the structures. We found that the number of disulfide bonds appears important for protein structure, and the network clusters based on protein structures correlate with the optimal activity temperature. Thus, comparative genomics offers new insights into the biology, adaptation, and evolutionary history of thermophilic fungi while providing a parts list for bioengineering applications.

Suggestions

Du même auteur

Comparative genomics of Aspergillus nidulans and section Nidulantes

Archive ouverte | Theobald, Sebastian | CCSD

International audience. Aspergillus nidulans is an important model organism for eukaryotic biology and the reference for the section Nidulantes in comparative studies. In this study, we de novo sequenced the genomes...

Genome-scale phylogeny and comparative genomics of the fungal order Sordariales

Archive ouverte | Hensen, Noah | CCSD

Corresponding author: hanna.johannesson@su.se (H. Johannesson). International audience. The order Sordariales is taxonomically diverse, and harbours many species with different lifestyles and large economic importan...

Ecological generalism drives hyperdiversity of secondary metabolite gene clusters in xylarialean endophytes

Archive ouverte | Franco, Mario | CCSD

International audience. Although secondary metabolites are typically associated with competitive or pathogenic interactions, the high bioactivity of endophytic fungi in the Xylariales, coupled with their abundance a...

Chargement des enrichissements...