High-resolution species assignment of Anopheles mosquitoes using k-mer distances on targeted sequences

Archive ouverte

Boddé, Marilou | Makunin, Alex | Ayala, Diego | Bouafou, Lemonde | Diabaté, Abdoulaye | Ekpo, Uwem Friday | Kientega, Mahamadi | Le Goff, Gilbert | Makanga, Boris, K | Ngangue, Marc, F | Omitola, Olaitan Olamide | Rahola, Nil | Tripet, Frederic | Durbin, Richard | Lawniczak, Mara Kn

Edité par CCSD ; eLife Sciences Publication -

International audience. The ANOSPP amplicon panel is a genus-wide targeted sequencing panel to facilitate large-scale monitoring of Anopheles species diversity. Combining information from the 62 nuclear amplicons present in the ANOSPP panel allows for a more senstive and specific species assignment than single gene (e.g. COI) barcoding, which is desirable in the light of permeable species boundaries. Here, we present NNoVAE, a method using Nearest Neighbours (NN) and Variational Autoencoders (VAE), which we apply to k- mers resulting from the ANOSPP amplicon sequences in order to hierarchically assign species identity. The NN step assigns a sample to a species-group by comparing the k -mers arising from each haplotype’s amplicon sequence to a reference database. The VAE step is required to distinguish between closely related species, and also has sufficient resolution to reveal population structure within species. In tests on independent samples with over 80% amplicon coverage, NNoVAE correctly classifies to species level 98% of samples within the An. gambiae complex and 89% of samples outside the complex. We apply NNoVAE to over two thousand new samples from Burkina Faso and Gabon, identifying unexpected species in Gabon. NNoVAE presents an approach that may be of value to other targeted sequencing panels, and is a method that will be used to survey Anopheles species diversity and Plasmodium transmission patterns through space and time on a large scale, with plans to analyse half a million mosquitoes in the next five years.

Suggestions

Du même auteur

Host preference patterns in domestic and wild settings: Insights into Anopheles feeding behavior

Archive ouverte | Bouafou, Lemonde | CCSD

International audience. The adaptation of Anopheles malaria vectors to domestic settings is directly linked to their ability to feed on humans. The strength of this species–habitat association is unequal across the ...

Improved species assignments across the entire Anopheles genus using targeted sequencing

Archive ouverte | Boddé, Marilou | CCSD

International audience. Accurate species identification of the mosquitoes in the genus Anopheles is of crucial importance to implement malaria control measures and monitor their effectiveness. We use a previously de...

Genomic Signatures of Microgeographic Adaptation in Anopheles coluzzii Along an Anthropogenic Gradient in Gabon

Archive ouverte | Daron, Josquin | CCSD

Species distributed across heterogeneous environments often evolve locally adapted populations, but understanding how these persist in the presence of homogenizing gene flow remains puzzling. In Gabon, Anopheles coluzzii, a major ...

Chargement des enrichissements...