Souporcell: robust clustering of single-cell RNA-seq data by genotype without reference genotypes

Archive ouverte

Heaton, Haynes | Talman, Arthur, M. | Knights, Andrew | Imaz, Maria | Gaffney, Daniel | Durbin, Richard | Hemberg, Martin | Lawniczak, Mara

Edité par CCSD ; Nature Publishing Group -

International audience. Methods to deconvolve single-cell RNA-sequencing (scRNA-seq) data are necessary for samples containing a mixture of genotypes, whether they are natural or experimentally combined. Multiplexing across donors is a popular experimental design that can avoid batch effects, reduce costs and improve doublet detection. By using variants detected in scRNA-seq reads, it is possible to assign cells to their donor of origin and identify cross-genotype doublets that may have highly similar transcriptional profiles, precluding detection by transcriptional profile. More subtle cross-genotype variant contamination can be used to estimate the amount of ambient RNA. Ambient RNA is caused by cell lysis before droplet partitioning and is an important confounder of scRNA-seq analysis. Here we develop souporcell, a method to cluster cells using the genetic variants detected within the scRNA-seq reads. We show that it achieves high accuracy on genotype clustering, doublet detection and ambient RNA estimation, as demonstrated across a range of challenging scenarios.

Consulter en ligne

Suggestions

Du même auteur

The Malaria Cell Atlas: Single parasite transcriptomes across the complete Plasmodium life cycle

Archive ouverte | Howick, Virginia | CCSD

International audience. Mapping the malaria parasite Several species of the parasite Plasmodium cause human malarial diseases, and, despite determined control efforts, a huge global disease burden remains. Howick et...

The genome sequence of the malaria mosquito, Anopheles funestus, Giles, 1900

Archive ouverte | Ayala, Diego | CCSD

International audience. We present a genome assembly from an individual female Anopheles funestus (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae). The genome sequence is 251 megabases in span. The ma...

A chromosomal reference genome sequence for the malaria mosquito, Anopheles moucheti, Evans, 1925

Archive ouverte | Nsango, Sandrine, N | CCSD

International audience. We present a genome assembly from an individual male Anopheles moucheti (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), from a wild population in Cameroon. The genome sequenc...

Chargement des enrichissements...