Abstract
The boom in single-cell technologies has brought a surge of high dimensional data that come from different sources and represent cellular systems from different views. With advances in these single-cell technologies, integrating single-cell data across modalities arises as a new computational challenge. Here, we present an adversarial approach, sciCAN, to integrate single-cell chromatin accessibility and gene expression data in an unsupervised manner. We benchmarked sciCAN with 5 existing methods in 5 scATAC-seq/scRNA-seq datasets, and we demonstrated that our method dealt with data integration with consistent performance across datasets and better balance of mutual transferring between modalities than the other 5 existing methods. We further applied sciCAN to 10X Multiome data and confirmed that the integrated representation preserves biological relationships within the hematopoietic hierarchy. Finally, we investigated CRISPR-perturbed single-cell K562 ATAC-seq and RNA-seq data to identify cells with related responses to different perturbations in these different modalities.
Original language | English |
---|---|
Article number | 33 |
Journal | npj Systems Biology and Applications |
Volume | 8 |
Issue number | 1 |
DOIs | |
State | Published - Dec 2022 |
Funding
This work was supported by NIH NIGMS grant R35GM133557 to R.P.M. E.B. has dual affiliations with both Oak Ridge National Laboratory and University of Tennessee, Knoxville. The manuscript has been in part co-authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. We would also like to thank Heng Li for manuscript proofreading and editing.
Funders | Funder number |
---|---|
U.S. Department of Energy | |
National Institute of General Medical Sciences | R35GM133557 |
Oak Ridge National Laboratory | |
University of Tennessee | |
UT-Battelle | DE-AC05-00OR22725 |