Last updated: 2018-08-27

workflowr checks: (Click a bullet for more information)
Expand here to see past versions:


Cardelino : Integrating whole exomes and single-cell transcriptomes to reveal phenotypic impact of somatic variants

Decoding the clonal substructures of somatic tissues sheds light on cell growth, development and differentiation in health, ageing and disease. Targeted DNA sequencing, whole exome sequencing, and most recently single-cell DNA sequencing have been applied to reconstruct trees describing this clonal substructure. However, the functional and phenotypic differences between clonal populations identified have been difficult to characterise and remain unclear in most cases.

To address this, we present cardelino (https://github.com/PMBio/cardelino), a computational method to assign single-cell transcriptome profiles to nodes in a clonal tree, based on variants identified from single-cell RNA-seq data. After validating the model on simulated data, we apply cardelino to matched single-cell RNA-seq and exome sequencing data from 32 human dermal fibroblast lines.

The somatic mutation landscape reveals non-neutral clonal evolution in a subset of these healthy donors. In line with this evolutionary selection pressure, we identify gene expression differences between clones. Interestingly, cell cycle and proliferation pathways separate clones repeatedly in our fibroblast samples. In summary, we develop and apply an approach that allows for reconstructing transcriptome profiles of clonal subpopulations within tissues. This has the potential to inform on mutational processes at single-cell resolution, both in cancer and healthy ageing.

Key findings:

Authors

The full author list is as follows:

Davis J. McCarthy1,4*, Raghd Rostom1,2,*, Yuanhua Huang1*, Daniel J. Kunz2,5,6, Petr Danecek2, Marc Jan Bonder1, Tzachi Hagai1,2, HipSci Consortium, Wenyi Wang, Daniel J. Gaffney2, Benjamin D. Simons5,6,7, Oliver Stegle1,3,#, Sarah A. Teichmann1,2,5,#

1European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, CB10 1SD Hinxton, Cambridge, UK; 2Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK; 3European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany; 4St Vincent’s Institute of Medical Research, Fitzroy, Victoria 3065, Australia. 5Cavendish Laboratory, Department of Physics, JJ Thomson Avenue, Cambridge, CB3 0HE, UK. 6The Wellcome Trust/Cancer Research UK Gurdon Institute, University of Cambridge, Cambridge, CB2 1QN, UK. 7The Wellcome Trust/Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, UK.

* These authors contributed equally to this work.

# Corresponding authors.


This reproducible R Markdown analysis was created with workflowr 1.1.1