6533b7d7fe1ef96bd1269508

RESEARCH PRODUCT

Single-cell ChIP-seq imputation with SIMPA by leveraging bulk ENCODE data

Steffen AlbrechtTommaso AndreaniMiguel A. Andrade-navarroJean-fred Fontaine

subject

description

Abstract Single-cell ChIP-seq analysis is challenging due to data sparsity. We present SIMPA ( https://github.com/salbrec/SIMPA ), a single-cell ChIP-seq data imputation method leveraging predictive information within bulk ENCODE data to impute missing protein-DNA interacting regions of target histone marks or transcription factors. Machine learning models trained for each single cell, each target, and each genomic region enable drastic improvement in cell types clustering and genes identification.

10.1101/2019.12.20.883983http://dx.doi.org/10.1101/2019.12.20.883983