Selected Publications

Adjacent CpG sites in mammalian genomes can be co-methylated owing to the processivity of methyltransferases or demethylases, yet discordant methylation patterns have also been observed, which are related to stochastic or uncoordinated molecular processes. We focused on a systematic search and investigation of regions in the full human genome that show highly coordinated methylation. We defined 147,888 blocks of tightly coupled CpG sites, called methylation haplotype blocks, after analysis of 61 whole-genome bisulfite sequencing data sets and validation with 101 reduced-representation bisulfite sequencing data sets and 637 methylation array data sets. Using a metric called methylation haplotype load, we performed tissue-specific methylation analysis at the block level. Subsets of informative blocks were further identified for deconvolution of heterogeneous samples. Finally, using methylation haplotypes we demonstrated quantitative estimation of tumor load and tissue-of-origin mapping in the circulating cell-free DNA of 59 patients with lung or colorectal cancer.
In Nat Genet

Generation of human induced pluripotent stem cells (hiPSCs) by the expression of specific transcription factors depends on successful epigenetic reprogramming to a pluripotent state. Although hiPSCs and human embryonic stemcells (hESCs) display a similar epigenome, recent reports demonstrated the persistence of specific epigenetic marks from the somatic cell type of origin and aberrant methylation patterns in hiPSCs. However, it remains unknownwhether the use of different somatic cell sources, encompassing variable levels of selection pressure during reprogramming, influences the level of epigenetic aberrations in hiPSCs. In this work, we characterized the epigenomic integrity of 17 hiPSC lines derived from six different cell types with varied reprogramming efficiencies. We demonstrate that epigenetic aberrations are a general feature of the hiPSC state and are independent of the somatic cell source. Interestingly, we observe that the reprogramming efficiency of somatic cell lines inversely correlateswith the amount of methylation change needed to acquire pluripotency. Additionally, we determine that both shared and linespecific epigenetic aberrations in hiPSCs can directly translate into changes in gene expression in both the pluripotent and differentiated states. Significantly, our analysis of different hiPSC lines from multiple cell types of origin allow us to identify a reprogrammingspecific epigenetic signature comprised of nine aberrantly methylated genes that is able to segregate hESC and hiPSC lines regardless of the somatic cell source or differentiation state.
In PNAS

Targeted quantification of DNA methylation allows for interrogation of the most informative loci across many samples quickly and cost-effectively. Here we report improved bisulfite padlock probes (BSPPs) with a design algorithm to generate efficient padlock probes, a library-free protocol that dramatically reduces sample-preparation cost and time and is compatible with automation, and an efficient bioinformatics pipeline to accurately obtain both methylation levels and genotypes from sequencing of bisulfite-converted DNA.
In Nat Methods

Recent Publications

  • Identification of methylation haplotype blocks aids in deconvolution of heterogeneous tissue samples and tumor tissue-of-origin mapping from plasma DNA

    Details PDF Dataset Project

  • Advances in the profiling of DNA modifications: cytosine methylation and beyond

    Details PDF

  • Identification of a specific reprogramming-associated epigenetic signature in human induced pluripotent stem cells

    Details PDF Dataset

  • Library-free methylation sequencing with bisulfite padlock probes

    Details PDF

Recent Posts

Software

All

BisReadMapper

Pipeline for high throughput bisulfite sequencing data analysis

MONOD2

Toolkit for methylation haplotype analysis for bisulfite sequencing data

cgDMR-miner

Generalized method for the identification of DMRs from low coverage bisulfite sequencing data

Contact