2.3 Higher order C-techs
C-Techs (chromosome conformation capture)-coupled
Introduction
Overview of 3C-based methods
2.1. Specificity
2.2. Through-put and resolution
Hi-C
ChIA-PET
Selected methods comparison
2.3.1 Introduction
The foundamental object of 3C(Chromosome Conformation Capture) techniques and 3C-derived methods is to understand the physical wiring diagram of the genome by identifying the physical interaction between chromosomes.
To capture the interaction (crosslink between strings), there are few steps in general:
Take a snapshot of the flowing cells - Crosslink with fixative agent (formaldehyde)
Zoom in on crosslinked part and exclude untangled parts - Digested with a restriction enzyme
Analyze the components come from the same chromatin - Reverse crosslink and sequence
Finish the jigsaw puzzle and get the results - Align the reads and summarize the contacts
Based on these general ideas, then we'll dive deeper by walking through two of the most popular techniques and then briefly introduce some other methods.
2.3.2 Overivew of 3C methods
Figure1. Schematic Representation of Chromosome Conformation Capture (3C) and 3C-Derived Methods. These methods help to elucidate nuclear organization by detecting physical interactions between genetic elements located throughout the genome. Abbreviations: IP, immunoprecipitation; RE, restriction enzyme. Figure by Sotelo-Silveira, Mariana, et al. Trends in Plant Science (2018).
To better understand the difference between these methods, I'd like to distinguish them between the following couple of aspects:
1) Specificity - What does one, all, many mean
‘1’, ‘Many’ and ‘All’ indicate how many loci are interrogated in a given experiment. For example, ‘1 versus All’ indicates that the experiment probes the interaction profile between 1 locus and all other potential loci in the genome. ‘All versus All’ means that one can detect the interaction profiles of all loci, genome-wide, and their interactions with all other genomic loci [1].
These kind of specificity is determined by the primer when people use specific primers before PCR.
2) Through-put and resolution
Hi-C techniques has the highest through-put (billion reads per sample) but suffering of a relative low resolution of 0.1-1Mb. However, the other methods usually have a higher resolution around 1kb. For more details one can refer to table2 in [2].
2.3.3 Hi-C
Hi-C is the highest through-put version of 3C-derived technologies. Due to the decreasing cost of 2nd generation sequencing, Hi-C is widely used.
The principle of Hi-C can be illustrated as:
Hi-C critical steps [8]
Fixation: keep DNA conformed
Digestion: enzyme frequency and penetration
Fill-in: biotin for junction enrichment
Ligation: freeze interactions in sequence
Biotin removal: junctions only
Fragment size: small fragments sequence better
Adapter ligation: paired-end and indexing
PCR: create enough material for flow cell
Hi-C derived techniques
Hi-C original: Lieberman-Aiden et al., Science 2010
Hi-C 1.0: Belton-JM et al., Methods 2012
In situ Hi-C: Rao et al., Cell 2014
Single cell Hi-C: Nagano et al., Genome Biology 2015
DNase Hi-C Ma, Wenxiu, Methods et al
Hi-C 2.0: Belaghzal et al., Methods 2017
DLO-Hi-C: Lin et al., Nature Genetics 2018
Hi-C improving: Golloshi et al., Methods 2018
Arima 1-day Hi-C: Ghurye et al., BioRxiv 2018
2.3.4 ChIA-PET
ChIA-PET is another method that combines ChIP and pair-end sequencing to analysis the chromatin interaction. It allows for targeted binding factors such as: estrogen receptor alpha, CTCF-mediated loops, RNA polymerase II, and a combination of key architectural factors. On the one hand, it has the benefit of achieving a higher resolution compared to Hi-C, as only ligation products involving the immunoprecipitated molecule are sequenced, on the other hand, ChIA-PET has systematic biases due to ChIP process:
Only one type of binding factor selected
Different antibodies
ChIP conditions
2.3.5 Selected methods comparison
Method
Targets
Resolution
Notes
one-vs-one
~1–10 kb
Sequence of bait locus must be known
Data analysis low throughput
one-vs-all
~2 kb
Sequence of bait locus must be known
Detects novel contacts
Long-range contacts
many-vs-many
~1 kb
High dynamic range complete contact map of a locus 3C with ligation-mediated amplification (LMA) of a ‘carbon copy’ library of oligos designed across restriction fragment junctions of interest 3C
all-vs-all
0.1–1 Mb
Genome-wide nucleosome core positioning
Relative low resolution high cost
Interaction of whole genome mediated by protein
Depends on read depth and the size of the genome region bound by the protein of interest
Lower noise with ChIP
Biased method since selected protein
Last updated