3dgenome
  • Initial page
  • Cover
  • Preface
  • Figurelist
  • Chap0 Preparation
    • 0.1 Molecular biology
    • 0.2 Sequencing technologies
    • 0.3 RNA-seq Data Mapping & Gene Quantification
    • 0.4 RNA-seq Differential Analysis
  • Chap1 Why we care about 3D genome
    • 1.1 From 2D to 3D nuclear structure
    • 1.2 From static to dynamic
    • 1.3 From intra to inter chromosomes "talk"
    • 1.4 From aggregation to division - phase separation
  • Chap2 experiment tools for exploring genome interaction
    • 2.1 Image based
    • 2.2 Primary order
    • 2.3 Higher order C-techs
  • Chap3 Computational analysis
    • 3.1 Primary order analysis
    • 3.2 Higer order data analysis
      • 3.2.1 Read mapping consideration
      • 3.2.2 Analytical Pipelines
        • GITAR Pipeline
        • HiC-Pro Pipeline
      • 3.2.3 TAD calling algorithms
    • 3.3 3D structure
  • Chap4 RNA-genome interaction
    • 4.1 Experimental Methods
    • 4.2 Computational Analysis
  • Chap5 Integrative Data Visualization Tools
    • 5.1 GIVE
    • 5.2 HiGlass
  • Chap6 4DN Project
  • Appendix
    • Homework
    • Student's presentation
      • A Brief Introduction to Machine Learning
      • Precision medicine
      • CHIP-Seq
Powered by GitBook
On this page
  • Integrative data visualization
  • 1.Motivation for Vis:
  • 2.What kind of data:
  • 3.Desired feaures of Hi-C Vis tools:
  • 4.Current plights
  • 5.Overview of tools
  • Referrence

Chap5 Integrative Data Visualization Tools

Integrative data visualization

  1. Motivation for Vis

  2. What kind of data

  3. Desired feaures of Hi-C Vis tools

  4. Current plights

  5. Overview of tools

Here we'll mainly focused on Hi-C data visulazation, while most of the tools designed for Hi-C data are also able to integragate other 3C-based data to improve the interpretability.

We'll first walk through the motivation for visualizing complex dataset and then discuss the object, demand, current limitations in Hi-C data visualization. Then we'll provide a roadmap with some comparison of some of the most popular softwares available, based on some criteria that people may refer to. Finally we'll choose GIVE and HIGlass as showcases to prepare folk for practical analysis.

1.Motivation for Vis:

  • Explore data feature, find patterns

  • Interpretate the data

  • Verify or raise hypothesis

2.What kind of data:

i) Hi-C data:

  • What is hi-c data, brief intro. may refer to other chapter

ii) Other useful types of data:

  • genome, epigenome

  • SNP

  • CTCF, ChiP-seq

  • ChIA-PET

  • ...

3.Desired feaures of Hi-C Vis tools:

To take a cruise on a Hi-C map is just like navigate on a google map: we want to zoom in & out anytime we want, we want to focus on a special location, get relative information about that location... In a word, we're immersed in the ocean of data but we still want to be as free as a fish.

  • Multi-omics data, multi-techs data. Optional for users

  • Vivid presentation: Circos, Heatmaps, Birdview.

  • Interactive presentation: mouse zoom-in & out, panning, sliding.

  • Easy to share the data:

    • Cloud based, docker based, block chain(for server), URL share, QR code.

    • Records for eaxct parameters that enable collaborators and public to repeat the trials.

  • Easy for comparison:

    • Compare own data with published papaer —— preloaded dataset.

    • Compare data from different analysis pipelines.

    • Compare 2 datasets in one screen, with exact same scalability and same region, which needs fast and intelligent localization.

  • Desktop applicaton, off-line usage.

  • 3D ,even 4D vis.

  • One stop service, the only desidered input is the raw reads data :D

4.Current plights

  • Size: trillions of pixels of one high resolution heatmap.

    • Resolution at different scale:

      • Compartments:1-10Mb 100-500 Kb bins

      • TADs: 0.1-1Mb 40-100Kb bins

      • Loops: 0.01-0.1Mb 10-40Kb bins

  • Storage of the data, need a standard format ".hic"

  • Smart computation with the need of in time zoom-in & out

  • Input format not uniform. A contact matrix, compressed TAD data or others.

  • Normalization methods for comparing different datasets.

  • ...

5.Overview of tools

Referrence

[1] R. Calandrelli, Q. Wu, J. Guan, and S. Zhong, “GITAR : An open source tool for analysis and visualization of Hi-C data,” 2018.

[2] B. Informatics, “HiGlass : Web-based Visual Exploration and Analysis of Genome Interaction Maps,” pp. 1–31, 2017.

[3] D. Yang et al., “3DIV: A 3D-genome Interaction Viewer and database,” Nucleic Acids Res., vol. 46, no. D1, pp. D52–D57, 2018.

[4] J. T. Robinson, D. Turner, N. C. Durand, H. Thorvaldsdóttir, J. P. Mesirov, and E. L. Aiden, “Juicebox.js Provides a Cloud-Based Visualization System for Hi-C Data,” Cell Syst., vol. 6, no. 2, p. 256–258.e1, 2018.

[5] M. N. Djekidel, M. Wang, M. Q. Zhang, and J. Gao, “HiC-3DViewer: a new tool to visualize Hi-C data in 3D space,” Quant. Biol., vol. 5, no. 2, pp. 183–190, 2017.

[6] G. G. Yardimci and W. S. Noble, “Software tools for visualizing Hi-C data,” Genome Biol., vol. 18, no. 1, 2017.

Previous4.2 Computational AnalysisNext5.1 GIVE

Last updated 6 years ago