0.1 Molecular biology

Molecular biology concerns all molecular basis of a life from composition to activity, including the interactions between DNA, RNA, proteins, their biosynthesis, as well as the regulation of these interactions. Molecular biology also is the study of molecular underpinnings of the processes of replication, transcription, translation and cell function, which is a good starting point to understand the field.

To better understand the content about 3D genome and serve a broad spectrum of readers, let's first review some important concepts. Folks who are familiar with these content feel free to skip this chapter.

Restriction enzyme

Restriction Enzymes that cuts DNA at specific nucleotide sequences known as restriction sites, were first discovered in bacteria where these enzymes are designed to selectively cut exogenous DNA (like virus) to protect themselves.

To cut the DNA, restriction enzyme makes two incisions, each strand of the DNA double helix. These restriction sites are palindrome ( which means the sequence read the same forwards as backwards ). There are over 3000 RE have been identified and more than 600 of them are commercially available. Details of enzyme can be found in database or commercial website.

Naturally occurring restriction endonucleases are categorized into four groups (Types I, II III, and IV) based on their composition and enzyme cofactor requirements, the nature of their target sequence, and the position of their DNA cleavage site relative to the target sequence. Here we will focus on the last criteria.

Cleavage Site

Examples

Type I

cut DNA at random far from their recognition sequences.

EcoK I

EcoA I

CfrA I

Type II

Specific

Within the recognition site

EcoR I

BamH I

Hind III

Type III

Random

24-26 bp away from recognition site

EcoP I

Hinf III

EcoP15 I

Hydrogen bonds

Hydrogen bond is a partially electrostatic attraction between a hydrogen (H) which is bound to a more electronegative atom such as nitrogen (N), oxygen(O), or fluorine (F), and another adjacent atom bearing a lone pair of electrons (according to wiki). In another word it is an attraction between a slightly positive hydrogen atom and a slightly negative atom.

All life depends on hydrogen bonds in basic form water. It also plays an important role in determining the three-dimensional structures and the properties adopted by many synthetic and natural proteins. DNA, proteins, cellulose and so on.

In epigenetics, there are 142 hydrogen bonds between DNA and the histone core in each nucleosome. More than 1/5 of the amino acids in each of the core histones are either lysine or arginine, and their positive charges neutralize the negatively charged DNA backbone.

For protein-DNA recognition, the mechanism is based on base readout and shape readout. Base readout is when the protein recognizes the specific chemical signatures of different nucleic acid bases which extensively depends on hydrogen bonds. In protein-DNA recognition, it is a greater source of specificity in the major groove as compared to the minor groove due to the pattern of hydrogen bond donors and acceptors available [1].

Gel electrophoresis

Gel electrophoresis is a method to separate DNA, RNA or even proteins by size and charge. It is used in biochemistry and molecular biology to separate a mixed population of DNA and RNA fragments by length, to estimate the size of DNA and RNA fragments or to separate proteins by charge.[2]

This is achieved by moving negatively charged nucleic acid molecules (like DNA) through an agarose matrix with an electric field (electrophoresis). The speed of molecular depends on various factors like: strength of the electrical field, buffer, density of agarose gel and so on, but most importantly is the size of DNA. Shorter molecules move faster and migrate farther than longer ones due to their mobility through pores in the gel. Within an agarose gel, linear DNA migrate inversely proportional to the log10 of their molecular weight.

Gel electrophoresis has many applications such like:

  • Estimation of the size of DNA molecules after using restriction enzyme digestion, the size could be estimated by comparing with some standard reference markers ( whose length is known ).

  • Analysis of PCR products, e.g. in molecular genetic diagnosis or genetic fingerprinting.

  • Separation of restricted genomic DNA by cutting down the block of gel contains specific sequence.

PCR

Polymerase Chain Reaction is a method to amplify a particular piece of DNA, it can make billions of copies of a target sequence of DNA in a few hours. PCR was invented in the 1984 as a way to make numerous copies of DNA fragments in the laboratory, this technique has been applied in enormous various applications and really enriched the field of molecular biology.

We can think PCR as an in vitro version of DNA replication in vivo. DNA replication is semi-conservative which means only one strand of the DNA is used as the template for the growth of a new DNA strand. We need the following major components to start with a PCR reaction:

  • DNA template: include regions of interest to be amplified.

  • DNA Polymerase: a type of enzyme that synthesizes new strands of DNA complementary to the target sequence and heat resistant.

  • two DNA primers that are complementary to the 3' (three prime) ends of each of the sense and anti-sense strands of the DNA target, at which DNA polymerase will start synthesizing.

  • dNTP: deoxynucleoside triphosphates, building blocks of newly synthesized DNA.

Besides all these materials, there are three main steps that finish the magic to replicate the desired sequence from one to millions:

  • Step 1: Denature DNA

    At 95°C, the DNA is denatured (i.e. the two strands are separated).

  • Step 2: Primers Anneal.

    At 40°C- 65°C, the primers anneal (or bind to) their complementary sequences on the single strands of DNA.

  • Step 3: DNA polymerase Extends the DNA chain

    At 72°C, DNA Polymerase extends the DNA chain by adding nucleotides to the 3’ ends of the primers.

Even though PCR is such an powerful tool, there are still some caveats we should notice and they may produce biases during many applications. Reviews about this topic can be found [3], [4], [5].

Last updated