Deoxyribonucleic acid (DNA) is a nucleic acid usually in the form of a double helix that contains the genetic instructions specifying the biological development of all cellular forms of life, and most viruses. DNA is a long polymer of nucleotides and encodes the sequence of the amino acid residues in proteins using the genetic code, a triplet code of nucleotides.

In complex eukaryotic cells such as those from plants, animals, fungi and protists, most of the DNA is located in the cell nucleus. By contrast, in simpler cells called prokaryotes, including the eubacteria and archaea, DNA is not separated from the cytoplasm by a nuclear envelope. The cellular organelles known as chloroplasts and mitochondria also carry DNA.

DNA is often referred to as the molecule of heredity as it is responsible for the genetic propagation of most inherited traits. In humans, these traits can range from hair colour to disease susceptibility. During cell division, DNA is replicated and can be transmitted to offspring during reproduction. Lineage studies can be done based on the facts that the mitochondrial DNA only comes from the mother, and the male Y chromosome only comes from the father.

Every person's DNA, their genome, is inherited from both parents. The mother's mitochondrial DNA together with twenty-three chromosomes from each parent combine to form the genome of a zygote, the fertilized egg. As a result, with certain exceptions such as red blood cells, most human cells contain 23 pairs of chromosomes, together with mitochondrial DNA inherited from the mother.


Contrary to a common misconception, the DNA is not a single molecule, but rather a pair of molecules joined by hydrogen bonds: it is organized as two complementary strands, head-to-toe, with the hydrogen bonds between them. Each strand of DNA is a chain of chemical "building blocks", called nucleotides, of which there are four types: adenine (abbreviated A), cytosine (C), guanine (G) and thymine (T). In some organisms, most notably the PBS1 phage, Uracil (U) replaces T in the organism's DNA. These allowable base components of nucleic acids can be polymerized in any order giving the molecules a high degree of uniqueness.

Between the two strands, each base can only "pair up" with one single predetermined other base: A+T, T+A, C+G and G+C are the only possible combinations; that is, an "A" on one strand of double-stranded DNA will "mate" properly only with a "T" on the other, complementary strand; therefore, naming the bases on the conventionally chosen side of the strand is enough to describe the entire double-strand sequence. Two nucleotides paired together are called a base pair. On rare occasions, wrong pairing can happen, when thymine goes into its enol form or cytosine goes into its imino form. The double-stranded structure of DNA provides a simple mechanism for DNA replication: the DNA double strand is first "unzipped" down the middle, and the "other half" of each new single strand is recreated by exposing each half to a mixture of the four bases. An enzyme makes a new strand by finding the correct base in the mixture and pairing it with the original strand. In this way, the base on the old strand dictates which base will be on the new strand, and the cell ends up with an extra copy of its DNA.

DNA contains the genetic information that is inherited by the offspring of an organism; this information is determined by the sequence of base pairs along its length. A strand of DNA contains genes, areas that regulate genes, and areas that either have no function, or a function yet unknown. Genes can be loosely viewed as the organism's "cookbook" or "blueprint".

Other interesting points:
* DNA is an acid because of the phosphate groups between each deoxyribose. This the primary reason why DNA has a negative charge.
* The "polarity" of each pair is important: A+T is not the same as T+A, just as C+G is not the same as G+C (note that "polarity" as such is never used in this context -- it's just a suggestive way to get the idea across)
* Mutations are chemical imperfections in this process, where a base is accidentally skipped, inserted, or incorrectly copied, or the chain is trimmed, or added to; many basic mutations can be described as combinations of these accidental "operations". Mutations can also occur through chemical damage (through mutagens), light (UV damage), or through other more complicated gene swapping events.
* DNA molecules that act as enzymes are known in laboratories, but none have been known to be found in life so far.
* In addition to the traditionally viewed duplex form of DNA, DNA can also acquire triplex and quadruplex forms. Here instead of the Watson Crick base pairing, Hoogsten base pairing comes into picture.
* DNA differs from ribonucleic acid (RNA) by having a sugar 2-deoxyribose instead of ribose in its backbone. This is the basic chemical distinction between RNA and DNA. In addition, in RNA, the nucleotides T are replaced by U.

Molecular Structure

Although sometimes called "the molecule of heredity", DNA macromolecules as people typically think of them are not single molecules. Rather, they are pairs of molecules, which entwine like vines to form a double helix.

The general structure of a section of DNA

Each vine-like molecule is a strand of DNA: a chemically linked chain of nucleotides, each of which consists of a sugar (deoxyribose), a phosphate and one of five kinds of nucleobases ("bases"). Because DNA strands are composed of these nucleotide subunits, they are polymers.

The diversity of the bases means that there are five kinds of nucleotides, which are commonly referred to by the identity of their bases. These are adenine (A), thymine (T), uracil (U), cytosine (C), and guanine (G). U is rarely found in DNA except as a result of chemical degradation of C, but in some viruses, notably PBS1 phage DNA, U completely replaces the usual T in its DNA. Similarly, RNA usually contains U in place of T, but in certain RNAs such as transfer RNA, T is always found in some positions. Thus, the only true difference between DNA and RNA is the sugar, 2-deoxyribose in DNA and ribose in RNA.

In a DNA double helix, two polynucleotide strands can associate through the hydrophobic effect and pi stacking. Specificity of which strands stay associated is determined by complementary pairing. Each base forms hydrogen bonds readily to only one other - A to T and C to G - so that the identity of the base on one strand dictates the strength of the association; the more complementary bases exist, the stronger and longer-lasting the association.

The cell's machinery is capable of melting or disassociating a DNA double helix, and using each DNA strand as a template for synthesizing a new strand which is nearly identical to the previous strand. Errors that occur in the synthesis are known as mutations. The process known as PCR (polymerase chain reaction) mimics this process in vitro in a nonliving system.

Because pairing causes the nucleotide bases to face the helical axis, the sugar and phosphate groups of the nucleotides run along the outside; the two chains they form are sometimes called the "backbones" of the helix. In fact, it is chemical bonds between the phosphates and the sugars that link one nucleotide to the next in the DNA strand.

Strand Direction

The asymmetric shape and linkage of nucleotides means that a DNA strand always has a discernible orientation or directionality. Because of this directionality, close inspection of a double helix reveals that nucleotides are heading one way along one strand (the "ascending strand"), and the other way along the other strand (the "descending strand"). This arrangement of the strands is called antiparallel.

Chemical nomenclature (5' and 3')

For reasons of chemical nomenclature, people who work with DNA refer to the asymmetric ends of ("five prime" and "three prime"). Within a cell, the enzymes that perform replication and transcription read DNA in the "3' to 5' direction", while the enzymes that perform translation read in the opposite directions (on RNA). However, because chemically produced DNA is synthesized and manipulated in the opposite or in non-directional manners, the orientation should not be assumed. In a vertically oriented double helix, the 3' strand is said to be ascending while the 5' strand is said to be descending.

Sense and antisense

As a result of their antiparallel arrangement and the sequence-reading preferences of enzymes, even if both strands carried identical instead of complementary sequences, cells could properly translate only one of them. The other strand a cell can only read backwards. Molecular biologists call a sequence "sense" if it is translated or translatable, and they call its complement "antisense". It follows then, somewhat paradoxically, that the template for transcription is the antisense strand. The resulting transcript is an RNA replica of the sense strand and is itself sense.

Distinction between sense and antisense strands

A small proportion of genes in prokaryotes, and more in plasmids and viruses, blur the distinction made above between sense and antisense strands. Certain sequences of their genomes do double duty, encoding one protein when read 5' to 3' along one strand, and a second protein when read in the opposite direction (still 5' to 3') along the other strand. As a result, the genomes of these viruses are unusually compact for the number of genes they contain, which biologists view as an adaptation. This merely confirms that there is no biological distinction between the two strands of the double helix. Typically each strand of a DNA double helix will act as sense and antisense in different regions.

As viewed by topologists

Topologists like to note that the juxtaposition of the 3' end of one DNA strand beside the 5' end of the other at both ends of a double-helical segment makes the arrangement a "crab canon".


Go to Start | This article uses material from the Wikipedia