Genetics

MOLECULAR GENETICS

The transmission rules of genetic material are of fundamental importance to the theory of natural selection and to all components of population genetics. It is essential that you have a solid background in the basics of molecular and mendelian genetics.

DNA is the genetic material. A phosphate sugar backbone with purine or pyrimidine bases bound to the sugar moiety. In the double helix structure of DNA, Adenine pairs with Thymine and Cytosine pairs with Guanine. A:T or C:G pairs are held together with hydrogen bonds, the C:G pairing being stronger than the A:T pairing.

The eukaryotic Chromosomes are made up of chromatin which is about half protein (histones) and half DNA. The DNA is coiled around the histones in a nucleosome structure, and these strings of nucleosomes are then coiled upon themselves making the hierarchical packaging of the genetic material very space-efficient. The prokaryotic "chromosome" (usually one per cell) does not have the tight packaging of eukaryotic chromosomes, but is associated with various proteins.

The Central Dogma describes the general view that information transfer in genetics is unidirectional from DNA to RNA to protein, and has come to refer to the general mechanisms by which this information is retrieved. Transcription is the polymerization of a strand of RNA from DNA by the enzyme RNA polymerase. Translation is the various mechanisms by which the sequence of nucleotides in the RNA is translated into a polypeptide and requires transfer RNAs, the messenger RNA, ribosomes, amino acids (among other things; see Table 2.1, Figure 2.3 pg. 25-26).

The original view of the central dogma was One gene=>One protein (one gene codes for the production, through transcription and translation, of one protein). This has been modified to the view of one gene => one polypeptide since some genes code for parts of a protein; and further defined as one gene => one function, since some genes code for RNA products (ribosomal RNA, transfer RNA) which clearly are not polypeptides. Protein genes can be structural (e.g., collagen in your skin); catalytic (e.g., amylase enzyme in your saliva). How would you classify hemoglobin?? RNA genes can also be structural (ribosomal RNA), catalytic (RNase P acts as a cleavage enzyme).

Gene structure in prokaryotes often takes the form of an operon which is a set of adjacent structural and regulatory genes. The coding regions (= genes) are uninterrupted open reading frames of DNA that are transcribed as one RNA and translated into distinct polypeptides. The adjacent regulatory regions can alter the expression of the genes in response to specific signals from the cell.

Gene structure in eukaryotes is quite different , most notably in that the coding regions are often broken up into exons (expressed) and introns (intervening sequences). Other specific sequences in the DNA can serve as promoter elements to stimulate transcription. The intron/exon structure of eukaryotic genes means that after transcription into RNA (called the pre-messenger RNA), the intron sequences in the RNA must be removed and the exon sequence spliced back together. After splicing, the RNA is called the messenger RNA (or "mature" messenger RNA) and is transported out of the cell nucleus into the cytoplasm where it will be transcribed into protein.

The translation process goes on at the ribosomes and woks according to the rules of the genetic code. With 20 amino acids found in proteins and only 4 nucleotides (A,C,T,G), the DNA must be read in blocks of 3 nucleotides to provide enough information to translate DNA into polypeptides (4¹ only = 4 ; 4² = 16 [still 4 amino acids short]; 4³ = 64 [more than needed]). The very important result of the fact the 43 = 64 means the genetic code is a redundant code, i.e., some three-nucleotide "words" will translate into the same amino acid (see table 2.1 page 26). This means that some nucleotide positions in the coding region of a gene will be silent with respect to the amino acid sequence that gets translated from that DNA -> RNA. "Silent" means that if a mutation occured at such a position and changed the nucleotide, the resulting amino acid sequence would still be the same (e.g., see the third position for Leu in the table). Other sites will affect the amino acid sequence if mutated (see second positions for all codons in the code). We can thus refer to two general types of nucleotide positions in the DNA of coding regions silent or synonymous sites (if mutated, no amino acid change) and replacement or nonsynonymous sites (if mutated, a different amino acid sequence will result).

If we tabulate all possible changes in a theoretical "random" coding sequence, 61 codons could code for amino acids (3 codons of the 64 total code for stop codons) with 3 nucleotides per codon = 183 nucleotides. Each of these 183 different types of positions could change 3 ways (A could mutate to C, G or T; C could mutate to A, G or T, etc.) for a total of 549 possible types of changes in a random coding sequence. If we now look at the genetic code and ask whether each of these possible changes is a silent or a replacement site, the following is obtained: 1st position: 4% silent, 96% replacement; 2nd position: 0% silent, 100% replacement; 3rd position: 69% silent, 31% replacement. This means that mutation at 3rd positions in codons will be much less likely to affect protein sequence and hence much less likely to be subject to natural selection (more later on Molecular Evolution).

The genome is the total genetic complement of the cell. In humans this is about 3 billion nucleotides in a haploid genome (sperm or egg cell). Given a rough estimate of 100,000 genes per genome and 1000 base pairs (bp) per gene, this adds up to 10⁵ x 10³ = 10⁸ , or only 10% of our genome!! What the hell is all the rest of it for?? Good question (more late on Molecular Evolution), but it appears to be spacer DNA and repetitive DNA.

MENDELIAN GENETICS

Adult human (mice, fruit flies, etc.) are diploid organisms having received one set of chromosomes from the mother and one from the father. Humans have 46 chromosomes in 23 pairs; each pair is made up of one maternal and one paternal chromosome. The sex chromosomes are a "pair" in females (two X chromosomes), but in males the X and Y sex chromosomes are very different in size. The other 22 pairs of chromosomes are autosomes.

The chromosomes need not segregate as a "top" and "bottom" set together because there is independent assortment of chromosomes during meiosis making many different possible combinations. Note that in the lower gamete, there is no F locus. This is the case in males when there is no section of the Y that is homologous to the X.

Crosses between genotypes are best diagrammed in a Punnet square. These are made by putting all possible types of gametes of one parent along one side of the square and all possible types of gametes of the other parent along the other side. The boxes are filled in by pairing gametes together to make genotypes. In a cross of two AA homozygotes, all offspring are identical. In a cross between two heterozygous Aa x Aa, each parent can produce two possible gametes: A or a. In the cross the genotypes come out as 1:2:1 of AA:Aa:aa. In a two locus situation where the loci are on separate chromosomes (unlinked) and each locus is heterozygous, each adult can produce 4 different gametes: AB, Ab, aB and ab. These four gametes go along each edge of a 4x4 Punnet square resulting in 16 possible 2-locus genotypes: AABB, AABb. AaBB, etc (figure 2.6, pg. 32). How would the Punnet square be different if the two parents had these genotypes: AABb x AaBB? Work it out!

An important issue for Darwin was the mode of transmission of traits. The Lamarkian view of the "inheritance of acquired characteristics" was workable under the Darwinian view, but was wrong. Before the rediscovery of Mendel's work at the turn of the century, blending inheritance was accepted mainly because offspring tended to look like a mixture of parents. But note that if blending inheritance was the mechanism, then all individuals in the population would eventually come to look alike and there would be no variation upon which to select! and Natural Selection would not operate. This troubled Darwin to his death. Unfortunately Darwin never found out about Mendel's experiments even though they were published long before Darwin's death. Mendel's proposal of a "particulate" mechanism of inheritance solved the problem but the history of Darwinism would have been interestingly different if Darwin had understood genetics before he died. A simple example of a cross between a red (RR) and a white (rr) flower illustrates the point: RRxrr gives all Rr (lets say they are pink). Now a cross of two pink flowers: RrxRr gives back the parental types 1:2:1 RR:Rr:rr. The blending is a function of gene expression, not gene inheritance (figures 2.8, 2.9, pg. 35, 37).