| 
			MOLECULAR & CELLULAR 
			NEUROBIOLOGY  
			
			Master Course Cognitive Neuroscience - Radboud 
			University, Nijmegen | 
|  | 
| Chapter 5: Molecular biological research methodology | 
| Bioinformatics - data analysis | CRISPR-cas genome editing | 
| ChIP-chip/seq | 
|  | 
Molecular Biology
| Modern molecular biology has revolutionized research and our understanding of the molecular pathogenesis of disease in various branches of medical disciplines. In the past, the nomenclature of recombinant DNA and other techniques of molecular biology remained somewhat foreign to the practicing neuroscientist, in part because the techniques were only recently developed and their application to brain diseases became prominent only over the past decade. Here the historical development of the techniques of recombinant DNA and of molecular biology will be described. The unique features of these techniques over that of conventional scientific techniques will be discussed together with how they solve problems in a way that was previously not feasible. The historical perspective is intended to provide insight into why molecular techniques have blossomed and why they have an advantage over existing scientific techniques. It should be emphasized that the terminology and techniques are generic and are essentially the same regardless of the organ, organism or field of research to which they are applied. | 
| Modern molecular biology is almost synonymous with the development of recombinant DNA technology. Despite the fundamental discoveries in the 1950s and 1960s, application of these techniques did not emerge until the late 1970s and early 1980s. Miescher isolated DNA for the first time in 1869 (he noted there was a substance derived from cell nuclei which differed from protein, and which dissolved in alkali but not in water; he isolated the substance from salmon sperm and bandages from surgical wounds and called it “nuclein”), and in 1944 Avery provided evidence beyond doubt that DNA, rather than protein, is responsible for transferring genetic information during bacterial transformation. In 1953, Watson and Crick deduced the double helix structure for DNA, which was based on the results of X-ray diffraction by Franklin and Gosling, and Wilkins. The work of Watson, Crick, and Wilkins was rewarded with a Nobel Prize, the first of three Nobel Prizes rewarded for advances in the understanding of molecular genetics. Marmor, Lane, and Doty showed the double helix of DNA could be separated into single strands by high temperatures (denaturation) and reannealed (double stranded) with return to lower temperatures. This is a major property of DNA enabling many processes such as DNA amplification by the polymerase chain reaction (PCR). In 1964, Nirenberg and Matthaei, and Nishimura elucidated the genetic code by discovering the sequence of three DNA bases (the triplet codon) code for each amino acid in a protein, a discovery that irrefutably linked DNA as the molecule of life and resulted in another Nobel Prize in the field of modern genetics. In 1968, Olivera discovered DNA ligase, the enzyme used to join DNA fragments together, helping set the stage for recombinant DNA technology. However, the large size of the DNA molecule and its monotonous nature made it difficult to isolate and manipulate. The ability to isolate, manipulate, and clone DNA was, in large part, due to six further seminal contributions, as follows: (1) the discovery of restriction endonucleases; (2) the discovery of reverse transcriptase; (3) the cloning of DNA; (4) the ability to sequence the bases comprising a DNA fragment; (5) the ability to mutate specific DNA residues (altering specific amino acid codes) allowing for structure/function studies of proteins; and (6) the advent of PCR, which provided the tool to rapidly amplify exponentially selected fragments of DNA sequence (Table 1). | 
Table 1. Discoveries seminal to modern molecular biology
| 1889 | Isolation of DNA | 
| 1944 | DNA as hereditary material | 
| 1953 | DNA structure deduced | 
| 1964 | Genetic code deduced | 
| 1970 | Discovery of specific restriction endonuclease | 
| 1970 | Discovery of reverse transcriptase | 
| 1972/73 | Development of the cloning technique | 
| 1975/77 | DNA sequencing | 
| 1980 | Polymerase chain reaction | 
| 1982 | Site-directed mutagenesis | 
| 
 (1) To the molecular biologist, restriction endonucleases are what the scalpel is to the surgeon. They cut double-stranded DNA within the molecule (hence, endonucleases) at base pair sequences that are specific for each enzyme. The recognition sites for most enzymes are four to eight base pairs in length, with a few having recognition sites of only three base pairs, and even fewer recognize eight base pairs. The restriction endonucleases are isolated from bacteria where their normal function is as a defense mechanism, to digest foreign DNA, restricting it from being incorporated into the genome, and so are referred to as restriction endonucleases. It is possible to cut DNA into fragments of a desired and consistent size, knowing specifically where each cut is performed. The ability to cut DNA into fragments of specific length was absolutely essential to all of the recombinant techniques and more specifically for the development of cloning (see below). The existence of a DNA restriction endonuclease was first discovered by Linn in bacteria in 1962; however, it was not until later work, in 1970, that specific endonucleases were isolated and applied to molecular genetic techniques. 
 | 
| (2) The second contribution was the independent discovery of reverse transcriptase in 1970 by two laboratories, which made it possible to generate DNA complementary (cDNA) to messenger RNA (mRNA). The first stage in generating a protein from a gene requires transcribing nuclear DNA (transcription) into mRNA, which has the encoding sequence for the amino acids of the protein. Thus, isolation of an mRNA and conversion to cDNA provides one with a probe that will only bind with its complementary DNA (gene). The dogma for a long time was that DNA to RNA could not be reversed. The discovery of reverse transcription was worthy of more than a Nobel Prize for several reasons. RNA represents the expressed form of a DNA gene. Only the gene in its DNA form can be replicated such as with the cloning technique. The sequence of the DNA coding for protein is less than 1.5% of all DNA. Finding a gene is like finding a needle in a haystack. Thus, being able to convert the RNA into DNA sequences provided us with a new tool for identifying genes. The observation, that cDNA contains only the coding sequence, was all that was needed to clone or express the gene was a major revelation. The cDNA also provides a specific means to index its location on the chromosome, as well as a more stable nucleic acid structure (mRNA is easily degradable) for multiple applications, such as cloning. 
 | 
| (3) The third contribution was the birth of cloning. Cloning is a method to obtain multiple copies of a DNA fragment including a gene. In 1972, the first recombinant DNA molecule was generated at Stanford (CA, USA), and in 1973, the first foreign DNA fragment was inserted (recombined) into a plasmid (DNA vector). Plasmids are autonomously replicating DNA molecules, commonly present in bacteria, and capable of replicating in great numbers within a bacterium using the molecular machinery of the organism. This first recombined plasmid was successfully reinserted (the process of transformation) into a bacterium, which was grown in cultured media. The plasmid replicated providing millions of copies, and hence, the first successful cloning of a foreign DNA fragment. Thus, it was now possible to isolate mRNA known to code for a specific protein, and, with reverse transcriptase, convert it into a stable cDNA, which could then be recombined with a plasmid vector, transformed into a bacterium, and grown in culture, allowing for the cloning of large quantities of a specific DNA sequence or gene. 
 | 
| (4) The fourth contribution was made in 1977, when Sanger at Cambridge (UK) and Maxam and Gilbert at Harvard (USA) independently developed rapid nucleic acid sequencing techniques. These investigators were subsequently awarded a Nobel Prize. Thus, DNA of unknown sequence could now be cut into fragments of reasonable size, cloned into plasmid vectors, replicated into large quantities, and the specific DNA sequence determined. 
 | 
| 
 | 
| (6) The development of the PCR to amplify selected DNA or RNA fragments to several million copies instead of the need to clone provided the final tool for modern molecular biology. While cloning was a major breakthrough, PCR provides a more rapid and robust means to obtain millions of copies of a specific DNA molecule within hours. This discovery in 1985, also awarded with a Nobel Prize, is a major discovery that has markedly facilitated advances over the last decade in the study of human genetic diseases. The technique of PCR is the foundation of almost all investigations in modern molecular genetics. Although many applications exist, as will be described later, PCR was first utilized to detect the genome of pathogens responsible for human disease. For example, myocardial biopsies are obtained routinely in patients suspected of cardiomyopathy where the diagnosis is not evident and can be analyzed by PCR for the responsible pathogen. In essence, the presence of only one or two copies of RNA of DNA in a cell, which cannot be detected by conventional techniques, can be amplified by PCR to several million copies, putting particles such as viral RNA within the threshold of detection for conventional techniques. In addition to the tremendous advances modern molecular genetic techniques have provided in the study of human health and disease, such techniques have also proven invaluable in therapeutic drug development. For example, the first cardiac drug made by recombinant DNA techniques was recombinant tissue plasminogen activator (rt-PA) in 1983, which revolutionized the therapy of acute myocardial infarction. The development of this therapeutic agent serves to illustrate the value of the techniques of recombinant DNA and molecular biology. A cDNA containing all of the coding regions of the TPA gene was mass produced in bacterial and mammalian cell-culture systems. Site-directed mutagenesis of the cDNA gene and its expression led to the identification of its functions, namely lytic activity, fibrin affinity, and fibrin-dependent enhanced lytic activity. Five domains were recognized to have specific functions that are coded by separate and autonomous portions (exons) of the gene. Hundreds of drugs have been generated by recombinant DNA technology. | 
Recombinant DNA technology
| Recombinant DNA (rDNA) has various definitions, ranging from very simple to strangely complex. The following are three examples of how recombinant DNA is defined: (i) A DNA molecule containing DNA originating from two or more sources; (ii) DNA that has been artificially created. It is DNA from two or more sources that is incorporated into a single recombinant molecule; (iii) According to the National Institutes of Health (NIH) guidelines, recombinant DNA are molecules constructed outside of living cells by joining natural or synthetic DNA segments to DNA molecules that can replicate in a living cell, or molecules that result from their replication. When we say that something is "cloned" we mean that it grew from a single genetic origin. Molecular cloning (i.e. gene cloning) involves creating recombinant DNA and introducing it into a host cell to be replicated. One of the basic strategies of molecular cloning is to move desired genes from a large, complex genome to a small, simple one. In 1974 the National Institutes of Health (NIH, USA) took the lead in regulating research with recombinant DNA. Important tools for recombinant DNA technology include bacteria, viruses, plasmids, viral vectors and restriction enzymes. 
 | 
| Bacteria and viruses | 
| A bacterium is a prokaryotic cell with its own circular DNA. A virus is a disease-causing agent consisting of a nucleic acid molecule and protein coat. Viruses are incapable of autonomous replication and have to use a host cell's translational system. A bacteriophage is a virus that infects bacteria only. Viruses would appear to be the simplest infectious particle. The discovery of viroids, nucleic acid without a protein capsule and prions, infectious proteins, subtracts another level of complexity. Both viroids and prions can cause diseases. 
 | 
| Plasmids and viral vectors | 
| Plasmids are small, circular, extrachromosomal DNA molecules found in bacteria, which can replicate on their own, outside of a host cell. They have a cloning limit of 100 to 10,000 base pairs or 0.1-10 kilobases (kb). A plasmid vector is made from natural plasmids by removing unnecessary segments and adding essential sequences. Plasmids make excellent cloning vectors for various laboratory techniques, including recombinant DNA. A viral vector is a virus that carries a modified or foreign gene. They are commonly used in gene therapy where the viral vector delivers the desired gene to a target cell. Some of the viruses used as vectors in gene therapy include retroviruses, adenoviruses, parvoviruses, herpesviruses and Poxviruses. Plasmids typically have three important elements: 
 A viral vector is a virus that carries a modified or foreign gene. They are commonly used in gene therapy where the viral vector delivers the desired gene to a target cell. Some of the viruses used as vectors in gene therapy include: 
 | 
| Restriction enzymes | 
| As mentioned above, restriction enzymes, also known as restriction endonucleases, recognize and cut specific sequences in double-stranded DNA. Restriction enzymes are made to protect the bacteria from foreign DNA. Bacteria have a method of marking their own DNA as being "self". Any DNA not recognized as self is digested into smaller pieces by the restriction enzymes. Restriction enzymes search for exact sequences of a defined length. Some enzymes recognize sequences 4 bp long (e.g., GTAC), some 6 (e.g., GAATTC), and still others 8 or more. One of the common features of most enzyme recognition sites is that they are palindromes. A palindrome is a sequence which is read the same on both strands in the 5' --> 3' direction (see example). The example shows the recognition sequence for Eco RI. The sequence thus reads the same from either direction. The arrows are the points at which the enzyme actually causes breaks in the backbone of the DNA: 
 Click Cutting and pasting DNA for an animation. Click Construction of recombinant DNA for a movie. | 
| 
 | 
|  | 
| Next page: Techniques used in Molecular Biology | Go back to: Molecular networks | 
|  |