Sunday 29 March 2015

Amplification and PCR: the concepts and the methodology


The final lab sessions of this term, that form the core of the UTC’s skills passport, involves the technique called simply, PCR: the Polymerase Chain Reaction. It is arguably the most important laboratory method to come out of Molecular Biology apart from nucleotide sequencing, although I am sure some would disagree! Nevertheless, since the concept was proposed by the distinguished Biochemist, Arthur Kornberg and subsequently developed into the practical technique we know today by Kary Mullis, it has become invaluable not only to Life Science Researchers, but to the Police, Medics, Archaeologists, Historians and Lawyers! Before I discuss the methodology in detail, there are several generic principles that underpin the PCR method, and the technique of PCR provides me with an opportunity to discuss them. The first is the phenomenon that is perhaps best described in large scale data analysis using computational methods: "garbage in garbage out", a phrase attributed in more eloquent form to the computational pioneer Charles Babbage:

On two occasions I have been asked, "Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?" ... I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question.
Image result for tooth extractionPersonally, I don't believe that enough experimental scientists appreciate the significance of sample extraction and preparation. After all, we place a great deal of trust in our dentist if an extraction is required! This is largely an experiential view and one that I was forced to confront when I first had to come to terms with transitioning basic to applied research in collaboration with a number of commercial organisations. The robustness of a method, such as ion exchange chromatography, electron microscopy, NMR spectroscopy etc., all stand or fall on the quality and quantity of the input sample. It doesn't matter how sophisticated your instrument is, if the sample under investigation does not meet a certain level of purity, or is present at too low a concentration, or has been prepared at the wrong pH, or salt concentration etc., you may as well not bother! As Arthur Kornberg himself commented: "Why waste pure thoughts on impure proteins!"

If part of your business is the sale of DNA purification kits, then you will not only sell on price, but on the simplicity, efficiency and reproducibility of your particular kit in producing samples fit for purpose in the downstream process. In most research labs, a method is often developed by an individual, which may subsequently becomes a key part of that particular laboratory's repertoire (often for several years). This process of method development and dissemination to the wider scientific community has been a pillar of experimental science in Universities and Research Institutes for many years (this is reflected in the many journals and books dedicated to experimental methodology). However, in our high-throughput, data-hungry world, the importance of personal experimental failure and improvement in the laboratory, has largely been sacrificed in the oncoming juggernaut which is our thirst for answers. Clearly, the current search for an effective Ebola vaccine, in the same way that Jonas Salk raced to deliver his polio vaccine, against a background of escalating human tragedy, focuses the mind in an important way. In many ways, the two competing forces: delivery of solutions and the provision of high quality scientific training are complementary: a good scientist knows when to slow down and take care at the bench, and when to buy an off the peg solution to move a project along. What I would advocate is that greater importance is placed on teaching robust sample preparation methods, as a means of mitigating some of the trouble-shooting shortcomings of many young scientists, in order that they are capable of making informed decisions in their work. The quality and quantity of the input sample in PCR (usually referred to as the "template" is critical for success.
Moving on, amplification and magnification are generally associated with electronics and optics respectively. However, they both have the same general meaning. Consider a singer in a small room ( a typical classroom for example): the singer would normally be audible to everyone in that room. As the size of the room increases, for example a small assembly hall or a large theatre, the singer becomes inaudible, especially at the back of the room. Clearly this phenomenon limited the size of many concert venues, and at the same time stimulated architects to incorporate acoustic criteria into their designs. Nevertheless, there is always a point reached where the singer becomes inaudible to even someone with the keenest of ears! This not only stimulated architects, but also electronic engineers, and thus was born, the microphone: a device for capturing sound waves and converting them into an electrical current. Similarly an amplifier is an electronic device that increases the power of a signal, such as "amplifying" the normal level of sound from a solid body electric guitar. As a result of these two developments, it has become commonplace to hold concerts in football stadia, with suitable equipment. In an analogous way, it was impossible to observe single bacterial cells and certainly not viruses, until the development of optical and electron microscopes respectively. In the former, usually through a combination of optical lenses, the image of a small object can be obtained and otherwise "invisible" objects appear as "virtual" images. Electron microscopes use electrons as the source of illumination and can produce images around 5 000 times greater in magnification, owing to the difference in resolving power resulting in the use of electrons. PCR is similarly a method that does for DNA samples what the electron microscope does for imaging, or the modern microphone does for sound. So let's consider how PCR works.

The objective of (the) PCR is to amplify a specific sequence or set of sequences from a vanishingly small sample of DNA (it could be a research sample or a scene of crime swab). The products of this amplification are called amplicons, and the sample of DNA is referred to as the template. The components of PCRs are pretty logical: a DNA Polymerase enzyme to replicate the DNA, a mixture of dATP, dGTP, dCTP and dTTP (collectively called dNTPs); the building blocks of DNA. All polymerases usually require magnesium ions and a suitable buffer. The missing ingredients are the "primers". These are short (usually between 18-50 nucleotides in length) sequences of DNA; they must be complementary to the ends of the amplicon, following the standard Watson-Crick base pairing rules (A pairs with T and G with C). It is important to appreciate that the two strands of the double helix are anti-parallel. This is illustrated in the top RHS figure. We refer to the direction of the strand as 5'-3' (spoken 5-prime to 3-prime), this is reflected in the orientation of the sugar phosphates that form the backbone of the DNA molecule.

The key to amplification lies in a series of repeated denaturation steps, or cycles during which the Watson-Crick base pairs are "broken", allowing the primers to find their complementary sites. Denaturation is achieved by heating the reaction; the primers (which are in considerable excess over the template DNA) then anneal to form the substrate binding site for the polymerase enzyme. The dNTPs are then incorporated as the polymerase copies each strand in the 5'-3' direction. DNA replication proceeds only in one direction, each double helix that is copied is analogous to two railway lines (say from Liverpool to London): just as the train keeps to one set of tracks, so to do the DNA polymerase molecules. During replication in vivo, this unidirectionality causes topological challenges for the genome in the cell, but this will be the subject of another post.

By heating and cooling around 20-30 times, the amplification proceeds as 1 duplex becomes 2, 2 become 4, 4 become 8 etc. How many cycles are needed to achieve 1 000 000 fold amplification? This is made possible by the thermostability of the polymerase enzyme. Taq polymerase (LHS) was the first commercial enzyme used in PCR. Its technical suitability was accompanied by considerable financial success! The patents surrounding PCR (including the instrumentation) have not only been lucrative, but also highly controversial. Today, there are many polymerases to choose from, some are more accurate than others (higher fidelity), some are more robust, some are better for long amplicons etc. Depending on the application of PCR, there are many choices of kits and enzymes now available. We shall be working through the logistics of planning a PCR experiment prior to carrying out some amplifications in the lab after Easter.

Thursday 12 March 2015

From peas to drugs: What to expect from the new Genetic Engineering Unit

On Monday we begin the Genetics and Genetic Engineering Unit for Y13 BTEC students in the innovation labs. The unit comprises two halves: classical genetics as defined conceptually by Gregor Mendel using Pisum sativum (left) as his model organism. This was later given a physical basis by a number of scientists scientists in the USA, including, Morgan, Beadle and Tatum and others, and in molecular terms in the post war years (WWII) by Watson, Crick and Brenner to name but a few illustrious players! There is a nice timeline here. By the end of the 1960s, scientists knew that Mendel's ideas on inheritance could be explained by the genotype as conveyed by DNA, which passed on the phenotype through RNA and proteins and which was replicated by the process of semi-conservative replication. The latter discovery was a result of key experimental work by Arthur Kornberg and Matt Meselson and Frank Stahl. You can read all about 
these landmark experiments by following the links attached to their names. The flow of information from genotype to phenotype is often referred to as the central dogma of Molecular Biology. You must familiarise yourselves with this concept and the molecular basis of it.

The key to understanding Darwinian evolution, is an appreciation of the  intrinsically low, but finite levels of mutation that accompany cell division. The replication machinery of human cells is extremely accurate (we refer to this as high fidelity), but is not perfect. In addition mechanisms exist for genetic exchange as chromosomes "cross over", that is, closely related DNA sequences often swap over as chromosomes prepare for separation during meiosis in sexually reproducing organisms. Finally, there are a group of genetic elements that jump around the genome and insert, semi randomly: these are referred to as transposable elements. Again, these genetic elements can facilitate genetic diversity, as first discovered by Barbara McClintock in maize (top RHS). You will be given a grounding in the key concepts of genetics by Mr. Houseman and his team during the Unit.

As the fundamental knowledge of genetics shifted towards more molecular ideas during the 1950s and 1960s, several bacterial geneticists exploring the mechanisms of plasmid replication and eukaryotic genes, including Stanley Cohen, Herb Boyer and Paul Berg in California, together with a Swiss microbiologist, Werner Arber, a giant of a New Yorker, Hamilton (Ham) Smith and Dan Nathans on the East Coast of the USA, brought their knowledge of restriction enzymes to facilitate the construction of the first organism (Escherichia coli) harbouring foreign DNA. Thus was born molecular cloning, recombinant DNA technology, or more simply, Genetic Engineering. They managed to obtain genetic material from one organism, splice it into a "cloning vector" (a plasmid) and transform E. coli, thereby changing its genotype and its phenotype. These landmark experiments not only earned most of the scientists the Nobel Prize, but paved the way for the first Genetic Engineering company: Genentech (Genetic Engineering Technology) which remains one of the world's most successful drug company's (now as part of Roche), using genetic engineering technology to produce new medicines.

Mendel's ideas are at the heart of all contemporary Genetic Engineering. The difference between Genetics and Genetic Engineering is simply that in the latter, we are attempting to create new combinations of genes and genetic elements in order to produce a novel organism or gene product (either a protein or an RNA). Since the first recombinant DNA molecules were described in the 1970s, there have been many improvements in the strategies and methods for cloning genes. However, despite these modification, the basic principles remain the same. The introduction of the Polymerase Chain Reaction in the 1980s revolutionised Molecular cloning, so much so that it is probably one of the most common techniques in any research or diagnostic laboratory today. It is also one of the few techniques in Life Sciences that crosses the disciplinary boundaries. I have seen PCR instruments in Chemistry, Physics and Archaeology Departments!

During the first session in the lab we shall use agarose gel electrophoresis combined with a set of restriction enzymes to analyse the genome of the simple bacteriophage lambda. This will be combined with your experience from last year in basic bioinformatics to gain an appreciation of the experimental basis of Genetic Engineering and to compare the information deposited in the NCBI data bases with experimentally obtained "restriction mapping". Thanks to New England Biolabs, we not only have access to a selection of restriction enzymes, but we shall use their online suite of software tools to explore the results further. The experiments will be covered in more detail in microblog here, over the next few weeks.

The second phase of work will involve an introduction to the elements of PCR. This will include an explanation of the exponential phase of amplification together with the principles of primer design. From past experience, everyone will get this wrong, before you get it right! PCR relies on the availability of heat stable (thermostable) DNA Polymerases, and the application of the complementary base pairing described so elegantly by Watson and Crick over 60 years ago! You should refresh your knowledge of the molecular properties of DNA here. This will also be covered in class. I am most grateful to Bioline for their support of the laboratory work, they have provided us with samples of their Taq polymerase "master-mixes" in order that we can demonstrate PCR using primers from my own research lab. I am really looking forward to the unit: it is a close second favourite to Biochemistry!  The combination of Biochemistry and Genetics, or Molecular Biology, and its application in the form of Genetic Engineering, or as it is often now called Synthetic Biology is widely perceived to be one of the most vibrant areas in the Life Sciences today.