Team:Heidelberg/pages/PCR 2.0
From 2014.igem.org
Contents |
Introduction
Motivation
The invention of the polymerase chain reaction in 1983 by Kary Mulliy revolutionized the modern world by enabling amplification of DNA in an exponential manner. Further improvements including the use of thermo-stable DNA-polymerases made the method even more efficient, allowing its widespread use in nearly every field of modern diagnostics and research. However, a major part of information is lost when using conventional PCR due to missing transfer of DNA modifications. The most abundant DNA modification is methylation, which is present in all kingdoms of life where it acts as prominent regulator of gene expression. Such modifications that influence the DNA function without changing its actual sequence are studied in the field of epigenetics (greek: epi/επί-over, above, around).
To empower epigenetics, we propose the PCR 2.0 as an easy and efficient way to amplify DNA templates in an exponential manner while maintaining their specific methylation pattern. The pivotal element of this PCR reaction is the use of a heat-stable DNA methyltransferase (DNMT). Although comparable enzymes exist in thermophile organisms [1] until now no suitable protein has been found or synthesized that withstands the harsh conditions of a PCR.
Even though the detection and mapping of methylation patterns has become feasible in a high-throughput manner by using bisulfite sequencing and array techniques [2], further functional analysis is still hindered by the small amount of primary material. Our approach therefore aims to provide unlimited amounts of any methylated DNA sample of interest, without having to know all of its properties or the need of expensive methods and de-novo synthesis – just by running a PCR.
To achieve this goal of a PCR2.0 that can be used to amplifiy DNA with its according methylation pattern, the main aim was the generation of a heat stable Dnmt1 that can be produced in a large scale and shows efficient and specific methylation of DNA. Our approach applies protein circularization by using self-splicing protein sequences, socalled inteins from our toolbox as well as our linker design software. Since the restriction of conformational changes through intramolecular bonds has been known to increase the stability of proteins [3] [4] and joining of C and N terminus were reported to increases in thermostability of smaller peptides [5] we started the circularization of the so far biggest protein…
Epigenetics - there is more than just A, T, C and G…
In mammals and other vertebrates the methylation of cytosine nucleotides that are followed by guanines (CpG) is a prominent key regulator of transcription, embryonic development, X chromosome inactivation and many other cellular functions. [6] [7]. Approximately 1% of the genetic code in human somatic cells is consisting of methylated cytosines, amounting to 70-80% of all present CpG Dinucleotides [8]. Due to this great prevalence and the heredity of this modification, the 5-methylcytosine (5mC) is also known as the “fifth base” of eukaryotic genomes [9]. DNA methylation is preferentially occurring at intergenic regions and repetitive sequences, where it is known to silence gene expression [10] [11]. As against that, GC-rich promoter regions are usually characterized by a low methylation status, allowing transcription of the associated genes [12]. Deregulation of cytosine methylation has been reported to play a crucial role in the development of numerous diseases, including cancer, imprinting diseases and repeat-instability based diseases such as morbus huntington [13].
A closer look at DNA Methyltransferases
The family of enzymes called DNA methyltransferases (DNMTs) is responsible for the establishment and maintenance of most cell-type specific DNA methylation patterns. Whereas the two enzymes DNMT3a and DNMT3b contribute to de novo methylation of DNA during development, DNMT1 is responsible to preserve existing methylation patterns throughout cell divisions. Therefore DNMT1 exploits the principle of semiconservative replication, using the parental strand as a template to create an exact copy of methylation patterns. Subsequent to recognition of a so called hemi-methylated CpG site, the enzymatic machinery transferes a methyl group from the methyl donor S-Adenosyl-methionine (SAM) to the C5 position of the complementary cytosine residue. [14]
For our experiments we used the smallest truncated version of DNMT1 that has been reported to efficiently mediate methylation maintenance. This truncated version derives from Mus musculus and comprises amino acids 731-1602 (mDNMT1(731-1602) [15] [19] kindly provided by Dr. Bashtrykov after approval from Prof. Patel). mDNMT1(731-1602) contains the C-terminal catalytic methyltransferasedomain as well as the aligning bromo-adjacent homology (BAH) domains, which prevent the binding of unmethylated DNA in the catalytic core. The enzyme is missing the CXXC domain, which has a high affinity to CpG hemi-methylated dinucleotides, but was shown to be dispensable for protein function [16]. In contrast to the full-length DNMT1 that can only be expressed in insect cells using a baculovirus based system, the mDNMT1(731-1602) has the great advantage of being efficiently expressed in E.coli. Another advantage of the truncated DNMT1 is, that the N and C terminal ends of DNMT1 (731-1602) are closer together than in the full-lentgh DNMT1. Thus cirularization might cause less deformation and show lower impact on the overall activity, making this truncated mDNMT1 an ideal target for our purposes.
Circularization - The missing link
Heat stabilization of mDNMT1 as a pivotal element of our methylation-maintaining PCR2.0 was approached by circularization of the protein. Until now, only smaller proteins have been stabilized using this method. Therefore, to our knowledge, mDNMT1 – even in its shortest truncated version of 871 amino acids and a molecular weight of approximately 100kDa – is the largest protein that has ever been tried to circularize. According to the crystal structure, the distance between the termini of the truncated DNMT1 is 48 Å. We suspected that circularization of the protein by direct fusion of both termini might cause deformation of the protein structure. Therefore we used our own linker design software to create a linking amino acid sequence, adapted to the structure of DNMT1. The software is able to design protein linkers with the required length to bridge the gap between the protein termini while bypassing the catalytic core of the enzyme.
Since we were not able to estimate the impact of the linker introduction, we focused on two different kinds of linkers: a rather rigid linker that has been calculated and optimized during the establishment of our linker software and a flexible peptide connection consisting mostly of Glycine and Serine amino acids. To perform circularization we have used the split NpuDnaE Intein since it is known as the golden standard of protein splicing and has been efficiently tested by us through circularization of GFP, Lysosyme and Xylanase. Moreover, we tried to implement protein circularization by using Sortase A, which is recognizing and cleaving a carboxyterminal sorting signal followed by a transpeptidation reaction that can be exploited for protein circularization. This method has been reported to be very efficient, especially when used in vitro [4] and has therefore been included in our study as possible candidate for large scale productions of circular proteins.
Material and methods
Constructs
We designed constructs in order to characterize efficiency of sortase as well as intein mediated circularization of DNMT1. Both approaches are based on the comparison of the circularized protein with a corresponding linear counterpart at different temperatures and incubation times.
For circularization of DNMT1 with inteins we fused the obtained truncated version of DNMT1 (731-1602) [15] with an appropriate linker and split NpuDnaE domains at either site of the construct. For efficient purification of the protein, all constructs contained a hexa-histidine tag. Moreover, we have cloned the ubiquitin-like Smt3 protein from Saccharomyces cerevisiae in front of the DNMT1-Intein complex. This attachment is used to increase the yield when expressing large mammalian proteins like DNMT1 in E.coli [17]. The constructs have been designed in a way that Smt3 is no longer included in the final circular protein, and therefore cannot interfere with its function. All cloning steps were carried out using our new RFC(LINK) standard procedure.
The mDNMT1 (731-1602) constructs that have been designed to allow Sortase A mediated cirularization are flanked by the Sortase A recognition sequence LPETGG on the C–terminus as well as N-terminal glycines that that can be exposed through previous TEV treatment. Since the transpeptidation via sortase requires an additional in vitro reaction, subsequent purification is necessary. To facilitate this step and enrich the circular product in the sample, a hexa-histidine tag is present in the initial protein that is lost during circularization. Hence, an additional affinity chromatography via his trap can be used to separate the histidine tag containing sortase and not circularized educt from the circular flow-through. All cloning steps were carried out using the Sortase standard that you can find here (LINK).
The following linkers that had been optimized along with the linker software were used:
Definition | Sequence |
---|---|
Rigid linker | GGAEAAAKEAAAKVNLTAAEAAAKEAAAKEAAAKEAAAKEAAAKAVNLTAAEAAAKAHHHHHHSGRGT |
Flexible linker | CWEGGGSGGGSGGGSGGGSGGGSGGGSGGGSGGGSGGGSGGGSHHHHHHSGRGT |
Expression and purification
In comparison to the full length murine DNMT1, our modified mDNMT1(731-1602) can be expressed in E.coli (Rosetta DE3) and does not need the establishment of Baculovirus based system and the use of insect cell cultures. First experiments were conducted to determine the optimal conditions for protein induction and expression. We tested several concentrations of IPTG as well as different temperatures to maximize the yield of mDNMT1. Still, even though expression of the constructs can be simplified by using E.coli , purification of the recombinant protein remains a challenging task requiring multiple different steps that had to be adapted and established from published protocols [19].
In a first step, we had to optimize the process of cell lysis, to be able to extract a maximum amount of active protein from our cultures. Therefore we compared the two commonly used methods sonication and disruption via French press. Subsequently, the sample was cleared from cell debris by ultracentrifugation. The protein of interest containing a hexa-histidie tag was further enriched via immobilized metal ion affinity chromatography (IMAC) using a His trap and eluted with imidazol. In a next step, low molecular weight solutes such as salts and the imidazol that would interfere with the protein function were removed from the sample by dialysis. Finally, the purified protein was concentrated by size-exclusion centrifugation using filters with an appropriate mass cutoff.
Methylation assay
To measure the efficiency of mDNMT1 maintenance methylation, we used an assay that is based on two methylation-sensitive restriction enzymes Sau3AI and HpaII. It relies on the inhibition of restriction enzymes when methyl groups are attached to the cleavage site. We used a 40-mer DNA template with one hemi-methylated and one unmethylated CpG site which are located within the Sau3AI and HpaII restriction site, respectively (adapted from Bashtrykov et colleagues [16].
The restriction enzyme Sau3AI is capable of cleaving the template despite of the hemimethylated state of the first CpG nucleotide. Similarly, HpaII can attack its unmethylated native site, leading to fragmentation of the template as long as no further methylation occurs. Accordingly, the methylation activity of circular and linear mDNMT1 can be measured by quantifying the cleavage efficiency of the restriction enzymes after incubation with the protein.
Impairment of Sau3AI cleavage indicates maintenance activity of DNMT1, whereas a decrease in HpaII activity would result from de novo methylation. Therefore the assay is not only detecting activity of the enzyme but also includes its specificity for hemimethylated DNA over unmethylated DNA. To characterize mDNMT1 heat stability as well as kinetics, we perfomed methylation reactions after different heat treatments and looked at the amount of methylation that could be detected due to increasing incubation times.
Assay analysis
For evaluation cleaved DNA was made visible by gel electrophoresis and intensities were measured using ImageJ.
Results: Linear mDNMT1(731-1602) - Establishment of expression and assays
Efficient induction of linear mDNMT1(731-602)
To obtain maximum yields of active mDNMT1(731-1602) from expression in E.coli Rosetta (DE3), different amounts of IPTG and temperatures during induction have been compared. Generally, solubility tags such as the fused Smt3 as well as lower temperatures are known to prevent the occurrence of insoluble inclusion bodies under conditions of overexpression. Therefore we aimed to increase the yield of soluble and properly folded proteins by using lower temperatures during induction. We could verify, that expression of mDNMT1(731-1602) is more efficient at 15°C than 25°C (Figure ). Another factor that is influencing the yield of recombinantly expressed proteins is the amount of IPTG. Here, the optimal amount of inducer is again essential to allow efficient induction while preventing protein aggregation upon to strong induction. For the conditions tested, we could see increasing amounts of detectable protein upon increasing concentrations of IPTG. Therefore, optimal induction of our mDNMT1 construct obtained for incubation at 15°C and concentrations of 1mM IPTG.
Expression and extraction of linear mDNMT1(731-1602)
Even though the full length mDNMT1 can only be expressed in eukaryotic cells using the Baculovirus System, we could verify the successful expression of its truncated version in E. coli (Figure 6) . Even though the overall yield of recombinant protein is quite low, the presence of mDNMT1 after induction with IPTG was detectable through Coomassie staining and specifically verified by Western blotting. Whereas the largest protein identified at a corresponding molecular weight of ~140kDa can be attributed to the fully expressed mDNMT1(731-1602) construct, there can also be detected several other peptides with lower molecular weight. These peptides can possibly result from incomplete translation or degradation processes, since the construct used for the preliminary experiment (Figure 6) comprised an N-terminal hexa-histidine tag. To minimize the amount of incomplete protein products that would be enriched in the following purification, the optimized constructs that were used in further experiments contained C-terminal tags.
Upon successful expression In order to optimize protein extraction we compared cell lysis by sonification with disruption via French press (Figure 6A and 6B). Assuming similar amounts of overall protein that should be present in the consecutively generated samples, sonification seems to be slightly less efficient for extraction of recombinant protein compared to usage of the French press. Beyond this observation, the disruption of E.coli using French press has been reported to preserve the biological activity of susceptible proteins more efficiently than sonication [18]. Therefore we used the French press for further extraction of linear as well as circular mDNMT1.
Purification of linear mDNMT1(731-1602)
Activity and specificity of linear mDNMT1(731-1602)
After successful expression and purification of linear DNMT1 using a His trap column, we characterized the protein in its activity and specificity for hemi-methylated DNA by performing the restriction enzyme based assay described above. There has been detected an increasing amount of completely methylated template over time (Figure 7), represented by increasing amounts of template that are unsusceptible towards SauAI cleavage. In fact, methylation efficiency of mDNMT1(731-1602) has been high enough to methylate most of the template even when performing direct inactivation of the reaction after addition of the enzyme. Still, there could been seen an increase of methylation for longer incubation periods. The truncated DNMT1 (731-1602) showed maximal methylation of the template after approximately one hour. Therefore, our data agrees with efficiencies, that have been reported in literature for wild type and truncated versions of mDNMT1 [15] [16] .Overall, the reaction kinetics of the linear protein seem to be promising for later use in a PCR2.0, assuming that circularization of the protein will not ablate its function.
Since high specificity of the mDNMT1 represents a necessary property for its use in the targeted PCR2.0, especially with regard to the exponential multiplication of any products during the process, we included HpaII digestion in our assay to determine de novo methalation of CpG sites. Restriction efficiency of HpaII was not 100% as it can be seen from the hemimethylated control. Therefore, uncut template is possibly not resulting from unspecific methylation activity of the enzyme. Moreover, there has not been detected an increase of HpaII repression over time, confirming this assumption. Never the less, further test, especially when using circularized protein will have to be performed to confirm significance of this data.
Circular mDNMT1(731-1602) - The privotal element of PCR2.0
Expression and purification of circular mDNMT1(731-1602)
Expression and purification were conducted as they had been established for the linear DNMT1. Elutions from the His-Trap showed an enrichment and concentration of peptides that correlate with the expected molecular weight of the truncated mDNMT1(731-1602) (Figure 9). Nevertheless, purification with a His-Trap alone does not seem to exclude a variety of impurities that either result from degraded mDNMT1 that still contains a His-Tag or unspecific binding to the affinity column. Advanced protocols as they were used by Song and collegues [19] for analysis of the proteins crystal structure are therefore including several more steps. Nevertheless, additional steps of purification increase the risk of reducing the overall protein yield and activity, necessitating greater amounts of starting material. Since the active protein should be included and efficiently enriched in our sample despite of present impurities, we to continued with functional analysis of the sample.
Proof of efficient circularization
Activity and specificity of circular mDNMT1(731-1602)
Thermal stability of circular mDNMT1(721-1602) using different linkers
Discussion
References
[1] Watanabe, M., Yuzawa, H., Handa, N., & Kobayashi, I.. Hyperthermophilic DNA methyltransferase M.PabI from the archaeon Pyrococcus abyssi. Applied and Environmental Microbiology, 72(8), 5367–75 (2006).
[2] Meissner, A. et al. Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature 454, 766–70 (2008)
[3] Vieille, C. & Zeikus, G. Hyperthermophilic enzymes: sources, uses, and molecular mechanisms for thermostability. Microbiol. Mol. Biol. Rev. 65, 1–43 (2001).
[4] Antos, J. M., Popp, M. W.-L., Ernst, R., Chew, G.-L., Spooner, E., & Ploegh, H. L.. A straight path to circular proteins. The Journal of Biological Chemistry, 284(23), 16028–36 (2009).
[5] Tam, J. P. & Wong, C. T. T. Chemical synthesis of circular proteins. J. Biol. Chem. 287, 27020–5 (2012).
[6] Robertson, K. D. DNA methylation and human disease. Nat. Rev. Genet. 6, 597–610 (2005).
[7] Jaenisch, R. & Bird, A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat. Genet. 33 Suppl, 245–54 (2003).
[8] Ehrlich, M.. Amount and distribution of 5-methycytosine in human DNA from different types of tissues or cells. Nucleic Acids Res. 10: 2709–2721 (1982).
[9] Lister, R. & Ecker, J. Finding the fifth base: genome-wide sequencing of cytosine methylation. Genome Res. 959–966 (2009).
[10] Bird, A.. DNA methylation patterns and epigenetic memory. Genes & Development 16, 6-21 (2002).
[11] Jaenisch, R., and Bird, A.. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet 33, 245-254 (2003).
[12] Weichenhan, D., and Plass, C.. The evolving epigenome. Human Molecular Genetics 22, R1 (2013).
[13] Robertson, K. D. DNA methylation and human disease. Nat. Rev. Genet. 6, 597–610 (2005).
[14] Jurkowska, R. Z., Jurkowski, T. P. & Jeltsch, A. Structure and function of mammalian DNA methyltransferases. Chembiochem 12, 206–22 (2011).
[15] Song, J., Teplova, M., Ishibe-Murakami, S. & Patel, D. J. Structure-based mechanistic insights into DNMT1-mediated maintenance DNA methylation. Science 335, 709–12 (2012).
[16] Bashtrykov, P. et al. Specificity of Dnmt1 for methylation of hemimethylated CpG sites resides in its catalytic domain. Chem. Biol. 19, 572–8 (2012).
[17] Lee, C.-D., Sun, H.-C., Hu, S.-M., Chiu, C.-F., Homhuan, A., Liang, S.-M., Wang, T.-F. An improved SUMO fusion protein system for effective production of native proteins. Protein Science : A Publication of the Protein Society, 17(7), 1241–8 (2008).
[18] Benov, L. & Al-ibraheem, J. Disrupting Escherichia coli : A Comparison of Methods. 35, 428–431 (2002).
[19] Song, J., Rechkoblit, O., Bestor, T. H. & Patel, D. J. Structure of DNMT1-DNA complex reveals a role for autoinhibition in maintenance DNA methylation. Science 331, 1036–40 (2011).