Team:Heidelberg/Project/Toolbox

From 2014.igem.org

– giving YOU the full power of inteins

A primary mission of the iGEM team Heidelberg 2014 is to provide a new toolbox based on appropriate standards that introduces the highly functional INTEINS to the parts registry and thereby paving the way to a new level of posttranslational modification possibilities for the iGEM community. Inteins are self-excising peptide segments that mediate the reaction of protein trans-splicing. They don't require any additional energy sources to perform the splicing reaction. Another valuable feature of inteins is their specific recognition, also in the presence of other proteins, which makes them functional in vitro as well as in vivo. On this page we describe experimental approaches to provide and evaluate five related but still quite different applications using inteins and their splicing capabilities. Thereby we present scientific prove of the diversity and efficiency of an intein toolbox that can be used for a wide range of posttranslational modifications of proteins. Explore the new possibilities, intein usage provides! On this page we describe the scientific work in form of modeling and experiments that lead to the establishment of the first intein toolbox in the history of iGEM!

Jump right into splicing and use the toolbox guide to design your intein-equipped proteins of choice!

Contents

Circularization

Enzymes represent a major tool for many branches of the chemical industry, including food, brewing, paper, detergent or biofuel. Millions of years of evolution have allowed these proteins to performed extremely specific chemical modifications that are not only essential for living organisms but can also be of great benefit to produce useful molecules for our life, efficiently and at low cost. A major limitation of the use of enzyme for industrial application and in general out of their natural environment is their stability. They can be destroyed by other enzymes and they can unfold and take non-functional conformation when exposed to non-physiological temperature and pH. Such limitations has motivated research in species that can grow at extreme temperatures . Another major area of chemical research is the design of strategies to stabilize enzymes, and more generally proteins and peptides. Protein circularization, meaning ligation of the N- and C-terminal ends of a protein, represents a promising way to achieve this stabilization. While conserving the functionality of their linear counterpart, circular proteins can be superior in terms of thermostability, resistance against chemical denaturation and protection from exopeptidases . Moreover, a circular backbone can improve in vivo stability of therapeutical proteins and peptides. All these remarkable properties motivated us to develop new tools to circularize any protein of interest. Our Toolbox Guide provides a step-by-step strategy to clone a circularization linker and express it in E. coli. Moreover, in case of complex structures where the protein extremities are far from each other, we have developed the software tool CRAUT that will design the appropriate rigid linkers.

Please find more information about the circularization of proteins and the theory that laid the foundation for the circularization kit of the toolbox on the Circularization pages.

Circularization Constructs

The most promising approaches to circularize proteins are protein trans-splicing using split inteins and Sortase A-catalyzed cyclization. Both methods require the addition of specific proteins domains or peptides to the protein to be circularized. Consequently, on DNA level, creating circular proteins is equivalent to creating fusion proteins. However, existing protein fusion standards like RFC[23] cause scars. Those scars on protein level may affect protein function and further complicate 3D-structure modeling. Therefore, we decided to create the new RFC[i] that allows scarless cloning of inteins. Our intein circularization constructs apply to this standard, while our sortase constructs are closely related and can be used similarly. Detailed instructions on how to use our constructs are provided in our Toolbox Guide.

Split Intein Circularization

Figure 1) <a href='http://parts.igem.org/Part:BBa_K1362000'> BBa_K1362000</a>
Figure 1) BBa_K1362000

NpuDnaE intein RFC [i] circularization construct

Between the coding sequences of the Npu DnaE C-intein and the N-intein we placed BBa_J04450, an mRFP selection marker flanked by BsaI sites that can be replaced by the protein to be circularized.

Exteins, RFC [i] standard overhangs and BsaI sites have to be added to the coding sequence of the protein to be circularized without start- and stop codons by PCR. By Golden Gate assembly, the mRFP selection marker has to be replaced with the protein insert. After addition of an inducible promotor the circular protein is ready to be expressed. For detailed step-by-step instructions please use our Toolbox Guide.

Upon expression of the fusion protein, the split intein domains reassemble to the active intein and thus ligate the termini of the protein to be circularized in trans-splicing reaction.

These constructs were successfully used to circularize lambda lysozyme and xylanase and probably DNMT1.

Sortase Circularization

Figure 2) <a href='http://parts.igem.org/Part:BBa_K1362002'> BBa_K1362002</a>
Figure 2) BBa_K1362002

Sortase A circularization construct (with His6)

An mRFP selection marker BBa_J04450, which can be removed by restriction with BsaI, is flanked by TEV protease cleavage site (left) and a sorting signal (right) with a His6 tag.

BsaI sites and overhangs corresponding to the sortase A circularization construct have to be added to the coding sequence of the protein to be circularized without start- and stop codons by PCR. By Golden Gate assembly, the mRFP selection marker has to be replaced with the protein insert. After addition of an inducible promotor protein is ready to be expressed. The protein has to be purified and treated with TEV protease and sortase A. The sortase A catalyzes the transpeptidation reaction that leads to backbone circularization.

Please find more information about our the toolkit for cirularization on the Circularization Construct pages.

Oligomerization

Split inteins constitute a useful tool to produce huge polymers in vivo: Hauptmann et al. managed to fabricate synthetic spider silk with microfiber structure. The results using an easy-to-handle split intein system were stunning: The polymers had a molecular weight of 250 kDa and more [7]. Further application of oligomerization by inteins includes the posttranslational complexation of multi-domain proteins, after their domains have been expressed individually. This approach is very valuable considering great difficulities of expressing large eukaryotic proteins in E.coli.

Standardization of oligomerization

The valuable properties of spider silk, for example its exceptional strength and elasticity, result from numerous repeats of certain protein motifs. Convenitonal methods to multimerize these motifs bear a lot of difficulties: Often genetic and mRNA instability constitute a barrier for the production of multimers as fusion proteins [1]. Posttranslational assembly through split inteins is therefore the solution to overcome these problems. The successfull polimerization of spider silk potein motifs demonstrates the potential of split inteins to be a useful tool for the production of new biomaterials by performing oligomerization reactions with split inteins. The iGEM team Heidelberg standardized (lik to toolbox guide) the oligomerization procedure with split inteins to allow easy handling with different proteins.

The use of non-orthogonal split inteins can further be exploited to direct the oligomerisation of several protein domains at once.

The mechanism

Oligomerization with inteins.
Oligomerization with inteins.

Oligomerization reactions require the same constructs as the ones used for protein circularization.

Circularization is achieved by bringing the N and C terminus of a protein very close together, so both intein parts can asseble, cut out off the protein and thereby circularize it. In contrast, oligomerization occurs when both termini of a protein cannot reach each other and the intein parts of two neighbouring proteins assemble.

Fusion and Tagging

Introduction

Post-translational modifications are present in nature in great numbers.Synthetic Biology, however, has not yet made use of the innumerable possibilities nature has developed. With our collection of intein assembly constructs we expand the arsenal of synthetic biology by enabling unlimited changes of a protein's amino acid sequence even after translation.

Standard Construction

To be able to fuse any two halves of a protein together can have many different uses. We therefore saw the need for a standardised construct, the intein assembly part. This BioBrick part allows the user to clone two DNA-sequences coding for two parts of a peptide into a plasmid prepared with selection markers and standardised overhangs. Those parts were all send in with additional hexahistidine-tags to enable quick analysis on a western blot, however there are highly customisable parts available as well.Visit our parts page to get an overview of our assembly constructs. In an extensive assay we proved the principle behind split protein assembly by showing that GFP can be artificially split into two halves and thereafter be reassembled so the fluorescence is restored. Visit the split Fluorescent Protein.


N-intein fusion construct.
N-intein fusion construct.

C-intein fusion construct.
C-intein fusion construct.


Posttranslational Modifications

Introduction

Posttranslational modifications are highly prevalent in nature: Almost every protein in a cell is modified after having been translated, adding numerous varieties of the protein to the mere protein backbone. Synthetic Biology, however, can expand the possibilities offered by nature and introduce synthetic posttranslational modifications or attachments, such as biophysical probes. As the two parts of a split intein assemble in a highly specific manner, the modifications are introduced controllable a certain locus. There are different publications on intein-based introduction of posttranslational modifications, including phosphorylation, lipidation, glycosylation, acetylation and ubiquitination [4]. Phosphorylation, for example has been applied with tyrosine kinase C-terminal Src kinase (Csk) in order to be able to study the structure and function of this specifically modified protein [5].

Standard construction

Applying the intein assembly constructs the iGEM team Heidelberg provides a tool for all kinds of natural as well as synthetic posttranslational modifications by chemoselective addition of a peptide to a recombinant protein. The principle is based on intein-mediated protein fusion using the SspDnaB split intein for N-terminal modifications and SspDnaX-S11 for C-terminal ones. The split SspDnaB intein has a very short N-terminal part, consisting of only 11 amino acids intein and 5 amino acids extein sequence and a much longer C-terminal part. By contrast, SspDnaX-S11 has a C-terminal part consisting of only 6 amino acids intein and 3 amino acids extein sequence, but a much longer N-terminal part. The short part including the desired modification is easy to obtain by chemical synthesis. This offers the possibility to introduce this modification at a specific locus.


In general, split inteins are powerful tools to easily introduce all kinds of posttranslational modifications in a highly chemoselective manner. Use our toolbox guide to attach posttranslational modifications and attachments to your protein of interest!

Translocation

Introduction

Adding and removal of translocation tags is one application example for split intein-mediated fusion of two protein domains. Translocation tags offer the possibility to transfer proteins to a certain locus inside the cell by attaching a short tag sequence to the terminus of one's protein of interest. Expressed fused to the protein, which is the conventional way, such a tag is neither removable nor attachable to a protein at a certain time point. Usage of split inteins for tagging offers new dimensions of mobility and control: Tags can be attached (using fusion constructs) or removed (using the intein protease) posttranslationally at a specific time point.

Standard construction

Regulated targeting of proteins to certain loci inside the cell can be achieved using the iGEM Heidelberg 2014 translocation tags for either E. coli or eucaryotic expression systems. Many of them are based on parts created by previous iGEM teams, but with split intein usage now can be applied in much diverser ways. We provide following tags:

Our toolbox provides the following tags:
MinD membrane targeting sequence: BBa_K1362055 → membrane localization (E. coli)
OmpA: BBa_K1362060 → surface display (E. coli)
pelB leader sequence: BBa_K1362058 → periplasmic localization (E. coli)
myristooilation signal sequence: BBa_K1362056 → membrane localization (eucaryotic)
PKI-NES: BBa_K1362059 → cytoplasmic localization (eucaryotic)
NLS: BBa_K1362057 → nuclear localization (eucaryotic)

Use our toolbox guide to post-translationally assemble proteins, attach tags or change the location of your protein!

All translocation tags except from the OmpA surface protein tag were ordered as DNA oligos containing the tag sequence, extein sequence and BsaI oberhangs. OmpA surface protein was obtained from the Spring Distribution 2012 and also PCR amplified with overhangs. By Golden Gate Assembly the tags were cloned into the carrier constructs BBa_K1362052 (for C-terminal tags) and BBa_K1362053 (for N-terminal tags). These carrier constructs, pSB1C3 backbones with two BsaI sites as an insert, were used to send the tags to the registry of standard biological parts. The user can afterwards easily use Golden Gate Assembly to clone the tags from the carrier constructs into intein assembly constructs.

Results

The parts were sequenced, the results were positive. You can obtian them here:


MinD MTS:BBa_K1362055

pelB leader sequence: BBa_K1362058

myristoilation signal sequence: BBa_K1362056

PKI-NES: BBa_K1362059

NLS: BBa_K1362057

OmpA (part1, part2 , part3 , part4 ): BBa_K1362060

Purification

Therapeutic proteins have changed modern medicine. For instance, the monoclonal antibody Trastuzumab, is used to treat breast cancers that are positive for the epidermal growth factor HER2/neu (Ref. Clinical outcome in women with HER2-positive de novo or recurring stage IV breast cancer receiving trastuzumab-based therapy). For application in medicine proteins have to be of very high purity. However, purification at industrial scale remains time consuming and cost intensive. The purification procedure includes numerous steps: First of which is the binding of the protein to an affinity matrix, followed by washing and finally release of the protein from the affinity column. The affinity tag, used to achieve selective binding of the intended protein to the matrix remains bound to the protein of interest and has to be selectively removed by protease cleavage. This often leads to undesirable secondary cleavage within the sequence of the protein. Proteases are the most expensive component of a industrial scale purification process. Eventually, the affinity tag has to be separated from the protein causing further effort and costs. The split-intein based system is manifold easier: All these steps can be combined into one by using Ssp DnaB mini-intein, an artificial split intein. It performs the trans-splicing reaction only under certain pH and temperature conditions. A mutation of the SspDnaB N-intein at Cys1 to Ala prevents cleavage at the N-terminus, cleavage at the C-terminus still occurs. The N-terminal intein part, attached to a chitin binding domain can be used as an affinity ligand, while the C-terminal part, fused to a protein of interest, can be used as an affinity tag binding highly specific to the ligand part. After loading the protein of interest on a chitin column, the trans-splicing reaction can be induced by adjusting temperature and pH: the both split inteins fuse together, disrupting the fusion to the protein of interest in one single step [6]. Thereby, split-inteins simplify conventional protein purification in many aspects: The system is cheaper, faster and more efficient than other purification procedures (Ref.). In fact, only a single chromatographic step is necessary for both, binding of the protein fused to an affinity tag to the affinity column and tag removal. Also, there is no need for expensive proteases.

We standardized this split-intein purification system, so it is easy applicable to all kinds of proteins. Visit our toolbox guide to purify your protein!


Purification with inteins.
Purification with inteins.


On Off

On

NpuDnaE N-intein assembly construct
NpuDnaE N-intein assembly construct

Illustration of the part BBa_K1362100, the N-construct.

NpuDnaE C-intein assembly construct
NpuDnaE C-intein assembly construct

Illustration of the part BBa_K1362101, the C-construct.

In many systems biology projects, it is desirable to have an external trigger to turn a protein’s function ON and OFF in cells at a specific time point. The most prominent approach to achieve the ON-OFF switch is the regulation via gene expression. However this method is unfavorable in terms of temporal and spatial resolution. Using our intein toolbox it could become possible to activate a specific protein in a shorter period of time in single cell resolution. Imaging the most prominent system currently used in molecular biology- the CRISPR/Cas technology temporal and spatial activated within a single cell. This could be achieved using our assembly construct with two insertion sides to fuse proteins or protein domains combined with our light induction approach. Cas9 could be split in two parts resulting in an inactive state of each half. Upon splicing reaction the protein can be reassembled and its functionality restored. Unfortunately, we ran out of time to apply our assembly construct for the experiments with Cas9. However using our toolbox this can easily be achieved within few days. As proof of principle for the fusion and activation of proteins, we showed the reconstitution of fluorescence of superfold GFP that was previously split and connected to split NpuDnaE inteins.

Our assembly construct with two insertion sides provides an easy to handle approach to clone in front and behind of each split intein. Initially N- and C-terminal parts are separated on two plasmids and each intein provides the two insertion sides. The iGEM team Heidelberg 2014 created both constructs: BBa_K1362100, the N-construct and BBa_K1362101, the C-construct. As place holder for the insertion sides mRFP selection markers were included that are easily exchangeable via Golden Gate cloning using the restriction enzyme BsaI. The user only needs to make sure to clone the protein halves, which should reassemble, into the correct insertion sides; namely the N-terminal protein half into the N-terminal “gate” and the C-terminal protein into the C-terminal “gate”.

Confused? All your questions will be answered once looking into our RFC!

In this manner the separated plasmids containing either N- or C-terminal splicing construct can be coexpressed by transforming both plasmid into one cell. Alternatively they can be assembled onto one plasmid by standard Biobrick cloning.

We generated the assembly construct by joining and amplification of the three inserts – first insertion side, Npu DnaE intein and second insertion side- via polymerase cycling assembly (PCA) cloning method. PCA is a modification of CPEC cloning, resulting only in the amplification of the assembled insert fragment. Using CPEC this fragment was incorporated in our expression vector. The sequencing of our assembly construct with two insertion sides is available.

Off

The intein protease construct.
The intein protease construct.

Principle of the intein protease.
Principle of the intein protease.

We provide two different tools for deactivation of proteins. One of them is the C-terminal degradation tag SsrA, the other one the intein protease.The SsrA tag directs the tagged protein to the ClpXP protease, causing degradation of this protein [10]. The intein protease is a singular tool for site-specific in vivo protein cleavage. Under in vitro conditions, protease usage is relatively easy compared to in vivo conditions. The inside of a cell constitutes a very complex environment, so that conventional proteases would simply cleave not only the desired site of the protein of interest, but also other proteins inside this cell, causing huge interference with pathways crucial for cell functions. So far, only the tobacco etch virus (TEV) protease has been used with success for in-vivo cleavage without severe decline of cell viability.[12] Volkmann et al. first showed in vitro functionality of the intein protease [11]. Later it was shown, that the intein proteease is also applicable in vivo in E. coli and yeast cells without impairment of cell functions.[12]

The intein protease consists of the 144-aa SspDnaB S1 C-intein (with an Asn-to-Ala mutation at the end of the C-intein to prevent protein splicing) followed by the extein sequence. This C-terminal part of the split intein recognizes its counterpart, the N-terminal split intein inserted into the sequence of the proteine to be cleaved, and cleaces there between extein and intein sequence.

Intein protease standard construction

SspDnaB C intein was PCR amplified from a plasmid (pCL20) received from Prof. Dr. Henning Mootz. The PCR primers contained overhangs to get the following constuct:

XbaI site – T7RBS - His6 – SspDnaB C intein (N → A) – SspDnaB C extein – SpeI site – PstI site


This PCR product and pSB1C3 were digested with XbaI and PstI. The digested PCR product was ligated into pSB1C3. The resulting plasmid is BBa_K1362050 (Intein protease – T7RBS-His-SspDnaBC ).

The next step was to add a promotor. Therefore, BBa_K1362050 was digested with EcoRI and XbaI. BBa_K808000, an Ara promotor, was digested with SpeI and PstI. These two parts were ligated, resulting in BBa_K1362051 (Intein protease with arabinose inducible regulatory promoter/ repressor araC-Pbad-T7RBS-His-SspDnaBC).


Results

Both intein protease constructs BBa_K1362050 and BBa_K1362051 were sequenced. We obtained positive results. Here you can see the sequencing results of BBa_K1362051, the longer of the constructs (and including BBa_K1362050),


BBa_K1362051

part1:
part2:
part3:
part4:
part5:
part6:
part7:
part8:

SsrA degradation tag standard construction

The SsrA degradation tag was ordered as DNA oligos containing the tag sequence, extein sequence and BsaI overhangs. By Golden Gate Assembly the tag was cloned into the carrier construct BBa_K1362052 (for C-terminal tags). This carrier construct consists of a pSB1C3 backbone with two BsaI sites as an insert. It was used to send the tag to the registry of standard biological parts. The user can afterwards easily use Golden Gate Assembly to clone the tag from the carrier constructs into intein assembly constructs.

Results

The part was sequenced, the results were positive. You can see the sequencing results here:

SsrA: BBa_K1362054

References

[1] McGinness, KE et al.: Engineering controllable protein degradation. Molecular Cell 22, 701–707, June 9, 2006. DOI 10.1016/j.molcel.2006.04.027

[2] Volkmann, G. et al: Controllable protein cleavages through intein fragment complementation. Protein Sci, 18 (2009), pp. 2393–2402. DOI: 10.1002/pro.249

[3] Volkmann, Gerrit et al.: Site-specific protein cleavage in vivo by an intein-derived protease. FEBS Letters 586 (2012) 79–84. doi:10.1016/j.febslet.2011.11.028.

[4] Vila-Perello, Miquel et al.: Biological Applications of Protein Splicing. Cell 143. October 15, 2010. DOI 10.1016/j.cell.2010.09.031.

[5] Muir, Tom W. et al.: Expressed protein ligation: A general method for protein engineering. Proc. Natl. Acad. Sci. USA 95 (1998).

[6] Lu, Wei et al.: Split intein facilitated tag affinity purification for recombinant proteins with controllable tag removal by inducible auto-cleavage. J. Chromatogr. A 1218 (2011)

[7] Hauptmann, V. et al.: Native-sized spider silk proteins synthesized in planta via intein-based multimerization. Transgenic Res (2013) 22:369–377. DOI 10.1007/s11248-012-9655-6.

[8/D] Vila-Perello, Miquel et al.: Biological Applications of Protein Splicing. Cell 143. October 15, 2010. DOI 10.1016/j.cell.2010.09.031.

[9/E] Muir, Tom W. et al.: Expressed protein ligation: A general method for protein engineering. Proc. Natl. Acad. Sci. USA 95 (1998).

[10] McGinness, KE et al.: Engineering controllable protein degradation. Molecular Cell 22, 701–707, June 9, 2006. DOI 10.1016/j.molcel.2006.04.027

[11] Volkmann, G. et al: Controllable protein cleavages through intein fragment complementation. Protein Sci, 18 (2009), pp. 2393–2402. DOI: 10.1002/pro.249

[12] Volkmann, Gerrit et al.: Site-specific protein cleavage in vivo by an intein-derived protease. FEBS Letters 586 (2012) 79–84. doi:10.1016/j.febslet.2011.11.028.