Team:Tsinghua-A/Modeling

From 2014.igem.org

(Difference between revisions)
 
(19 intermediate revisions not shown)
Line 2: Line 2:
{{Team:Tsinghua-A/Killbanner}}
{{Team:Tsinghua-A/Killbanner}}
{{Team:Tsinghua-A/Menubar}}
{{Team:Tsinghua-A/Menubar}}
 +
<html>
<html>
-
<body>
+
<head>
 +
    <style type="text/css" media="screen">
 +
 
 +
 
 +
      #parallax_wrapper {
 +
        position: relative;
 +
        z-index: 0;
 +
        -webkit-transition: all 0.25s ease-in;
 +
        transition: all 0.25s ease-in;
 +
      }
 +
      #parallax_field {
 +
        overflow: hidden;
 +
        position: absolute;
 +
        left: 0;
 +
        top: 0;
 +
        height: 370px;
 +
        width: 100%;
 +
      }
 +
      #parallax_field #parallax_bg {
 +
        position: absolute;
 +
        top: -20px;
 +
        left: -20px;
 +
        width: 110%;
 +
        height: 425px;
 +
        z-index: 1;
 +
      }
 +
      #parallax_illustration {
 +
        display: block;
 +
        margin: 0 auto;
 +
        width: 940px;
 +
        height: 370px;
 +
        position: relative;
 +
        overflow: hidden;
 +
        clear: both;
 +
      }
 +
      #parallax_illustration img {
 +
        position: absolute;
 +
      }
 +
      #parallax_illustration #parallax_error_text {
 +
        top: 72px;
 +
        left: 72px;
 +
        z-index: 10;
 +
      }
 +
      #parallax_illustration #human {
 +
        top: 140px;
 +
        left: 200px;
 +
        z-index: 9;
 +
      }
 +
      #parallax_illustration #gene {
 +
        top: 180px;
 +
        left: 220px;
 +
        z-index: 8;
 +
      }
 +
      #parallax_illustration #parallax_octocatshadow {
 +
        top: 297px;
 +
        left: 371px;
 +
        z-index: 7;
 +
      }
 +
      #parallax_illustration #parallax_speedershadow {
 +
        top: 263px;
 +
        left: 442px;
 +
        z-index: 6;
 +
      }
 +
      #parallax_illustration #logo {
 +
        top: 150px;
 +
        left: 630px;
 +
        z-index: 5;
 +
      }
 +
      #parallax_illustration #brick {
 +
        top: 180px;
 +
        left: 200px;
 +
        z-index: 4;
 +
      }
 +
    </style>
 +
  <style>[touch-action="none"]{ -ms-touch-action: none; touch-action: none; }[touch-action="pan-x"]{ -ms-touch-action: pan-x; touch-action: pan-x; }[touch-action="pan-y"]{ -ms-touch-action: pan-y; touch-action: pan-y; }[touch-action="scroll"],[touch-action="pan-x pan-y"],[touch-action="pan-y pan-x"]{ -ms-touch-action: pan-x pan-y; touch-action: pan-x pan-y; }</style><style id="style-1-cropbar-clipper">/* Copyright 2014 Evernote Corporation. All rights reserved. */
 +
.en-markup-crop-options {
 +
    top: 18px !important;
 +
    left: 50% !important;
 +
    margin-left: -90px !important;
 +
    width: 180px !important;
 +
    border: 2px rgba(255,255,255,.38) solid !important;
 +
    border-radius: 4px !important;
 +
}
 +
 +
.en-markup-crop-options div div:first-of-type {
 +
    margin-left: 0px !important;
 +
}
 +
</style>
 +
 +
</head>
 +
</html>
 +
 +
 +
 +
 +
 +
<html>
 +
<body> 
 +
<section>
 +
<div class="span12 ">
 +
<div id="parallax_wrapper">
 +
      <div id="parallax_field"> 
 +
      </div>
 +
      <div id="parallax_illustration">
 +
        <div id="auth"></div>
 +
  <img alt="" class="js-plaxify" data-xrange="10" data-yrange="10" height="162" id="human" width="113" src="https://static.igem.org/mediawiki/2014/6/68/Tsinghua-A_Human.png" style="top: 93.01263823064771 px; left: 100.7109375px;">
 +
 +
        <img alt="" class="js-plaxify" data-xrange="8" data-yrange="8" height="111" id="gene" width="460" src="https://static.igem.org/mediawiki/2014/d/d6/Tsinghua-A_Gene.png" style="top: 180.01263823064772px; left: 220.7109375px;">
 +
 +
<img alt="" class="js-plaxify" data-invert="true" data-xrange="20" data-yrange="20" height="84" id="logo" width="120" src="https://static.igem.org/mediawiki/2014/a/a7/Tsinghua_A_logo.png" style="top: 150.97472353870458px; left: 630.4453125px;">
 +
 +
      <img alt="" class="js-plaxify" data-invert="true" data-xrange="10" data-yrange="10" height="30" id="brick" width="30" src="https://static.igem.org/mediawiki/2014/c/cc/Tsinghua-A_Small_brick.png" style="top: 180.96208530805687px; left: 210.66796875px;">
 +
    </div>
 +
 +
    </div>
 +
 +
</div>
 +
</section>
-
 
 
-
 
 
-
 
 
-
   
 
-
 
 
   <!--Content Section One-->
   <!--Content Section One-->
 +
 
 +
 
 +
 
 +
 
 +
 
     <section id="blog">
     <section id="blog">
     <div class="container">
     <div class="container">
Line 41: Line 159:
    <div class="span10">
    <div class="span10">
   
   
-
    <h2><b>Overview</b></h2>
+
    <h2><b>Algorithms</b></h2>
-
    <p><B>What is TAL effector?</B><br>
+
    <p><B>Overview</B><br>
-
TAL (transcription activator-like) effectors are proteins secreted by Xanthomonas bacteria via their type III secretion system when they infect various plant species. These proteins are able to bind promoter sequences in the host plant and to activate the expression of plant genes that aid bacterial infection.[1]<br><br>
+
<p>In order to lower down the repeatability rate of codons, we use intelligent optimization algorithm.  
-
<B>What can TAL effectors do and how do they function?</B><br>
+
We exploit amino acid degeneracy and alternate nucleotides to reduce the repetition rate of bases of TALE DNA sequence. (See Figure 1) Then the optimized TALE sequence will be tested in wet lab to verify our conjecture whether TALE DNA sequence of lower repeatability rate works at higher efficiency.<br><br>
-
There appears to be a one-to-one correspondence between the identity of two critical amino acids in each repeat and each DNA base in the target sequence. [2]As a result, TAL effectors have attracted great interest as DNA targeting tools.<br><br>
+
<div align="center"><img src="https://static.igem.org/mediawiki/2014/3/3c/T14.jpg" alt="Title"><br></div>
-
Their targeting specificity is determined by a central domain of tandem, 33–35 amino acid repeats, followed by a single truncated repeat of 20 amino acids. The majority of naturally occurring TAL effectors examined have between 12 and 27 full repeats. A polymorphic pair of adjacent residues at positions 12 and 13 in each repeat, the ‘repeat-variable di-residue’ (RVD), specifies the target, one RVD to one nucleotide, with the four most common RVDs each preferentially associating with one of the four bases.[3][4]
+
<div align="center"><p>Figure 1. RNA codon Table<br><br></div>
-
<img src="https://static.igem.org/mediawiki/2014/d/d7/Tsinghua-A-overview.jpg" alt="Title"><br>
+
<p>For instance, we may change UUC into UUU to make one of the repeats different from others while the amino acid(PhenylalaninePhe) is identical.<br>
-
Figure 1. Structure of a naturally occurring TAL effector[5]<br><br>
+
<p>By this way, we use two types of intelligent optimization algorithm to optimize TALE DNA sequence.<br><br>
-
+
-
<B>What’s the weakness of TAL effector?</B><br>
+
-
This simple code between amino acids in TAL effectors and DNA bases in their target sites might be useful for protein engineering applications. Numerous groups have designed artificial TAL effectors capable of recognizing new DNA sequences in a variety of experimental systems. It has been reported that TAL effectors target genes efficiently in many eukaryocytic cells like mammalian cells and yeasts. However, the former experiments with E. coli indicate that TAL effectors don’t work well in E. coli cells.<br><br><br>
+
-
<b>References</b><br>
+
-
[1] Bogdanove AJ, Schornack S, Lahaye T. TAL effectors: finding plant genes for disease and defense. Curr. Opin. Plant Biol. 2010; 13: 394-401.<br>
+
-
[2] Boch J, Bonas U. Xanthomonas AvrBs3 family-type III effectors: discovery and function. Annu. Rev. Phytopathol. 2010; 48: 419-436.<br>
+
-
[3] Moscou MJ, Bogdanove AJ. A simple cipher governs DNA recognition by TAL effectors. Science. 2009; 326: 1501.<br>
+
-
[4] Boch J, Scholze H, Schornack S, Landgraf A, Hahn S, Kay S, Lahaye T, Nickstadt A, Bonas U. Breaking the code of DNA binding specificity of TAL-type III effectors. Science 2009; 326:1509-1512.<br>
+
-
[5] Tomas Cermak, Adam J. Bogdanove, and Daniel F. Voytas. Efficient design and assembly of custom TALEN and other TAL effector-based constructs for DNA targeting. Nucl. Acids Res. 2011; 39: 7879-7879
+
-
+
-
</p><br />
+
-
    <hr>
 
-
    </div>
 
-
 
 
-
    </div><!--End Article-->
 
-
   
 
-
   
 
-
    <!--Article Two -->
 
-
    <div class="row-fluid blog-article">
 
-
    <div class="span2 text-center">
 
-
                <div class="news-date">
 
-
                <h3>2</h3>
 
-
                </div>
 
-
              </div>
 
-
   
 
-
    <div class="span10">
 
-
   
 
-
    <h2><b>Hypothesis</b></h2>
 
-
    <p>It has been reported that TAL effectors target genes efficiently in many eukaryocytes like mammalian cells and yeasts. However, the former experiments with E. coli indicate that TAL effectors don’t work well in E. coli cells. There are a variety of factors can possibly be attributed to its low efficiency, and homologous recombination is a highly possible one.<br><br>
 
-
Some proteins originated from prokaryocytes, with their sequences paralleling with DNA sequences of E. coli, their expression might be hampered because of homologous recombination. By parity of reasoning, we propose a hypothesis that the inefficiency of TAL effector expression is caused by its sequence resemblance with E. coli genes.<br><br>
 
-
Our solution is based on the codon usage bias, which refers to differences in the frequency of occurrence of synonymous codons in coding DNA.[1] The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism.[2]<br>
 
-
<img src="https://static.igem.org/mediawiki/2014/4/4a/Tsinghua-A-condon.jpg" alt="Title"><br>
 
-
Figure 1. RNA codon table[3]<br><br>
 
-
<b>References</b><br>
+
<B>Genetic Algorithm(GA)</B><br><br>
-
[1] Susanta K. Behura* and David W. Severson, Codon usage bias: causative factors, quantification methods and genome-wide patterns: with emphasis on insect genomes. Biological Review. 2012; 88: 49-61<br>
+
<B>-Introduction of Genetic Algorithm</B><br>
-
[2] Lanza, Amanda M.; Curran, Kathleen A.; Rey, Lindsey G.; et al. A condition-specific codon optimization approach for improved heterologous gene expression in Saccharomyces cerevisiae. BMC Systems Biology. 2014; 8: 33. <br>
+
Genetic Algorithm is a search heuristic that mimics the process of natural selection. The heuristic is routinely used to generate useful solutions to optimization and search problems.[1]<br>
-
[3] http://bioinfo.bisr.res.in/cgi-bin/project/crat/theory_codon_restriction.cgi
+
GA has lots of applications in many fields, such as in bioinformatics, using GA to optimize DNA successfully made the sequences have better physic-chemical properties for PCR. <br><br>
 +
 +
<B>-Our strategy</B><br>
 +
<B> Initialization</B><br> Create a population of hundreds of TALE sequences.<br>
 +
<B> Mutation</B><br> Each sequence changes a base randomly under the premise that 1. Amino acids sequence remain unchanged 2. To prevent TALE sequence from being cut, no restriction enzyme site we exploit in experiment is allowed to be created 3. Mutation should not take place on the overhang of each repeat.<br>
 +
<B> Crossover</B><br> In GA, crossover is a simulation of the process of synapsis. We randomly choose a point of amino acid, and two sequences exchange their parts from the point we choose to one of the ends. <br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/a/ad/T18.jpg"><br></div>
 +
<div align="center"><p>Figure 2.The process of crossover is a simulation of natural synapsis of chromosome.<br></div>
 +
<p><B> Fitness Scoring</B><br>
 +
We take two main factors into consideration:<br>
 +
1.Repeatability rate of the sequence<br>
 +
To judge the repeatability, we compare the changed sequence with the original one and count the Hamming distance between them. When a restriction enzyme site sequence occurs on the mutated TALE sequence, there will be a penalty.<br>
 +
2.Codon Usage<br>
 +
When the sequence contains too many rare codons, which means the number of the homologous tRNA is extremely low in E.coli, the whole sequence can hardly express. In order to avoid the appearance of rare codons, each occurrence of rare codons will lead to penalty.<br>
 +
<B> Selection</B><br> 
 +
In each generation we sort the 200 sequences according to the scores of each single sequence. And we terminate the half of lower scores, the rest of them will have better fitness. Repeat the process of mutation and crossover, the average score of each generation will increase. (See Results.)<br>
 +
We repeat the process for 600 generations.<br><br>
 +
<B>Simulated Annealing(SA)</B><br><br>
 +
<B>-Introduction of Simulated Annealing</B><br>
 +
Simulated annealing is a simple and general algorithm for finding global minima. It operates by simulating the cooling of a (usually fictitious) physical system whose possible energies correspond to the values of the objective function being minimized. The analogy works because physical systems occupy only the states with lowest energy as the temperature is lowered to absolute zero.<br><br>
-
</p><br />
+
<B>-Our Strategy</B><br>
-
   
+
<B>Mutation</B><br> We randomly choose 5 points on the sequence to mutate. To each single point, the process is the same with our computational mutation in genetic algorithm.<br>  
-
    <hr>
+
<B>Scoring</B><br> The same as fitness scoring in genetic algorithm.<br>
-
    </div>
+
<B>Parameters</B><br> The initial temperature T is 10000.0, after each generation, the temperature reduces to 99% of the former temperature. If the new generation has higher score, it will be accepted. Otherwise it will be accepted at a probability P(A)<br>
-
   
+
-
    </div><!--End Article--> 
+
-
   
+
-
    <!--Article Three -->
+
-
    <div class="row-fluid blog-article">
+
-
    <div class="span2 text-center">
+
-
                <div class="news-date">
+
-
                <h3>3</h3>
+
-
                </div>
+
-
              </div>
+
-
   
+
-
    <div class="span10">
+
-
   
+
-
    <h2><b>TALE Assembly</b></h2>
+
-
    <p>The TALE assembly strategy uses the Golden Gate cloning method, which is based on the ability of type IIS enzymes to cleave outside of their recognition site. When type IIS recognition sites are placed to the far 5’ and 3’ end of any DNA fragment in inverse orientation, they are removed in the cleavage process, allowing two DNA fragments flanked by compatible sequence overhangs, termed fusion sites, to be ligated seamlessly. Since type IIS fusion sites can be designed to have different sequences, directional assembly of multiple DNA fragments is feasible. Using this strategy, DNA fragments can be assembled from undigested input plasmids in a one-pot reaction with high efficiency.<br><br>
+
-
We chose the native TALE AvrBs3  as a scaffold for customized assembly of TALE constructs. The central DNA binding domain of AvrBs3 is formed by 17.5 tandemly arranged 34 amino acid repeats, with the last half repeat showing similarity to only the first 20 amino acids of a full repeat. To reduce the risk of recombination events between the 17.5 highly homologous repeat sequences which is mentioned in the hypothesis part, we codon-optimized AvrBs3 applying the codon usage.<br><br>
+
-
In a single Golden Gate cloning reaction, cloning efficiency is significantly reduced for assembly of 17 repeat modules. Therefore, we split the assembly in two successive steps. In the first cloning step, 10 repeats were assembled in one vector. The preassembly vectors confer spectinomycin resistance and encode a lacZ-α fragment for blue/white selection. On both sides of the lacZ-α fragment a type IIS recognition sequence, BsaI, was positioned. Similarly, 11~17 repeats and NG-last-repeat were respectively ligated and inserted into another vector. After preassembly of the 10 and 7 and last repeats using BsaI, the intermediate blocks were released via Esp3I and cloned into the final assembly vector (modified pTAL1). As is explained in the backbone part, we constructed a backbone with constitutive promoter which can express under normal condition, and another with a tetracycline-induced promoter, which is expressed with tet. Modified pTAL1 confers AmpR, and allows plasmid replication in E.coli. The vector pTAL1 also contains all elements of the final TALE expression construct, including TALE N’ and C’ arms, replication origin, etc., but a lacZ-α in between the left and right arms. <br><br>
+
-
During the construction of wild-type control plasmid, we used the modified modules provided by our lab. In order to confirm our hypothesis, the designer TALEs were oligo-synthesized according to the results given by optimizing algorithm. We reserved the fusion sites for golden gate reaction, and broke each repeats into 2 parts. After annealing in PCR amplifier, the linear sequence of the repeats can be added to the golden gate reaction and be ligated with the primary vector and continue further ligation to accomplish the final construction. <br><br>
+
-
<img src="img/blog3.png" alt="Title">
+
-
<b>Reference</b><br>
+
-
  http://www.ncbi.nlm.nih.gov/nuccore/NG_034463.1
+
-
</p><br />
+
-
   
+
-
    <a class="more" href="individual-blog.html">More</a>
+
-
    <hr>
+
-
    </div>
+
-
   
+
-
    </div><!--End Article-->
+
-
 
+
-
<!--Article Four -->
+
-
    <div class="row-fluid blog-article">
+
-
    <div class="span2 text-center">
+
-
                <div class="news-date">
+
-
                <h3>4</h3>
+
-
                </div>
+
-
              </div>
+
-
   
+
-
    <div class="span10">
+
-
   
+
-
    <h2><b>TALE Expression</b></h2>
+
-
    <p><b>Background – pTAL1 vector</b><br>
+
-
Based on the fact that Golden Gate is an effective way to assembly TALE (Transcription activator-like effectors) and various eukaryotic expression systems have been established but few in prokaryotic systems, we are determined to construct such efficient expression system in Esherichia coli so that we can test our brilliant idea. Through referring to numbers of paper, we find that most scientists choose to construct stable cell line via homologous arm such as attL1 and attL2 or they just introduce exogenous TALE[1].  Considering the principle of Golden Gate Assembly, we only need to reconstruct the final vector pTAL1 (figure 1) to solve this problem. The vector pTAL1 contains TALE N-terminal, TALE C-terminal, lacZ for blue white scanning and attL1, attL2 homologous arm. However, it lacks necessary elements for prokaryotic creature such as promoter, RBS and terminator. Here comes to our story of establish a TALE expression system in prokaryotic creature.<br><br>
+
-
<img src="https://static.igem.org/mediawiki/2014/7/79/Tsinghua-A-pTAL1.jpg" alt="Title"><br>
+
-
Figure 1. The original vector pTAL1<br><br>
+
-
<b>Constitutive pTAL</b><br>
+
-
Wisely, we choose 3A assembly (http://parts.igem.org/Help:Assembly/3A_Assembly) to construct our expression system. Firstly, we design forward and reverse primers with extension on which contains EcoRI, XbaI and SpeI, PstI restriction enzyme sites to get PCR prodcuts of pTAL. Then through naïve enzyme digestion and liagtion we can ligate pTAL with terminator, promoter and RBS one by one. And finally, we can easily get our constitutive TALE expression vector (Figure 2). And we also submit this expression vector as K1311003 (http://parts.igem.org/Part:BBa_K1311003:Design) in part.igem.org.<br><br>
+
-
<b>Regulative pTAL</b><br>
+
-
Similar to the method of constitutive construction, we make use of the ligated pTAL with terminator to continue our regulative pTAL construction. After browsing on the igem parts website (http://parts.igem.org/Main_Page), we find that there is no ideal regulatory parts that can be directly applied. We need to make use of some parts to get our ideal composite regulatory parts. Based on parts C0040 (http://parts.igem.org/Part:BBa_C0040), we added promoter and RBS (K081005, http://parts.igem.org/Part:BBa_K081005), terminator (B0015, http://parts.igem.org/Part:BBa_B0015), pTet (TetR repressible promoter, R0040, http://parts.igem.org/Part:BBa_R0040) and RBS one by one via 3A assembly. Similarly, we insert this large fragment into the upstream of the ligated pTAL with terminator (Figure 2). Eventually, we successfully reconstruct regulative pTAL and offer 3 our own parts this year. One is regulative pTAL (K1311004, http://parts.igem.org/Part:BBa_K1311004:Design); one is K1311005 (http://parts.igem.org/Part:BBa_K1311005) and the other is K1311006 (http://parts.igem.org/Part:BBa_K1311006). <br><br>
+
-
<img src="https://static.igem.org/mediawiki/2014/7/77/Tsinghua-A-workflow.jpg" alt="Title">
+
-
 
+
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/d/de/T15.jpg" alt="Title"><br></div>
 +
<p><B>-References</B><br>
 +
[1]Wikipedia http://en.wikipedia.org/wiki/Genetic_algorithm<br><br>
 +
</p><br />
</p><br />
-
   
 
-
   
 
-
    <hr>
 
-
    </div>
 
-
   
 
-
    </div><!--End Article--> 
 
-
<!--Article Five -->
 
-
    <div class="row-fluid blog-article">
 
-
    <div class="span2 text-center">
 
-
                <div class="news-date">
 
-
                <h3>5</h3>
 
-
                </div>
 
-
              </div>
 
-
   
 
-
    <div class="span10">
 
-
   
 
-
    <h2><b>Report System</b></h2>
 
-
    <p>We construct a report system so as to test the reliability and efficiency of our ‘Marvelous TALE’. In this section, we test the TALE’s DNA binding ability and report it with a common report gene ‘RFP’. We attempt to put the target of TALE’s DNA binding target sequence inside the expression cassette of report gene and binding TALE can disrupt the express of report gene. We use iGEM standard parts to build our report system.<br><br>
 
-
We designed a standard iGEM part BBa_K1311007 to complete all the tasks. This part contains<br>
 
-
Promoter (J23102)-TALE binding site (repeats three times)-RBS-LacI coding sequence-Terminator(B0015)-LacI Regulative Promoter(R0010)-RBS(B0034)-mRFP1(E1010)-Terminator(B0015)<br>
 
-
<img src="https://static.igem.org/mediawiki/2014/c/c1/Reportsystem3tsinghuaa.jpg" alt="Title"><br>
 
-
This part can convert the binding ability of TALE protein to its target DNA sequence to an easier available parameter, the florescent intensity of RFP. When the TALE protein is expressed, the TALE make binds to its target, which may interrupt the transcription of LacI. The lack of repressors may lead to the expression of RFP. So the stronger florescent intensity means the better binding ability of TALE protein. In this part, the target of TALE recognition site is chosen to be the 18bp sequence (ACCTCATCAGGAACATGTT).<br><br>
 
-
<b>Our Circuit Design</b><br>
 
-
<img src="https://static.igem.org/mediawiki/2014/3/3a/%E6%9C%AA%E6%A0%87%E9%A2%98-1tsinghuaa.jpg" alt="Title"><br>
 
-
This parts can convert the binding ability of TALE protein to its target DNA sequence to a easier available parameter, the florescent intensity of RFP. When the TALE protein is expressed, the TALE make binds to its target, which may interrupt the transcription of LacI. The lack of repressors may lead to the expression of RFP. So the stronger florescent intensity means the better binding ability of TALE protein. <br><br>
 
-
<b>Validation</b><br>
 
-
We change the normal RBS sequence in the LacI coding sequence into an RBS containing three tandem TALE binding site sequences. So we have to validate that the LacI protein can express normally and can normally inhibit the expression of RFP.<br>
 
-
We transformed this plasmid in to E.coli DH5α with electroporation and chemical transformation. We spread the plate and culture it in 37 degrees Centigrade for more than 15 hours until small colonies can be seen in the plate. At this moment, the colonies might look red for the sake of the delay of expression of LacI. We picked colonies into the 10mL tubes and added 5mL LB broth with Chl antibiotics. <br>After six hours’ shaking at 37 degrees Centigrade and 220 rpm, we got the tubes out and double, four times, eight times diluted. (40μL of 0.1M IPTG was added as the positive control group) After another shaking for 12h, florescence of the bacteria was evaluated with an enzyme-labeled instrument and OD600 was tested with spectrometry.<br><br>
 
-
<b>Picture of our tubes </b><br><br>
 
-
<img src="https://static.igem.org/mediawiki/2014/b/b1/TsinghuaaParts1.jpg" alt="Title"><br>
 
-
(The two red ones on the back are the positive control groups, others are not red)
 
-
</p><br />
 
-
   
 
    <hr>
    <hr>
    </div>
    </div>
-
   
+
 
-
     </div><!--End Article-->
+
     </div><!--End Article-->
    
    
    
    
-
 
+
     
     </div><!--End Blog Article Container-->
     </div><!--End Blog Article Container-->
    
    
    
    
-
    <!--Right Side Bar-->
+
    <!--Right Side Bar-->
-
    <div class="span3 sidebar-container">
+
      <div class="span3 sidebar-container">
-
    <h2><b>Our Team</b></h2>
+
      <h2>Our Team</B></h2>
-
    <p><img src="https://static.igem.org/mediawiki/2014/c/cb/Tsinghua-A-sidebar.jpg"></p>
+
<a href = https://2014.igem.org/Team:Tsinghua-A/Team>
-
    <br />
+
      <p><img src="https://static.igem.org/mediawiki/2014/c/cb/Tsinghua-A-sidebar.jpg"</p>
 +
</a>
 +
      <br />
    
    
    
    
Line 202: Line 229:
   <ul class="unstyled blog-links">
   <ul class="unstyled blog-links">
       <li><a href="https://2014.igem.org"><i class="icon-angle-right"></i> iGEM 2014</a></li>
       <li><a href="https://2014.igem.org"><i class="icon-angle-right"></i> iGEM 2014</a></li>
-
       <li><a href="http://http://www.tsinghua.edu.cn/publish/newthuen/index.html"><i class="icon-angle-right"></i> Tsinghua University</a></li>
+
       <li><a href="http://www.tsinghua.edu.cn/publish/newthu/index.html"><i class="icon-angle-right"></i> Tsinghua University</a></li>
       <li><a href="http://www.au.tsinghua.edu.cn/publish/auen/index.html"><i class="icon-angle-right"></i> Department of Automation,<br>Tsinghua University</a></li>
       <li><a href="http://www.au.tsinghua.edu.cn/publish/auen/index.html"><i class="icon-angle-right"></i> Department of Automation,<br>Tsinghua University</a></li>
       <li><a href="http://bioinfo.au.tsinghua.edu.cn/CSSB/web/english/index.html"><i class="icon-angle-right"></i> Center for Synthetic and Systems Biology,<br>Tsinghua University</a></li>
       <li><a href="http://bioinfo.au.tsinghua.edu.cn/CSSB/web/english/index.html"><i class="icon-angle-right"></i> Center for Synthetic and Systems Biology,<br>Tsinghua University</a></li>
Line 216: Line 243:
     </div>
     </div>
     </section>
     </section>
 +
 +
 +
<section id="Results">
 +
    <div class="container">
 +
    <div class="row-fluid">
 +
        <div class="span12">
 +
         
 +
          <!--Bread Crumbs-->
 +
       
 +
        </div>
 +
      </div>
 +
     
 +
      <div class="row-fluid"
 +
   
 +
    <!--Blog Article Container-->
 +
    <div class="span9 article-container">
 +
   
 +
 
 +
   
 +
   
 +
    <!--Article Two -->
 +
    <div class="row-fluid blog-article">
 +
    <div class="span2 text-center">
 +
                <div class="news-date">
 +
                <h3>2</h3>
 +
                </div>
 +
              </div>
 +
   
 +
    <div class="span10">
 +
   
 +
    <h2><b>Results</b></h2>
 +
    <p><B>Genetic Algorithms</B><br>
 +
<p>We output the average scores of each generation. And we can see that the scores range from -4500 to 350, and after about 100 generations, the average score becomes stable.(See Figure.3) <br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/f/f9/03.jpg"><br></div>
 +
<div align="center"><p>Figure 3. The fitness score's variation among the whole 600 generations.<br></div>
 +
<p>In order to see the variation of the score more clearer, we output the score of last 50 generations.<br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/3/3f/02.jpg"><br></div>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/4/4e/01.jpg"><br></div>
 +
<div align="center"><p>Figure 4. The fitness score of the last 50 generations.<br></div>
 +
<p>The second graph in Figure 4 shows the highest score, lowest score and average score in each generation. We can see that the extreme values are relatively stable. The sequence which has the highest score can be seen as the best sequence that we found.<br>
 +
 +
<p><B>Simulated Annealing</B><br>
 +
<p>To compare with GA, we output the scores of each generation.<br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/2/2d/04.jpg"><br></div>
 +
<div align="center"><p>Figure 5. The score variation.<br></div>
 +
<p>Because of the parameters we gave, the probablity of accepting the new solution when the score becomes lower is relatively low. <br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/d/db/Gaiguode.jpg"><br></div>
 +
 +
<div align="center"><p>Figure 6. The score of last 50 solutions.<br></div>
 +
<p>Compared with genetic algorithm, we can see that simulated annealing has lower convergence rate and higher probability randomicity, which can be affected by the temperature, cooling velocity and coefficient that are set.<br>
 +
 +
 +
 +
</p><br />
 +
   
 +
    <hr>
 +
    </div>
 +
   
 +
    </div><!--End Article--> 
 +
     
 +
    </div><!--End Blog Article Container-->
 +
   
 +
   
 +
   
 +
    </div>
 +
    </div>
 +
    </section>
    
    
 +
 
 +
<section id="Simulation">
 +
    <div class="container">
 +
    <div class="row-fluid">
 +
        <div class="span12">
 +
         
 +
          <!--Bread Crumbs-->
 +
       
 +
        </div>
 +
      </div>
 +
     
 +
      <div class="row-fluid"
 +
   
 +
    <!--Blog Article Container-->
 +
    <div class="span9 article-container">
 +
   
 +
   
 +
    <!--Article Three -->
 +
    <div class="row-fluid blog-article">
 +
    <div class="span2 text-center">
 +
                <div class="news-date">
 +
                <h3>3</h3>
 +
                </div>
 +
              </div>
 +
   
 +
    <div class="span10">
 +
   
 +
    <h2><b>Simulation</b></h2>
 +
    <p><B>System Model</B><br> 
 +
<img src="https://static.igem.org/mediawiki/2014/f/fb/T6.jpg"><br>
 +
<div align="center"><p>Figure 7. The model of the report system.<br></div>
 +
<p>We use the following formula to simulate the lac repressor system.<br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/3/33/T3.jpg"><br></div>
 +
<p>We use Michaelis-Menten equation to simulate the process.<br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/c/c3/T9.jpg"><br></div>
 +
<br>
 +
<p><B>Simulation Results</B><br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/2/27/T10.png"><br></div>
 +
<div align="center"><p>Figure 8. Simulation of the IPTG-lac repressor system.<br></div>
 +
<p>We infer that the situation of TALE will be similar with IPTG. When TALE protein binds on the plasmid, the expression of lacI will be repressed. Consequently, the concentration of RFP will be increased. If no TALE proteins
 +
are expressed, the concentration of RFP will be very low.<br>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2014/d/d0/T13.jpg"><br></div>
 +
<div align="center"><p>Figure 9. Situations that TALE successfully express and not express.<br></div>
 +
<p>From the simulation, we can conclude that RFP can be a signal of whether our TALE has successfully expressed, moreover, the concentration of RFP, which shows as the fluorescence intensity, has positive correlation with the quantity of TALE protein.<br><br>
 +
 +
<B>-References</B><br>
 +
[1]Uri Alon,An Introduction to Systems Biology[B],Chapman & Hall/CRC, P244<br><br>
 +
 +
</p><br />
 +
<hr>
 +
</div>
 +
   
 +
    </div><!--End Article-->  
 +
    </div><!--End Blog Article Container-->
 +
   
 +
   
 +
   
 +
   
 +
    </div>
 +
    </div>
 +
    </section>
    
    
Line 222: Line 377:
</html>
</html>
-
 
+
{{Team:Tsinghua-A/LogoCode}}
{{Team:Tsinghua-A/Footer}}
{{Team:Tsinghua-A/Footer}}
{{Team:Tsinghua-A/Script}}
{{Team:Tsinghua-A/Script}}

Latest revision as of 03:39, 18 October 2014




1

Algorithms

Overview

In order to lower down the repeatability rate of codons, we use intelligent optimization algorithm. We exploit amino acid degeneracy and alternate nucleotides to reduce the repetition rate of bases of TALE DNA sequence. (See Figure 1) Then the optimized TALE sequence will be tested in wet lab to verify our conjecture whether TALE DNA sequence of lower repeatability rate works at higher efficiency.

Title

Figure 1. RNA codon Table

For instance, we may change UUC into UUU to make one of the repeats different from others while the amino acid(PhenylalaninePhe) is identical.

By this way, we use two types of intelligent optimization algorithm to optimize TALE DNA sequence.

Genetic Algorithm(GA)

-Introduction of Genetic Algorithm
Genetic Algorithm is a search heuristic that mimics the process of natural selection. The heuristic is routinely used to generate useful solutions to optimization and search problems.[1]
GA has lots of applications in many fields, such as in bioinformatics, using GA to optimize DNA successfully made the sequences have better physic-chemical properties for PCR.

-Our strategy
Initialization
Create a population of hundreds of TALE sequences.
Mutation
Each sequence changes a base randomly under the premise that 1. Amino acids sequence remain unchanged 2. To prevent TALE sequence from being cut, no restriction enzyme site we exploit in experiment is allowed to be created 3. Mutation should not take place on the overhang of each repeat.
Crossover
In GA, crossover is a simulation of the process of synapsis. We randomly choose a point of amino acid, and two sequences exchange their parts from the point we choose to one of the ends.


Figure 2.The process of crossover is a simulation of natural synapsis of chromosome.

Fitness Scoring
We take two main factors into consideration:
1.Repeatability rate of the sequence
To judge the repeatability, we compare the changed sequence with the original one and count the Hamming distance between them. When a restriction enzyme site sequence occurs on the mutated TALE sequence, there will be a penalty.
2.Codon Usage
When the sequence contains too many rare codons, which means the number of the homologous tRNA is extremely low in E.coli, the whole sequence can hardly express. In order to avoid the appearance of rare codons, each occurrence of rare codons will lead to penalty.
Selection
In each generation we sort the 200 sequences according to the scores of each single sequence. And we terminate the half of lower scores, the rest of them will have better fitness. Repeat the process of mutation and crossover, the average score of each generation will increase. (See Results.)
We repeat the process for 600 generations.

Simulated Annealing(SA)

-Introduction of Simulated Annealing
Simulated annealing is a simple and general algorithm for finding global minima. It operates by simulating the cooling of a (usually fictitious) physical system whose possible energies correspond to the values of the objective function being minimized. The analogy works because physical systems occupy only the states with lowest energy as the temperature is lowered to absolute zero.

-Our Strategy
Mutation
We randomly choose 5 points on the sequence to mutate. To each single point, the process is the same with our computational mutation in genetic algorithm.
Scoring
The same as fitness scoring in genetic algorithm.
Parameters
The initial temperature T is 10000.0, after each generation, the temperature reduces to 99% of the former temperature. If the new generation has higher score, it will be accepted. Otherwise it will be accepted at a probability P(A)

Title

-References
[1]Wikipedia http://en.wikipedia.org/wiki/Genetic_algorithm



2

Results

Genetic Algorithms

We output the average scores of each generation. And we can see that the scores range from -4500 to 350, and after about 100 generations, the average score becomes stable.(See Figure.3)


Figure 3. The fitness score's variation among the whole 600 generations.

In order to see the variation of the score more clearer, we output the score of last 50 generations.



Figure 4. The fitness score of the last 50 generations.

The second graph in Figure 4 shows the highest score, lowest score and average score in each generation. We can see that the extreme values are relatively stable. The sequence which has the highest score can be seen as the best sequence that we found.

Simulated Annealing

To compare with GA, we output the scores of each generation.


Figure 5. The score variation.

Because of the parameters we gave, the probablity of accepting the new solution when the score becomes lower is relatively low.


Figure 6. The score of last 50 solutions.

Compared with genetic algorithm, we can see that simulated annealing has lower convergence rate and higher probability randomicity, which can be affected by the temperature, cooling velocity and coefficient that are set.



3

Simulation

System Model

Figure 7. The model of the report system.

We use the following formula to simulate the lac repressor system.


We use Michaelis-Menten equation to simulate the process.



Simulation Results


Figure 8. Simulation of the IPTG-lac repressor system.

We infer that the situation of TALE will be similar with IPTG. When TALE protein binds on the plasmid, the expression of lacI will be repressed. Consequently, the concentration of RFP will be increased. If no TALE proteins are expressed, the concentration of RFP will be very low.


Figure 9. Situations that TALE successfully express and not express.

From the simulation, we can conclude that RFP can be a signal of whether our TALE has successfully expressed, moreover, the concentration of RFP, which shows as the fluorescence intensity, has positive correlation with the quantity of TALE protein.

-References
[1]Uri Alon,An Introduction to Systems Biology[B],Chapman & Hall/CRC, P244