Team:SJTU-BioX-Shanghai/Modeling

From 2014.igem.org

(Difference between revisions)
 
(134 intermediate revisions not shown)
Line 2: Line 2:
{{Template:Team:SJTU-BioX-Shanghai/top-nav}}
{{Template:Team:SJTU-BioX-Shanghai/top-nav}}
{{Team:SJTU-BioX-Shanghai/clear}}
{{Team:SJTU-BioX-Shanghai/clear}}
 +
{{Template:Team:SJTU-BioX-Shanghai/article}}
 +
{{Template:Team:SJTU-BioX-Shanghai/preview}}
<html>
<html>
<style type="text/css">
<style type="text/css">
-
   .header_logo{ background-image:url("https://static.igem.org/mediawiki/2014/c/c1/SJTU14_achievement.png");}
+
   .header_logo{ background-image:url("/wiki/images/d/d8/SJTU14_modeling.png");}
 +
.projtile {
 +
    margin-right:1.167%;
 +
    margin-left:1.167%;
 +
    width:31%;}
 +
 
 +
#passage{
 +
    width:100%;}
 +
#passage p{
 +
    padding:4%;}
 +
 
 +
 
 +
 
 +
#subtitleyjn2{
 +
    margin-left:40%;}
</style>
</style>
<div class="content">
<div class="content">
-
<div class="overview">
+
<div class="jiao" >
-
  <h2>Aims</h2><hr />
+
-
  <ol><li>Flexbox is a new layout mode in CSS3 that is designed for the more sophisticated needs of the modern web. </li><li>This article will describe the newly-stablized Flexbox syntax in technical detail.</li> <li>Browser support is going to grow quickly, so you’ll be ahead of the game when support is wide enough for Flexbox to be practical. </li><li>Read on if you want to know what it does and how it works!</li></ol>
+
-
  <h2>Background</h2><hr />
+
-
  <p>Authors have long been using tables, floats, inline-blocks, and other CSS properties to lay out their site content. However, none of these tools were designed for the complex webpages and webapps we are making nowadays. Simple things like vertical centering require work. Complex things like flexible grid layouts are so hard that it’s considered ambitious to roll your own, hence the success of CSS grid frameworks. Still, if so many projects needs to do these things, why can’t it just be easy? Flexbox aims to change all that.
+
-
  </p>
+
-
  <h2>Results</h2><hr />
+
-
  <p>Though Flexbox makes it trivial to create layouts that would have been difficult or impossible in the past, it takes some time to get used to the Flexbox way of doing things. New terminology and new abstractions can be a barrier to using Flexbox, so let’s discuss them up-front.</p>
+
-
  </div>
+
-
  <article class="post__article">
+
-
<p>Biology is a dynamic, exciting, and important subject. It is dynamic because it is constantly changing, with new discoveries about the living world being made every day. (Although it is impossible to pinpoint an exact number, approximately 1 million new research articles in biology are published each year.) The subject is exciting because life in all of its forms has always fascinated people. As active scientists who have spent our careers teaching and doing research in a wide variety of fields, we know this first hand.<br>
+
-
  Biology has always been important in peoples’ daily lives, if only through the effects of achievements in medicine and agriculture. Today more than ever the science of biology is at the forefront of human concerns as we face challenges raised both by recent advances in genome science and by the rapidly changing environment.<br>
+
-
  Life’s new edition brings a fresh approach to the study of biology while retaining the features that have made the book successful in the past. A new coauthor, the distinguished entomologist May R. Berenbaum (University of Illinois at Urbana-Champaign) has joined our team, and the role of evolutionary biologist David Hillis (University of Texas at Austin) is greatly expanded in this edition. The authors hail from large, medium-sized, and small institutions. Our multiple perspectives and areas of expertise, as well as input from many col- leagues and students who used previous editions, have in- formed our approach to this new edition.</p>
+
-
  <h2>Enduring Features</h2>
+
-
  <p>We remain committed to blending the presentation of core ideas with an emphasis on introducing students to the process of scientific inquiry. Having pioneered the idea of depicting seminal experiments in specially designed figures, we continue to develop this here, with 79 INVESTIGATING LIFE figures. Each of these figures sets the experiment in perspective and relates it to the accompanying text. As in previous editions, these figures employ a structure: Hypothesis, Method, Results, and Conclusion. They often include questions for further research that ask students to conceive an experiment that would explore a related question. Each Investigating Life figure has a reference to BioPortal (yourBioPortal.com), where citations to the original work as well as additional discussion and references to follow-up re- search can be found.<br>
+
-
  A related feature is the TOOLS FOR INVESTIGATING LIFE figures, which depict laboratory and field methods used in biology. These, too, have been expanded to provide more useful context for their importance.<br>
+
-
  Over a decade ago—in Life’s Fifth Edition—the authors and publishers pioneered the much-praised use of BALLOON CAP- TIONS in our figures. We recognized then, and it is even truer today, that many students are visual learners. The balloon captions bring explanations of intricate, complex processes directly into the illustration, allowing students to integrate information without repeatedly going back and forth between the figure, its legend, and the text.<br>
+
-
  Life is the only introductory textbook for biology majors to begin each chapter with a story. These OPENING STORIES pro- vide historical, medical, or social context and are intended to intrigue students while helping them see how the chapter’s biological subject relates to the world around them. In the new edition, all of the opening stories (some 70 percent of which are new) are revisited in the body of the chapter to drive home their relevance.<br>
+
-
  We continue to refine our well-received chapter organization. The chapter-opening story ends with a brief IN THIS CHAPTER preview of the major subjects to follow. A CHAPTER OUTLINE asks questions to emphasize scientific inquiry, each of which is answered in a major section of the chapter. A RECAP at the end of each section asks the student to pause and answer questions to review and test their mastery of the previous material. The end-of-chapter summary continues this inquiry framework and highlights key figures, bolded terms, and activities and animated tutorials available in BioPortal.</p>
+
-
  <h2>New Features</h2>
+
-
  <p>Probably the most important new feature of this edition is new authorship. Like the biological world, the authorship team of Life continues to evolve. While two of us (Craig Heller and David Sadava) continue as coauthors, David Hillis has a greatly expanded role, with full responsibility for the units on evolution and diversity. New coauthor May Berenbaum has rewritten the chapters on ecology. The perspectives of these two ac- claimed experts have invigorated the entire book (as well as their coauthors).<br>
+
-
  Even with the enduring features (see above), this edition has a different look and feel from its predecessor. A fresh new de- sign is more open and, we hope, more accessible to students. The extensively revised art program has a contemporary style and color palette. The information flow of the figures is easier to follow, with numbered balloons as a guide for students. There are new conceptual figures, including a striking visual timeline for the evolution of life on Earth (Figure 25.12) and a single overview figure that summarizes the information in the genome (Figure 17.4).<br>
+
-
  In response to instructors who asked for more real-world data, we have incorporated a feature introduced online in the Eighth Edition, WORKING WITH DATA. There are now 36 of these exercises, most of which relate to an Investigating Life figure. Each is referenced at the end of the relevant chapter and is available online via BioPortal (yourBioPortal.com). In these exercises, we describe in detail the context and approach of there search paper that forms the basis of the figure. We then ask the student to examine the data, to make calculations, and to draw conclusions.<br>
+
-
  We are proud that this edition is a greener Life, with the goal of reducing our environmental impact. This is the first introductory biology text to be printed on paper earning the Forest Steward- ship Council label, the “gold standard” in green paper products, and it is manufactured from wood harvested from sustainable forests. And, of course, we also offer Life as an eBook.</p>
+
-
  <h2>The Ten Parts</h2>
+
-
  <p>We have reorganized the book into ten parts. Part One, The Science of Life and Its Chemical Basis, sets the stage for the book: the opening chapter focuses on biology as an exciting science. We begin with a startling observation: the recent, dramatic de- cline of amphibian species throughout the world. We then show how biologists have formed hypotheses for the causes of this environmental problem and are testing them by carefully de- signed experiments, with a view not only to understanding the decline, but reversing it. This leads to an outline of the basic principles of biology that are the foundation for the rest of the book: the unity of life at the cellular level and how evolution unites the living world. This is followed by chapters on the basic chemical building blocks that underlie life. We have added a new chapter on nucleic acids and the origin of life, introducing the concepts of genes and gene expression early and expanding our coverage of the major ideas on how life began and evolved at its earliest stages.<br>
+
-
  In Part Two, Cells, we describe the view of life as seen through cells, its structural units. In response to comments by users of our previous edition, we have moved the chapter on cell signal- ing and communication from the genetics section to this part of the book, with a change in emphasis from genes to cells. There is an updated discussion of ideas on the origin of cells and organelles, as well as expanded treatment of water transport across membranes.<br>
+
-
  Part Three, Cells and Energy, presents an integrated view of bio- chemistry. For this edition, we have worked to clarify such challenging concepts as energy transfer, allosteric enzymes, and bio- chemical pathways. There is extensive revision of the discussions of alternate pathways of photosynthetic carbon fixation, as well as a greater emphasis on applications throughout these chapters.<br>
+
-
  Part Four, Genes and Heredity, is extensively revised and reorganized to improve clarity, link related concepts, and provide updates from recent research results. Separate chapters on prokaryotic genetics and molecular medicine have been re- moved and their material woven into relevant chapters. For ex- ample, our chapter on cell reproduction now includes a discus- sion of how the basic mechanisms of cell division are altered in cancer cells. The chapter on transmission genetics now includes coverage of this phenomenon in prokaryotes. New chapters on gene expression and gene regulation compare prokaryotic and eukaryotic mechanisms and include a discussion ofPREFACE xi epigenetics. A new chapter on mutation describes updated applications of medical genetics.<br>
+
-
  In Part Five, Genomes, we reinforce the concepts of the previous part, beginning with a new chapter on genomes—how they are analyzed and what they tell us about the biology of prokaryotes and eukaryotes, including humans. This leads to a chapter describing how our knowledge of molecular biology and genetics underpins biotechnology (the application of this knowledge to practical problems). We discuss some of the latest uses of biotechnology, including environmental cleanup. Part Five finishes with two chapters on development that explore the themes of molecular biology and evolution, linking these two parts of the book.<br>
+
-
  Part Six, The Patterns and Processes of Evolution, emphasizes the importance of evolutionary biology as a basis for comparing and understanding all aspects of biology. These chapters have been extensively reorganized and revised, as well as updated with the latest thinking of biologists in this rapidly changing field. This part now begins with the evidence and mechanisms of evolution, moves into a discussion of phylogenetic trees, then covers speciation and molecular evolution, and concludes with the evolutionary history of life on Earth. An integrated time- line of evolutionary history shows the timing of major events of biological evolution, the movements of the continents, floral and faunal reconstructions of major time periods, and depicts some of the fossils that form the basis of the reconstructions.<br>
+
-
  In Part Seven, The Evolution of Diversity, we describe the latest views on biodiversity and evolutionary relationships. Each chapter has been revised to make it easier for the reader to appreciate the major changes that have evolved within the various groups of organisms. We emphasize understanding the big picture of organismal diversity, as opposed to memorizing a taxonomic hierarchy and names (although these are certainly important). Throughout the book, the tree of life is emphasized as a way of understanding and organizing biological information. A Tree of Life Appendix allows students to place any group of organisms mentioned in the text of our book into the context of the rest of life. The web-based version of this appendix provides links to photos, keys, species lists, distribution maps, and other information to help students explore biodiversity of specific groups in greater detail.<br>
+
-
  After modest revisions in the past two editions, Part Eight, Flowering Plants: Form and Function, has been extensively reorganized and updated with the help of Sue Wessler, to include both classical and more recent approaches to plant physiology. Our emphasis is not only on the basic findings that led to the elucidation of mechanisms for plant growth and reproduction, but also on the use of genetics of model organisms. There is expanded coverage of the cell signaling events that regulate gene expression in plants, integrating concepts introduced earlier in the book. New material on how plants respond to their environment is included, along with links to both the book’s earlier descriptions of plant diversity and later discussions of ecology.<br>
+
-
  Part Nine, Animals: Form and Function, continues to provide a solid foundation in physiology through comprehensive coverage of basic principles of function of each organ system and then emphasis on mechanisms of control and integration. An important reorganization has been moving the chapter on immunology from earlier in the book, where its emphasis was on molecular genetics, to this part, where it is more closely allied to the information systems of the body. In addition, we have added a number of new experiments and made considerable effort to clarify the sometimes complex phenomena shown in the illustrations.<br>
+
-
  Part Ten, Ecology, has been significantly revised by our new coauthor, May Berenbaum. A new chapter of biological interactions has been added (a topic formerly covered in the community ecology chapter). Full of interesting anecdotes and discus- sions of field studies not previously described in biology texts, this new ecology unit offers practical insights into how ecologists acquire, interpret, and apply real data. This brings the book full circle, drawing upon and reinforcing prior topics of energy, evolution, phylogenetics, Earth history, and animal and plant physiology.</p>
+
-
  <h2>Exceptional Value Formats</h2>
+
-
  <p>We again provide Life both as the full book and as a cluster of paperbacks. Thus, instructors who want to use less than the whole book can choose from these split volumes, each with the book’s front matter, appendices, glossary, and index.<br>
+
-
  Volume I, The Cell and Heredity, includes: Part One, The Science of Life and Its Chemical Basis (Chapters 1–4); Part Two, Cells (Chapters 5–7); Part Three, Cells and Energy (Chapters 8–10); Part Four, Genes and Heredity (Chapters 11–16); and Part Five, Genomes (Chapters 17–20).<br>
+
-
  Volume II, Evolution, Diversity, and Ecology, includes: Chapter 1, Studying Life; Part Six, The Patterns and Processes of Evolution (Chapters 21–25); Part Seven, The Evolution of Diversity (Chapters 26–33); and Part Ten, Ecology (Chapters 54–59).<br>
+
-
  Volume III, Plants and Animals, includes: Chapter 1, Study- ing Life; Part Eight, Flowering Plants: Form and Function (Chapters 34–39); and Part Nine, Animals: Form and Function (Chapters 40–53).<br>
+
-
  Responding to student concerns, we offer two options of the entire book at a significantly reduced cost. After it was so well received in the previous edition, we again provide Life as a loose- leaf version. This shrink-wrapped, unbound, 3-hole punched ver- sion fits into a 3-ring binder. Students take only what they need to class and can easily integrate any instructor handouts or other resources.<br>
+
-
  Life was the first comprehensive biology text to offer the en- tire book as a truly robust eBook. For this edition, we continue to offer a flexible, interactive ebook that gives students a new way to read the text and learn the material. The ebook integrates the student media resources (animations, quizzes, activities, etc.) and offers instructors a powerful way to customize the textbook with their own text, images, Web links, documents, and more.</p>
+
-
  <h2>Media and Supplements for the Ninth Edition</h2>
+
-
  <p>The wide range of media and supplements that accompany Life, Ninth Edition have all been created with the dual goal of help- ing students learn the material presented in the textbook more efficiently and helping instructors teach their courses more effectively. Students in majors introductory biology are faced with learning a tremendous number of new concepts, facts, and terms, and the more different ways they can study this mate- rial, the more efficiently they can master it.<br>
+
-
  All of the Life media and supplemental resources have been developed specifically for this textbook. This provides strong consistency between text and media, which in turn helps students learn more efficiently. For example, the animated tutorials and activities found in BioPortal were built using textbook art, so that the manner in which structures are illustrated, the colors used to identify objects, and the terms and abbreviations used are all consistent.<br>
+
-
  For the Ninth Edition, a new set of Interactive Tutorials gives students a new way to explore many key topics across the text- book. These new modules allow the student to learn by doing, including solving problem scenarios, working with experimental techniques, and exploring model systems. All new copies of the Ninth Edition include access to the robust new version of BioPortal, which brings together all of Life’s student and instructor resources, powerful assessment tools, and new integration with Prep-U adaptive quizzing.<br>
+
-
  The rich collection of visual resources in the Instructor’s Media Library provides instructors with a wide range of options for enhancing lectures, course websites, and assignments. High- lights include: layered art PowerPoint® presentations that break down complex figures into detailed, step-by-step presentations; a collection of approximately 200 video segments that can help capture the attention and imagination of students; and Power- Point slides of textbook art with editable labels and leaders that allow easy customization of the figures.<br>
+
-
  For a detailed description of all the media and supplements available for the Ninth Edition, please turn to “Life’s Media and Supplements,” on page xvii.</p>
+
-
  <h2>Many People to Thank</h2>
+
-
  <p>“If I have seen farther, it is by standing on the shoulders of giants.” The great scientist Isaac Newton wrote these words over 330 years ago and, while we certainly don’t put ourselves in his lofty place in science, the words apply to us as coauthors of this text. This is the first edition that does not bear the names of Bill Purves and Gordon Orians. As they enjoy their “retirements,” we are humbled by their examples as biologists, educators, and writers.<br>
+
-
  One of the wisest pieces of advice ever given to a textbook author is to “be passionate about your subject, but don’t put your ego on the page.” Considering all the people who looked over our shoulders throughout the process of creating this book, this advice could not be more apt. We are indebted to many people who gave invaluable help to make this book what it is. First and foremost are our colleagues, biologists from over 100 institutions. Some were users of the previous edition, who suggested many improvements. Others reviewed our chapter drafts in detail, including advice on how to improve the illustrations. Still others acted as accuracy reviewers when the book was al- most completed. All of these biologists are listed in the Reviewer credits.<br>
+
-
  Of special note is Sue Wessler, a distinguished plant biologist and textbook author from the University of Georgia. Sue looked critically at Part Eight, Flowering Plants: Form and Function, wrote three of the chapters (34–36), and was important in the revision of the other three (37–39). The new approach to plant biology in this edition owes a lot to her.<br>
+
-
  The pace of change in biology and the complexities of preparing a book as broad as this one necessitated having two developmental editors. James Funston coordinated Parts 1–5, and Carol Pritchard-Martinez coordinated Parts 6–10. We benefit- ted from the wide experience, knowledge, and wisdom of both of them. As the chapter drafts progressed, we were fortunate to have experienced biologist Laura Green lending her critical eye as in-house editor. Elizabeth Morales, our artist, was on her third edition with us. As we have noted, she extensively revised al- most all of the prior art and translated our crude sketches into beautiful new art. We hope you agree that our art program remains superbly clear and elegant. Our copy editors, Norma Roche, Liz Pierson, and Jane Murfett, went far beyond what such people usually do. Their knowledge and encyclopedic recall of our book’s chapters made our prose sharper and more accurate. Diane Kelly, Susan McGlew, and Shannon Howard effectively coordinated the hundreds of reviews that we described above. David McIntyre was a terrific photo editor, finding over 550 new photographs, including many new ones of his own, that enrich the book’s content and visual statement. Jefferson Johnson is responsible for the design elements that make this edition of Life not just clear and easy to learn from, but beautiful as well. Christopher Small headed the production department—Joanne Delphia, Joan Gemme, Janice Holabird, and Jefferson Johnson—who contributed in innumerable ways to bringing Life to its final form. Jason Dirks once again coordinated the creation of our array of media and supplements, including our superb new Web resources. Carol Wigg, for the ninth time in nine editions, oversaw the editorial process; her influence pervades the entire book.<br>
+
-
  W. H. Freeman continues to bring Life to a wider audience. Associate Director of Marketing Debbie Clare, the Regional Specialists, Regional Managers, and experienced sales force are effective ambassadors and skillful transmitters of the features and unique strengths of our book. We depend on their expertise and energy to keep us in touch with how Life is perceived by its users. And thanks also to the Freeman media group for eBook and BioPortal production.<br>
+
-
  Finally, we are indebted to Andy Sinauer. Like ours, his name is on the cover of the book, and he truly cares deeply about what goes into it. Combining decades of professionalism, high standards, and kindness to all who work with him, he is truly our mentor and friend.</p>
+
-
  <p>DAVID SADAVA DAVID HILLIS CRAIG HELLER MAY BERENBAUM</p>
+
-
  </article>
+
-
  </div>
+
-
<!--main content -->
+
-
<table width="70%" align="center">
+
 +
<div class="projtile_only">
 +
      <center><h2>Modeling&ensp;and&ensp;Simulation</h2></br></center>
 +
<center><p>"All models are wrong, but some are useful."
 +
When we decided to use TAL effectors building CROWN, our project, there were three main challenges concerning the efficiency of this system.
 +
First, allowing some DNA mutations, can the CROWN be as efficient as before?
 +
Second, given that CROWN can be successfully distributed on certain area of single cell, can it make sense?
 +
Third, how to design the sequence of Golden Gate?
 +
The following three parts focus on the three questions. </p>
 +
</center>
 +
</div>
 +
 
 +
    <div class="projtile" id="dingweidian2">
 +
  <a href="#dingweidian2" title="Part I Single Cell">
 +
    <center><h2>Part I Single Cell</h2></center></a>
 +
    </div>
 +
     
 +
    <div class="projtile">
 +
  <a href="#dianweidian9"  title="Part II Docking">
 +
    <center> <h2>Part II Docking</h2></center></a>
 +
    </div>
 +
     
-
<!--welcome box -->
+
    <div class="projtile">
-
<tr>
+
<a href="#dianweidian10" title="Part III K-clique Problem">
-
<td style="border:1px solid black;" colspan="3" align="center" height="150px" bgColor=#FF404B>
+
    <center><h2>Part III K-clique Problem</h2></center></a>
-
<h1 >WELCOME TO iGEM 2014! </h1>
+
    </div>
-
<p>Your team has been approved and you are ready to start the iGEM season!
+
-
<br>On this page you can document your project, introduce your team members, document your progress <br> and share your iGEM experience with the rest of the world! </p>
+
-
<br>
+
-
<p style="color:#E7E7E7"> <a href="https://2014.igem.org/wiki/index.php?title=Team:SJTU-BioX-Shanghai/Modeling&action=edit"style="color:#FFFFFF"> Click here  to edit this page!</a> </p>
+
-
</td>
+
-
</tr>
+
-
<tr> <td colspan="3" height="5px"> </td></tr>
+
    <div class="projtile">
-
<!-- end welcome box -->
+
<a href="#Discussion" title="Reference">
-
<tr>  
+
    <center><h2>Reference</h2></center></a>
 +
    </div>
-
<!--navigation menu -->
 
-
<td align="center" colspan="3">
 
-
<table  width="100%">
+
</div>
-
<tr heigth="15px"></tr>
+
-
<tr heigth="75px">  
+
 +
<div style="clear:both;"></div>
 +
<article class="post__article">
 +
<!--  <h2 id="partisinglecell">Overview&ensp;of&ensp;Modeling&ensp;and&ensp;Simulation</h2>
 +
<p>"All models are wrong, but some are useful."</p>
 +
<p>When we decide to use TAL effectors building CROWN, our project, there are three main challenges concerning the efficiencies of this system.</p>
 +
<p>First, allowing some DNA mutations, whether the CROWN can be efficient as before?</p>
 +
<p>Second, given that CROWN can be successfully distributed on certain area of single cell, can it make sense?</p>
 +
<p>Third, how to design the sequence of Golden Gate?</p>
 +
<p>The following three parts focus on the three questions. </p>
 +
        -->
-
<td style="border:1px solid black;" align="center" height ="45px" onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7>
+
<h2 id="partisinglecell">Part&ensp;I&ensp;Single&ensp;Cell</h2>
-
<a href="https://2014.igem.org/Team:SJTU-BioX-Shanghai"style="color:#000000">Home </a> </td>
+
<p>Our project is about the system involving various enzymes, mostly the series enzymes, combining into certain area. This area can be more efficient when it comes to synthesizing or degrading chemicals. So the first question is, whether this system can be so useful when distributing multiple similar areas in a single cell.</p>
-
<td style="border:1px solid black;" align="center" height ="45px" onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7>  
+
<h3 id="fourtypesofdistribution">Four Types of Distribution</h3>
-
<a href="https://2014.igem.org/Team:SJTU-BioX-Shanghai/Team"style="color:#000000"> Team </a> </td>
+
 +
<p><strong>Type 1: Membrane & Random</strong>
 +
The position of enzyme is distributed randomly on the cell membrane.</p>
 +
<p><strong>Type 2: Membrane & Polymerization</strong>
 +
Certain enzymes are polymerized on the cell membrane.</p>
 +
<p><strong>Type 3: Matrix & Random</strong>
 +
The position of enzyme is distributed randomly inside the cell.</p>
 +
<p><strong>Type 4: Matrix & Polymerization</strong>
 +
The polymerization of certain enzymes is distributed randomly inside the cell.</p>
-
<td style="border:1px solid black;" align="center"  height ="45px"  onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7>  
+
<h3 id="hypothesisofsimulation">Hypothesis of Simulation</h3>
-
<a href="https://igem.org/Team.cgi?year=2014&team_name=SJTU-BioX-Shanghai"style="color:#000000"> Official Team Profile </a></td>
+
<h4>1. Metabolism</h4>
 +
<center><img src="https://static.igem.org/mediawiki/2014/4/4e/SJTU14_matabolism.png" width=800px></img></center>
 +
<center><small>Figure2.2.1 The process of the metabolism: s0, s1, s2, s3 are the substrates and E0, E1,E2 are the enzymes </small></center>
 +
<p>Enzymes: E0, E1,E2</p>
 +
<p>Substrates:S0,S1,S2,S3</p>
 +
                        <center><img src="https://static.igem.org/mediawiki/2014/0/01/SJTU14_animation-synthesis.gif"width=800x></img></center>
 +
                        <center><small>Figure2.2.2 the simulation of the CROWN</small></center>
 +
<h4>2. Initial Distribution of Substrates</h4>
 +
<p>All substrates are randomly distributed OUTSIDE the cell in all four simulations.</p>
 +
<h4>3. Movement of Substrates</h4>
 +
<p>The motion of molecules is random, including the rate and orientation.</p>
 +
<h4>4. Catalytic reaction</h4>
 +
<p>The time period of reaction is neglected. When the type of chemical match the type of enzyme, distance is less than threshold, then the enzyme reaction is recognized and recorded.</p>
 +
<h4>5. Other Hypothesis</h4>
 +
<p>Other physical and chemical parameters are under the scaling rule. The whole modeling combined with periodic boundary condition(PBC) to show the real performance of substrates and enzyme system.</p>
-
<td style="border:1px solid black" align="center" height ="45px" onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7>
+
<h3 id="results:">Results:</h3>
-
<a href="https://2014.igem.org/Team:SJTU-BioX-Shanghai/Project"style="color:#000000"> Project</a></td>
+
<h4>All Results</h4>
 +
<center><embed width="560" height="315" src="//www.youtube.com/embed/3msikNu8D7M" frameborder="0" allowfullscreen></embed></center>
 +
 +
<a href="http://v.youku.com/v_show/id_XODAyOTY5MzM2.html" > <center>Click to watch the video</a></center>
 +
                        <center><img src="https://static.igem.org/mediawiki/2014/d/de/SJTU14-All_Results.JPG" width=700x></img></center>
 +
                        <center><small>Figure2.2.3 All the results of the four types.</small></center>
-
<td style="border:1px solid black;" align="center" height ="45px" onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7>  
+
<h4 >Type 1</h4>
-
<a href="https://2014.igem.org/Team:SJTU-BioX-Shanghai/Parts"style="color:#000000"> Parts</a></td>
+
<p>Click to watch the video<a href="https://www.youtube.com/watch?v=EJQXhFBMqN4&list=UUjN3REkaTC_YulQONweFpSA"> Youtube</a>
 +
<a href="http://v.youku.com/v_show/id_XODAyMzQ4OTYw.html">Youku</a></p>
 +
                        <center><img src="https://static.igem.org/mediawiki/2014/b/ba/SJTU14-Membrane_%26_Random.JPG" width=700x></img></center>
 +
                        <center><small>Figure2.2.4 The extent of reaction of type 1.</small></center>
 +
<h4>Type 2</h4>
 +
<p>Click to watch the video<a href="https://www.youtube.com/watch?v=hjr4DZ7nJwA&list=UUjN3REkaTC_YulQONweFpSA"> Youtube</a> <a href="http://v.youku.com/v_show/id_XODAyMzQ3NjA4.html">Youku</a></p>
 +
                        <center><img src="https://static.igem.org/mediawiki/2014/a/ad/SJTU14-Membrane_%26_Polymerization.JPG" width=700x></img></center>
 +
                        <center><small>Figure2.2.5 The extent of reaction of type 2.</small></center>
 +
<h4>Type 3</h4>
 +
<p>Click to watch the video<a href="https://www.youtube.com/watch?v=W-AkV9MUITQ&list=UUjN3REkaTC_YulQONweFpSA&index=2"> Youtube</a> <a href="http://v.youku.com/v_show/id_XODAyMzQ1MDAw.html">Youku</a></p>
 +
                        <center><img src="https://static.igem.org/mediawiki/2014/0/0b/SJTU14-Matrix_%26_Random.JPG" width=700px></img></center>
 +
                        <center><small>Figure2.2.6 The extent of reaction of type 3</small></center>
 +
<h4>Type 4</h4>
 +
<p>Click to watch the video<a href="https://www.youtube.com/watch?v=eS2nZS7mFsc"> Youtube</a> <a href="http://v.youku.com/v_show/id_XODAyMzQwODg4.html">Youku</a></ul></p>
 +
                        <center><img src="https://static.igem.org/mediawiki/2014/7/7a/SJTU14-Matrix_%26_Polymerization.JPG" width=700x ></img></center>
 +
                        <center><small>Figure2.2.7 The extent of reaction of type 4</small></center><p id="dianweidian9"></br></br></br></br></p>
-
<td style="border:1px solid black;" align="center" height ="45px" onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7>  
+
<!--Part II-->
-
<a href="https://2014.igem.org/Team:SJTU-BioX-Shanghai/Modeling"style="color:#000000"> Modeling</a></td>
+
<h2 id="part2">Part&ensp;II&ensp;Docking</h2>
-
<td style="border:1px solid black;" align="center" height ="45px" onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7> 
+
<!--
-
<a href="https://2014.igem.org/Team:SJTU-BioX-Shanghai/Notebook"style="color:#000000"> Notebook</a></td>
+
<p>Our project is about the system involving various enzymes, mostly the series enzymes, combining into certain area. This area can be more efficient when it comes to synthesizing or degrading chemicals. So the first question is, whether this system can be so useful when distributing multiple similar areas in a single cell.</p>-->
-
<td style="border:1px solid black;" align="center"  height ="45px" onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7>  
+
<h3 id="WhydoweneedDocking?">Why do we need Docking?</h3>
-
<a href="https://2014.igem.org/Team:SJTU-BioX-Shanghai/Safety"style=" color:#000000"> Safety </a></td>
+
<p>Biobrick designers and users want to understand the characteristics of a particular biobrick, especially the performance and accuracy. Because they need to answer a question, that is, were there to be a certain mutation, whether a huge change would happen to the protein function. We hope to introduce evaluation methods of bioinformatics, to evaluate binding of protein and DNA.</p>
-
<td style="border:1px solid black;" align="center"  height ="45px" onMouseOver="this.bgColor='#d3d3d3'" onMouseOut="this.bgColor='#e7e7e7'" bgColor=#e7e7e7>
+
<h3 id="Materials">Materials</h3>
-
<a href="https://2014.igem.org/Team:SJTU-BioX-Shanghai/Attributions"style="color:#000000"> Attributions </a></td>
+
 +
<p>TAL (transcription activator-like) effectors, secreted by phytopathogenic bacteria, recognize host DNA sequences through a central domain of tandem repeats. Each repeat consists of 33 to 35 conserved amino acids and targets a specific base pair by using two hypervariable residues [known as repeat variable diresidues (RVDs)] at positions 12 and 13.</p>
 +
<p><strong>PDB:3V6T</strong></p>
 +
<center><img src="https://static.igem.org/mediawiki/2014/1/10/SJTU14_original_seq.jpg" width=800px></img></center>
 +
<center><img src="https://static.igem.org/mediawiki/2014/d/d3/SJTU14_3V6T.gif"></img></center>
-
<td align ="center"> <a href="https://2014.igem.org/Main_Page"> <img src="https://static.igem.org/mediawiki/igem.org/6/60/Igemlogo_300px.png" width="55px"></a> </td>
+
<h3 id="Mutations">Mutations</h3>
-
</tr>
+
-
</table>
+
<p>We designed fifteen sequences derived from raw sequence. These mutated sequences contain different mutations, ranging from one to five. Through a series of calculations, we obtained scores to represent the binding of TAL effectors and DNA.</p>
-
</tr>
+
<p><strong><li>The highlighted Letters represent the mutation site.</li></strong></p>
-
</tr>
+
<p><strong><li>The white DNA sequences on the graph is the originated position and orange one represents the possible binding DNA.</li></strong></p>
-
</td>
+
                        <p><strong><li>The higher Docking scores, the better Docking</li></strong></p>
 +
 +
<br><br><ul style="padding-left:5%;">
 +
<li><strong>mutation-1</strong></li>
 +
<li><strong>Score:1164.128</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/8/8f/SJTU14_4M2.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/9/9b/SJTU14_seq01.jpg"></img>
 +
                        <br/><br/>
-
<tr> <td colspan="3" height="15px"> </td></tr>
+
-
<tr><td bgColor="#e7e7e7" colspan="3" height="1px"> </tr>
+
-
<tr> <td colspan="3" height="5px"> </td></tr>
+
<li><strong>mutation-2</strong></li>
 +
<li><strong>Score:1170.910</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/5/53/SJTU_14M2.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/9/9e/SJTU14_seq02.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-3</strong></li>
 +
<li><strong>Score:1153.537</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/e/e1/SJTU14_3M2.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/d/d2/SJTU14_seq03.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-4</strong></li>
 +
<li><strong>Score:1377.231</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/4/47/SJTU14_4.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/8/8f/SJTU14_seq04.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-5</strong></li>
 +
<li><strong>Score:1169.283</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/b/b9/SJTU14_2M2.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/1/1c/SJTU14_seq05.jpg"></img>
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-6</strong></li>
 +
<li><strong>Score:1179.122</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/1/14/SJTU14_6.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/f/f0/SJTU14_seq06.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-7</strong></li>
 +
<li><strong>Score:1482.902</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/7/73/SJTU14_7.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/d/dc/SJTU14_seq07.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-8</strong></li>
 +
<li><strong>Score:1161.824</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/b/be/SJTU14_8.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/2/22/SJTU14_seq08.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-9</strong></li>
 +
<li><strong>Score:1482.897</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/c/cb/SJTU14_9.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/f/f9/SJTU14_seq09.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-10</strong></li>
 +
<li><strong>Score:1174.229</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/d/d1/SJTU14_10.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/a/ab/SJTU14_seq10.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-11</strong></li>
 +
<li><strong>Score:1237.449</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/f/f3/SJTU14_11.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/0/0d/SJTU14_seq11.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-12</strong></li>
 +
<li><strong>Score:1482.896</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/5/55/SJTU14_12.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/c/cc/SJTU14_seq12.jpg"></img>
 +
 +
<br/><br/>
 +
 +
 +
<li><strong>mutation-13</strong></li>
 +
<li><strong>Score:1483.352</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/6/60/SJTU14_13.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/f/f5/SJTU14_seq13.jpg"></img>
 +
-
<!--modeling content -->
+
<br/><br/>
-
<tr><td colspan="3"> <h3>Modeling</h3></td></tr>
+
-
</tr>
+
<li><strong>mutation-14</strong></li>
 +
<li><strong>Score:1482.048</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/4/4b/SJTU14_14.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/7/77/SJTU14_seq14.jpg"></img>
 +
 +
<br/><br/>
 +
 +
<li ><strong>mutation-15</strong></li>
 +
<li><strong>Score:1164.128</strong></li>
 +
<img src="https://static.igem.org/mediawiki/2014/5/58/SJTU14_15.gif"></img>
 +
<img src="https://static.igem.org/mediawiki/2014/2/21/SJTU14_seq15.jpg"></img>
 +
-
<tr>
+
                        </ul>
-
<td width="45%valign="top">  
+
<h3 id="Analysis">Analysis</h3>
-
<p>If you choose to create a model during your project, please write about it here. Modeling is not an essential part of iGEM, but we encourage any and all teams to model some aspect of their project. See previous "Best Model" awards for more information.</p>
+
<center><p><strong>Table</strong></p></center>
-
</td>
+
<center><img src="https://static.igem.org/mediawiki/2014/7/7f/SJTU14_Dcok_score.PNG"  width="800px"></img></center>
 +
<br><center><p><strong>Scatter Diagram</strong></p></center>
 +
<center><img src="https://static.igem.org/mediawiki/2014/c/c5/SJTU14_scatter.JPGwidth="800px"></img></center>
 +
                        <center><small>Figure 2.2.8 The correlation between the number of mutation sites and the docking scores.The higher docking scores indicates the better combination of TAL and target sequence.</small></center>
 +
<p>From the docking scores, we can see that in the event of single nucleotide mutation, binding of TAL effectors and DNA differs greatly from normal. However, when there are more than two mutation sites, the difference becomes less drastic.</p>
 +
<p id="dianweidian10">From the PDB document, we can find that mutation at certain sites may lead to huge conformational distortions of TAL-DNA complex. With as many as five mutations, the binding site changes greatly.</p>
 +
<p>In conclusion, we strongly recommend that TAL designers and users ensure the accuracy of DNA binding sequence. If not, the specificity of binding site will not be guaranteed.</p>
 +
        <h2>Part Ⅲ K-clique Problem</h2>
 +
              <h3 id="Problem Identification">Problem Identification</h3>
 +
                        <p>Before designing the TAL sequence, what we need to do first is to design a set of sequences, which are consist of seven fragments containing four nucleotides. In order to accomplish this, we offer a new sequence alignment algorithm for these fragmental sequences comparison. By using Loose Algorithm and Strict Algorithm, two fragmental sequences can be scored, and when the score is less than or equal to a specific value (eg 2), we can accept that they are a valid combination.</p>
 +
                        <p>As we can see, the sequence database contains a total of 4 * 4 * 4 * 4 = 256 kinds of fragmental sequences. Now, what we should do is to find out some seven sequences whose each two fragments can be a valid combination, based on a large (256*256) score table.</p>
 +
              <h3 id="Assumption and Model Formulation">Assumption and Model Formulation</h3>
 +
                        <p>We try to solve this problem by making full use of graph theory. In our model, each fragmental sequence can be treated as a node, and two “node” is connected, if they are a valid combination (the score less than or equal to 2). After doing that, we just need to find such seven nodes which are consist of a complete graph from a graph with 256 nodes. (Note: not every two nodes in original graph is connected)</p>
 +
              <h3 id="Brief introduction to graph theory">Brief introduction to graph theory</h3>
 +
                        <p>For an undirected graph G = (V, E),if  U⊆V, and for any u, v ⊆ U, (u, v) ⊆ E,  U is called complete subgraph of G.</p>
 +
                        <p>A clique in an undirected graph G = (V, E) is a subset of the vertex set C ⊆ V, such that for every two vertices in C, there exists an edge connecting the two. A maximal clique is a clique that cannot be extended by including one more adjacent vertex, that is, a clique which does not exist exclusively within the vertex set of a larger clique.</p>
 +
                        <p>U is the maximal complete subgraph of G, if and only if U is a clique of U and U is not contained in a larger subgraph.</p>
 +
                        <p>The k-clique problem is to find the complete graph with k nodes in a specific graph. What’s more, k-clique algorithm is defined in the paper "Uncovering the overlapping community structure of complex networks in nature and society" - G. Palla, I. Derényi, I. Farkas, and T. Vicsek - Nature 435, 814–818 (2005).</p>
 +
                        <p>Although a deterministic algorithm for this problem with an O(n*2^n) algorithm time complexity, fortunately, in our experiment, the problem we should solve is just with 256 nodes, so a non-deterministic algorithm can be applied.</p>
 +
              <h3 id="Solution">Solution</h3>
 +
                        <p>Early in 2005, a scientist have tried to use some kinds of Backtracking Algorithm to solve this problem, which have published in Science. Based on this excellent work, we have gone further to offer two more efficient algorithms.</p>
 +
              <h3 id="Algorithm 1">Algorithm 1</h3>
 +
                        <p>In the solution space tree containing all available solutions, we search the solution space tree from the root according to the depth-first strategy. When reaching a node, we always firstly determine whether the node include any solutions or not. If not, we don't need to search the subtree whose root is such node, and then backtrack the ancestor nodes step-by-step; otherwise, we should continue our depth-first search.</p>
 +
              <h3 id="Algorithm 2">Algorithm 2</h3>
 +
                        <p>1.Sort the degree of each node.</p>
 +
                        <p>2.In the current data set, from the first degree to the last, we remain the nodes which is relative to such degree, and delete the unconnected one.</p>
 +
                        <p>3.In the set, containing the nodes connected to the previous one, for each node, determine whether it is connected to others, and then sort the nodes based on their degree.</p>
 +
                        <p>4.Divide the problem into some much smaller size problems, repeat above method.</p>
 +
              <h3 id="Result">Result</h3>
 +
                        <p>Based on our efficient algorithm, we have found a possible solution:</p>
 +
              <img src="https://static.igem.org/mediawiki/2014/1/18/SJTU14-model_table.png" width=700x></img>
 +
              <h3 id="Discussion">Discussion</h3>   
 +
                        <p>Clique problem play an important role in graph theory as well as is quite complex. However, we have provided some valid combinations for the TAL users here.</p>     
 +
<h2 id="Reference">Reference</h2>
 +
<ol style="font-style: italic;">
 +
<li>Pierce, Brian G., Yuichiro Hourai, and Zhiping Weng. "Accelerating protein docking in ZDOCK using an advanced 3D convolution library." PloS one 6.9 (2011): e24657.</li>
 +
<li>Mintseris, Julian, et al. "Integrating statistical pair potentials into protein complex prediction." Proteins: Structure, Function, and Bioinformatics 69.3 (2007): 511-520.</li>
 +
                        <li>  G. Palla, I. Derényi, I. Farkas, and T. Vicsek. "Uncovering the overlapping community structure of complex networks in nature and society" Nature 435, 814–818 (2005)</li>
 +
</ol>
 +
 
 +
  </article>
 +
  </div>
-
<td></td>
 
-
<td></td>
 
-
</tr>
 
-
</table>
 
</html>
</html>
{{Team:SJTU-BioX-Shanghai/footer}}
{{Team:SJTU-BioX-Shanghai/footer}}

Latest revision as of 02:36, 18 October 2014

Modeling and Simulation


"All models are wrong, but some are useful." When we decided to use TAL effectors building CROWN, our project, there were three main challenges concerning the efficiency of this system. First, allowing some DNA mutations, can the CROWN be as efficient as before? Second, given that CROWN can be successfully distributed on certain area of single cell, can it make sense? Third, how to design the sequence of Golden Gate? The following three parts focus on the three questions.

Part I Single Cell

Our project is about the system involving various enzymes, mostly the series enzymes, combining into certain area. This area can be more efficient when it comes to synthesizing or degrading chemicals. So the first question is, whether this system can be so useful when distributing multiple similar areas in a single cell.

Four Types of Distribution

Type 1: Membrane & Random The position of enzyme is distributed randomly on the cell membrane.

Type 2: Membrane & Polymerization Certain enzymes are polymerized on the cell membrane.

Type 3: Matrix & Random The position of enzyme is distributed randomly inside the cell.

Type 4: Matrix & Polymerization The polymerization of certain enzymes is distributed randomly inside the cell.

Hypothesis of Simulation

1. Metabolism

Figure2.2.1 The process of the metabolism: s0, s1, s2, s3 are the substrates and E0, E1,E2 are the enzymes

Enzymes: E0, E1,E2

Substrates:S0,S1,S2,S3

Figure2.2.2 the simulation of the CROWN

2. Initial Distribution of Substrates

All substrates are randomly distributed OUTSIDE the cell in all four simulations.

3. Movement of Substrates

The motion of molecules is random, including the rate and orientation.

4. Catalytic reaction

The time period of reaction is neglected. When the type of chemical match the type of enzyme, distance is less than threshold, then the enzyme reaction is recognized and recorded.

5. Other Hypothesis

Other physical and chemical parameters are under the scaling rule. The whole modeling combined with periodic boundary condition(PBC) to show the real performance of substrates and enzyme system.

Results:

All Results

Click to watch the video
Figure2.2.3 All the results of the four types.

Type 1

Click to watch the video Youtube Youku

Figure2.2.4 The extent of reaction of type 1.

Type 2

Click to watch the video Youtube Youku

Figure2.2.5 The extent of reaction of type 2.

Type 3

Click to watch the video Youtube Youku

Figure2.2.6 The extent of reaction of type 3

Type 4

Click to watch the video Youtube Youku

Figure2.2.7 The extent of reaction of type 4





Part II Docking

Why do we need Docking?

Biobrick designers and users want to understand the characteristics of a particular biobrick, especially the performance and accuracy. Because they need to answer a question, that is, were there to be a certain mutation, whether a huge change would happen to the protein function. We hope to introduce evaluation methods of bioinformatics, to evaluate binding of protein and DNA.

Materials

TAL (transcription activator-like) effectors, secreted by phytopathogenic bacteria, recognize host DNA sequences through a central domain of tandem repeats. Each repeat consists of 33 to 35 conserved amino acids and targets a specific base pair by using two hypervariable residues [known as repeat variable diresidues (RVDs)] at positions 12 and 13.

PDB:3V6T

Mutations

We designed fifteen sequences derived from raw sequence. These mutated sequences contain different mutations, ranging from one to five. Through a series of calculations, we obtained scores to represent the binding of TAL effectors and DNA.

  • The highlighted Letters represent the mutation site.
  • The white DNA sequences on the graph is the originated position and orange one represents the possible binding DNA.
  • The higher Docking scores, the better Docking


    • mutation-1
    • Score:1164.128


    • mutation-2
    • Score:1170.910


    • mutation-3
    • Score:1153.537


    • mutation-4
    • Score:1377.231


    • mutation-5
    • Score:1169.283


    • mutation-6
    • Score:1179.122


    • mutation-7
    • Score:1482.902


    • mutation-8
    • Score:1161.824


    • mutation-9
    • Score:1482.897


    • mutation-10
    • Score:1174.229


    • mutation-11
    • Score:1237.449


    • mutation-12
    • Score:1482.896


    • mutation-13
    • Score:1483.352


    • mutation-14
    • Score:1482.048


    • mutation-15
    • Score:1164.128

    Analysis

    Table


    Scatter Diagram

    Figure 2.2.8 The correlation between the number of mutation sites and the docking scores.The higher docking scores indicates the better combination of TAL and target sequence.

    From the docking scores, we can see that in the event of single nucleotide mutation, binding of TAL effectors and DNA differs greatly from normal. However, when there are more than two mutation sites, the difference becomes less drastic.

    From the PDB document, we can find that mutation at certain sites may lead to huge conformational distortions of TAL-DNA complex. With as many as five mutations, the binding site changes greatly.

    In conclusion, we strongly recommend that TAL designers and users ensure the accuracy of DNA binding sequence. If not, the specificity of binding site will not be guaranteed.

    Part Ⅲ K-clique Problem

    Problem Identification

    Before designing the TAL sequence, what we need to do first is to design a set of sequences, which are consist of seven fragments containing four nucleotides. In order to accomplish this, we offer a new sequence alignment algorithm for these fragmental sequences comparison. By using Loose Algorithm and Strict Algorithm, two fragmental sequences can be scored, and when the score is less than or equal to a specific value (eg 2), we can accept that they are a valid combination.

    As we can see, the sequence database contains a total of 4 * 4 * 4 * 4 = 256 kinds of fragmental sequences. Now, what we should do is to find out some seven sequences whose each two fragments can be a valid combination, based on a large (256*256) score table.

    Assumption and Model Formulation

    We try to solve this problem by making full use of graph theory. In our model, each fragmental sequence can be treated as a node, and two “node” is connected, if they are a valid combination (the score less than or equal to 2). After doing that, we just need to find such seven nodes which are consist of a complete graph from a graph with 256 nodes. (Note: not every two nodes in original graph is connected)

    Brief introduction to graph theory

    For an undirected graph G = (V, E),if U⊆V, and for any u, v ⊆ U, (u, v) ⊆ E, U is called complete subgraph of G.

    A clique in an undirected graph G = (V, E) is a subset of the vertex set C ⊆ V, such that for every two vertices in C, there exists an edge connecting the two. A maximal clique is a clique that cannot be extended by including one more adjacent vertex, that is, a clique which does not exist exclusively within the vertex set of a larger clique.

    U is the maximal complete subgraph of G, if and only if U is a clique of U and U is not contained in a larger subgraph.

    The k-clique problem is to find the complete graph with k nodes in a specific graph. What’s more, k-clique algorithm is defined in the paper "Uncovering the overlapping community structure of complex networks in nature and society" - G. Palla, I. Derényi, I. Farkas, and T. Vicsek - Nature 435, 814–818 (2005).

    Although a deterministic algorithm for this problem with an O(n*2^n) algorithm time complexity, fortunately, in our experiment, the problem we should solve is just with 256 nodes, so a non-deterministic algorithm can be applied.

    Solution

    Early in 2005, a scientist have tried to use some kinds of Backtracking Algorithm to solve this problem, which have published in Science. Based on this excellent work, we have gone further to offer two more efficient algorithms.

    Algorithm 1

    In the solution space tree containing all available solutions, we search the solution space tree from the root according to the depth-first strategy. When reaching a node, we always firstly determine whether the node include any solutions or not. If not, we don't need to search the subtree whose root is such node, and then backtrack the ancestor nodes step-by-step; otherwise, we should continue our depth-first search.

    Algorithm 2

    1.Sort the degree of each node.

    2.In the current data set, from the first degree to the last, we remain the nodes which is relative to such degree, and delete the unconnected one.

    3.In the set, containing the nodes connected to the previous one, for each node, determine whether it is connected to others, and then sort the nodes based on their degree.

    4.Divide the problem into some much smaller size problems, repeat above method.

    Result

    Based on our efficient algorithm, we have found a possible solution:

    Discussion

    Clique problem play an important role in graph theory as well as is quite complex. However, we have provided some valid combinations for the TAL users here.

    Reference

    1. Pierce, Brian G., Yuichiro Hourai, and Zhiping Weng. "Accelerating protein docking in ZDOCK using an advanced 3D convolution library." PloS one 6.9 (2011): e24657.
    2. Mintseris, Julian, et al. "Integrating statistical pair potentials into protein complex prediction." Proteins: Structure, Function, and Bioinformatics 69.3 (2007): 511-520.
    3. G. Palla, I. Derényi, I. Farkas, and T. Vicsek. "Uncovering the overlapping community structure of complex networks in nature and society" Nature 435, 814–818 (2005)