Team:Kent/sequencealign

From 2014.igem.org

(Difference between revisions)
 
Line 32: Line 32:
<div class="content">
<div class="content">
-
    <p>&nbsp;</p>
 
     <table width="950" border="0">
     <table width="950" border="0">
       <tr>
       <tr>

Latest revision as of 22:11, 17 October 2014

Univeristy of Kent iGEM

 

Sequences, Sequence Alignment and Primer Design

Overview:

Sequence Alignment:

After researching fragrance-producing enzymes in plants, we selected two terpene synthase genes from a protein database (Uniprot). These genes were Zingerberene Synthase and R-linalool Synthase respectively. We modified these sequences to optimize translation of the gene product according to the frequency of codon usage in bacteria. The sequences were then further modified to remove endogenous restriction enzyme sites, allowing us to carry out restriction digests without cutting the coding region. We then added a 50 base pair sequence prefix and suffix region to the 5' and 3' ends of our construct respectively corresponding to the vector pSB1C3, allowing us to clone our construct into this plasmid using Gibson Assembly.

Gibson Assembly Schematic:

Where A (in red) corresponds to the vector 50 bp pSB1C3 prefix region, B (in green) corresponds to the pSB1C3 suffix region, and the yellow region corresponding to either the R-linalool Synthase or Zingiberene Synthase gene. (Figure1.)

 
 

Primers:

The red and green arrows represent the forward and reverse primers respectively that correspond to the forward and reverse complement of the sequences in A (region containing prefix) and B (region containing suffix) from the pSB1C3 plasmid. Since these regions were incorporated at the 3' and 5' ends of both our constructs, these primers can therefore be used to amplify either one of our gene fragments as well the pSB1C3 vectors using the polymerase chain reaction (PCR). We designed the primers using IDT oligo analyzer1 to ensure that they had a suitable melting temperature, did not form secondary structures and were of a suitable length. The forward and reverse primer sequences and parameters are outlined below.

 
 

The annealing sites of the oligonucleotides are outlined in white in the construct sequences for R-linalool Synthase and Zingiberene Synthase in the next sections.

Our Genes:

R linalool synthase:

This gene encodes an enzyme that catalyzes the conversion of the mevalonate pathway intermediate Geranyl diphosphate (GPP) into R-linalool. This particular isomer of Linalool has a fragrance resembling that of a woody lavender smell. We found the gene sequence (taken from Lavender) on Uniprot:

This sequence was then codon optimised for use in E.coli by running the sequence through the IDT codon optomisation tool2. Running this new codon optimised sequence through NEB cutter3 then identified endogenous restriction enzyme sites. Restriction sites corresponding to SpeI, EcoRI, XbaI, and PstI where then removed by creating a one base silent substitution at each endogenous site. Appropriate bases for the substitutions were selected via a codon usage table optimised for E.coli. A fifty base pair sequence corresponding to the prefix and upstream region of the vector PSB1C3 was added to the 5' end of the codon optimized gene such that the first base of the start codon overlapped with the final base of the XbaI restriction site. The 50bp suffix region of the same vector (PSB1C3) was added to the 3'end of the gene to obtain our final construct. The coding part of the final construct was run through Expasy Translate tool4 and aligned with the uniprot protein primary sequence with a 100% match using EBI Clustal Omega5 sequence alignment software. We ordered this sequence as a fragment from the DNA synthesis company Life Technologies and, on receiving the artificially synthesized fragment, cloned it into the pSB1C3 vector provided by iGEM HQ using Gibson Assembly method.

Final Optimized Sequence: R-Linalool Synthase:

http://www.uniprot.org/uniprot/Q2XSC5
http://www.ebi.ac.uk/ena/data/view/ABB73045

 
 
 

Zingiberene Synthase:

This gene product produces Zingiberene (ginger scent) from (2E,6E)-farnesyl diphosphate (FPP) and is found in the long grass Sorghum Bicolor. http://www.uniprot.org/uniprot/C5YHH7 The sequence was taken from Uniprot, however, incomplete mRNA sequencing meant that we had to use the amino acid sequence of the protein to determine the missing codons via a codon usage table. This sequence was then processed in a similar manner to R-linalool synthase using the same IDT codon optimisation tool and then removing restriction sites using the same process as outlined above for R-linalool Synthase. Final Optimised Sequence:

Zingiberene Synthase:
http://www.uniprot.org/uniprot/C5YHH7

 
 

Online tools used:

1. IDT Oligo Analyzer: https://www.idtdna.com/analyzer/Applications/OligoAnalyzer/

2. IDT Codon Optimization Tool: hhttps://www.idtdna.com/CodonOpt

3. NEB Cutter: http://tools.neb.com/NEBcutter2/

4. Expasy Translate Tool: http://web.expasy.org/translate/

5. Clustal Omega Mulitple Sequence Alignment: http://www.ebi.ac.uk/Tools/msa/clustalo/