Team:UESTC-Software/Validation.html
From 2014.igem.org
Tangying0608 (Talk | contribs) (Created page with "{{CSS/Main}} <html> <head> <title>Hunman Practice | UESTC Software 2014</title> <meta http-equiv=Content-Type content="text/html;charset=utf-8"> <meta http-equiv="X-UA-Compati...") |
Tangying0608 (Talk | contribs) |
||
Line 2: | Line 2: | ||
<html> | <html> | ||
<head> | <head> | ||
- | <title> | + | <title>Validation | UESTC Software 2014</title> |
<meta http-equiv=Content-Type content="text/html;charset=utf-8"> | <meta http-equiv=Content-Type content="text/html;charset=utf-8"> | ||
<meta http-equiv="X-UA-Compatible" content="IE=Edge,chrome=1" > | <meta http-equiv="X-UA-Compatible" content="IE=Edge,chrome=1" > | ||
Line 18: | Line 18: | ||
<div id="vine3" class="vine"></div> | <div id="vine3" class="vine"></div> | ||
</div> | </div> | ||
- | <h1 style="padding-top: 30px"> | + | <h1 style="padding-top: 30px">Validation</h1> |
<div class="parts" style="padding: 20px 50px 20px 100px;"> | <div class="parts" style="padding: 20px 50px 20px 100px;"> | ||
- | <div class="question" id="p1"> | + | <div class="question" id="p1">Complexity Analysis</div> |
- | <p> | + | <h2>1 Notation</h2> |
- | + | <p><span class="serif">n</span> Number of candidate sgRNAs</p> | |
- | + | <p><span class="serif">n<sub>h</sub></span> Number of hits in cache</p> | |
- | + | <p><span class="serif">n<sub>m</sub></span> Number of misses in cache</p> | |
- | + | <p><span class="serif">n<sub>t</sub></span> System will only output n<sub>t</sub> results</p> | |
- | + | <p><span class="serif">m</span> Number of possible-o_target sgRNAs</p> | |
- | + | <p><span class="serif">l</span> Length of sgRNA</p> | |
- | + | <p><span class="serif">t<sub>cd</sub></span> Time cost on connecting database</p> | |
+ | <p><span class="serif">t<sub>rd</sub></span> Approximate time cost on one database operation</p> | ||
+ | <h2>2 Analysis</h2> | ||
+ | <p>1. System will connect to Database, which costs <span class="serif">t<sub>cd</sub></span>.</p> | ||
+ | <p>2. Retrieve data from database, which costs <span class="serif">t<sub>rd</sub>* (n + m)</span>.</p> | ||
+ | <p>3. System calculate each <span class="serif">n</span> candidate sgRNAs' scores. For <span class="serif">n<sub>h</sub></span> sgRNAs which | ||
+ | has already calculated before, system will cost only <span class="serif">O(n<sub>h</sub>)</span>. For n<sub>m</sub> others, | ||
+ | system will calculate the scores. | ||
+ | According to our algorithm, those sgRNAs will compare every <span class="serif">m</span> possible- | ||
+ | o_target sgRNAs. In each comparison, algorithm compare all <span class="serif">l</span> nucleobase, | ||
+ | make a little adds and multiplies, package result to strings and save strings | ||
+ | to database. All <span class="serif">n<sub>m</sub></span> calculation will cost n<sub>m</sub>* (m*l + t<sub>rd</sub>).</p> | ||
+ | <p>4. System will sort all <span class="serif">n</span> result, which cost <span class="serif">O(n log2 n)</span>.</p> | ||
+ | <p>5. Output result costs <span class="serif">O(n<sub>t</sub>)</span>.</p> | ||
+ | <h2>3 Run Time</h2> | ||
+ | <p><span class="serif">t<sub>cd</sub> + t<sub>rd</sub>* (n + m) + O(n<sub>h</sub>) + n<sub>m</sub>* (m* l + t<sub>rd</sub>) + O(n log<sub>2</sub> n) + O(n<sub>t</sub>)</span></p> | ||
</div> | </div> | ||
<div class="parts" style="padding: 20px 50px 20px 100px;"> | <div class="parts" style="padding: 20px 50px 20px 100px;"> | ||
- | <div class="question" id="p2"> | + | <div class="question" id="p2">Validation</div> |
- | <p> | + | <p>In order to confirm our software is consistent with the experimental results, we use the experimental data on the MLE Cleavage with the different mismatches, and we compare our scoring results to corresponding to the experimental data, in addition find the correlation coefficient of them.</p> |
- | <a href="https://static.igem.org/mediawiki/2014/ | + | <h2>(A)Single mismatch</h2> |
- | <p> | + | <p>First, we use aggregate data from single-mismatch guide RNAs for 15 EMX1 targets in literature [1] (It’s relation figure is figure 2C, heat map for relative SpCas9 cleavage efficiency for each possible RNA: DNA base pair).</p> |
- | <a href="https://static.igem.org/mediawiki/2014/ | + | <a href="https://static.igem.org/mediawiki/2014/b/bd/2014-UESTC-Software-Vp1.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/b/bd/2014-UESTC-Software-Vp1.jpg"></a><p style="font-size: 14px;text-align: center;font-weight: 900;margin: 0 auto 20px;">Heat map for relative SpCas9 cleavage efficiency for each possible RNA:DNA base pair</p> |
- | <p> | + | <p>We use this set of data to determine the relationship between our software score with the MLE Cleavage for single mismatch position. MATLAB program is shown below:</p> |
- | <p> | + | <pre style='color:#d1d1d1;background:#000000;font-size:14px;padding:20px 0;'> |
+ | data<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>load</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'E:\matlab\work\igem_data.mat'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#d2cd86; '>[</span>m n<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>size</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>dataigem<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> i<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | ave<span style='color:#d2cd86; '>(</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>dataigem<span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>,</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | M<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.014</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.395</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.317</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.389</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.079</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.445</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.508</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.613</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.851</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.732</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.828</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.615</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.804</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.685</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.583</span><span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> j<span style='color:#d2cd86; '>=</span>2<span style='color:#d2cd86; '>:</span>20 | ||
+ | S<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>-</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>(</span>4<span style='color:#d2cd86; '>*</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span><span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>(</span>4<span style='color:#d2cd86; '>*</span>j<span style='color:#d2cd86; '>+</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | x<span style='color:#d2cd86; '>=</span>19<span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>-</span>1<span style='color:#d2cd86; '>:</span>1<span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>figure</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%title('Single mismatch,correlation coefficient=0.8840');</span> | ||
+ | |||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The relaition between the single mismatch location and cleavage activity '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'location/nt'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'g'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The figure of the single mismatch location and mismatch score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'location/nt'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | |||
+ | <span style='color:#bb7977; '>figure</span><span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>plot</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'b'</span><span style='color:#d2cd86; '>,</span>x<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'r'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>legend</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'mismatch cleavage activity'</span><span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'mismatch score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The contrast figure of the single mismatch cleavage activity and mismatch score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'location/nt'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'amplitude'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | p<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>polyfit</span><span style='color:#d2cd86; '>(</span>ave<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span> | ||
+ | y<span style='color:#d2cd86; '>=</span>p<span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>*</span>ave<span style='color:#d2cd86; '>+</span>p<span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%Fitting equation of the straight line</span> | ||
+ | <span style='color:#bb7977; '>figure</span><span style='color:#d2cd86; '>(</span>3<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>plot</span><span style='color:#d2cd86; '>(</span>ave<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'b'</span><span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span>y<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'r'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The relation of the single mismatch location and mismatch score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>legend</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Relation curve'</span><span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'Fitting straight line'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | |||
+ | R<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>corrcoef</span><span style='color:#d2cd86; '>(</span>ave<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>)</span><span style='color:#9999a9; '>%find the correlation coefficient</span> | ||
+ | </pre> | ||
+ | <p>The result and figures are:</p> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/b/b7/2014-UESTC-Software-Vp2.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/b/b7/2014-UESTC-Software-Vp2.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/e/ed/2014-UESTC-Software-Vp3.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/e/ed/2014-UESTC-Software-Vp3.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/f/f0/2014-UESTC-Software-Vp4.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/f/f0/2014-UESTC-Software-Vp4.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/9/9c/2014-UESTC-Software-Vp5.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/9/9c/2014-UESTC-Software-Vp5.jpg"></a> | ||
+ | <p>Therefore, the correlation coefficient between the single mismatch cleavage activity and CRISPR-X mismatch score is 0.8840. And the fitting equation of the straight line is: Y=0.1872*X+0.0987. (X stands for mismatch score).</p> | ||
+ | <h2>(B) Multiple mismatches</h2> | ||
+ | <p>We next explored the effect between multiple base mismatches on SpCas9 target activity and our mismatch score. We use data of sets of guide RNAs that contained varying combinations of mismatches to investigate the effect of mismatch number, position and spacing on SpCas9 target cleavage activity for four targets within the <i>EMX1</i> gene. [1]</p> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/c/cc/2014-UESTC-Software-Vp6.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/c/cc/2014-UESTC-Software-Vp6.jpg"></a> | ||
+ | <p style="font-size: 14px;text-align: center;font-weight: 900;margin: 0 auto 20px;">SpCas9 target cleavage activity for multiple mismatches [1]</p> | ||
+ | <h3>(a) Two concatenated mismatches</h3> | ||
+ | <p>MATLAB program is shown below:</p> | ||
+ | <pre style='color:#d1d1d1;background:#000000;font-size:14px;padding:20px 0;'> | ||
+ | data<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>load</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'E:\matlab\work\data2misc.mat'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#d2cd86; '>[</span>m n<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>size</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>data2misc<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> i<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | ave<span style='color:#d2cd86; '>(</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>data2misc<span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>,</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | N<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>19 20<span style='color:#d2cd86; '>;</span>17 18<span style='color:#d2cd86; '>;</span>15 16<span style='color:#d2cd86; '>;</span>13 14<span style='color:#d2cd86; '>;</span>11 12<span style='color:#d2cd86; '>;</span>9 10<span style='color:#d2cd86; '>;</span>7 8<span style='color:#d2cd86; '>;</span>5 6<span style='color:#d2cd86; '>;</span>3 4<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span> | ||
+ | M<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.014</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.395</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.317</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.389</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.079</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.445</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.508</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.613</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.851</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.732</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.828</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.615</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.804</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.685</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.583</span><span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> j<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | d0<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | S<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>(</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span><span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>(</span>4<span style='color:#d2cd86; '>*</span>d0<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | x<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n<span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The two concatenated mismatches cleavage activity '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The serial number'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'g'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The two concatenated mismatches score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The serial number'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | p<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>polyfit</span><span style='color:#d2cd86; '>(</span>S<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span> | ||
+ | y<span style='color:#d2cd86; '>=</span>p<span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>*</span>S<span style='color:#d2cd86; '>+</span>p<span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%Fitting equation of the straight line</span> | ||
+ | <span style='color:#bb7977; '>figure</span><span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>plot</span><span style='color:#d2cd86; '>(</span>S<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'b'</span><span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span>y<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'r'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The relation of the two concatenated mismatches cleavage activity and mismatch score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>legend</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Relation curve'</span><span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'Fitting straight line'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | R<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>corrcoef</span><span style='color:#d2cd86; '>(</span>ave<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>)</span> | ||
+ | </pre> | ||
+ | <p>The result and figures are:</p> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/e/e3/2014-UESTC-Software-Vp7.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/e/e3/2014-UESTC-Software-Vp7.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/c/cc/2014-UESTC-Software-Vp8.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/c/cc/2014-UESTC-Software-Vp8.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/8/8a/2014-UESTC-Software-Vp9.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/8/8a/2014-UESTC-Software-Vp9.jpg"></a> | ||
+ | <p>Therefore, the correlation coefficient between the two concatenated mismatches cleavage activity and CRISPR-X mismatch score is 0.8902. And the fitting equation of the straight line is: Y=0.0445*X-0.0103. (X stands for mismatch score).</p> | ||
+ | <h3>(b) Two interspaced mismatches</h3> | ||
+ | <p>MATLAB program is shown below:</p> | ||
+ | <pre style='color:#d1d1d1;background:#000000;font-size:14px;padding:20px 0;'> | ||
+ | data<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>load</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'E:\matlab\work\data2misi.mat'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#d2cd86; '>[</span>m n<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>size</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>data2misi<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> i<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | ave<span style='color:#d2cd86; '>(</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>data2misi<span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>,</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | N<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>18 20<span style='color:#d2cd86; '>;</span>15 20<span style='color:#d2cd86; '>;</span>11 20<span style='color:#d2cd86; '>;</span>6 20<span style='color:#d2cd86; '>;</span>1 20<span style='color:#d2cd86; '>;</span>16 18<span style='color:#d2cd86; '>;</span>13 18<span style='color:#d2cd86; '>;</span>9 18<span style='color:#d2cd86; '>;</span>4 18<span style='color:#d2cd86; '>;</span>14 16<span style='color:#d2cd86; '>;</span>11 16<span style='color:#d2cd86; '>;</span>7 16<span style='color:#d2cd86; '>;</span>2 16<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span> | ||
+ | M<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.014</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.395</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.317</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.389</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.079</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.445</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.508</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.613</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.851</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.732</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.828</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.615</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.804</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.685</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.583</span><span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> j<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | d0<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | S<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>(</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span><span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>(</span>4<span style='color:#d2cd86; '>*</span>d0<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | x<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n<span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The two interspaced mismatches cleavage activity '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The serial number'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'g'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The two interspaced mismatches score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The serial number'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | p<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>polyfit</span><span style='color:#d2cd86; '>(</span>S<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span> | ||
+ | y<span style='color:#d2cd86; '>=</span>p<span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>*</span>S<span style='color:#d2cd86; '>+</span>p<span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%Fitting equation of the straight line</span> | ||
+ | <span style='color:#bb7977; '>figure</span><span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>plot</span><span style='color:#d2cd86; '>(</span>S<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'b'</span><span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span>y<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'r'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The relation of the two interspaced mismatches cleavage activity and mismatch score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>legend</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Relation curve'</span><span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'Fitting straight line'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | R<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>corrcoef</span><span style='color:#d2cd86; '>(</span>ave<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>)</span> | ||
+ | </pre> | ||
+ | <p>The result and figures are:</p> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/7/73/2014-UESTC-Software-Vp10.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/7/73/2014-UESTC-Software-Vp10.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/0/06/2014-UESTC-Software-Vp11.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/0/06/2014-UESTC-Software-Vp11.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/1/1c/2014-UESTC-Software-Vp12.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/1/1c/2014-UESTC-Software-Vp12.jpg"></a> | ||
+ | <p>Therefore, the correlation coefficient between the two interspaced mismatches cleavage activity and CRISPR-X mismatch score is 0.7688. And the fitting equation of the straight line is: Y=0.0866*X-0.0353. (X stands for mismatch score).</p> | ||
+ | <h3>(c) Three concatenated mismatches</h3> | ||
+ | <p>MATLAB program is shown below:</p> | ||
+ | <pre style='color:#d1d1d1;background:#000000;font-size:14px;padding:20px 0;'> | ||
+ | data<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>load</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'E:\matlab\work\data3misc.mat'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#d2cd86; '>[</span>m n<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>size</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>data3misc<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> i<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | ave<span style='color:#d2cd86; '>(</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>data3misc<span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>,</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | N<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>17 18 19<span style='color:#d2cd86; '>;</span>14 15 16<span style='color:#d2cd86; '>;</span>11 12 13<span style='color:#d2cd86; '>;</span>8 9 10<span style='color:#d2cd86; '>;</span>5 6 7<span style='color:#d2cd86; '>;</span>2 3 4<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span> | ||
+ | M<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.014</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.395</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.317</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.389</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.079</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.445</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.508</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.613</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.851</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.732</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.828</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.615</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.804</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.685</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.583</span><span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> j<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | d0<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | S<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span>4<span style='color:#d2cd86; '>/</span>9<span style='color:#d2cd86; '>*</span><span style='color:#d2cd86; '>(</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>3<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span><span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>(</span>4<span style='color:#d2cd86; '>*</span>d0<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | x<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n<span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The three concatenated mismatches cleavage activity '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The serial number'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'g'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The three concatenated mismatches score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The serial number'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | p<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>polyfit</span><span style='color:#d2cd86; '>(</span>S<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span> | ||
+ | y<span style='color:#d2cd86; '>=</span>p<span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>*</span>S<span style='color:#d2cd86; '>+</span>p<span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%Fitting equation of the straight line</span> | ||
+ | <span style='color:#bb7977; '>figure</span><span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>plot</span><span style='color:#d2cd86; '>(</span>S<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'b'</span><span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span>y<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'r'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The relation of the three concatenated mismatches cleavage activity and mismatch score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>legend</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Relation curve'</span><span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'Fitting straight line'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | R<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>corrcoef</span><span style='color:#d2cd86; '>(</span>ave<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>)</span> | ||
+ | </pre> | ||
+ | <p>The result and figures are:</p> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/e/e9/2014-UESTC-Software-Vp13.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/e/e9/2014-UESTC-Software-Vp13.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/9/9d/2014-UESTC-Software-Vp14.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/9/9d/2014-UESTC-Software-Vp14.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/7/75/2014-UESTC-Software-Vp15.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/7/75/2014-UESTC-Software-Vp15.jpg"></a> | ||
+ | <p>Therefore, the correlation coefficient between the three concatenated mismatches cleavage activity and CRISPR-X mismatch score is 0.8560. And the fitting equation of the straight line is: Y=0.0175*X-0.0082. (X stands for mismatch score).</p> | ||
+ | <h3>(d) Three interspaced mismatches</h3> | ||
+ | <p>MATLAB program is shown below:</p> | ||
+ | <pre style='color:#d1d1d1;background:#000000;font-size:14px;padding:20px 0;'> | ||
+ | data<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>load</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'E:\matlab\work\data3misi.mat'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#d2cd86; '>[</span>m n<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>size</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>data3misi<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> i<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | ave<span style='color:#d2cd86; '>(</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>data<span style='color:#d2cd86; '>.</span>data3misi<span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>,</span>i<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | N<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>16 18 20<span style='color:#d2cd86; '>;</span>14 17 20<span style='color:#d2cd86; '>;</span>12 16 20<span style='color:#d2cd86; '>;</span>10 15 20<span style='color:#d2cd86; '>;</span>14 16 18<span style='color:#d2cd86; '>;</span>12 15 18<span style='color:#d2cd86; '>;</span>10 14 18<span style='color:#d2cd86; '>;</span>8 13 18<span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%The matrix element represents the position of the sgrna</span> | ||
+ | M<span style='color:#d2cd86; '>=</span><span style='color:#d2cd86; '>[</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.014</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.395</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.317</span><span style='color:#d2cd86; '>,</span>0<span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.389</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.079</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.445</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.508</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.613</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.851</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.732</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.828</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.615</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.804</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.685</span><span style='color:#d2cd86; '>,</span><span style='color:#009f00; '>0.583</span><span style='color:#d2cd86; '>]</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%The weight of each position</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>for</span> j<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n | ||
+ | d0<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>mean</span><span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span><span style='color:#d2cd86; '>:</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | S<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>=</span>4<span style='color:#d2cd86; '>/</span>9<span style='color:#d2cd86; '>*</span><span style='color:#d2cd86; '>(</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span><span style='color:#bb7977; '>exp</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>-</span>M<span style='color:#d2cd86; '>(</span>N<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>,</span>3<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span><span style='color:#d2cd86; '>(</span><span style='color:#d2cd86; '>(</span>4<span style='color:#d2cd86; '>*</span>d0<span style='color:#d2cd86; '>(</span>j<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>+</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>/</span>19<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%Find the Smm</span> | ||
+ | <span style='color:#e66170; font-weight:bold; '>end</span> | ||
+ | x<span style='color:#d2cd86; '>=</span>1<span style='color:#d2cd86; '>:</span>n<span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The three interspaced mismatches cleavage activity '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The serial number'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>subplot</span><span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>,</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>stem</span><span style='color:#d2cd86; '>(</span>x<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'g'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The three interspaced mismatches score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The serial number'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | p<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>polyfit</span><span style='color:#d2cd86; '>(</span>S<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span>1<span style='color:#d2cd86; '>)</span> | ||
+ | y<span style='color:#d2cd86; '>=</span>p<span style='color:#d2cd86; '>(</span>1<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>*</span>S<span style='color:#d2cd86; '>+</span>p<span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span><span style='color:#9999a9; '>%Fitting equation of the straight line</span> | ||
+ | <span style='color:#bb7977; '>figure</span><span style='color:#d2cd86; '>(</span>2<span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>plot</span><span style='color:#d2cd86; '>(</span>S<span style='color:#d2cd86; '>,</span>ave<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'b'</span><span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>,</span>y<span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'r'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | title<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'The relation of the three interspaced mismatches cleavage activity and mismatch score '</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | xlabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch score'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span>ylabel<span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Mismatch cleavage activity'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | <span style='color:#bb7977; '>legend</span><span style='color:#d2cd86; '>(</span><span style='color:#b060b0; '>'Relation curve'</span><span style='color:#d2cd86; '>,</span><span style='color:#b060b0; '>'Fitting straight line'</span><span style='color:#d2cd86; '>)</span><span style='color:#d2cd86; '>;</span> | ||
+ | R<span style='color:#d2cd86; '>=</span><span style='color:#bb7977; '>corrcoef</span><span style='color:#d2cd86; '>(</span>ave<span style='color:#d2cd86; '>,</span>S<span style='color:#d2cd86; '>)</span><span style='color:#9999a9; '>%Find the the correlation coefficient</span> | ||
+ | </pre> | ||
+ | <p>The result and figures are:</p> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/f/fd/2014-UESTC-Software-Vp16.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/f/fd/2014-UESTC-Software-Vp16.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/5/56/2014-UESTC-Software-Vp17.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/5/56/2014-UESTC-Software-Vp17.jpg"></a> | ||
+ | <a href="https://static.igem.org/mediawiki/2014/f/f2/2014-UESTC-Software-Vp18.jpg" target="_blank"><img src="https://static.igem.org/mediawiki/2014/f/f2/2014-UESTC-Software-Vp18.jpg"></a> | ||
+ | <p>Therefore, the correlation coefficient between the three interspaced mismatches cleavage activity and CRISPR-X mismatch score is 0.6092. And the fitting equation of the straight line is: Y=0.0065*X-0.0018. (X stands for mismatch score).</p> | ||
+ | <p>In summary, the correlation coefficient of the above-mentioned five different conditions (single mismatch, two concatenated mismatches, two interspaced mismatches, three concatenated mismatches and three interspaced mismatches) respectively are: 0.8840, 0.8902, 0.7688, 0.8566, and 0.6092. The correlation coefficients are all over 0.6, and three correlation coefficients are over 0.85. In some extent, this result demonstrated the validity and availability of our scoring algorithm.</p> | ||
+ | <p><ul><b>Reference:</b> | ||
+ | <li>[1] <i>DNA targeting specificity of RNA-guided Cas9 nucleases, Hsu et al, 2013</i></li> | ||
+ | </ul></p> | ||
</div> | </div> | ||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
<!-- Left Menu --> | <!-- Left Menu --> | ||
<div class="leftMenu"> | <div class="leftMenu"> | ||
- | <div id="go1" class="go" style="width: | + | <div id="go1" class="go" style="width: 120px;text-align: left;">Complexity Analysis</div> |
- | <div id="go2" class="go" style="width: | + | <div id="go2" class="go" style="width: 120px;text-align: left;">Validation</div> |
- | + | ||
- | + | ||
</div> | </div> | ||
Line 133: | Line 326: | ||
$(".UL>li").css("width",logoLeft/3+"px"); | $(".UL>li").css("width",logoLeft/3+"px"); | ||
$(".vines").css("left",winWidth*.2-40+"px"); | $(".vines").css("left",winWidth*.2-40+"px"); | ||
- | $(".leftMenu").css("left",winWidth*.2- | + | $(".leftMenu").css("left",winWidth*.2-170+"px"); |
} | } | ||
$(window).load(function(){ | $(window).load(function(){ |
Revision as of 16:50, 15 October 2014
Validation
1 Notation
n Number of candidate sgRNAs
nh Number of hits in cache
nm Number of misses in cache
nt System will only output nt results
m Number of possible-o_target sgRNAs
l Length of sgRNA
tcd Time cost on connecting database
trd Approximate time cost on one database operation
2 Analysis
1. System will connect to Database, which costs tcd.
2. Retrieve data from database, which costs trd* (n + m).
3. System calculate each n candidate sgRNAs' scores. For nh sgRNAs which has already calculated before, system will cost only O(nh). For nm others, system will calculate the scores. According to our algorithm, those sgRNAs will compare every m possible- o_target sgRNAs. In each comparison, algorithm compare all l nucleobase, make a little adds and multiplies, package result to strings and save strings to database. All nm calculation will cost nm* (m*l + trd).
4. System will sort all n result, which cost O(n log2 n).
5. Output result costs O(nt).
3 Run Time
tcd + trd* (n + m) + O(nh) + nm* (m* l + trd) + O(n log2 n) + O(nt)
In order to confirm our software is consistent with the experimental results, we use the experimental data on the MLE Cleavage with the different mismatches, and we compare our scoring results to corresponding to the experimental data, in addition find the correlation coefficient of them.
(A)Single mismatch
First, we use aggregate data from single-mismatch guide RNAs for 15 EMX1 targets in literature [1] (It’s relation figure is figure 2C, heat map for relative SpCas9 cleavage efficiency for each possible RNA: DNA base pair).
Heat map for relative SpCas9 cleavage efficiency for each possible RNA:DNA base pair
We use this set of data to determine the relationship between our software score with the MLE Cleavage for single mismatch position. MATLAB program is shown below:
data=load('E:\matlab\work\igem_data.mat'); [m n]=size(data.dataigem); for i=1:n ave(i)=mean(data.dataigem(:,i)); end M=[0,0,0.014,0,0,0.395,0.317,0,0.389,0.079,0.445,0.508,0.613,0.851,0.732,0.828,0.615,0.804,0.685,0.583]; for j=2:20 S(j-1)=(4*exp(1-M(j)))/((4*j+19)/19); end x=19:-1:1; figure(1);%title('Single mismatch,correlation coefficient=0.8840'); subplot(1,2,1); stem(x,ave); title('The relaition between the single mismatch location and cleavage activity '); xlabel('location/nt');ylabel('cleavage activity'); subplot(1,2,2); stem(x,S,'g'); title('The figure of the single mismatch location and mismatch score '); xlabel('location/nt');ylabel('score'); figure(2); plot(x,ave,'b',x,S,'r'); legend('mismatch cleavage activity','mismatch score'); title('The contrast figure of the single mismatch cleavage activity and mismatch score '); xlabel('location/nt');ylabel('amplitude'); p=polyfit(ave,S,1) y=p(1)*ave+p(2);%Fitting equation of the straight line figure(3); plot(ave,S,'b',ave,y,'r'); title('The relation of the single mismatch location and mismatch score '); xlabel('Mismatch cleavage activity');ylabel('Mismatch score'); legend('Relation curve','Fitting straight line'); R=corrcoef(ave,S)%find the correlation coefficient
The result and figures are:
Therefore, the correlation coefficient between the single mismatch cleavage activity and CRISPR-X mismatch score is 0.8840. And the fitting equation of the straight line is: Y=0.1872*X+0.0987. (X stands for mismatch score).
(B) Multiple mismatches
We next explored the effect between multiple base mismatches on SpCas9 target activity and our mismatch score. We use data of sets of guide RNAs that contained varying combinations of mismatches to investigate the effect of mismatch number, position and spacing on SpCas9 target cleavage activity for four targets within the EMX1 gene. [1]
SpCas9 target cleavage activity for multiple mismatches [1]
(a) Two concatenated mismatches
MATLAB program is shown below:
data=load('E:\matlab\work\data2misc.mat'); [m n]=size(data.data2misc); for i=1:n ave(i)=mean(data.data2misc(:,i)); end N=[19 20;17 18;15 16;13 14;11 12;9 10;7 8;5 6;3 4]; M=[0,0,0.014,0,0,0.395,0.317,0,0.389,0.079,0.445,0.508,0.613,0.851,0.732,0.828,0.615,0.804,0.685,0.583]; for j=1:n d0(j)=mean(N(j,:)); S(j)=(exp(1-M(N(j,1)))+exp(1-M(N(j,2))))/((4*d0(j)+19)/19); end x=1:n; subplot(1,2,1); stem(x,ave); title('The two concatenated mismatches cleavage activity '); xlabel('The serial number');ylabel('cleavage activity'); subplot(1,2,2); stem(x,S,'g'); title('The two concatenated mismatches score '); xlabel('The serial number');ylabel('score'); p=polyfit(S,ave,1) y=p(1)*S+p(2);%Fitting equation of the straight line figure(2); plot(S,ave,'b',S,y,'r'); title('The relation of the two concatenated mismatches cleavage activity and mismatch score '); xlabel('Mismatch score');ylabel('Mismatch cleavage activity'); legend('Relation curve','Fitting straight line'); R=corrcoef(ave,S)
The result and figures are:
Therefore, the correlation coefficient between the two concatenated mismatches cleavage activity and CRISPR-X mismatch score is 0.8902. And the fitting equation of the straight line is: Y=0.0445*X-0.0103. (X stands for mismatch score).
(b) Two interspaced mismatches
MATLAB program is shown below:
data=load('E:\matlab\work\data2misi.mat'); [m n]=size(data.data2misi); for i=1:n ave(i)=mean(data.data2misi(:,i)); end N=[18 20;15 20;11 20;6 20;1 20;16 18;13 18;9 18;4 18;14 16;11 16;7 16;2 16]; M=[0,0,0.014,0,0,0.395,0.317,0,0.389,0.079,0.445,0.508,0.613,0.851,0.732,0.828,0.615,0.804,0.685,0.583]; for j=1:n d0(j)=mean(N(j,:)); S(j)=(exp(1-M(N(j,1)))+exp(1-M(N(j,2))))/((4*d0(j)+19)/19); end x=1:n; subplot(1,2,1); stem(x,ave); title('The two interspaced mismatches cleavage activity '); xlabel('The serial number');ylabel('cleavage activity'); subplot(1,2,2); stem(x,S,'g'); title('The two interspaced mismatches score '); xlabel('The serial number');ylabel('score'); p=polyfit(S,ave,1) y=p(1)*S+p(2);%Fitting equation of the straight line figure(2); plot(S,ave,'b',S,y,'r'); title('The relation of the two interspaced mismatches cleavage activity and mismatch score '); xlabel('Mismatch score');ylabel('Mismatch cleavage activity'); legend('Relation curve','Fitting straight line'); R=corrcoef(ave,S)
The result and figures are:
Therefore, the correlation coefficient between the two interspaced mismatches cleavage activity and CRISPR-X mismatch score is 0.7688. And the fitting equation of the straight line is: Y=0.0866*X-0.0353. (X stands for mismatch score).
(c) Three concatenated mismatches
MATLAB program is shown below:
data=load('E:\matlab\work\data3misc.mat'); [m n]=size(data.data3misc); for i=1:n ave(i)=mean(data.data3misc(:,i)); end N=[17 18 19;14 15 16;11 12 13;8 9 10;5 6 7;2 3 4]; M=[0,0,0.014,0,0,0.395,0.317,0,0.389,0.079,0.445,0.508,0.613,0.851,0.732,0.828,0.615,0.804,0.685,0.583]; for j=1:n d0(j)=mean(N(j,:)); S(j)=4/9*(exp(1-M(N(j,1)))+exp(1-M(N(j,2)))+exp(1-M(N(j,3))))/((4*d0(j)+19)/19); end x=1:n; subplot(1,2,1); stem(x,ave); title('The three concatenated mismatches cleavage activity '); xlabel('The serial number');ylabel('cleavage activity'); subplot(1,2,2); stem(x,S,'g'); title('The three concatenated mismatches score '); xlabel('The serial number');ylabel('score'); p=polyfit(S,ave,1) y=p(1)*S+p(2);%Fitting equation of the straight line figure(2); plot(S,ave,'b',S,y,'r'); title('The relation of the three concatenated mismatches cleavage activity and mismatch score '); xlabel('Mismatch score');ylabel('Mismatch cleavage activity'); legend('Relation curve','Fitting straight line'); R=corrcoef(ave,S)
The result and figures are:
Therefore, the correlation coefficient between the three concatenated mismatches cleavage activity and CRISPR-X mismatch score is 0.8560. And the fitting equation of the straight line is: Y=0.0175*X-0.0082. (X stands for mismatch score).
(d) Three interspaced mismatches
MATLAB program is shown below:
data=load('E:\matlab\work\data3misi.mat'); [m n]=size(data.data3misi); for i=1:n ave(i)=mean(data.data3misi(:,i)); end N=[16 18 20;14 17 20;12 16 20;10 15 20;14 16 18;12 15 18;10 14 18;8 13 18];%The matrix element represents the position of the sgrna M=[0,0,0.014,0,0,0.395,0.317,0,0.389,0.079,0.445,0.508,0.613,0.851,0.732,0.828,0.615,0.804,0.685,0.583];%The weight of each position for j=1:n d0(j)=mean(N(j,:)); S(j)=4/9*(exp(1-M(N(j,1)))+exp(1-M(N(j,2)))+exp(1-M(N(j,3))))/((4*d0(j)+19)/19);%Find the Smm end x=1:n; subplot(1,2,1); stem(x,ave); title('The three interspaced mismatches cleavage activity '); xlabel('The serial number');ylabel('cleavage activity'); subplot(1,2,2); stem(x,S,'g'); title('The three interspaced mismatches score '); xlabel('The serial number');ylabel('score'); p=polyfit(S,ave,1) y=p(1)*S+p(2);%Fitting equation of the straight line figure(2); plot(S,ave,'b',S,y,'r'); title('The relation of the three interspaced mismatches cleavage activity and mismatch score '); xlabel('Mismatch score');ylabel('Mismatch cleavage activity'); legend('Relation curve','Fitting straight line'); R=corrcoef(ave,S)%Find the the correlation coefficient
The result and figures are:
Therefore, the correlation coefficient between the three interspaced mismatches cleavage activity and CRISPR-X mismatch score is 0.6092. And the fitting equation of the straight line is: Y=0.0065*X-0.0018. (X stands for mismatch score).
In summary, the correlation coefficient of the above-mentioned five different conditions (single mismatch, two concatenated mismatches, two interspaced mismatches, three concatenated mismatches and three interspaced mismatches) respectively are: 0.8840, 0.8902, 0.7688, 0.8566, and 0.6092. The correlation coefficients are all over 0.6, and three correlation coefficients are over 0.85. In some extent, this result demonstrated the validity and availability of our scoring algorithm.
- Reference:
- [1] DNA targeting specificity of RNA-guided Cas9 nucleases, Hsu et al, 2013