Team:HFUT CHINA/Examples.html

From 2014.igem.org

Revision as of 14:54, 17 October 2014 by Jadmjgp (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

HFUT_CHINA | meetup

Results

The algorithms used in Biodesigner could be grouped into two categories: preprocessing part and executing part.

Preprocessing Part

The preprocessing part includes three algorithms. Here we assume there are n devices and each device contains at most m biobrickes.

The first one is to count the probabilities of two successive biobrickes, three successive biobrickes, four successive biobrickes and five successive biobrickes. Here successive biobrickes means the biobrickes are neighbors. Thus the number of two successive biobrickes, three successive biobrickes, four successive biobrickes and five successive biobrickes are n(m-1), n(m-2), n(m-3), n(m-4), respectively. Hash tables are used to store these successive biobrickes. Therefore, the probabilities could be obtained by scanning the device chains four times. Thus, the time complexity of this algorithm is O(2n(m-1)+3n(m-2)+4n(m-3)+5n(m-3))=O(nm).

The second one is to count the probabilities of the first biobrick of each device chain. This work could be done by scanning the first biobrick of each chain. Thus, the complexity of this algorithm is O(n).

The third one is to calculate the hamming distances of each paired biobrickes. Since there are n device chains and each device contains at most m biobrickes. Therefore, there are at most nm biobrickes, which means there are at most nm*nm paired biobrickes. The calculation of hamming distances between two sequences with length l could be done in O(l) time. Thus, the complexity of this algorithm is O(l*nm*nm)=O(ln2m2).

Hence, the time complexity of preprocessing part is O(nm)+O(n)+O(ln2m2) =O(ln2m2).

Executing Part

The executing part contains two algorithms. Here we assume the given device chain contains at most m biobrickes.

The first one is to inquire the probabilities of two successive biobrickes, three successive biobrickes, four successive biobrickes and five successive biobrickes of a given device chain. Since all the data is stored in a hash table and the access of hash table needs O(1) time. Thus, the time complexity of this algorithm is O(m).

The second one is to store the given device chain. This work is done by store the two successive biobrickes, three successive biobrickes, four successive biobrickes and five successive biobrickes of the given device chain into the hash table. Thus, the time complexity of this algorithm is O(m).

Hence, the time complexity of executing part is O(m).

Note, we only need to execute the preprocessing part once. After the execution of preprocessing part, all the data is store in database. When using our software, users do not need to waste their time for the preprocessing part as it has been executed.

Function	Response Time (sec.)
Query of Biobrick information	0.04
Query of Biobrick’s twins	0.03
Ambiguous Query of Biobrick	0.1

Number of Biobricks	Response Time(sec.)
0	0.3
1	0.19
2	0.14
3	0.27
4	0.35

Device Chain	Correctly recommended biobricks	Recommendation ratio
BBa_R0040-BBa_B0034-BBa_J36835-BBa_J36837-BBa_K283009-BBa_B0010-BBa_B0012	BBa_B0034 BBa_J36837 BBa_K283009 BBa_B0010 BBa_B0012	71.4%
BBa_R0010-BBa_B0030-BBa_K233307-BBa_I732005-BBa_E0040-BBa_B0010-BBa_B0012	BBa_B0030 BBa_K233307 BBa_I732005 BBa_E0040 BBa_B0010 BBa_B0012	85.7%
BBa_J23118-BBa_B0030-BBa_K233306-BBa_I732005-BBa_E0040-BBa_B0010-BBa_B0012	BBa_K233306 BBa_I732005 BBa_E0040 BBa_B0010 BBa_B0012	71.4%
BBa_R0040-BBa_B0034-BBa_K255000-BBa_B0010-BBa_B0012-BBa_R0040-BBa_B0034-BBa_E0030-BBa_B0010-BBa_B0012	BBa_B0034 BBa_B0010 BBa_B0012 BBa_R0040 BBa_B0034 BBa_E0030 BBa_B0010 BBa_B0012	80%
BBa_J23119-BBa_B0034-BBa_C0040-BBa_B0034-BBa_E1010-BBa_B0010-BBa_B0012	BBa_B0034 BBa_C0040 BBa_B0034 BBa_E1010 BBa_B0010 BBa_B0012	85.7%
BBa_J23106-BBa_B0034-BBa_C0012-BBa_B0034-BBa_C0062-BBa_B0012-BBa_B0011	BBa_C0012 BBa_B0034 BBa_C0062 BBa_B0012 BBa_B0011	71.4%
BBa_K145150-BBa_B0034-BBa_K145151-BBa_B0034-BBa_E1010-BBa_B0012-BBa_B0011	BBa_K145151 BBa_B0034 BBa_E1010 BBa_B0012 BBa_B0011	71.4%
BBa_R0040-BBa_B0034-BBa_C0179-BBa_B0010-BBa_B0012-BBa_R0079	BBa_B0034 BBa_B0010 BBa_B0012 BBa_R0079	66.7%
BBa_R0040-BBa_K118012-BBa_I742151-BBa_J15001-BBa_K118002	BBa_I742151 BBa_J15001 BBa_K118002	60%
BBa_I14032-BBa_B0034-BBa_C0062-BBa_B0010-BBa_B0012-BBa_R0062-BBa_J61127-BBa_K228000-BBa_B0010-BBa_B0012	BBa_B0034 BBa_C0062 BBa_B0010 BBa_B0012 BBa_R0062 BBa_J61127 BBa_K228000 BBa_B0010 BBa_B0012	90%

Device Chain	Correctly recommended biobricks	Recommendation ratio
BBa_B0034-BBa_C0071-BBa_B0010-BBa_B0012-BBa_R0071-BBa_B0030-BBa_E1010	BBa_B0010 BBa_B0012 BBa_R0071 BBa_E1010	57.1%
BBa_J23100-BBa_B0034-BBa_C0179-BBa_B0010-BBa_B0012-BBa_R0079-BBa_B0030-BBa_E1010	BBa_B0034 BBa_B0010 BBa_B0012 BBa_R0079 BBa_E1010	62.5%
BBa_R0079-BBa_B0034-BBa_E0040-BBa_B0010-BBa_B0012	BBa_B0034 BBa_E0040 BBa_B0010 BBa_B0012	80%
BBa_R0079-BBa_B0030-BBa_E1010-BBa_B0010-BBa_B0012	BBa_E1010 BBa_B0010 BBa_B0012	60%
BBa_J23100-BBa_B0034-BBa_C0179-BBa_B0010-BBa_B0012-BBa_R0079-BBa_B0034-BBa_E0040	BBa_B0034 BBa_B0010 BBa_B0012 BBa_R0079 BBa_B0034 BBa_E0040	75%
BBa_B0034-BBa_C0171-BBa_B0010-BBa_B0012-BBa_R0071-BBa_B0030-BBa_E1010-BBa_B0010-BBa_B0012	BBa_B0010 BBa_B0012 BBa_R0071 BBa_E1010 BBa_B0010 BBa_B0012	66.7%