Team:SJTU-Software/Database

From 2014.igem.org

(Difference between revisions)
Line 487: Line 487:
   <tr><th rowspan="18">BBa_
   <tr><th rowspan="18">BBa_
 <tr><th>A<td>6<td>BBa_A340620
 <tr><th>A<td>6<td>BBa_A340620
 +
  <tr><th>B<td>4<td>BBa_B0034
 +
  <tr><th>C<td>4/5/6<td>BBa_C0053; BBa_C10001
 +
  <tr><th>E<td>4<td>BBa_E5504
 +
  <tr><th>F<td>4<td>BBa_F2622
 +
  <tr><th>G<td>4/5<td>BBa_G0011; BBa_G00500
 +
  <tr><th>I<td>4/5/6<td>BBa_I10018
 +
  <tr><th>J<td>4/5/6<td>BBa_J52100; BBa_J540013
 +
  <tr><th>K<td>6/7<td>BBa_K374013; BBa_K1218016
 +
  <tr><th>M<td>4/5<td>BBa_M1904; BBa_M31000
 +
  <tr><th>P<td>4<td>BBa_P2007
 +
  <tr><th>Q<td>5/6<td>BBa_Q200514
 +
  <tr><th>R<td>4<td>BBa_R4037
 +
  <tr><th>S<td>5<td>BBa_S01297
 +
  <tr><th>T<td>4<td>BBa_T1009
 +
  <tr><th>V<td>4<td>BBa_V1022
 +
  <tr><th>Y<td>5<td>BBa_Y00100
 +
  <tr><th>Z<td>4/5<td>BBa_Z0506; BBa_Z52935
 
 
</table>   
</table>   
-
<p class="text"  style="margin:1% 60px auto"><font face="SimSun" size="1px"  color="#FFFFFF">Table 2.1.1 Sources of the data in the biobrick database, using BBa_B0034 as an example
+
<p class="text"  style="margin:1% 60px auto"><font face="SimSun" size="1px"  color="#FFFFFF"> Z 4/5 BBa_Z0506; BBa_Z52935
 +
Table 2.1.2 Rules in naming biobricks starting with “BBa_”
 +
 
</font></p>  
</font></p>  
        
        

Revision as of 17:39, 15 October 2014




3. Database

Here, we have reconstructed the database of the current existing biobricks. All the necessary information available on the Registry are collected and used to construct our own database. In this part, the source of data, the way of data collection, database structure and some facts of the biobricks in the database are introduced.



3.1.1 Data collection

Data are collected from 5 different websites on the Registry and Google Scholar.
The data needed in the reconstructed biobrick database is available from 5 sources, with four of them from the websites of Registry of Standard Biological Parts and 1 of them from Google Scholar. Addresses of those sources for BBa_B0034 are listed in Table 2.1.1, and the part name could be replaced by any biobrick.

 
Name of WebsitesAddress  
XML formathttp://parts.igem.org/cgi/xml/part.cgi?part=BBa_B0034  
Hard informationhttp://parts.igem.org/cgi/partsdb/part_info.cgi?part_name=BBa_B0034
Get parthttp://parts.igem.org/partsdb/get_part.cgi?part=BBa_B0034
Experiencehttp://parts.igem.org/Part:BBa_B0034:Experience
Google scholarhttp://scholar.google.com.cn/scholar?q=BBa_B0034

Table 2.1.1 Sources of the data in the biobrick database, using BBa_B0034 as an example

PERL scripts are composed to connect to those links and to pick out the information we need. In order to connect to those links, we need to understand the way how biobricks on the website are named. There exist two main ways to name the biobricks, which are illustrated in Table 2.1.2 and Table 2.1.3. “BBa” is used for most biobricks, names of which consist of 3 parts. The first part is “BBa_”; the second part is a capitalized letter; the third part is several digits, the number of which depends on the previous capitalized letter (possible combinations are listed in Table 2.1.2). “pSB” is specially used for plasmids, names of which consist of 4 parts. The first part is “pSB”; the second part is one digit ranging from 1 to 9; the third part is a single pattern or a combination of two patterns listed; the fourth part can be any number from 1 to 29.

 
The First Part of the NameThe Second Part of the NameThe Third Part of the Name (Number of Digits)Examples
BBa_  
A6BBa_A340620
B4BBa_B0034
C4/5/6BBa_C0053; BBa_C10001
E4BBa_E5504
F4BBa_F2622
G4/5BBa_G0011; BBa_G00500
I4/5/6BBa_I10018
J4/5/6BBa_J52100; BBa_J540013
K6/7BBa_K374013; BBa_K1218016
M4/5BBa_M1904; BBa_M31000
P4BBa_P2007
Q5/6BBa_Q200514
R4BBa_R4037
S5BBa_S01297
T4BBa_T1009
V4BBa_V1022
Y5BBa_Y00100
Z4/5BBa_Z0506; BBa_Z52935  

Z 4/5 BBa_Z0506; BBa_Z52935 Table 2.1.2 Rules in naming biobricks starting with “BBa_”



3.1. 2 Database Structure

The structure of the database is presented in Figure 3.1. The primary key in table “Main” is “part name”, with all the other tables linked to it by identical “part name”. There is no primary key in the tables except “Main” because biobricks could belong to more than one category or have more than one twins. Thus, the part name in the rest tables could be repeated for several times, depending on different biobricks.

Figure 3.1 The structure of the reconstructed biobrick database



SJTU-Software

Shanghai Jiao Tong University

800, Dongchuan Road

200240 Shanghai, China

back to top