Team:Harvard/Template:NotebookData

From 2011.igem.org

(Difference between revisions)
Line 1: Line 1:
<div id="606" style="display:none">
<div id="606" style="display:none">
==June 6th==
==June 6th==
-
First day if iGEM.
+
First day if iGEM.</div>
-
</div>
+
-
 
+
<div id="607" style="display:none">
<div id="607" style="display:none">
== June 7th ==
== June 7th ==
Line 12: Line 10:
Procedure: followed Qiagen Kit instructions, each student (8) using 1 mL cell suspension  
Procedure: followed Qiagen Kit instructions, each student (8) using 1 mL cell suspension  
-
Results: DNA reasonably pure (260/280 between 1.8 and 2) and between 25 and 50 ng/µL
+
Results: DNA reasonably pure (260/280 between 1.8 and 2) and between 25 and 50 ng/µL</div>
-
</div>
+
-
 
+
<div id="608" style="display:none">
<div id="608" style="display:none">
-
 
== June 8th ==
== June 8th ==
'''PCR to connect ultramers into OZ052 (Zif268 F2 triplicate, GCCGATGTC)and OZ123 (Zif268 F2 triplicate, GAGTGGTTA):'''
'''PCR to connect ultramers into OZ052 (Zif268 F2 triplicate, GCCGATGTC)and OZ123 (Zif268 F2 triplicate, GAGTGGTTA):'''
Line 45: Line 40:
*5) Repeat 2-4 for 25 cycles
*5) Repeat 2-4 for 25 cycles
*6) 68⁰C for 5 min
*6) 68⁰C for 5 min
-
*7) 4⁰C forever
+
*7) 4⁰C forever</div>
-
</div>
+
-
 
+
<div id="609" style="display:none">
<div id="609" style="display:none">
== June 9th ==
== June 9th ==
Line 215: Line 208:
|}
|}
-
Follow up work on this will be to convert this table to frequencies instead of values: values are less meaningful.
+
Follow up work on this will be to convert this table to frequencies instead of values: values are less meaningful. </div>
-
</div>
+
-
 
+
<div id="610" style="display:none">
<div id="610" style="display:none">
== June 10th ==
== June 10th ==
Line 303: Line 294:
|}
|}
-
Follow up work here is to check more properties, and maybe try individual pairings (ex. phobic-philic, polar-phillic).
+
Follow up work here is to check more properties, and maybe try individual pairings (ex. phobic-philic, polar-phillic).</div>
-
</div>
+
-
 
+
<div id="613" style="display:none">
<div id="613" style="display:none">
== June 13th ==
== June 13th ==
Line 378: Line 367:
:: add pseudo counts (call add_pseudo) and generate a dependent random call for a position (using generate_indep on the adjusted matrix)
:: add pseudo counts (call add_pseudo) and generate a dependent random call for a position (using generate_indep on the adjusted matrix)
-
We finished generate_indep, generate_dep, and add_pseudo today, along with creating a 140x140 matrix of needed values.
+
We finished generate_indep, generate_dep, and add_pseudo today, along with creating a 140x140 matrix of needed values.</div>
-
</div>
+
-
 
+
<div id="614" style="display:none">
<div id="614" style="display:none">
== June 14 ==
== June 14 ==
Line 448: Line 435:
-
Brandon learned basic Python today. Justin created a JavaScript program that recognizes potential binding sites from a given sequence.
+
Brandon learned basic Python today. Justin created a JavaScript program that recognizes potential binding sites from a given sequence.</div>
-
</div>
+
-
 
+
<div id="615" style="display:none">
<div id="615" style="display:none">
==June 15th==
==June 15th==
Line 623: Line 608:
  |GAGGCGTGGC
  |GAGGCGTGGC
  |[http://www.pdb.org/pdb/explore/explore.do?structureId=2WBU]
  |[http://www.pdb.org/pdb/explore/explore.do?structureId=2WBU]
-
  |}
+
  |}</div>
-
</div>
+
-
 
+
<div id="616" style="display:none">
<div id="616" style="display:none">
==June 16th==
==June 16th==
Line 771: Line 754:
*Many of our current programs currently look at overall data or data based on specific DNA triplets (for example: 'GAT' or 'AAA'). However, in order to more easily understand some of the patterns that occur in the datasets, we want to examine broader subsets of data. For example, do different patterns appear when looking at fingers that bind to 'GNN' triplets versus 'NGN' triplets (where 'N' represents any of the 4 bases)?
*Many of our current programs currently look at overall data or data based on specific DNA triplets (for example: 'GAT' or 'AAA'). However, in order to more easily understand some of the patterns that occur in the datasets, we want to examine broader subsets of data. For example, do different patterns appear when looking at fingers that bind to 'GNN' triplets versus 'NGN' triplets (where 'N' represents any of the 4 bases)?
**We added the capability for our programs to accept inputs with the variable 'N' by using regular expressions.  
**We added the capability for our programs to accept inputs with the variable 'N' by using regular expressions.  
-
***We can now create lists of the zinc fingers that bind to any triplet, and create interaction matrices and frequency tables for any triplet input.
+
***We can now create lists of the zinc fingers that bind to any triplet, and create interaction matrices and frequency tables for any triplet input.</div>
-
</div>
+
-
 
+
<div id="617" style="display:none">
<div id="617" style="display:none">
==June 17th==
==June 17th==
Line 832: Line 813:
#OpenWetWareCodonUsage http://openwetware.org/wiki/Escherichia_coli/Codon_usage
#OpenWetWareCodonUsage http://openwetware.org/wiki/Escherichia_coli/Codon_usage
#NIHRareCodonCalculator http://nihserver.mbi.ucla.edu/RACC/
#NIHRareCodonCalculator http://nihserver.mbi.ucla.edu/RACC/
-
</biblio>
+
</biblio></div>
-
</div>
+
-
 
+
-
 
+
<div id="620" style="display:none">
<div id="620" style="display:none">
==June 20th==
==June 20th==
Line 880: Line 858:
| [[File:nnc_probs.png|thumb|left|Probability data for the 262 fingers that bind to '''NNC''' triplets. The position 4 leucine motif remains. There is also a very high (> 0.75) probability of an arginine at position 6.]]
| [[File:nnc_probs.png|thumb|left|Probability data for the 262 fingers that bind to '''NNC''' triplets. The position 4 leucine motif remains. There is also a very high (> 0.75) probability of an arginine at position 6.]]
| [[File:nna_probs.png|thumb|left|Probability data for the 218 fingers that bind to '''NNA''' triplets. The position 4 leucine motif remains. There is also a very high (> 0.75) probability of a glutamine at position -1 and an arginine at position 6.]]
| [[File:nna_probs.png|thumb|left|Probability data for the 218 fingers that bind to '''NNA''' triplets. The position 4 leucine motif remains. There is also a very high (> 0.75) probability of a glutamine at position -1 and an arginine at position 6.]]
-
|}
+
|}</div>
-
</div>
+
-
 
+
-
 
+
<div id="621" style="display:none">
<div id="621" style="display:none">
==June 21st==
==June 21st==
-
 
'''His3 sequencing results:'''
'''His3 sequencing results:'''
Line 1,021: Line 995:
****2 and 1
****2 and 1
****6 and 5
****6 and 5
-
**Because there is not much data for 'CNN' and 'ANN' sequences (with 16 and 29 known fingers that bind to each triplet, respectively), we should use pseudocounts for these sequences, so that our frequency generator is not too biased toward probabilities that may not be significant.
+
**Because there is not much data for 'CNN' and 'ANN' sequences (with 16 and 29 known fingers that bind to each triplet, respectively), we should use pseudocounts for these sequences, so that our frequency generator is not too biased toward probabilities that may not be significant.</div>
-
</div>
+
-
 
+
<div id="622" style="display:none">
<div id="622" style="display:none">
==June 22nd==
==June 22nd==
Line 1,228: Line 1,200:
  | [[File:IGem_logo_ANN_based_on_open.png|thumb|left|WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates only OPEN data.]]
  | [[File:IGem_logo_ANN_based_on_open.png|thumb|left|WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates only OPEN data.]]
  | [[File:IGem_logo_ANN_based_on_open_and_persikov.png|thumb|left|WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates both OPEN and Persikov data.]]
  | [[File:IGem_logo_ANN_based_on_open_and_persikov.png|thumb|left|WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates both OPEN and Persikov data.]]
-
|}
+
|}</div>
-
 
+
-
</div>
+
-
 
+
-
 
+
<div id="623" style="display:none">
<div id="623" style="display:none">
==June 23rd==
==June 23rd==
Line 1,319: Line 1,287:
| Myc-gene Cancer||chr8:128,938,529-128,941,440||981||GGA GAG GGT||style="background:#92D050" | GGC TGG AAA||QANHLSR.RQDNLGR.TRQKLET||EKSHLTR.RREHLTI.#######
| Myc-gene Cancer||chr8:128,938,529-128,941,440||981||GGA GAG GGT||style="background:#92D050" | GGC TGG AAA||QANHLSR.RQDNLGR.TRQKLET||EKSHLTR.RREHLTI.#######
|}
|}
-
*Green cells are our target sequences.
+
*Green cells are our target sequences.</div>
-
</div>
+
-
 
+
<div id="624" style="display:none">
<div id="624" style="display:none">
==June 24==
==June 24==
Line 1,418: Line 1,384:
*Reverse translate fingers avoiding type II restriction enzymes and primers
*Reverse translate fingers avoiding type II restriction enzymes and primers
*Append type II restriction enzyme and primer sequences to each finger
*Append type II restriction enzyme and primer sequences to each finger
-
*Yay
+
*Yay</div>
-
</div>
+
-
 
+
<div id="625" style="display:none">
<div id="625" style="display:none">
==June 25th-26th - Bioinformatics==
==June 25th-26th - Bioinformatics==
Line 1,445: Line 1,409:
-
Additionally, primer tags '''(forward: GTACATGAAACGATGGACGG, reverse:CTGGTATAGTCTCCTCAGCG)''' will be assigned to the 100 control sequences.
+
Additionally, primer tags '''(forward: GTACATGAAACGATGGACGG, reverse:CTGGTATAGTCTCCTCAGCG)''' will be assigned to the 100 control sequences.</div>
-
</div>
+
-
 
+
<div id="627" style="display:none">
<div id="627" style="display:none">
==June 27, Wet lab==
==June 27, Wet lab==
Line 1,517: Line 1,479:
The program appears to run extremely slowly because of the computationally intensive step of checking the reverse translated sequences  
The program appears to run extremely slowly because of the computationally intensive step of checking the reverse translated sequences  
*In addition to checking for the primers and cutsites, we also have to check for 'GGGGGG' because it can lead to undesirable structures forming. In addition, we have to check for the reverse complements for all these undesirable sequences.
*In addition to checking for the primers and cutsites, we also have to check for 'GGGGGG' because it can lead to undesirable structures forming. In addition, we have to check for the reverse complements for all these undesirable sequences.
-
*We decided on a similarity of 0.8 as the maximum acceptable similarity between the sequence the primer bind to and any other part of the generated sequence. If the sequences are too similar, the primer might mishybridize. We originally had a similarity threshold of 0.6 but that made the program run too slowly, so we decided on a '''threshold of 0.8'''.
+
*We decided on a similarity of 0.8 as the maximum acceptable similarity between the sequence the primer bind to and any other part of the generated sequence. If the sequences are too similar, the primer might mishybridize. We originally had a similarity threshold of 0.6 but that made the program run too slowly, so we decided on a '''threshold of 0.8'''.</div>
-
</div>
+
-
 
+
-
 
+
<div id="628" style="display:none">
<div id="628" style="display:none">
==June 28th==
==June 28th==
Line 1,681: Line 1,640:
These subsequent steps will be those that were outlined above, namely cutting out the F1 sequence from each oligo, ligating this F1 into our expression plasmid, putting the expression plasmid into our selection strain, observing colonies which get infused with ZFs that bind to our target site (the "hits"), and sequencing the colonies that get hits to determine which ZF they are expressing.
These subsequent steps will be those that were outlined above, namely cutting out the F1 sequence from each oligo, ligating this F1 into our expression plasmid, putting the expression plasmid into our selection strain, observing colonies which get infused with ZFs that bind to our target site (the "hits"), and sequencing the colonies that get hits to determine which ZF they are expressing.
-
We will be repeating these exact same steps once we get the chip, so if we can perfect our protocols with these practice sequences, we should be golden when the chip comes in.
+
We will be repeating these exact same steps once we get the chip, so if we can perfect our protocols with these practice sequences, we should be golden when the chip comes in.</div>
-
</div>
+
-
 
+
-
 
+
<div id="629" style="display:none">
<div id="629" style="display:none">
==June 29th==
==June 29th==
Line 1,763: Line 1,719:
All of this is on our spec-resistance-containing plasmid.  The above construct replaced the GFP which was present previously on this plasmid.
All of this is on our spec-resistance-containing plasmid.  The above construct replaced the GFP which was present previously on this plasmid.
-
Tomorrow we will begin design of our primers from these SeqBuilder files.
+
Tomorrow we will begin design of our primers from these SeqBuilder files.</div>
-
</div>
+
-
 
+
-
 
+
<div id="630" style="display:none">
<div id="630" style="display:none">
==June 30th==
==June 30th==
Line 1,827: Line 1,780:
*Example:
*Example:
  >1_control
  >1_control
-
  GTACATGAAACGATGGACGGGGTCTCAGCCATTCCAATGTCGTATCTGTATGCGTAATTTTTCACGCAAACACCATTTGGGTCGTCATATCCGTACGCACACGGTGAGACCCGCTGAGGAGACTATACCAG
+
  GTACATGAAACGATGGACGGGGTCTCAGCCATTCCAATGTCGTATCTGTATGCGTAATTTTTCACGCAAACACCATTTGGGTCGTCATATCCGTACGCACACGGTGAGACCCGCTGAGGAGACTATACCAG</div>
-
</div>
+

Revision as of 14:52, 2 August 2011