Team:Harvard/Results/Chip Synthesis

From 2011.igem.org

(Difference between revisions)
(qPCR curves)
 
(29 intermediate revisions not shown)
Line 1: Line 1:
 +
{{:Team:Harvard/Template:CSS}}
{{:Team:Harvard/Template:ResultsBar}}
{{:Team:Harvard/Template:ResultsBar}}
{{:Team:Harvard/Template:ResultsGrayBar}}
{{:Team:Harvard/Template:ResultsGrayBar}}
-
<div class="whitebox">
 
__NOTOC__
__NOTOC__
-
{{Template:Team:Harvard/javascript}}
+
<div class="whitebox">
=qPCR curves=
=qPCR curves=
-
When performing a qPCR, one wants to run the reaction while the growth is still exponential. During this phase, all the oligos are replicated at equal rates. Once growth levels off, the reaction should be stopped as the oligos are now being replicated unequally. Each of the graphs below represents a qPCR for one of our sub-pools.
+
When amplifying the oligos, one wants to perform a qPCR in order to monitor the growth rates of each sub-pool and ensure that growth is happening at equal rates. By plotting the relative flouresence present in each cycle, we can visualize the pogress of the reaction. One wants to run the reaction while the growth is still exponential. During this phase, all the oligos are replicated at equal rates. Once growth begins leveling off, as it does at the end of the graph below, the reaction should be stopped as the oligos are now being replicated unequally. Stopping the reaction at the very end of the exponential growth phase ensures that we do not have significantly more oligos of a particular sequence in our pool, and decreases the bias in our results.  
-
<html>
 
-
<script>
 
-
function show(k)
 
-
{
 
-
elem = document.getElementById('GNN');
 
-
  elem.style.display = 'none';
 
-
if(k==1){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('TNN');
 
-
  elem.style.display = 'none';
 
-
if(k==2){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('CNN');
 
-
  elem.style.display = 'none';
 
-
if(k==3){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('ANN');
 
-
  elem.style.display = 'none';
 
-
if(k==4){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('NGN');
 
-
  elem.style.display = 'none';
 
-
if(k==5){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('NTN');
 
-
  elem.style.display = 'none';
 
-
if(k==6){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('0');
 
-
  elem.style.display = 'none';
 
-
if(k==13){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('.01');
 
-
  elem.style.display = 'none';
 
-
if(k==14){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('.015');
 
-
  elem.style.display = 'none';
 
-
if(k==15){
 
-
elem.style.display = 'block';
 
-
}
 
-
elem = document.getElementById('.02');
 
-
  elem.style.display = 'none';
 
-
if(k==16){
 
-
elem.style.display = 'block';
 
-
}
 
-
}
 
-
</script>
 
-
<div id="marginbox">
+
Each line on the graph below represents a qPCR for one of our sub-pools (each sequence we are targeting represents a sub-pool: CB top, CB bottom, FH top, FH bottom, Myc 198, Myc 981).
-
<table>
+
-
<tr><td><span id="student" onclick="show(1)">CB Top</span></td></tr>
+
-
<tr><td><span id="student" onclick="show(2)">CB Bot</span></td></tr>
+
-
<tr><td><span id="student" onclick="show(3)">FH Top</span></td></tr>
+
-
<tr><td><span id="student" onclick="show(4)">FH Bot</span></td></tr>
+
-
<tr><td><span id="student" onclick="show(5)">Myc 198</span></td></tr>
+
-
<tr><td><span id="student" onclick="show(6)">Myc 981</span></td></tr>
+
-
</table>
+
-
</div>
+
-
</html>
+
'''Based on the graph and gel below, we knew we successfully amplified each of our target sub-pools.'''
-
<div id="GNN" style="width:900px">[[File:HARVGnn_freqs.png|frame|left|Probability data for the 783 fingers that bind to '''GNN''' triplets. Note the high probability of leucine at position 4 and arginine at position 6.]]</div>
 
-
<div id="TNN" style="display:none">[[File:HARVTnn_probs.png|frame|left|Probability data for the 128 fingers that bind to '''TNN''' triplets. Note the high probability of leucine at position 4.]]</div>
 
-
<div id="CNN" style="display:none">[[File:HARVCnn_probs.png|frame|left|Probability data for the 16 fingers that bind to '''CNN''' triplets. There may not be enough data to consider this information statistically significant]]</div>
 
-
<div id="ANN" style="display:none">[[File:HARVAnn_probs.png|frame|left|Probability data for the 29 fingers that bind to '''ANN''' triplets. There may not be enough data to consider this information statistically significant]]</div>
 
-
<div id="NGN" style="display:none">[[File:HARVNgn_probs.png|frame|left|Probability data for the 298 fingers that bind to '''NGN''' triplets. The position 4 leucine motif remains. There is also a high probability (> 0.5) of a histidine at position 3 and an arginine at position 6.]]</div>
 
-
<div id="NTN" style="display:none"> [[File:HARVNtn_probs.png|frame|left|Probability data for the 177 fingers that bind to '''NTN''' triplets. The position 4 leucine motif remains.]]</div>
 
-
<br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br>
+
Because flourescence grows exponentially and just begins to level off at the end of the graph (where PCR was terminated), we are confident that overamplification did not occur, so individual oligos in each of our sub-pools were amplified at roughly equal rates. This preserves library integrity
 +
[[File:HARVQpcr.png|center|frame|800 px|caption|]]
 +
 +
 +
After a PCR clean up, we ran a gel in order to confirm that the qPCR produced the expected product, of approximately 130 bps. Based on the gel, we could conclude that our qPCR was indeed successful, and our oligo library was ready to use.
 +
[[File:HARV2011.8.,17_chip_library_annotatedZOOM.png|frame|center|]]
</div>
</div>
 +
 +
<div class="whitebox">
=Sequencing results of Library transformation=
=Sequencing results of Library transformation=
==Error rates==
==Error rates==
-
'''''Still being updated'''''
+
A common question that rises when dealing with novel synthesis techniques concerns the error rates. After all, if chip synthesis produced oligos in which errors were ubiquitous, then the technique would be of little practical value, despite the low cost. In order to understand what the error rates of our chip were, we sent out a 96 plate of post-transformation colonies for sequencing. Of those, 77 had sequencing results. We then compared the sequences of these oligos to the Finger 1 oligos that were originally ordered.
-
*Good: 57.8% (22/38)
+
 
-
*Single SNP: 5.3% (2/38)
+
[[File:HARVSeq_results_pie1.png|center]]
-
*Multiple SNPs:  18.4% (7/38)
+
 
-
*Frame shift: 18.4% (7/38)
+
As seen above 44 out of 77, or 57.1%, of our sequences were perfect matches.
 +
 
 +
This means there were two problems we then had to deal with: point mutations and frameshift mutations.  
 +
 
 +
In total, 20.8% of our oligos contained point mutations. However, point mutations in the helix region are unlikely to affect the structure of the the protein. In fact, they add additional variation to our F1 library. It is entirely possible that one of these mutants ends up being a strong binder and allows the cell to survive selection, whereas the original sequence is not. It should also be noted that the actual rate of point mutations is lower than those documented. We simply looked as the nucleotide sequence of the oligos, not the trace files. It is possible that some reported point mutations have bad traces and thus, may not actually represent a SNP.
 +
 
 +
The 17 frame shift mutations (22.1%) are a bigger problem, since that means we have essentially lost an oligo we wanted to test. It is possible there are other copies of the oligo without the error, and that our cell happened to uptake one with a detrimental mutation. In any case, cells that take up plasmids that contain an F1 region with a frameshift will not survive selection.
 +
 
 +
We determined the overall per base pair error rate for this set sequenced to be around 1/200, which includes errors generated by the chip, or generated during PCR and assembly. These are a bit higher than those found by Kosuri, et al., but within a reasonable margin.
==Distributions==
==Distributions==
-
'''''Still being updated'''''
+
Of the 77 samples with good sequencing results, 2 sequences were repeated once. Discounting these, 73 of the 77 sequences, or 94.8%, were unique. This suggests that there is substantial variability within the library, and then we should not be concerned that our library is dominated primarily by a single oligo.
-
Each of the sequenced zinc fingers have original F1 fingers.
+
</div>

Latest revision as of 17:54, 23 October 2011

bar

qPCR curves

When amplifying the oligos, one wants to perform a qPCR in order to monitor the growth rates of each sub-pool and ensure that growth is happening at equal rates. By plotting the relative flouresence present in each cycle, we can visualize the pogress of the reaction. One wants to run the reaction while the growth is still exponential. During this phase, all the oligos are replicated at equal rates. Once growth begins leveling off, as it does at the end of the graph below, the reaction should be stopped as the oligos are now being replicated unequally. Stopping the reaction at the very end of the exponential growth phase ensures that we do not have significantly more oligos of a particular sequence in our pool, and decreases the bias in our results.


Each line on the graph below represents a qPCR for one of our sub-pools (each sequence we are targeting represents a sub-pool: CB top, CB bottom, FH top, FH bottom, Myc 198, Myc 981).


Based on the graph and gel below, we knew we successfully amplified each of our target sub-pools.


Because flourescence grows exponentially and just begins to level off at the end of the graph (where PCR was terminated), we are confident that overamplification did not occur, so individual oligos in each of our sub-pools were amplified at roughly equal rates. This preserves library integrity

HARVQpcr.png


After a PCR clean up, we ran a gel in order to confirm that the qPCR produced the expected product, of approximately 130 bps. Based on the gel, we could conclude that our qPCR was indeed successful, and our oligo library was ready to use.

HARV2011.8.,17 chip library annotatedZOOM.png

Sequencing results of Library transformation

Error rates

A common question that rises when dealing with novel synthesis techniques concerns the error rates. After all, if chip synthesis produced oligos in which errors were ubiquitous, then the technique would be of little practical value, despite the low cost. In order to understand what the error rates of our chip were, we sent out a 96 plate of post-transformation colonies for sequencing. Of those, 77 had sequencing results. We then compared the sequences of these oligos to the Finger 1 oligos that were originally ordered.

HARVSeq results pie1.png

As seen above 44 out of 77, or 57.1%, of our sequences were perfect matches.

This means there were two problems we then had to deal with: point mutations and frameshift mutations.

In total, 20.8% of our oligos contained point mutations. However, point mutations in the helix region are unlikely to affect the structure of the the protein. In fact, they add additional variation to our F1 library. It is entirely possible that one of these mutants ends up being a strong binder and allows the cell to survive selection, whereas the original sequence is not. It should also be noted that the actual rate of point mutations is lower than those documented. We simply looked as the nucleotide sequence of the oligos, not the trace files. It is possible that some reported point mutations have bad traces and thus, may not actually represent a SNP.

The 17 frame shift mutations (22.1%) are a bigger problem, since that means we have essentially lost an oligo we wanted to test. It is possible there are other copies of the oligo without the error, and that our cell happened to uptake one with a detrimental mutation. In any case, cells that take up plasmids that contain an F1 region with a frameshift will not survive selection.

We determined the overall per base pair error rate for this set sequenced to be around 1/200, which includes errors generated by the chip, or generated during PCR and assembly. These are a bit higher than those found by Kosuri, et al., but within a reasonable margin.

Distributions

Of the 77 samples with good sequencing results, 2 sequences were repeated once. Discounting these, 73 of the 77 sequences, or 94.8%, were unique. This suggests that there is substantial variability within the library, and then we should not be concerned that our library is dominated primarily by a single oligo.