|
|
(5 intermediate revisions not shown) |
Line 1: |
Line 1: |
| <div id="621" style="display:none"> | | <div id="621" style="display:none"> |
- | ==June 21st== | + | ==June 21st == |
| + | |
| + | ==Wet Lab== |
| '''His3 sequencing results:''' | | '''His3 sequencing results:''' |
| | | |
Line 14: |
Line 16: |
| ===Persikov Statistics - Graphs=== | | ===Persikov Statistics - Graphs=== |
| {| | | {| |
- | | [[File:Scatterplot of top bottom 20 with SVM polynomial.png|thumb|left|Scatterplot of top/bottom 20 with SVM polynomial]] | + | | [[File:HARVScatterplot of top bottom 20 with SVM polynomial.png|thumb|left|Scatterplot of top/bottom 20 with SVM polynomial]] |
- | | [[File:Sequence by sequence (lin SVM).png|thumb|left|Sequence by sequence (lin SVM)]] | + | | [[File:HARVSequence by sequence (lin SVM).png|thumb|left|Sequence by sequence (lin SVM)]] |
- | | [[File:Top_Bottom_20_ZFs_(SVM_linear).png|thumb|left|Top/Bottom 20 ZFs (SVM linear)]] | + | |} |
- | | [[File:Comparison of polynomial vs linear distribution (polynomial generally higher values).png|thumb|left|Comparison of polynomial vs. linear distribution (polynomial generally higher values)]] | + | {| |
| + | | [[File:HARVTop_Bottom_20_ZFs_(SVM_linear).png|thumb|left|Top/Bottom 20 ZFs (SVM linear)]] |
| + | | [[File:HARVComparison of polynomial vs linear distribution (polynomial generally higher values).png|thumb|left|Comparison of polynomial vs. linear distribution (polynomial generally higher values)]] |
| |} | | |} |
| | | |
Line 138: |
Line 142: |
| | | |
| <div id="622" style="display:none"> | | <div id="622" style="display:none"> |
- | ==June 22nd== | + | ==June 22nd - Wet Lab== |
| '''Preparing media/reagents for selection system:''' | | '''Preparing media/reagents for selection system:''' |
| *Made 0.1M zinc chloride solution, M9 salt, and 1M magnesium sulphate solutions for the amino acid mixture | | *Made 0.1M zinc chloride solution, M9 salt, and 1M magnesium sulphate solutions for the amino acid mixture |
Line 192: |
Line 196: |
| *Ran gel of the above PCR products and imaged below: only the omega and omega+Zif268 reactions seemed to work | | *Ran gel of the above PCR products and imaged below: only the omega and omega+Zif268 reactions seemed to work |
| | | |
- | [[File:2011.06.22.ZFBsBackbone&omega(labeled)|thumb|none|PCR for 3-part ZF assembly]] | + | [[File:HARV2011.06.22.ZFBsBackbone&omega(labeled).png|thumb|none|PCR for 3-part ZF assembly]] |
| | | |
| '''pZE21G backbone:''' | | '''pZE21G backbone:''' |
Line 332: |
Line 336: |
| | | |
| {| | | {| |
- | | [[File:open_logo_NNN.png|thumb|left|WebLogo for the OPEN data.]] | + | | [[File:HARVOpen_logo_NNN.png|thumb|left|WebLogo for the OPEN data.]] |
- | | [[File:persikov_logo_NNN.png|thumb|left|WebLogo for the Persikov data.]] | + | | [[File:HARVPersikov_logo_NNN.png|thumb|left|WebLogo for the Persikov data.]] |
| |- | | |- |
- | | [[File:IGem_logo_NNN_based_on_open.png|thumb|left|WebLogo for 10000 sequences generated with our program, when it incorporates only OPEN data.]] | + | | [[File:HARVIGem_logo_NNN_based_on_open.png|thumb|left|WebLogo for 10000 sequences generated with our program, when it incorporates only OPEN data.]] |
- | | [[File:IGem_logo_NNN_based_on_open_and_persikov.png|thumb|left|WebLogo for 10000 sequences generated with our program, when it incorporates both OPEN and Persikov data.]] | + | | [[File:HARVIGem_logo_NNN_based_on_open_and_persikov.png|thumb|left|WebLogo for 10000 sequences generated with our program, when it incorporates both OPEN and Persikov data.]] |
| |- | | |- |
- | | [[File:open_logo_ANN.png|thumb|left|WebLogo for fingers that bind to ANN according to OPEN data.]] | + | | [[File:HARVOpen_logo_ANN.png|thumb|left|WebLogo for fingers that bind to ANN according to OPEN data.]] |
- | | [[File:IGem_logo_ANN_based_on_open.png|thumb|left|WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates only OPEN data.]] | + | | [[File:HARVIGem_logo_ANN_based_on_open.png|thumb|left|WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates only OPEN data.]] |
- | | [[File:IGem_logo_ANN_based_on_open_and_persikov.png|thumb|left|WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates both OPEN and Persikov data.]] | + | | [[File:HARVIGem_logo_ANN_based_on_open_and_persikov.png|thumb|left|WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates both OPEN and Persikov data.]] |
| |}</div> | | |}</div> |
| <div id="623" style="display:none"> | | <div id="623" style="display:none"> |
- | ==June 23rd== | + | ==June 23rd - Wet Lab== |
| | | |
| '''Ran gel to determine the results of the PCR products''' | | '''Ran gel to determine the results of the PCR products''' |
Line 351: |
Line 355: |
| **PCR for the backbone failed again, even done through gradient PCR | | **PCR for the backbone failed again, even done through gradient PCR |
| | | |
- | [[File:2011.06.23.hisB&pZE21Gbackbone(labeled)|thumb|none|PCR of HisB locus]] | + | [[File:HARV2011.06.23.hisB&pZE21Gbackbone(labeled).png|thumb|none|PCR of HisB locus]] |
| | | |
| | | |
Line 360: |
Line 364: |
| ***In image below the low concentration of desired product can be seen, along with the high concentration of unused primers | | ***In image below the low concentration of desired product can be seen, along with the high concentration of unused primers |
| | | |
- | [[File:2011.06.23.KanZFBHis3Ura3(labeled)|thumb|none|PCR of kan-ZFB-wp-his3-ura3]] | + | [[File:HARV2011.06.23.KanZFBHis3Ura3(labeled).png|thumb|none|PCR of kan-ZFB-wp-his3-ura3]] |
| | | |
| *Ran gel of initial overlap PCR product (undiluted, starting without primers) from June 16th, using whole sample | | *Ran gel of initial overlap PCR product (undiluted, starting without primers) from June 16th, using whole sample |
Line 379: |
Line 383: |
| | | |
| | | |
- | [[File:2011.06.23.pZE21Gminiprep(labeled)|thumb|none|pZE21G miniprep]] | + | [[File:HARV2011.06.23.pZE21Gminiprep(labeled).png|thumb|none|pZE21G miniprep]] |
| | | |
| '''Ran PCR for pZE21G backbone''' | | '''Ran PCR for pZE21G backbone''' |
June 22nd - Wet Lab
Preparing media/reagents for selection system:
- Made 0.1M zinc chloride solution, M9 salt, and 1M magnesium sulphate solutions for the amino acid mixture
- M9 Salt solution (20x)
- 67.8 g of disodium phosphate
- 30 g of monopotassium phosphate
- 5 g of sodium chloride
- 10 g of ammonium chloride
- All in 500 mL of distilled water
- Sterile filtered when done dissolving
Overhang PCR for 3-part assembly of ZFs, omega subunit, and backbone vector (pZE21G, spec resistance)
- Clone out omega+Zif268:
- template: original selection construct plasmid (ZFB, his3, etc.)
- primers: omega_F+homolog, Zif268_R+homolog
- Protocol
- 98 C for 30 sec
- 98 C for 10 sec
- 68 C for 30 sec
- 72 C for 30 sec
- Repeat steps 2-4, 30 times
- 4 C for ever
- Clone out omega only:
- template: original selection construct
- primers: omega_F+homolog, omega_R
- Protocol
- Clone OZ052 with overhang:
- template: OZ052 overhang (overhang currently matches selection construct), 1:10b, 15ng/µL
- primers: OZ052_F+omega homolog, OZ052_R+homolog
- Protocol
- Clone OZ123 with overhang:
- template: OZ123 overhang (overhang currently matches selection construct), 1, 6.3ng/µL
- primers: OZ123_F+omega homolog, OZ123_R+homlog
- Protocol
- clone out pZE21G backbone
- template: pZE21G containing cells from plate, diluted 1:10
- primers: back_F, back_R
- Protocol
- 98 C for 30 sec
- 98 C for 10 sec
- 68 C for 30 sec
- 72 C for 1:30
- Repeat steps 2-4, 30 times
- 72 C for 5 min
- 4 C for ever
- 25µL reaction:
- 12.5µL Phusion mastermix
- 1.25µL each primer
- 1µL of 1ng/µL dilution of template
- 9µL of ddH2O
- Ran gel of the above PCR products and imaged below: only the omega and omega+Zif268 reactions seemed to work
PCR for 3-part ZF assembly
pZE21G backbone:
- 1 colony of pZE21G grown in 3mL LB, 3µL spectinomycin (1000x) until mid-log. Glycerol stock made.
- Miniprep of pZE21G in order to PCR the backbone: used Qiagen kit
- 5ng/µL and didn't seem pure: miniprep (or nanodrop) not working
- another colony used to start an overnight culture for PCR/glycerol stocks
- Ran gradient PCR on the miniprep product in order to obtain backbone
- same protocol as before, with 1µL of template
- 6 tubes spaced so that annealing temp=60,62,64,66,68,70
- Parameters: (program on PCR5, IGEM-> DOGGED)
- 98°C for 30s (initial denaturation)
- 98°C for 10s (denature)
- 60°C to 71°C for 15s (anneal)
- 72°C for 90s (extend)
- Repeat steps 2-4 for 30 cycles total (denature, anneal, extend)
- 72°C for 5 min
- 4°C forever
HisB locus PCR: We repeated the PCR of the selection strain at this locus just to be sure HisB is still present
- grew 1 colony from a new selection strain plate Vatsan brought in 0.5mL LB, 0.5µL tet
- used 1 µL of bacterial suspension for PCR following same procedure as 6/20
Lambda red to make selection system:
- grew ?His3?PyrF?rpoZ+pKD46 to mid-log (0.4 using OD)
- induced lambda red by shaking culture in 42C water bath for 15 min
- spin down 1mL for 1 min, 18000 rcf at 4C
- wash 2x with cold water, removing as much supernatant as possible
- resuspend with 200 ng kan-ZFB-wp-his-ura template (20µL) and water up to 50µL (30µL)
- electroporate using 1mm gap cuvettes adn 1.80KV. Immediately afterward add 1mL LB to cuvette, mix, and transfer to culture tube containing 2 mL more of LB
- recover for 2hrs, 30C
- spread on kanamycin plates: 100µL, 10µL, or 1µL (the last two dilute with 100µL LB to help spread more easily)
- grow overnight at 30C
June 22nd - Bioinformatics
Final target sequences
Our "tentatively" Final DNA Target Sequences (i.e. barring any major objections, we're going with this):
Disease
| Target Range
| Binding Site Location
| Bottom Finger
| Top Finger
| Bottom AA (F3 to F1)
| Top AA (F3 to F1)
|
Colorblindness | chrX:153,402,679-153,408,753 | 256 | GGC TGA GGC | GTA GCT GGG | ESGHLKR.QREHLTT.####### | QSGTLTR.QRSDLTR.KKDHLHR
|
Colorblindness | chrX:153,402,679-153,408,753 | 2067 | GAA GGG GAC | GGG GCT CAC | QDGNLGR.RREHLVR.EEANLRR | RTEHLAR.QRSDLTR.#######
|
Familial Hypercholesterolemia | chr19:11,175,000-11,195,000 | 2707 | GGC TGG ATG | GGC TGG CTC* | ESKHLTR.RREHLTI.####### | ESKHLTR.RREHLTI.#######
|
Pancreatic Cancer | chr7:117,074,084-117,089,556 | 4423 | GCA GAC TGT | GCA GGA AAA | QGNTLTR.DRGNLTR.####### | QDVSLVR.QSAHLKR.#######
|
- Drier was unable to find a ZF that bound specifically to CTC. Instead he found zinc fingers that bound to CTC and other sequences with equal binding affinity.
- Note: The green cells are the target sequences that we are aiming for on our chip.
Finalizing the non-Zif268 backbones
In addition, we locked down the non-Zif268 backbones that we will be using for the chip. We have 10 backbones that are more closely related to Zif268, and 10 that are more distantly related:
More Closely Related Backbones | | More Distantly Related Backbones
|
Name
| Sequence (with helix)
|
| Name
| Sequence (with helix)
|
44GLAS_DROME | FRCPI---CDRRFSQSSSVTTH-MRTH-- | | 56EGR1_HUMAN | FAC---DICGRKFARSDERKRHTKIH---
|
38KRUP_DROME | FTCKI---CSRSFGYKHVLQNH-ERTH-- | | 47MZF1_HUMAN | FVC---GDCGQGFVRSARLEEHRRVH---
|
124EVI1_HUMAN | YRC---KYCDRSFSISSNLQRHVRNIH-- | | 23CF2_DROME | YTC---SYCGKSFTQSNTLKQHTRIH---
|
6HUNB_DROME | YECK---YCDIFFKDAVLYTIHMGY--H- | | 19ZEP2_RAT | YICE---ECGIRCKKPSMLKKHIRTH---
|
16SUHW_DROME | FPCEQ---CDEKFKTEKQLERH-VKTH-- | | 49SDC1_CAEEL | VVC---FHCG-TRCHYTLLHDHLDYCH--
|
125CF2_DROME | YTC---PYCDKRFTQRSALTVHTTKLH-- | | 27SDC1_CAEEL | LTC---AHCDWSFDNVMKLVRH-RGVH--
|
43EVI1_HUMAN | FKCHL---CDRCFGQQTNLDRH-LKKH-- | | 130TTKB_DROME | YRC---KVCSRVYTHISNFCRHYVTSH--
|
118ADR1_YEAST | YPC---GLCNRCFTRRDLLIRHAQKIH-- | | 80ESCA_DROME | YQC---PDCQKSYSTFSGLTKH-QQFH--
|
24EVI1_HUMAN | QECK---ECDQVFPDLQSLEKHMLS--H- | | 20IKZF1_MOUSE | HKCG---YCGRSYKQRSSLEEHKERCH--
|
25SUHW_DROME | MSCKV---CDRVFYRLDNLRSH-LKQH-- | | 127SRYD_DROME | QECTT---CGKVYNSWYQLQKHISEEH--
|
Updated Chip Design
The CODA article produced zinc fingers that bound a GNN or TNN F2 with either a ANN, GNN, or TNN F3. These results lead us to the following distribution of three types of zinc finger backbones (Zif268, similar but not equal to Zif268, and dissimilar to Zif268) across our 6 target DNA sequences. With 55,000 spaces on our chip, each of the 6 target DNA sequences is allotted 9,150 spaces with 100 spaces set aside for control zinc fingers from CODA and OPEN. Note that the values in the table below represent the number of helices inserted into each type of backbone.
Disease
| Target DNA Finger 2
| Target DNA Finger 1
| Helices in Zif268 Backbone
| Helices in Zif268 Closely-Related Backbones
| Helices in Zif268 Distantly-Related Backbones
|
Colorblindness | TNN | GNN | 5150 | 3000 | 1000
|
Colorblindness | GNN | CNN | 3050 | 3050 | 3050
|
Familial Hypercholesterolemia | TNN | ANN | 3050 | 3050 | 3050
|
Familial Hypercholesterolemia | TNN | CNN | 3050 | 3050 | 3050
|
Pancreatic Cancer | GNN | TNN | 5150 | 3000 | 1000
|
Pancreatic Cancer | GNN | ANN | 3050 | 3050 | 3050
|
N.B.: The chip will only be holding our F1 zinc fingers- the F2 and F3 will be on a separate plasmid that we must make ourselves
To Do: The distribution of helices to each backbone set/target sequence needs to be finalized. For example, the program can generate a set of helices for the Zif268 backbone to be applied to the colorblindness target sequence, but should the same set or a completely different helix set be applied to the Zif268 backbones for the familial hypercholesterolemia target sequences?
- If we want to test the effect of the backbone, would need to keep the helices constant-- but we could do this within a single target, to keep all other variables constant
Finishing the generator
We finished and finalized the program that generates zinc finger sequences. The following changes were made today:
- We incorporated the data from Persikov's database into out generator.
- We included the ability to remove duplicate sequences in the output file (with a dictionary).
- We added pseudocounts for fingers that bind to 'ANN' or 'CNN' targets. Because there is not much information for these targets, the data we do have may be biased. Thus, we want to make sure that amino acids that currently have no probability of occurring are bumped up to a minimum (currently 0.01).
- We placed additional weight on non zif-268 backbones. Formerly, the amino acids for positions 1, 4, and 5 were fixed based on zif-268 data, regardless of the original helix sequences on these backbones. Now information from both zif-268 and the original helix sequence is considered when assigning weights to the amino acids.
- We started working with Noah's reverse translate program.
We tested our generator to ensure that the sequences it was producing appeared to be legitimate.
- Jamie looked at the multiple sequence alignments of the fingers generated, so see if the frequencies correlated with what we expected them to be. (?)
- We input a known DNA triplet to see if the program generated sequences known to bind, according to OPEN data. When generating 10000 sequences, about half the known binders for the input triplet were found.
We created [http://weblogo.berkeley.edu/ WebLogos] to more easily visualize how adding the Persikov data affects the sequences we generate. The size of the letters correspond to the frequency of that amino acid in that position. We decided to incorporate the Persikov data so that our generator incorporates more information when generating sequences. Doing so does not drastically change the sequences generated.
WebLogo for the OPEN data.
| WebLogo for the Persikov data.
|
WebLogo for 10000 sequences generated with our program, when it incorporates only OPEN data.
| WebLogo for 10000 sequences generated with our program, when it incorporates both OPEN and Persikov data.
|
WebLogo for fingers that bind to ANN according to OPEN data.
| WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates only OPEN data.
| WebLogo for 10000 sequences generated for an ANN triplet with our program, when it incorporates both OPEN and Persikov data.
|
June 23rd - Wet Lab
Ran gel to determine the results of the PCR products
- Determined the HisB presence in selection strain
- Finalized the presence of hisB through the gel image below
- Determined the success of the pZE21G backbone primers through gel on gradient PCR
- PCR for the backbone failed again, even done through gradient PCR
PCR more Kan-ZFB-His3-Ura3
- Two 50 µL reactions
- Doubled the protocol used on the 16th and used HisUraKan_F and ZFBwpHisUra_R primers
- PCR produced very low concentration of Kan-ZFG-His3-Ura3 because melting temperature of primer was too high, so primers stuck to annealing DNA and did not dissociate
- In image below the low concentration of desired product can be seen, along with the high concentration of unused primers
PCR of kan-ZFB-wp-his3-ura3
- Ran gel of initial overlap PCR product (undiluted, starting without primers) from June 16th, using whole sample
- Performed gel extraction in order to have more Kan-ZFB-His3-Ura3 product for the transformation tomorrow: 8.5ng/µL, 260/280=2.03
Determined success of the selection construct transformation
- Checked the plates all day,and finally came to the conclusion that the transformation did not work
- Discovered that lambda red has a promoter induced by arabinose, not temperature (though the strain is still temperature sensitive). That is why it didn't work--we'll get arabinose and hopefully have a successful recombination.
- Preparing all parts for transformation today and will finish it tomorrow
Oligo Design for MAGE
- Designed 90bp long oligos for OZ052 and OZ123 insertion in the ZFB sites in place of Zif268. Reverse complement taken.
Miniprep pZE21G plasmid for backbone PCR
- Ran miniprep again for pZE21G plasmid: 6ng/µL, 260/280=2
- Worried that the miniprep didn't work: ran gel also on this miniprep and concluded that DNA was present in the sample
Ran PCR for pZE21G backbone
- Used same protocol from June 22 in PCR today for pZE21G backbone
June 23rd - Bioinformatics
Revising Target Sequences
Target DNA
| Cystic Fibrosis
| Familial Hypercholesterolemia
| Retinal Blastoma
| p53
| Myc
| Pancreatic Cancer
|
GNN A | Flank 1 | | | | | ?
|
GNN T | Flank 1 | | | | |
|
GNN C | ? | Flank 2 | | | |
|
TNN G | | Flank 2 | | | X |
|
TNN C | | Flank 3 | | | ? |
|
TNN A | | Flank 3 | | | ? |
|
This is the set of final, final target sequences based on the table above:
Disease
| Target Range
| Binding Site Location
| Bottom Finger
| Top Finger
| Bottom AA (F3 to F1)
| Top AA (F3 to F1)
|
Colorblindness | chrX:153,403,001-153,407,000 | 3627 | GCT GGC TGG | GCG GTA ATG | EGSGLKR.EAHHLSR.####### | RRDDLTR.QRSSLVR.#######
|
Familial Hypercholesterolemia | chr19:11,175,000-11,195,000 | 14001 | GGC TGA GAC | GGA GTC CTG | ESGHLKR.QREHLTT.####### | QTTHLSR.DHSSLKR.#######
|
Myc-gene Cancer | chr8:128,938,529-128,941,440 | 198 | GGT GCA GGG | GGC TGA CTC | VDHHLRR.QSTTLKR.RRAHLQN | ESGHLKR.QREHLTT.#######
|
Myc-gene Cancer | chr8:128,938,529-128,941,440 | 981 | GGA GAG GGT | GGC TGG AAA | QANHLSR.RQDNLGR.TRQKLET | EKSHLTR.RREHLTI.#######
|
- Green cells are our target sequences.