Team:Edinburgh/Sequences

From 2011.igem.org

(Difference between revisions)
Line 296: Line 296:
<code>
<code>
-
GAATTCGCGGCCGC<font color="blue">ACTAGT</font>atgaccctggataaagcgctggtgctgcgcacttgtgcca
+
GAATTCGCGGCCGC<font color="blue">'''ACTAGT'''</font>atgaccctggataaagcgctggtgctgcgcacttgtgcca
ataatatggccgaccattgtggcctgatctggcctgcttctggtacagttgaaagccgct
ataatatggccgaccattgtggcctgatctggcctgcttctggtacagttgaaagccgct
attggcagtctacacgtcgtcacgaaaacgggctggttgggctgctgtggggtgccggaa
attggcagtctacacgtcgtcacgaaaacgggctggttgggctgctgtggggtgccggaa
Line 311: Line 311:
aagattctaccctgatctttcgtctgtgggatggtaaacgttatcgtcagctggtcgctc
aagattctaccctgatctttcgtctgtgggatggtaaacgttatcgtcagctggtcgctc
gtacaggtgaaaatggtgtggaagccgacattccgtattatgtgaatgaggacgatgata
gtacaggtgaaaatggtgtggaagccgacattccgtattatgtgaatgaggacgatgata
-
ttgtggacaaaccggacgaagatgatgattggatcgaagtgaaa<font color="blue">GCTAGC</font>GCGGCCGC<font color="red">'''c'''</font>T
+
ttgtggacaaaccggacgaagatgatgattggatcgaagtgaaa<font color="blue">'''GCTAGC'''</font>GCGGCCGC<font color="red">'''c'''</font>T
<br>GCAG
<br>GCAG
</code>
</code>

Revision as of 18:15, 22 July 2011

It may or may not be useful to have annotated nucleotide sequences of various stuff that we use. Some of this information can be retrieved from the Registry, but it may be useful to have it in a more nicely usable form. Copying DNA sequences from the Registry using the web-browser's normal copy function is strangely broken...

Note that any biobricks we submit to the Registry will still have to be properly documented there.

We have BBa_K523000 to BBa_K523999 available this year.

Contents

Biobrick prefix, suffix, scar

  • gaattcgcggccgcttctag - prefix, short
  • gaattcgcggccgcttctagag - prefix, long
  • tactagtagcggccgctgcag - suffix
  • tactag - scar, short
  • tactagag - scar, long

Restriction sites

  • gaattc - EcoRI (prefix 1)
  • tctaga - XbaI (prefix 2)
  • actagt - SpeI (suffix 1)
  • ctgcag - PstI (suffix 2)
  • gcggccgc - NotI (in both prefix and suffix)
  • agatct - BglII (not part of RFC10, we use it internally after the prefix)
  • tacgta - SnaBI (might become relevant)

pSB1C3 vector

  • This is the official sequence from the Registry ([http://partsregistry.org/wiki/index.php?title=Part:pSB1C3 here])
  • Red is CmlR coding sequence ([http://en.wikipedia.org/wiki/Chloramphenicol_acetyltransferase chloramphenicol acetyltransferase])
  • Blue is restriction sites for the 4 main enzymes (SpeI, PstI, EcoRI, XbaI)

tactagtagcggccgctgcagtccggcaaaaaagggcaaggtgtcaccaccctgcccttt ttctttaaaaccgaaaagattacttcgcgttatgcaggcttcctcgctcactgactcgct gcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggtt atccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggc caggaaccgtaaaaaggccgcgttgctggcgtttttccacaggctccgcccccctgacga gcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagata ccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttac cggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctg taggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccc cgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaag acacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgt aggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaagaacagt atttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttg atccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattac gcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctca gtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcac ctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaac ttggtctgacagctcgaggcttggattctcaccaataaaaaacgcccggcggcaaccgag cgttctgaacaaatccagatggagttctgaggtcattactggatctatcaacaggagtcc aagcgagctcgatatcaaattacgccccgccctgccactcatcgcagtactgttgtaatt cattaagcattctgccgacatggaagccatcacaaacggcatgatgaacctgaatcgcca gcggcatcagcaccttgtcgccttgcgtataatatttgcccatggtgaaaacgggggcga agaagttgtccatattggccacgtttaaatcaaaactggtgaaactcacccagggattgg ctgagacgaaaaacatattctcaataaaccctttagggaaataggccaggttttcaccgt aacacgccacatcttgcgaatatatgtgtagaaactgccggaaatcgtcgtggtattcac tccagagcgatgaaaacgtttcagtttgctcatggaaaacggtgtaacaagggtgaacac tatcccatatcaccagctcaccgtctttcattgccatacgaaattccggatgagcattca tcaggcgggcaagaatgtgaataaaggccggataaaacttgtgcttatttttctttacgg tctttaaaaaggccgtaatatccagctgaacggtctggttataggtacattgagcaactg actgaaatgcctcaaaatgttctttacgatgccattgggatatatcaacggtggtatatc cagtgatttttttctccattttagcttccttagctcctgaaaatctcgataactcaaaaa atacgcccggtagtgatcttatttcattatggtgaaagttggaacctcttacgtgcccga tcaactcgagtgccacctgacgtctaagaaaccattattatcatgacattaacctataaa aataggcgtatcacgaggcagaatttcagataaaaaaaatccttagctttcgctaaggat
gatttctggaattcgcggccgcttctagag

Plac-LacZα insert (clone 2)

  • Assigned code <partinfo>BBa_K523000</partinfo>
  • A modified version (seems to lack about 45 bases at the start) of [http://partsregistry.org/wiki/index.php?title=Part:BBa_J33207 BBa_J33207]
  • Sanger sequencing seemed to confirm that all was well with clone 2, though in a few regions the chromatograms were poor.
  • Sequence below includes BioBrick prefix and suffix (blue) as well as the BglII site (bold), which is partly in the prefix
  • lacZα is in red.
  • Both forward and reverse sequencing reactions failed to give good info about the suffix; so I have simply added the standard suffix. However we've not conclusively proved that the SpeI site exists; this should be verified by digest.

gaattcgcggccgcttctagagatctgctggggcaaaccagcgtggaccgcttgctgcaa ctctctcagggccaggcggtgaagggcaatcagctgttgcccgtctcactggtgaaaaga aaaaccaccctggcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcatta atgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaa tgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtat gttgtgtgaaattgtgagcggataacaatttcacacaggaaacagctatgaccatgatta cggattcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaac ttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgca ccgatcgcccttcccaacagttgcgcagcctgaatggcgaatggcgctttgcctggtttc cggcaccagaagcggtgccggaaagctggctggagtgatactagtagcggccgctgcag

malS using primers f1/r1 (clone 1)

  • Assigned code <partinfo>BBa_K523001</partinfo>
  • Expected sequence is [http://www.ncbi.nlm.nih.gov/nuccore/48994873?from=3735520&to=3737550&report=gbwithparts derived from this].
  • Prefix and suffix in blue.
  • Pre-coding sequence in CAPITALS and contains native RBS.
  • BglII site in bold.
  • Double stop codon is present.
  • This has been sequenced and looks good.

gaattcgcggccgcttctagagatctGAAATCGCAGCAATAAGGACTCATCCGCCatgaa actcgccgcctgttttctgacactccttcctggcttcgccgttgccgccagctggacttc tccggggtttcccgcctttagcgaacaggggacaggaacatttgtcagccacgcgcagtt gcccaaaggtacgcgtccactaacgctaaattttgaccaacagtgctggcagcctgcgga tgcgataaaactcaatcagatgctttccctgcaaccttgtagcaacacgccgcctcaatg gcgattgttcagggacggcgaatatacgctgcaaatagacacccgctccggtacgccaac attgatgatttccatccagaacgccgccgaaccggtagcaagcctggtccgtgaatgccc gaaatgggatggattaccgctcacagtggatgtcagcgccactttcccggaaggagccgc cgtacgggattattacagccagcaaattgcgatagtgaagaacggtcaaataatgttaca acccgctgccaccagcaacggtttactcctgctggaacgggcagaaactgacacatccgc ccctttcgactggcataacgccacggtttactttgtgctgacagatcgtttcgaaaacgg cgatcccagtaatgaccagagttacggacgtcataaagacggtatggcggaaattggcac ttttcacggcggcgatttacgcggcctgaccaacaaactggattacctccagcagttggg cgttaatgctttatggataagcgccccatttgagcaaattcacggctgggtcggcggcgg tacaaaaggcgatttcccgcattatgcctaccacggttattacacacaggactggacgaa tcttgatgccaatatgggcaacgaagccgatctacggacgctggttgatagcgcacatca gcgcggtattcgtattctctttgatgtcgtgatgaaccacaccggctatgccacgctggc ggatatgcaggagtatcagtttggcgcgttatatctttctggtgacgaagtgaaaaaatc gctgggtgaacgctggagcgactggaaacctgccgccgggcaaacctggcatagctttaa cgattacattaatttcagcgacaaaacaggctgggataaatggtggggaaaaaactggat cagaacggatatcggcgattacgacaatcctggattcgacgatctcactatgtcgctagc ctttttgccggatatcaaaaccgaatcaactaccgcttctggtctgccggtgttctataa aaacaaaatggatacccacgccaaagccattgacggctatacgccgcgcgattacttaac ccactggttaagtcagtgggtccgcgactatgggattgatggttttcgggtcgataccgc caaacatgttgagttgcccgcctggcagcaactgaaaaccgaagccagcgccgcgcttcg cgaatggaaaaaagctaaccccgacaaagcattagatgacaaacctttctggatgaccgg tgaagcctggggccacggcgtgatgcaaagtgactactatcgccacggcttcgatgcgat gatcaatttcgattatcaggagcaggcggcgaaagcagtcgactgtctggcgcagatgga tacgacctggcagcaaatggcggagaaattgcagggtttcaacgtgttgagctacctctc gtcgcatgatacccgcctgttccgtgaagggggcgacaaagcagcagagttattactatt agcgccaggcgcggtacaaatcttttatggtgatgaatcctcgcgtccgttcggtcctac aggttctgatccgctgcaaggtacacgttcggatatgaactggcaggatgttagcggtaa atctgccgccagcgtcgcgcactggcagaaaatcagccagttccgcgcccgccatcccgc aattggcgcgggcaaacaaacgacacttttgctgaagcagggctacggctttgttcgtga gcatggcgacgataaagtgctggtcgtctgggcagggcaacagtaataatactagtagcg
gccgctgcag

malS using primers f2/r2 (clone 1)

  • This is essentially as above but starting at the start codon, and with no stop codon, and adding two bases at the end of the BioBrick, which (together with the T at the start of the suffix) code for a glycine when using BioSandwich assembly.
  • Expected sequence is [http://www.ncbi.nlm.nih.gov/nuccore/48994873?from=3735520&to=3737550&report=gbwithparts derived from this].
  • This has been sequenced and looks good.

gaattcgcggccgcttctagagatctatgaaactcgccgcctgttttctgacactccttc ctggcttcgccgttgccgccagctggacttctccggggtttcccgcctttagcgaacagg ggacaggaacatttgtcagccacgcgcagttgcccaaaggtacgcgtccactaacgctaa attttgaccaacagtgctggcagcctgcggatgcgataaaactcaatcagatgctttccc tgcaaccttgtagcaacacgccgcctcaatggcgattgttcagggacggcgaatatacgc tgcaaatagacacccgctccggtacgccaacattgatgatttccatccagaacgccgccg aaccggtagcaagcctggtccgtgaatgcccgaaatgggatggattaccgctcacagtgg atgtcagcgccactttcccggaaggagccgccgtacgggattattacagccagcaaattg cgatagtgaagaacggtcaaataatgttacaacccgctgccaccagcaacggtttactcc tgctggaacgggcagaaactgacacatccgcccctttcgactggcataacgccacggttt actttgtgctgacagatcgtttcgaaaacggcgatcccagtaatgaccagagttacggac gtcataaagacggtatggcggaaattggcacttttcacggcggcgatttacgcggcctga ccaacaaactggattacctccagcagttgggcgttaatgctttatggataagcgccccat ttgagcaaattcacggctgggtcggcggcggtacaaaaggcgatttcccgcattatgcct accacggttattacacacaggactggacgaatcttgatgccaatatgggcaacgaagccg atctacggacgctggttgatagcgcacatcagcgcggtattcgtattctctttgatgtcg tgatgaaccacaccggctatgccacgctggcggatatgcaggagtatcagtttggcgcgt tatatctttctggtgacgaagtgaaaaaatcgctgggtgaacgctggagcgactggaaac ctgccgccgggcaaacctggcatagctttaacgattacattaatttcagcgacaaaacag gctgggataaatggtggggaaaaaactggatcagaacggatatcggcgattacgacaatc ctggattcgacgatctcactatgtcgctagcctttttgccggatatcaaaaccgaatcaa ctaccgcttctggtctgccggtgttctataaaaacaaaatggatacccacgccaaagcca ttgacggctatacgccgcgcgattacttaacccactggttaagtcagtgggtccgcgact atgggattgatggttttcgggtcgataccgccaaacatgttgagttgcccgcctggcagc aactgaaaaccgaagccagcgccgcgcttcgcgaatggaaaaaagctaaccccgacaaag cattagatgacaaacctttctggatgaccggtgaagcctggggccacggcgtgatgcaaa gtgactactatcgccacggcttcgatgcgatgatcaatttcgattatcaggagcaggcgg cgaaagcagtcgactgtctggcgcagatggatacgacctggcagcaaatggcggagaaat tgcagggtttcaacgtgttgagctacctctcgtcgcatgatacccgcctgttccgtgaag ggggcgacaaagcagcagagttattactattagcgccaggcgcggtacaaatcttttatg gtgatgaatcctcgcgtccgttcggtcctacaggttctgatccgctgcaaggtacacgtt cggatatgaactggcaggatgttagcggtaaatctgccgccagcgtcgcgcactggcaga aaatcagccagttccgcgcccgccatcccgcaattggcgcgggcaaacaaacgacacttt tgctgaagcagggctacggctttgttcgtgagcatggcgacgataaagtgctggtcgtct gggcagggcaacagggtactagtagcggccgctgcag

bglX using primers f1/r1 (clone 1)

  • Assigned code <partinfo>BBa_K523002</partinfo>
  • Expected sequence is [http://www.ncbi.nlm.nih.gov/nuccore/48994873?from=2217714&to=2220011&report=gbwithparts derived from this].
  • Prefix and suffix in blue.
  • Pre-coding sequence in CAPITALS and contains native RBS.
  • BglII site in bold.
  • Double stop codon is present.
  • The sequence as it stands has a PstI site (red).
  • This has been sequenced and looks good. A silent mutation from the expected sequence is indicated as a red capital T.

gaattcgcggccgcttctagagatctGCCACGTCGGGCAACAAAGGAAGAAAAATCCATa tgaaatggctatgttcagtaggaatcgcggtgagtctggccctgcagccagcactggcgg atgatttattcggcaaccatccattaacgcccgaagcgcgggatgcgttcgtcaccgaac tgcttaagaaaatgacagttgatgagaaaattggtcagctgcgcttaatcagcgtcggcc cggataacccgaaagaggcgatccgcgagatgatcaaagacggtcaggttggggcgattt tcaacaccgtaacccgtcaggatatccgcgccatgcaggatcaggtgatggaattaagcc gcctgaaaattcctcttttctttgcttacgacgtgctgcacggtcagcgcacggtgttcc cgattagcctcggtctggcctcgtcttttaacctcgatgcagtgaaaacggtcggacgtg tctctgcttatgaagcggcagatgatggcctgaatatgacctgggcaccgatggtcgatg tctcgcgcgatccgcgctggggacgtgcttccgaaggttttggcgaagatacgtatctca cctcaacaatgggtaaaaccatggtggaagcgatgcagggtaaaagcccggcagatcgct actcggtgatgaccagcgtcaaacactttgccgcatacggcgcggtagaaggcggtaaag agtacaacaccgtcgatatgagtccgcagcgcctgtttaatgattatatgccgccgtaca aagcggggctggacgcaggcagcggcgcggtgatggtggcgctgaactcgctgaacggca cgccagccacctccgattcctggctgctgaaagatgttctgcgcgaccagtggggcttta aaggcatcaccgtttccgatcacggtgcaatcaaagagctgattaaacatggcacggcgg cagacccggaagatgcggtgcgcgtggcgctgaaatccggaatcaacatgagcatgagcg acgagtactactcgaagtatctgcctgggttgattaaatccggcaaagtgacgatggcag agctggacgatgctgcccgccatgtactgaacgttaaatatgatatggggttgtttaacg acccatacagccatttggggccgaaagagtctgacccggtggataccaatgccgaaagcc gcctgcaccgtaaagaagcgcgtgaagtggcgcgcgaaagcttggtgttgctgaaaaacc gtctcgaaacgttaccgctgaaaaaatcggccaccattgcggtggttgggccactggcgg acagtaaacgtgacgtgatgggcagctggtccgcagccggtgttgccgatcaatccgtga ccgtactgaccgggattaaaaatgccgtcggtgaaaacggtaaagtgctgtatgccaaag gggcgaacgttaccagtgacaaaggcattatcgatttcctgaatcagtatgaagaagcgg tcaaagtcgatccgcgttcgccgcaagagatgattgatgaagcggtgcagacggcgaaac aatctgatgtggtggtggctgtagtcggtgaagcacaggggatggcgcacgaagcctcca gccggaccgatatcactattccgcaaagccaacgtgacttgattgcggcgctgaaagcca ccggtaaaccgctggtgctggtgctgatgaacgggcgtccgctggcgctggtgaaagaag atcagcaggctgatgcgattctggaaacctggtttgcggggactgaaggcggtaatgcaa ttgccgatgtattgtttggcgattacaacccgtccggcaagctgccaatgtccttcccgc gttctgtcgggcagatcccggtgtactacagccatctgaataccggtcgcccgtataatg ccgacaagccgaacaaatacacttcgcgttattttgatgaagctaacggggcgttgtatc cgttcggctatgggctgagctacaccactttcaccgtctctgatgtgaaactttctgcgc cgaccatgaagcgtgacggcaaagtgactgccagcgtgcaggtgacgaacaccggtaagc gcgagggtgccacggtagtgcagatgtacttgcaggatgtgacTgcttccatgagtcgcc ctgtgaaacagctgaaaggctttgagaaaatcaccctgaaaccgggcgaaactcagactg tcagcttcccgatcgatattgaggcgctgaagttctggaatcaacagatgaaatatgacg ccgagcctggcaagttcaatgtctttatcggcactgattccgcacgcgttaagaaaggcg agtttgagttgctgtaataatactagtagcggccgctgcag

bglX using primers f2/r2 (clone 1)

  • This is essentially as above but starting at the start codon, and with no stop codon, and adding two bases at the end of the BioBrick, which (together with the T at the start of the suffix) code for a glycine when using BioSandwich assembly.
  • Expected sequence is [http://www.ncbi.nlm.nih.gov/nuccore/48994873?from=2217714&to=2220011&report=gbwithparts derived from this].
  • This has been sequenced and looks good. A silent mutation from the expected sequence is indicated as a red capital T.

gaattcgcggccgcttctagagatctatgaaatggctatgttcagtaggaatcgcggtga gtctggccctgcagccagcactggcggatgatttattcggcaaccatccattaacgcccg aagcgcgggatgcgttcgtcaccgaactgcttaagaaaatgacagttgatgagaaaattg gtcagctgcgcttaatcagcgtcggcccggataacccgaaagaggcgatccgcgagatga tcaaagacggtcaggttggggcgattttcaacaccgtaacccgtcaggatatccgcgcca tgcaggatcaggtgatggaattaagccgcctgaaaattcctcttttctttgcttacgacg tgctgcacggtcagcgcacggtgttcccgattagcctcggtctggcctcgtcttttaacc tcgatgcagtgaaaacggtcggacgtgtctctgcttatgaagcggcagatgatggcctga atatgacctgggcaccgatggtcgatgtctcgcgcgatccgcgctggggacgtgcttccg aaggttttggcgaagatacgtatctcacctcaacaatgggtaaaaccatggtggaagcga tgcagggtaaaagcccggcagatcgctactcggtgatgaccagcgtcaaacactttgccg catacggcgcggtagaaggcggtaaagagtacaacaccgtcgatatgagtccgcagcgcc tgtttaatgattatatgccgccgtacaaagcggggctggacgcaggcagcggcgcggtga tggtggcgctgaactcgctgaacggcacgccagccacctccgattcctggctgctgaaag atgttctgcgcgaccagtggggctttaaaggcatcaccgtttccgatcacggtgcaatca aagagctgattaaacatggcacggcggcagacccggaagatgcggtgcgcgtggcgctga aatccggaatcaacatgagcatgagcgacgagtactactcgaagtatctgcctgggttga ttaaatccggcaaagtgacgatggcagagctggacgatgctgcccgccatgtactgaacg ttaaatatgatatggggttgtttaacgacccatacagccatttggggccgaaagagtctg acccggtggataccaatgccgaaagccgcctgcaccgtaaagaagcgcgtgaagtggcgc gcgaaagcttggtgttgctgaaaaaccgtctcgaaacgttaccgctgaaaaaatcggcca ccattgcggtggttgggccactggcggacagtaaacgtgacgtgatgggcagctggtccg cagccggtgttgccgatcaatccgtgaccgtactgaccgggattaaaaatgccgtcggtg aaaacggtaaagtgctgtatgccaaaggggcgaacgttaccagtgacaaaggcattatcg atttcctgaatcagtatgaagaagcggtcaaagtcgatccgcgttcgccgcaagagatga ttgatgaagcggtgcagacggcgaaacaatctgatgtggtggtggctgtagtcggtgaag cacaggggatggcgcacgaagcctccagccggaccgatatcactattccgcaaagccaac gtgacttgattgcggcgctgaaagccaccggtaaaccgctggtgctggtgctgatgaacg ggcgtccgctggcgctggtgaaagaagatcagcaggctgatgcgattctggaaacctggt ttgcggggactgaaggcggtaatgcaattgccgatgtattgtttggcgattacaacccgt ccggcaagctgccaatgtccttcccgcgttctgtcgggcagatcccggtgtactacagcc atctgaataccggtcgcccgtataatgccgacaagccgaacaaatacacttcgcgttatt ttgatgaagctaacggggcgttgtatccgttcggctatgggctgagctacaccactttca ccgtctctgatgtgaaactttctgcgccgaccatgaagcgtgacggcaaagtgactgcca gcgtgcaggtgacgaacaccggtaagcgcgagggtgccacggtagtgcagatgtacttgc aggatgtgacTgcttccatgagtcgccctgtgaaacagctgaaaggctttgagaaaatca ccctgaaaccgggcgaaactcagactgtcagcttcccgatcgatattgaggcgctgaagt tctggaatcaacagatgaaatatgacgccgagcctggcaagttcaatgtctttatcggca ctgattccgcacgcgttaagaaaggcgagtttgagttgctgggtactagtagcggccgct
gcag

INP in RFC12 format

  • This is <partinfo>BBa_K265008</partinfo>.
  • This came on pSB1AK3 and with an RFC12 prefix and suffix (capitals).
  • This has been sequenced and the thing we have is perfect except it has an extra C base in the suffix (red).
  • (Blue) There is a SpeI site in the prefix (not suffix) and a NheI site in the suffix. These are in frame and can probably be used in a modified BioSandwich protocol.

GAATTCGCGGCCGCACTAGTatgaccctggataaagcgctggtgctgcgcacttgtgcca ataatatggccgaccattgtggcctgatctggcctgcttctggtacagttgaaagccgct attggcagtctacacgtcgtcacgaaaacgggctggttgggctgctgtggggtgccggaa catccgcctttctgagcgttcatgccgatgctcgttggattgtttgtgaggttgctgttg ccgacatcattagcctggaagaaccgggtatggtgaaatttcctcgtgctgaagtggtcc atgttggtgatcgtattagcgcctcccatttcatttcagctcgtcaggctgatccagcaa gcacatcaacaagcacctctaccagtacactgacaccaatgccaaccgctatccctactc ctatgccggctgtagcctctgttactctgcctgtggccgaacaagctcgccacgaagtct ttgacgttgcctctgtgtctgctgctgctgctcctgttaacactctgcctgttaccacgc ctcaaaatctgcaaacggcgacctatggctcaacactgagtggtgataaccattctcgtc tgattgccgggtatggctctaacgaaaccgccggaaaccattctgatctgatcggtggtc acgattgtacgctgatggcaggtgatcaatctcgtctgaccgctggcaaaaattccgttc tgaccgctggggcacgttctaaactgatcgggagcgaaggctctactctgtccgccggag aagattctaccctgatctttcgtctgtgggatggtaaacgttatcgtcagctggtcgctc gtacaggtgaaaatggtgtggaagccgacattccgtattatgtgaatgaggacgatgata ttgtggacaaaccggacgaagatgatgattggatcgaagtgaaaGCTAGCGCGGCCGCcT
GCAG

pVIII

  • Blue is the leader sequence... MKKSLVLKASVAVATLVPMLSFA
  • Expected sequence is [http://www.ncbi.nlm.nih.gov/nuccore/56713234?report=genbank derived from this].

atgaaaaagtctttagtcctcaaagcctctgtagccgttgctaccctcgttccgatgctg tctttcgctgctgagggtgacgatcccgcaaaagcggcctttaactccctgcaagcctca gcgaccgaatatatcggttatgcgtgggcgatggttgttgtcattgtcggcgcaactatc ggtatcaagctgtttaagaaattcacctcgaaagcaagctga