Team:Edinburgh/Genetic Instability

From 2011.igem.org

(Difference between revisions)
Line 8: Line 8:
==What is genetic instability?==
==What is genetic instability?==
-
Genetic instability refers to the random reactions affecting DNA. When a lot of similar sequences of DNA are introduced to a cell, this can potentially lead to those sections of DNA recombining in an uncontrolled manner. This will not be a problem in our lab strain of ''E. coli'', JM109, as it lacks an important recombinase enzyme. However, use in industry requires hardy strains that necessarily possess recombinases.
+
 
 +
When a lot of similar sequences of DNA are introduced to a cell, this can potentially lead to those sections of DNA recombining in an uncontrolled manner. This will not be a problem in our lab strain of ''E. coli'', JM109, as it lacks the recombinase enzyme which causes this to happen. However, industriy requires hardy strains of ''E. coli'' that must possess recombinases to deal with their high stress working environment.
'''But there is a solution!'''
'''But there is a solution!'''

Revision as of 18:31, 19 July 2011

OK, so apparently there exists this thing called genetic instability, and we need to counter it...

Contents

What is genetic instability?

When a lot of similar sequences of DNA are introduced to a cell, this can potentially lead to those sections of DNA recombining in an uncontrolled manner. This will not be a problem in our lab strain of E. coli, JM109, as it lacks the recombinase enzyme which causes this to happen. However, industriy requires hardy strains of E. coli that must possess recombinases to deal with their high stress working environment.

But there is a solution!

If we only had many different (as far apart as possible) DNA sequences coding for the same protein, the world would be saved!

Enter Team Synergy's genetic stability tool! It will do exactly that - using DNA's natural property of redundancy, it will find a number of sequences of DNA for you! And they will code for the same protein! Superb!

So, how does it work?

Possible modes

Random generation

This is a very crude mode, in which a random sequence of codons coding for analogous amino acids is generated, e.g. if you had:

atgaaaaagtctttagtcctcaaagcctctgtagccgttgctaccctcgttccgatgctgtctttcgct...
(the leader sequence for pVIII),

the program would translate it to amino acids sequence, i.e.:

MKKSLVLKASVAVATLVPMLSFA...

and then find a random codon for each amino acid, put them together, and spit out the result.

This sound too easy to be right... that's right!

Best codon

In this mode, the tool would choose the best codon per every amino acid. What is 'best codon'? It's the codon which has the most base pairs different from the original one.

For example, Leucine can be coded by UUA, UUG, CUU, CUC, CUA, or CUG. If the original sequence uses UUA, then the best codon method would choose CUU, CUC or CUG. Yeaaaah... . There often are plenty of 'best' codons, so this method would be improved if it looked at the wider context...

Genetic Algorithm

This is a very powerful algorithm, inspired by Computer Scientists observing Biology. It represents a problem using a string of characters - i.e. the 'DNA', then solutions are 'individuals', there is many of them in a 'generation', and they 'cross-over' and 'mutate' their 'genes' at some randomised rate. The most 'fit' individuals get into the next generation.

That means it is perfect for applying it in Biology!

We use this algorithm to give you the DNA sequences which are as far apart from the original as possible. In this mode, the tool also chooses codons which are well-expressed in E.coli. And, it can return more than one sequence of DNA.