Team:UNIPV-Pavia/Modelling03

From 2011.igem.org

(Difference between revisions)
Line 58: Line 58:
-
<a name="The importance of the mathematical model"></a><h2> <span class="mw-headline"> <b>The importance of the mathematical model</b> </span></h2>  
+
<a name="The importance of the mathematical model"></a><h2> <span class="mw-headline"> <b>The importance of mathematical modelling</b> </span></h2>  
<div style='text-align:justify'>The purposes of writing mathematical models for gene networks can be: </div>  
<div style='text-align:justify'>The purposes of writing mathematical models for gene networks can be: </div>  
<br>
<br>

Revision as of 22:56, 9 September 2011

UNIPV TEAM 2011

CTRL + E

Signalling is nothing without control...


Contents



Mathematical modelling: introduction

Mathematical modelling plays nowadays a central role in Synthetic Biology, due to its ability to serve as a crucial link between the concept and realization of a biological circuit: what we propose in this page is a modelling approach to our project, which has proved extremely useful and very helpful before and after the "wet lab".
Thus, immediately at the beginning, when there was little knowledge, a mathematical model based on a system of differential equations was derived and performed using a set of reasonable parameters, so as to validate the feasibility of the project. Once this became clear, starting from the characterization of the single subparts created in the wet lab, some of the parameters of the mathematical model were estimated (the others are known from literature) and they have been implemented in the same model, in order to predict the final behaviour of the whole engineered closed-loop circuit.

Therefore here, after a brief overview about the advantages that modelling engineered circuits can bring, we deeply analyze the system of equation formulas, underlining the role and the function of the parameters involved.
Experimental procedures for parameters estimation are discussed and, finally, a different type of circuit is presented and simulations performed, using ODEs with MATLAB and explaining the difference between a closed-loop model and an open one.


The importance of mathematical modelling

The purposes of writing mathematical models for gene networks can be:

  • Prediction: in the initial steps of the project, a good a-priori identification in silico allows to suppose the kinetics of the enzymes (AiiA, Luxi) and HSL involved in our gene network, basicly to understand if the complex circuit structure and functioning could be achievable and to investigate the range of parameters values ​​for which the behavior is that expected.

  • Parameter identification: using the lsqnonlin function of MATLAB it was possible to get all the parameters involved in the model, in order to perform realistic simulations not only of the single subparts created, but also of the whole final circuit, according to the a-posteriori identification.

  • Modularity: studing and characterizing simple BioBrick Parts can allow to reuse this knowledge in other studies, facing with the same basic modules.

    NOTE1: In order to better investigate the range of dynamics of each subparts, every promoter has been considered with 4 different RBSs, so as to develop more knowledge about the state variables in several configurations of RBS' efficiency. Hereafter, referring to the notation "RBSx" we mean, respectively, RBS30, RBS31, RBS32, RBS34.


    Equations for gene networks




    Equations (1) and (2)

    Equations (1) and (2) have identical structure, differing only in the parameters involved. They represent the synthesis degradation and diluition of both the enzymes of the circuit, LuxI and AiiA, respectively in the first and second equation: in each of them both transcription and translation processes have been condensed.
    These equations are composed of 2 parts:

    1. The first term describes, through Hill's equation formalism, the synthesis rate of the protein of interest (either LuxI or AiiA) depending on the concentration of the inducible protein (anhydrotetracicline -aTc- or HSL respectively). As can be seen in the parameters table (see below), α refers to the maximum activation of the promoter, δ stands for its leakage activity (this means that the promoter is quite active even if there is no induction). In particular, in equation (1), the quite total inhibition of pTet promoter is due to the constitutive production of TetR by our MGZ1 strain, while, in equation (2), pLux is almost repressed in the absence of the complex LuxR-HSL.
      In equation (2) only HSL seems to be the inducer, instead of the complex LuxR-HSL. This is motivated by the fact that our final device offers a constitutive production of LuxR (due to the upstream constitutive promoter pLac), so that, assuming it abundant in the cytoplasm, we can derive the semplification of attributing pLux promoter induction only by HSL: this is the reason why we didn' t consider LuxR in the equations system as well as LuxI and AiiA. Furthermore, in both equations k stands for the dissaciation constant of the promoter from the inducer (respectively aTc and HSL in eq. 1 and 2), while η is the cooperativity constant.

      The second term in equations (1) and (2) is in turn composed of 2 parts. The first one (γ*LuxI or γ*AiiA respectively) describes, with a linear relation, the degradation rate per cell of the protein. The second one (μ*(Nmax-N)/Nmax)*LuxI or μ*(Nmax-N)/Nmax)*AiiA, respectively) takes into account the dilution term due to cell growth and is related to the cell replication process.

    Equation (3)

    Here the kinetics of HSL is modeled, basicly through enzymatic reactions either related to the production or the degradation of HSL: based on the experiments performed, we derived appropriate expressions for HSL synthesis and degradation.
    3 parts have been identified in this equation:

    1. The first term represents the production of HSL due to LuxI expression. We model this process with saturation curve in which Vmax is HSL maximum transcription rate, while KM is the dissociation constant of LuxI from the substrate HSL and it represents the concentration of LuxI at which HSL synthesis rate is Vmax/2.

    2. The second term represents the degradation of HSL due to the AiiA expression. Similarly to LuxI, Kcat represents maximum degradation per unit of HSL concentration, while KM1 is the concentration at which AiiA dependent HSL concentration rate is (Kcat*HSL)/2.

    3. The third term (γHSL*HSL) is similar to the corresponding ones present in the first two equations and describes the intrinsic protein degradation.

      NOTE2: the whole equation, except for the term of intrinsic degradation of HSL, is multiplied by the number of cells N, due to the property of the lactone to diffuse free inside/outside bacteria. Notice that, in system equation, LuxI and AiiA amounts are expressed per cell.

    Equation (4)

    This is the common logistic cell growth, depending on the rate μ and the maximum number NMAX of cells per well reachable.


    Table of parameters and species


    Parameter & Species Description Unit of Measurement Value
    αpTet maximum transcription rate of pTet (related with RBSx efficiency) [(AUr/min)/cell] -
    δpTet leakage factor of promoter pTet basic activity [-] -
    ηpTet Hill coefficient of pTet [-] -
    kpTet dissociation costant of aTc from pTet [nM] -
    αpLux maximum transcription rate of pLux (related with RBSx efficiency) [(AUr/min)/cell] -
    δpLux leakage factor of promoter pLux basic activity [-] -
    ηpLux Hill coefficient of pLux [-] -
    kpLux dissociation costant of HSL from pLux [nM] -
    γpLux LuxI costant degradation [1/min] -
    γAiiA AiiA costant degradation [1/min] -
    γHSL HSL costant degradation [1/min] -
    Vmax maximum transcription rate of LuxI [nM/(min*cell)] -
    KM dissociation costant of LuxI from HSL [AUr/cell] -
    Kcat maximum number of enzymatic reactions catalysed per minute [1/(min*cell)] -
    KM1 dissociation costant of AiiA from HSL [AUr/cell] -
    Nmax maximum number of bacteria per well [cell] -
    μ rate of bacteria growth [1/min] -
    LuxI kinetics of enzyme LuxI [AUrcell] -
    AiiA kinetics of enzyme AiiA [AUrcell] -
    HSL kinetics of HSL [nM(min)] -
    N number of cells cell -


    Parameter estimation

    The philosofy of the model is to predict the behavior of the final closed loop circuit starting from the characterization of single BioBrick parts through a set of well-designed ad hoc experiments. Relating to these, in this section the way parameters of the model have been identified is presented. As explained before in NOTE1, considering a set of 4 RBS for each subpart to caracterize increase their range of dynamics and helps us to understand deeplier the interactions between state variables and parameters.

    Promoter (PTet & pLux)

    These are the first subparts tested. In this phase of the project the target is to learn more about promoter pTet and pLux. Characterizing only promoter BioBrick is quite impossible: for this reason we consider promoter and the respective RBS from RBSx set together. Here a strong and reasonable hypothesis must be pointed out: the number of fluorescent protein produced, due to the concentration of induction (aTc, HSL for Ptet, Plux respectively) is exactly the same as the number given by any other protein that would be expressed instead of the mRFP. In other words, in our hypotesis, if we would substitute the mRFP coding region with a region coding for another protein, we would obtain the same synthesis rate: this is the reason why the strength of the complex promoter-RBSx is expressed in Arbitrary Units [AUr].
    As shown in the box below, we consider a range of induction and we monitor, during the time, absorbance (line1, line2) and fluorescence (line3); the two vertical segments for each figure highlight the exponential phase of bacteria' s groth. Scell (explained few lines below) can be derived as a function of inducer concentration, thereby providing the desired input-output relation (inducer concentration versus promoter+RBS activity), which was modelled as a Hill curve.
    After that, we can calculate the Scell as:
    In the end, plotting Scell VS induction, we obtain the activation Hill curve of the considered promoter.
    As shown in the box above, α as already mentioned, represent the protein maximum synthesis rate, which is reached, in accordance with Hill's formalism, when the inducer concentration tends to infinite, and, more practically, for sufficently high concentrations of inducer, meanwhile the product α*δ stands for the leakage activity (at no induction), liable for protein production (LuxI and AiiA respectively) even in the absence of autoinducer. The paramenter η is the Hill's cooperativity constant and it affects the rapidity and ripidity of the switch like curve relating Scell with the concentration of inducer. Lastly, k stands for the semi-saturation constant and, in case of η=1, it indicates the concentration of substrate at which half the synthesis rate is achieved. The unities of the various parameters can be easily derived considering the Hill equation(for more details see the Table of parameters above).


    AiiA & LuxI

    On a biological level, the ability to control the concentration of a given molecule reveals fundamental in limiting the metabolic burden of the cell; moreover, in the particular case of HSL signalling molecules, this would give the possibility to regulate quorum sensing based population's behaviours.
    In this paragraph are shown the experiments whose target is to learn the degradation mechanism of HSL due to the expression of AiiA, in order to estimate Kkat end KM1 parameters. These tests have been performed using the following BioBrick:
    As said before, we assume that, in the case of same induction, the same amount of protein would be produced, regardless of the gene encoding: knowing quantitively the production of mRFP, we are so able to predict the concentration of the enzyme AiiA. Moreover, it's possible to quantify exactly the concentration of HSL, using the well-characterized BioBrick BBa_T9002.
    Before discussing parameter estimation, it's good to spend few words about this device. It's a biosensor which receives HSL concentration as input and returns GFP intensity (more precisely SCell) as output. According to this, it' s necessary to know very well the reationship input-output: a curve of "calibration" of T9002 is obtain for each test performed, even if, in theory, it should be always the same.

    So, our idea is to control the degradation of HSL in time. aTc activates pTet and, after having waited enough for AiiA to become in stationary phase, a certain concentration of HSL is given. Then, in precise time samples absorbance and HSL concentration are monitored, the last reading the fluorescence of T9002. Kcat represents HSL maximum degradation rate per unit of HSL, reached when AiiA concentration is far above KM1. KM1 is the dissociation constant between AiiA and HSL.
    File:Degradation.jpg
    Now, considering the exponential growth, the concentration of the AiiA is supposed to be constant: after the cell division, fewer enzyme is present into the single bacteria but, on the other hand, the number of cells has increased, and so the enzyme equilibrium is conserved. Due to a well-known induction of aTc, the steady-state level per cell can be calculated:

    Then considering, for the same match of promoter and RBS, several induction of aTc and, for each of it, several samples of HSL concentration during the time, parameters Kcat and KM1 must be estimated, through numerous iterations of an algorithm that includes the functions lsqnonlin and ODE of MATLAB.

    N

    The parameters Nmax and μ can be calculated from the analysis of the OD600 produced by our MGZ1 culture. In particular, μ is derived as the slope of the log(OD600) growth curve. Nmax is determined with a proper procedure. After having reached saturation phase and having retrieved the corresponding OD600, we take a sample of the culture and make serial dilutions of it, then we plate the final diluted culture on a Petri and wait for the formation of colonies. The dilution serves to avoid the growth of too many and too close colonies in the Petri. Finally, we count the number of colonies, which correspond to Nmax.

    Simulations

    Now that we have gone deep into the various aspects of the mathematical model of our closed loop, it's time to explain why it is advantageous with respect to the open loop. In order to see that, we implemented and simulated in Matlab our closed loop circuit and the open loop one, consisting of the same construct without the feedback loop, which is represented by . The table below provides the values for the parameters of the model. disegno, promotore a valle di AiiA sempre acceso. grafici e differenze, mostrare che AiiA in qll ad anello aperto sta vistosamente sopra---> spreco di energia da parte della cellula.

    References

  • Retrieved from "http://2011.igem.org/Team:UNIPV-Pavia/Modelling03"