Computational Systems Biology
Sauro Lab
University of Washington
Home  |   Downloads  |   News  |   Research  |   Papers  |   About Us  |   Contact Us  |   SBW Help   |   Jobs
     


What is SBW 
Research Impact 
Published Papers 
Lab Members 
Available Positions 
CSB Links 
Courses 
Student Projects 
 
 
 
Software Downloads:

1. SBW
2. JDesigner
3. Jarnac
4. WinSCAMP
5. Matlab Interface
6. Optimization
6. Bifurcation
 
maintained by Frank Bergmann
 
 
 

Conservation Analysis for Large Models

Information Provided by Ravi Rao

Update: (5 Dec 2005): This work is now available as a paper, and can be obtained from the Advance Access section of Bioinformatics (See Link: doi:10.1093/bioinformatics/bti800).

What constitutes a large model?

Models of biochemical networks are indicative of the complexity of the system represented. Often, the size of such models tends to increase as more data becomes known and the scope of the model is enhanced. This is even true of models that are built by combining smaller networks. Conservation analysis of such large networks using current set of tools is likely to be fraught with numerical error. This is due to the numerical instability of methods such as Gaussian Elimination (for e.g. used in Jarnac) and LU Decomposition (COPASI; PySCeS). We have developed a new algorithm that is robust for computing the conserved cycles of large systems. This method makes use of the Householder QR factorization to generate the correct conserved cycles.

Description of Models

In the following, four examples are used to show the capability of this method. The robustness and performance of this method is also compared with those used by other tools. These four examples include

  1. iCS291 - H. pylori: 396 Species, 381 Reactions (xls, SBML, Jarnac, PySCeS) Ref: Schilling, C. H., et al., J. Bacteriology, 184(16): pp. 4582-4593 (2002).
  2. iJE660a - E. coli: 537 Species, 739 Reactions (xls, SBML, Jarnac, PySCeS).
  3. iJR904 - E. coli: 764 Species, 931 Reactions (xls, SBML, Jarnac, PySCeS) Ref: Reed, J. L., et al., Genome Biology, 4(9): pp. R54.1-R54.12 (2003).
  4. iND750 - S. cerevisiae: 1072 Species, 1149 Reactions (xls, SBML, Jarnac, PySCeS) Ref: Duarte, N.C., et al., Genome Research, 14 (7), 1298-1309.

The SBML models were built using the Excel spread sheet versions developed by the Systems Biology Group at the University of California, San Diego (see In silico Organisms). The reactions comprising the network were extracted from the Excel spread sheets and used to build a Jarnac file. This Jarnac file was then used to generate the SBML file to be used by our conservation analysis algorithm. We are thankful to Brett Olivier for generating the equivalent PyCSeS formats for two of the models - iJR904 and iND750. (For the other two models, the PyCSeS format was generated by using a script written in PERL. A similar script was written to produce the Jarnac file as well. The PERL script reads the Excel file, after it has been saved as a tab-delimited file and parses the reaction information to build the equivalent Jarnac or Pysces files. These are two bare-bones PERL scripts - build_jarnac.pl and build_pysces.pl - with no documentation, so please contact me if you are interested in using them - and I will email instructions on using them. Please note that the scripts do not handle all special characters in the reactions and reaction names - so the generated scripts have to be checked to eliminate errors).

Results of Conservation Analysis

The results from conservation analysis on these models are listed below. These are saved in files that can be seen or downloaded following the links for each model. The results from our algorithm (Householder QR) are named as modelname_QR.txt while that for our implementation of the LU factorization are named as modelname_LU.txt. Results from Jarnac are given by modelname_JN.txt and those from PySCeS are given as modelname_PY.txt. (note that COPASI results are available as text files as the results could only be viewed, but not saved). The conservation laws obtained from each of these models are in bold letters.

iCS291 iJE660a iJR904 iND750
Householder QR 36
iCS291_QR.txt
19
iJE660a_QR.txt
43
iJR904_QR.txt
100
iND750_QR.txt
LU (full pivoting) 36
iCS291_LUf.txt
19
iJE660a_LUf.txt
42
iJR904_LUf.txt
100
iND750_LUf.txt
LU (partial pivoting) 36
iCS291_LU.txt
19
iJE660a_LU.txt
42 iJR904_LU.txt 100 iND750_LU.txt
Jarnac 36
iCS291_JN.txt
19
iJE660a_JN.txt
41 iJR904_JN.txt 98
iND750_JN.txt
PySCeS 36
iCS291_PY.txt
19
iJE660a_PY.txt
41
iJR904_PY.txt
100
iND750_PY.txt
COPASI 31 16 36 89


Here is a picture of a part of the large network iJE660a showing the complexity involved. (Thanks to Anastasia Deckard for generating a PS file for the whole network) We will try to have similar images of the other large networks in near future.

Simulation Times (in seconds) - 10 Runs

iCS291 i JE660a iJR904 iND750
Method QR LU QR LU QR LU QR LU
Run1 1.485 1.063 6.828 4.094 14.563 8.625 33.735 20.75
Run 2 1.5 1.078 6.875 4.172 14.548 8.594 30.095 16.422
Run 3 1.516 1.079 6.5 4.25 14.657 8.469 30.203 16.641
Run 4 1.5 1.078 6.68 4.218 14.891 8.579 29.844 17.031
Run 5 1.515 1.094 6.875 4.094 14.547 8.766 29.969 16.828
Run 6 1.516 1.062 6.844 4.062 14.453 8.594 29.75 16.937
Run 7 1.484 1.047 6.953 4.11 14.626 8.407 29.767 16.75
Run 8 1.547 1.078 6.953 4.094 14.469 8.437 30.017 16.485
Run 9 1.5 1.062 7.047 4.187 14.672 8.468 29.782 16.641
Run 10 1.609 1.047 7.359 4.156 14.453 8.422 29.719 17
Average 1.51177 1.0688 6.8914 4.1437 14.5879 8.5361 30.2881 17.1485
 
sysbio/conservationanalysis/moredetails/largemodels.txt · Last modified: 2007/08/02 11:29 by mallain
 

    Home  |   About Us  |   Contact Us  |   Statistics
Recent changes RSS feed Creative Commons License Donate Powered by PHP Valid XHTML 1.0 Valid CSS Driven by DokuWiki