Computational Systems Biology 1 6106, 5 cr

Transcription

Computational Systems Biology 1 6106, 5 cr
Computational Systems Biology 1
6106, 5 cr
Olli Yli-Harja
Tampere University of Technology
Institute of Signal Processing
Topics covered on CSB1
Introduction
Systems biology
Role of modeling in biosciences
Quantitative models
Discrete models
Measurement systems in biosciences
Array techniques
Models of measurement systems
Microscopy and image analysis
Data analysis
Microarray data
Supervised / Unsupervised learning
Biological databases and software
CSB1 – and other courses
Course homepage
SGN-6106 COMPUTATIONAL SYSTEMS BIOLOGY I.htm
Lecture and exercise schedule lectures.txt
Prerequisites
Basic vocabulary in biosciences ICSP_Program_2006-07.pdf
Basic skills in computation and signal processing SGN-6106.html
Continuation
CSB2 – Genomic data, models and data analysis in depth
CSB1 required
Some more computational skills required (e.g. statistics etc.)
Other course offerings at TUT/ISP
Computational biosciences
Genomic signal processing, SGN-6206 5cr
Computational models in complex systems, SGN-6467 5cr
Complex systems 1, SGN-6307
Graduate seminars
Seminar on Signal Processing for Systems Biology
Signal Processing Graduate Seminar IV
Other useful courses
Image processing
Pattern recognition
Statistical signal processing
Knowledge mining
First lecture, 6.2.2007
Introduction to the research environment:
TUT / Institute of Signal Processing
Introduction to the CSB research team at TUT/ISP:
Signal Processing for Systems Biology
Teaching provided by the research team
Curriculum in biotechnology
Curriculum in computer science
Research topics, examples
Signal Processing for Systems Biology
Olli Yli-Harja
Tampere University of Technology
Institute of Signal Processing
Research environment
Institute of Signal Processing (TUT/ISP)
research personnel 170, 11 professors
30-40% from abroad - also in faculty
topics:
audio, transforms, image processing, multimedia, medical imaging,
systems biology, cell phones, cellular signals, …
Academy of Finland Centre of Excellence, 1996-2000-2006-2011
Tampere International Centre of Signal Processing
Invites researchers to engage in research at the TUT/ISP
40 visits yearly since 1997
Catalyst of research and international connections
A truly international environment for research
What is systems biology?
Based on the availability of large scale measurement data
Knowledge of DNA & genes, microarray technology, single cell manipulation and
measurement …
Advancing the use of regular scientific research approaches in biology:
Mathematical models, Model-based prediction, …, control, …, design
Biology becoming part of physics! General laws in biology?
Interplay between models and experiments
Why is it necessary?
Amount of measurement data increases
Biological knowledge increases
Interpretation without tools is impossible
Problems
Prohibitive complexity
cell, tissue, organism, environment
Repeatability
Controllability of the experimental setup
How to obtain a comparable set of cells
for a repeat experiment
Huge engineering effort needed in data
integration
What has signal processing to offer
for systems biology?
Signal processing:
In the intersection of computer science, statistics, dynamic systems theory
Manipulation, storing, retrieval, analysis of measurement data
Systematic modeling of observations
Design of optimal algorithms based on mathematical model
Develop novel signal processing methods that can be applied in modern
systems biology
Building on fundamental research of signal processing
Inter-disciplinary collaboration with research partners
Main directions
Tool development
Quantitative models of biological systems
Modelling of novel biosystem measurement technologies
Image processing and analysis in an important role
Applied research in medicine
Data analysis
Basic research in theoretical biology
Discrete network models
Research team
(started in 2001)
Data Analysis (1+1)
Professor Jaakko Astola, Reija Autio
Computational Neuroscience (1+5)
Academy research fellow *Marja-Leena Linne, Tiina Manninen, Antti
Pettinen, Katri Hituri, Eeva Mäkiraatikka, Kalle Leinonen
Computational systems biology (1+2+9+6)
Professor Olli Yli-Harja, Dr. Harri Lähdesmäki, Miika Ahdesmäki,
Tommi Aho, Juha Kesseli, Antti Larjo, Antti Niemistö, Nikhil, Matti
Nykter, Antti Saarinen, *Jenni Seppälä, Olli-Pekka Smolander, KaisaLeena Taattola, Tomi Korpelainen, Antti Ylipää, Sakari Palokangas,
Virpi Kivinen, Timo Erkkilä
Image processing (1+4)
Lecturer Heikki Huttunen, Petri Hirvonen, Antti Lehmussola, Pekka
Ruusuvuori, Jyrki Selinummi
Teaching in computational bioscience
New major in Bio- and environmental engineering
in 2004:
Computational systems biology
for Biotechnology students
New major in Institute of Signal Processing
in 2005:
Computational systems biology
for Computer Science students
Joint projects and joint guidance of students
within Tampere area
Courses offered by the CSB research group
Old courses
Lecture course, Signal processing for systems biology, 40 students,
2003-2004
Seminars in microscopy arranged, 15-20 students, 2001-2005
Weekly research seminar, 20-30 students, since 2001
New courses in 2005-2006:
Introduction to computational systems biology, SGN-6056 5cr
Computational systems biology 1, SGN-6106 5cr
Computational systems biology 2, SGN-6156 5cr
Computational models in complex systems, SGN-6467 5cr
Complex systems 1, SGN-6307 5cr
Graduate seminars
Seminar on Signal Processing for Systems Biology, SGN-6906, 2-3 cr
Signal Processing Graduate Seminar IV, SGN-9406, 3-8 cr
International collaboration
Research visits in 2002-2007
(~ 178 person-months ~ 15 person-years)
Institute of Systems Biology, Seattle: Dr. Harri Lähdesmäki, Dr. Antti Niemistö, Dr. Matti Nykter, Antti Larjo, Jyrki
Selinummi, Miika Ahdesmäki (74m)
University of Texas, M. D. Anderson Cancer Center, Houston: Dr. Harri Lähdesmäki, Dr. Antti Niemistö, Dr. Matti
Nykter, Prof. Olli Yli-Harja, Dr. Daniel Nicorici (27m)
Institute for Biocomplexity and Informatics, University of Calgary: Dr. Pauli Rämö, Juha Kesseli, Tiina Manninen,
Antti Lehmussola, Andre Ribeiro (12m)
NHGRI, Washington & MIT Boston: Dr. Sampsa Hautaniemi (36m)
University of Jena, Germany: Tommi Aho (3m)
Texas A&M University: Pekka Ruusuvuori (3m)
University of Uppsala: Jenni Seppälä (2m)
University of Antwerp: Katri Hituri (3m)
University of York, UK: Kathryn Williams, X (6m)
*Visiting Tampere from University of California in Santa Barbara : John Berger (12m)
TICSP Workshop on Computational Systems
Biology held annually 2003-2006 WCSB06.htm
IEEE Workshop on Genomic Signal Processing in
June 2007 Gensips 2007.htm
Xochicalco, Mexico, 2006
Research directions
Discrete
models
Simulation
(quantitative
models)
Models and tools
for measurement
systems
Image processing
and analysis
Data analysis
and fusion
Simulation
Quantitative models of signalling and metabolic networks
Bioelectrical neuron models:
Linne et al., (2004) A Model Integrating the
Cerebellar Granule Neuron Excitability and Calcium
Signaling Pathways, Neurocomputing
Using stochastic differentian equations
Saarinen et al., (2006) Modeling single neuron
behavior using stochastic differential equations,
Neurocomputing
Manninen et al., (2006) Developing Itô stochastic
differential equation models for neuronal signal
transduction pathways, Computational Biology
and Chemistry
Evaluation of existing simulation tools:
***Pettinen et al., (2005) Simulation Tools for
Biochemical Networks: Evaluation of Performance
and Usability, Bioinformatics
PKC signalling network in mammal neurons
Ordinary and stochastic differential equations
Case studies in mammal neurons, yeast,
bacteria
Main collaborators
Ari Huovila, Institute of Medical
Technology, University of Tampere
Jaakko Puhakka, Institute of Environmental
Engineering and Biotechnology, Tampere
University of Technology
Microbial hydrogen production
Joint research project with Jaakko
Puhakka,
TUT
Bioand
environmental
engineering
Reactor simulation
Performing
experiments
r/y
Experimental
research cycle
Analyzing results
r/min
Computational
research cycle
Designing
experiments
r/min
Proposing new
hypotheses
Models of biosystem measurement systems
Modeling and inversion of the effects
of the measurement system:
Lähdesmäki et al., (2005) In
silico microdissection of
microarray data from
heterogeneous cell populations,
BMC Bioinformatics
***Lähdesmäki et al., (2003)
Estimation and Inversion of the
Effects of Cell Population
Asynchrony in Gene Expression
Time-Series, Signal Processing
Microarray model
Nykter et al., (2005) Simulation
of microarray data with realistic
characteristics, BMC
Bioinformatics
Main collaborators:
Wei Zhang, University of Texas
M. D. Anderson Cancer Center
Stuart Kauffman, Institute for
Biocomplexity and Informatics,
University of Calgary
Signal processing approach:
Systematic modeling of observations
Design of optimal algorithms based on
mathematical models
Validation of models and analysis tools
Integrated image analysis
Effects of Cell Population Asynchrony in Gene
Expression Time-Series
Gene expression is measured from a cell population
Cell population gradually loses its synchrony
This corresponds to a time-varying low-pass filtering
of the “true” gene expression time-series
Problem: Not all genes are synchronized
Lähdesmäki et al., (2003) Estimation and inversion
of the effects of cell population asynchrony in gene
expression time-series. Signal Processing
Subcellular Image Analysis
Analysis of dispersion of the Golgi apparatus after different treatments
(SH-SY5Y neuroblastoma cells)
Jyrki Selinummi, Antti Lehmussola, Jertta-Riina Sarkanen, Jonna Nykky, Tuula O. Jalonen, Olli Yli-Harja
“Automated Analysis of Golgi Apparatus Dispersion in Neuronal Cell Images”
Proc. 4th TICSP Workshop on Computational Systems Biology (WCSB 2006), Tampere, 2006.
Image processing and analysis for biomeasurements
Measurements based on imaging:
Niemistö et al., (2005) Robust quantification of
in vitro angiogenesis through image analysis,
IEEE Transactions on Medical Imaging
Niemistö et al., (2005) Analysis of angiogenesis
using in vitro experiments and stochastic
growth models, Physical Review E
***Selinummi et al.,(2005) Software for
quantification of labeled bacteria from digital
microscope images by automated image
analysis, Biotechniques
CellC-software, freely available
Quantitative measurements based on
microscope images of cell populations
Analysis and synthesis of images
Simulated of microscope images!?
Main collaborators
Wei Zhang, University of Texas,
M. D. Anderson Cancer Center
Jaakko Puhakka, Institute of Environmental
Engineering and Biotechnology, Tampere
University of Technology
Olli-Pekka Kallioniemi, Technical Research Centre
and University of Turku, Finland
v
Motivation for the Use of
Simulation in Validation
Xiaobo Zhou, Stephen T.C. Wong, ”Informatics challenges
of high-throughput microscopy” IEEE Signal Processing
Magazine, May 2006:
”..when we detect or track lots of cells, proteins, and neuron spines, how
can we validate the extracted results of automated analysis since it is
almost impossible to perform similar manual analysis in high-throughput
scale?”
”Manual methods can only be used to tens or at most up to hundreds of
images. They would make lots of counting errors when analyzing hundreds
or thousands of images… Thus, simulation of complex biological processes
and images becomes an urgent issue.”
”Should such simulation model exist, it would be invaluable in validating
HTS image analysis algorithms without relying on laborious and costly
manual validation”
Simulation of Images of Cell Populations
Motivation for simulation of measurement systems
Unlimited amounts of free data
Free of financial and time constraints
Total control of the experiments
Hypothesis testing
Education
Ground-truth information
Valuable information for benchmarking various analysis methods
Reduces requirements for preprocessing and quality control
Antti Lehmussola, Pekka Ruusuvuori, Jyrki Selinummi, Heikki Huttunen, Olli YliHarja “Computational framework for simulating fluorescence microscope
images with cell populations”
Manuscript submitted.
Antti Lehmussola, Jyrki Selinummi, Pekka Ruusuvuori, Antti Niemistö, Olli Yli-Harja
“Simulating fluorescent microscope images of cell populations”
Proceedings of the 27th Annual International Conference of the IEEE Engineering in
Medicine and Biology Society (EMBC'05), Shanghai, China, September 1-4, 2005,
pp. 3153-3156.
Simulation of Images of Cell Populations
Simulated images share properties of real
fluorescence microscope images
Example: real image (A) and synthetic image (B)
Realistic enough for validating e.g. image analysis
algorithms
Comparison of Cell Enumeration Algorithms
Cell enumeration software are compared
Five different clustering conditions
Each image has the same number of cells (1000)
Cells become more clustered Æ more overlapping
Explorative data analysis
Clustering and explorative analysis for biological data:
Lähdesmäki et al., (2004) Distinguishing Key Biological
Pathways Between Primary Breast Cancers and their
Lymph Node Metastases by Gene Function-based
Clustering Analysis, International Journal of
Oncology
D. Nicorici et al., (2006) Finding Large Domains of
Similarly Expressed Genes using MDL Principle, IEEE
Engineering in Medicine and Biology Magazine
Visualization, validation of results
***Nykter et al., (2006) Unsupervised analysis
uncovers changes in histopathologic diagnosis in
supervised genomic studies. Technology in Cancer
Research and Treatment
MDS
of
microarray
data
Labels from
clinical
database
Inter-disciplinary cancer research
Normalization, visualization
Main collaborators:
Wei Zhang, University of Texas,
M. D. Anderson Cancer Center
Olli-Pekka Kallioniemi, Technical Research
Centre and University of Turku, Finland
Jukka Partanen, Finnish blood service
Lauri Aaltonen, University of Helsinki
Discrete large-scale models of biosystems
Annealed Boolean dynamics:
Kesseli et al., (2005) On Spectral Techniques
in Analysis of Boolean Networks, Physica D
Kesseli et al., (2005) Tracking perturbations in
Boolean networks with spectral methods,
Physical Review E
Kesseli et al., (2006) Iterated maps for
annealed Boolean dynamics, Physical
Review E
Criticarity in gene regulatory networks
***Rämö et al., (2006) Perturbation
Avalanches and Criticality in Gene Regulatory
Networks, Journal of Theoretical Biology
Rämö et al., (2005) Stability of Functions in
Boolean Models of Gene Regulatory Networks,
Chaos
x
t +1
Order, chaos and criticality
in genetic regulatory networks
= f (x )
t
Complex systems
General laws in biology
New biological observables
Randomized network ensembles
Main collaborators:
Stuart Kauffman, Institute for Biocomplexity and
Informatics, University of Calgary
Ilya Shmulevich, Institute of Systems Biology, Seattle
Information theoretic approach to biology
Inference of Boolean networks from data:
Lähdesmäki et al., (2003) On Learning Gene
Regulatory Networks Under the Boolean Network
Model, Machine Learning
Lähdesmäki et al., Relationships between
probabilistic Boolean networks and dynamic
Bayesian networks as models of gene regulatory
networks, Signal Processing, accepted.
Information theoretic approach
Rämö et al., Information Propagation in Models of
Gene Regulatory Networks, Physica D,
submitted
***Nykter et al., I. Information flow in complex
networks and evolution: A universal approach,
submitted
Main collaborators:
Ilya Shmulevich, Institute of Systems Biology, Seattle
Stuart Kauffman, Institute for Biocomplexity and
Informatics, University of Calgary