Title: A FRAMEWORK BASED ON WEB SERVICES ORCHESTRATION FOR BIOINFORMATICS WORKFLOW MANAGEMENT
1A FRAMEWORK BASED ON WEB SERVICES ORCHESTRATION
FOR BIOINFORMATICS WORKFLOW MANAGEMENT
- Laboratory for Bioinformatics (LBI), Institute of
Computing (IC) - UNICAMP
2Topics
- Motivation
- Goals
- Related issues
- The framework
- Methodology
- Conclusions
- Future work
3Motivation
- Genome assembly and annotation pipeline
human validation
assembly
assembly OK
annotation
contigs
reads
incremental assembly
new reads generation
4Motivation
- The growth of bioinformatics activities
- Data
- Services
- Data and services dont use public standards
5Goals
- Specification and development of a framework that
allows - Data integration
- Service integration
- Modeling of complex tasks as workflows
- Coordination of workflow execution
6This talk
- Work in progress
- no results yet
7Overview user interaction
8Related issues
- Web services
- a software application identified by a URI,
whose interfaces and bindings are capable of
being defined, described, and discovered as XML
artifacts W3Cwebservices
9Related issues
- Workflows
- Workflows represent a set of activities to be
executed, their interdependencies relations,
inputs and outputs.
activity1
input data1
activity3
output data
activity2
input data2
input data3
activity4
10Related issues
- Service coordination
- Service orchestration is a centralized mechanism
that describes how diverse services can interact.
This interaction includes message exchange,
business logic and order of execution - We are using PBEL4WS as the specification
language for service orchestration
11Related issues
- Bioinformatics tools and data
- Selection of basic bioinformatics tools for
genomic assembly and annotation - Selection of some important data sources
- Use of tools and data ontology.
12Related issues
- Example of part of a tool ontology
Alignment service
Local Alignment
Global Alignment
Heuristic Alignment
Non-Heuristic Alignment
Non-Heuristic Alignment
Heuristic Alignment
13The framework
14The framework
- Service layer
- bioinformatics basic Web services
- assembly,
- matching,
- consensus,
- etc
- Service catalog layer
- stores Web services'
- syntactical description
- semantical description
- URI.
15The framework
- Service discovery
- search by
- functionality,
- context,
- syntax.
- Service request layer
- management of each Web service solicitation
- Sending input data
- Receiving results
- Detecting service failure.
16The framework
- Workflow engine layer
- controls execution of all workflow tasks, via
orchestration. - The main functions
- interpretation of the process (or task)
definition, - creation and management of process instances,
- navigation between activities,
- supervisory functions.
- Workflow design layer
- supports workflow specification and edition.
- The facilities provided are
- graphical interface for workflow edition,
- service list,
- interface description of selected services,
- syntactical check of workflow.
17Methodology
18Methodology
- Services
- bioinformatics basic services
- specification
- development.
- metadata types definition
- Service discovery
- development of techniques for service discovery
and request using syntactic and semantic search
mechanisms.
19Methodology
- Workflow
- specification and development of methods for
workflow design and execution - Design of workflows is being done using WOODS
- Specification and implementation of an
orchestration mechanism.
Seffino, L.A., Medeiros, C.B., Rocha, J.V.R.,
Yi, B. WOODS- a spatial decision support system
based on workflows. Decision Support Systems 27
(1999) 105-123
20Conclusions
- The main contribution is the framework itself
- It allows multi-institutional cooperation,
sharing - Data
- Tools
- Workflows
- It can be the interface among various kinds of
users of different research centers
21Conclusions
- Other contributions lie in
- Scientific workflows specification and publishing
(using Web services as basic units) - Semantic specification of bioinformatics tasks
- Definition of a generic methodology for data and
tools integration.
22Thank you!
Laboratory for Bioinformatics www.lbi.ic.unicamp.b
r Institute of Computation (IC)
www.ic.unicamp.br University of Campinas
(UNICAMP) www.unicamp.br Luciano Antonio
Digiampietri luciano_at_ic.unicamp.br