caBIG Pilot Project Selection Process - PowerPoint PPT Presentation

About This Presentation
Title:

caBIG Pilot Project Selection Process

Description:

Title: caBIG Pilot Project Selection Process Description: September 16, 2003 Created Date: 9/2/2003 2:23:47 PM Document presentation format: On-screen Show – PowerPoint PPT presentation

Number of Views:100
Avg rating:3.0/5.0
Slides: 26
Provided by: osgdocdbO
Category:

less

Transcript and Presenter's Notes

Title: caBIG Pilot Project Selection Process


1
TM
0
caBIG and caGrid Interoperable Computing
Infrastructure for the Nations and Worlds
Cancer Research Enterprise
Peter A. Covitz, Ph.D. Chief Operating
Officer National Cancer Institute Center for
Bioinformatics
2
  • The Center for Bioinformatics is the NCIs
    strategic and tactical arm for research
    information management
  • We collaborate with both intramural and
    extramural groups
  • Mission to integrate and harmonize disparate
    biomedical research data
  • Production, service-oriented organization.
    Evaluated based upon customer and partner
    satisfaction.

3
The Problem
  • 1,372,910 new cancer cases and 570,280 deaths due
    to cancer expected in the U.S. in 2005
  • Jemal et al., CA Cancer J Clin 2005 5510-30

4
A National Response
  • Enable investigators and research teams
    nationwide to combine and leverage their findings
    and expertise.
  • Create scalable, actively managed organization
    that will connect members of the NCI-supported
    cancer enterprise by building a biomedical
    informatics network

The Cancer Biomedical Informatics Grid (caBIG)
5
Scenario from caBIG Strategic Plan
  • A researcher involved in a phase II clinical
    trial of a new targeted therapeutic for brain
    tumors observes that cancers derived from one
    specific tissue progenitor appear to be strongly
    affected.
  • The trial has been generating proteomic and
    microarray data. The researcher would like to
    identify potential biochemical and signaling
    pathways that might be different between this
    cell type and other potential progenitors in
    cancer, deduce whether anything similar has been
    observed in other clinical trials involving
    agents known to affect these specific pathways,
    and identify any studies in model organisms
    involving tissues with similar pathway activity.

6
Interoperability
  • ability of a system to access and use the parts
    or equipment of another system

Semanticinteroperability
Syntacticinteroperability
7
SYNTACTIC
caBIG Compatibility Guidelines
8
caCORE
  • Model Driven Architecture Computable Semantics
  • Platform for Syntactic and Semantic
    Interoperability

9
(No Transcript)
10
caCORE
11
Bioinformatics Objects
12
Common Data Elements
  • What do all those UML data Classes and Attributes
    actually mean, anyway?
  • UML model components are mapped to semantic
    concepts drawn from Enterprise Vocabulary
    sources, then registered in the Cancer Data
    Standards Repository (caDSR).
  • caDSR is a metadata registry, implements ISO/IEC
    11179 standard for Common Data Elements (CDEs).

13
Description Logic
Enterprise Vocabulary
Concept Code
Relationships
Preferred Name
Definition
Synonyms
14
caCORE Software Development Kit
15
caCORE SDK Components
  • UML Modeling Tool (any with XMI export)
  • Semantic Connector (concept binding utility)
  • UML Loader (model registration in caDSR)
  • Codegen (middleware code generator)
  • Security Adaptor (Common Security Module)

caCORE SDK Generates a caBIG Silver-Compliant
System
16
caCORE Architecture
Clients
Data
Middleware
Web Application Server
HTTP Clients
A P I
Biomedical Data
Interfaces Java SOAP XML
A P I
SOAP Clients
Common Data Elements
Domain Objects Gene, Disease, etc.
Domain Objects Gene, Disease, Agent, etc.
Data Access Objects
A P I
Perl Clients
Enterprise Vocabulary
Data Access Objects
A P I
Java Applications
Authorization
17
From Silver to GoldcaGrid
18
Use cases not satisfied by caCORE alone
  • Advertisement
  • Service Provider composes service metadata
    describing the service and publishes it to grid.
  • Discovery
  • Researcher (or application developer) specifies
    search criteria describing a service of interest
  • The research submits the discovery request to a
    discovery service, which identifies a list of
    services matching the criteria, and returns the
    list.
  • Invocation
  • Researcher (or application developer)
    instantiates the grid service and access its
    resources

19
OTHER TOOLKITS
NCI
OTHER caBIG SERVICE PROVIDERS
Cancer Center
Cancer Center
Cancer Center
Cancer Center
Cancer Center
20
caGrid 1.0 Architecture
Functions
Quality of Service
Business Process
Semantic service
ID Resolution
Workflow
Portal
Security
Resource Management
DORIAN
caDSR
Service Registry
Grid ID
Service
Introduce
FQE
GSI
GME
Service Description
caDSR
Index
Grid Communication Protocol
GLOBUS Toolkit
GT4
GTS
Transport
EVS
GT4
21
Data Object Semantics, Metadata, and Schemas
  • Object oriented, APIs, well-defined data types
  • Classes defined in UML and converted into ISO/IEC
    11179, registered in the caDSR
  • Definitions drawn from Enterprise Vocabulary
    Services (EVS), relationships semantically
    described
  • XML serialization of objects adhere to XML
    schemas registered in the Global Model Exchange
    (GME)

22
Service Data Elements
  • Two types of top-level grid services defined
  • Data Services
  • Analytical Services
  • Service Data Elements (SDEs) describe services so
    clients can discover what they do

23
Integrating with other Grids
  • caGrid intentionally focused on federated data
    and analytic service interoperability, not
    computing power
  • Adoption of standard grid tooling intended to
    facilitate integration other grids with compute
    power focus
  • Seeking partnership with established compute
    grids to install caGrid Analytical Service nodes
    that would be transparently available to caGrid
    users

24
Acknowledgements
  • caCORE
  • Denise Warzel
  • George Komatsoulis
  • Avinash Shanbhag
  • Frank Hartel
  • Dianne Reeves
  • Sherri De Coronado
  • Gilberto Fragoso
  • SAIC
  • Terrapin Systems
  • Oracle
  • Ekagra
  • ScenPro
  • Apelon
  • MSD
  • caGrid
  • Avinash Shanbhag, NCI
  • Joel Saltz and colleagues, Ohio State U.
  • Ian Foster and colleagues, U. Chicago/Argonne
  • Booz Allen Hamilton
  • SAIC
  • SemanticBits

25
Links
  • caBIG
  • https//cabig.nci.nih.gov
  • caGrid
  • https//cabig.nci.nih.gov/News_Folder/caGrid_1.0_B
    eta_Release
  • caCORE
  • http//ncicb.nci.nih.gov/NCICB/infrastructure/caco
    re_overview
Write a Comment
User Comments (0)
About PowerShow.com