The EGEE project: the new Grid Infrastructure - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

The EGEE project: the new Grid Infrastructure

Description:

Science more digital oriented and dominated by data ... Planck satellite), Earth Observation (ozone maps, seismology, climate), Digital ... – PowerPoint PPT presentation

Number of Views:25
Avg rating:3.0/5.0
Slides: 30
Provided by: fab109
Category:

less

Transcript and Presenter's Notes

Title: The EGEE project: the new Grid Infrastructure


1
The EGEE project the new Grid Infrastructure
  • Fabrizio Gagliardi
  • EGEE Project Director

Second Annual RealityGrid Workshop , 15-16 June
2004 Royal Society, London
2
The Grid why now?
  • Networking, commodity computing and distributed
    software tools are ripe for Grid technology
  • Science more digital oriented and dominated by
    data
  • CERN networking land speed record (6.25 Gb/sec
    over 11000 Km) from California to CERN (10000
    times ADSL speed) lt 10 sec to download a DVD

3
We are ready for a new computing paradigm
4
What do we expect from the Grid?
  • Access to a world-wide virtual computing
    laboratory with almost infinite resources
  • Possibility to organize distributed scientific
    communities in VOs
  • Transparent access to distributed data and easy
    workload management
  • Easy to use application interfaces

5
Example of a Grid application Breast Cancer
Screening (I)
  • Breast Screening Programme
  • Access to remote distributed data

Courtesy of Peter Clarke
6
Example Breast Screening (II)
  • Breast Screening Programme in the Grid
  • Requires Gbit/s flows for remote access
  • Will not be possible without scheduled
    guaranteed net-services

Courtesy of Peter Clarke
7
What is EGEE ? (I)
  • EGEE (Enabling Grids for Escience in Europe) is a
    seamless Grid infrastructure for the support of
    scientific research, which
  • Integrates current national, regional and
    thematic Grid efforts, especially in HEP (High
    Energy Physics)
  • Provides researchers in academia and industry
    with round-the-clock access to major computing
    resources, independent of geographic location

8
What is EGEE ? (II)
  • 70 leading institutions in 27 countries,
    federated in regional Grids
  • 32 M Euros EU funding (2004-5), O(100 M) total
    budget
  • an ultimate combined capacity of over 20000 CPUs
    (the largest international Grid infrastructure
    ever assembled)
  • 300 persons

9
What will EGEE provide?
  • Simplified access (access to all the operational
    resources the user needs)
  • On demand computing (fast access to resources by
    allocating them efficiently)
  • Pervasive access (accessible from any geographic
    location)
  • Large scale resources (of a scale that no single
    computer centre can provide)
  • Sharing of software and data (in a transparent
    way)
  • Improved support (use the expertise of all
    partners to offer in-depth support for all key
    applications)

10
EGEE Activities
  • Emphasis on operating a production grid and
    supporting the end-users
  • 48 service activities (Grid Operations, Support
    and Management, Network Resource Provision)
  • 24 middleware re-engineering (Quality
    Assurance, Security, Network Services
    Development)
  • 28 networking (Management, Dissemination and
    Outreach, User Training and Education,
    Application Identification and Support, Policy
    and International Cooperation)

11
EGEE infrastructure
  • Access to networking services provided by GEANT
    and the NRENs
  • Production Service
  • in place (based on HEP LCG-2)
  • for production applications
  • MUST run reliably, runs only proven stable,
    debugged middleware and services
  • Will continue adding new sites in EGEE
    federations
  • Pre-production Service
  • For middleware re-engineering
  • Certification and Training/Demo testbeds

12
First EGEE infrastructure
  • Based on HEP-LCG testbed more than 50 sites
    worldwide

13
EGEE Operations
  • Operation Management Centre
  • located at CERN, coordinates operations and
    management
  • coordinates with other grid projects
  • Core Infrastructure Centres
  • behave as single organisations
  • operate infrastructure services
  • Regional Operation Centres
  • first point of contact for new users and user
    support

14
EGEE Middleware Activity
  • Middleware selected based on requirements of
    Applications and Operations
  • Harden and re-engineer existing middleware
    functionality, leveraging the experience of
    partners
  • Provide robust, supportable components
  • Support components evolution (WS-RF)

15
EGEE Middleware Implementation
  • Activity concentrated in few major centers and
    organized in Software clusters
  • Production grid service running HEP LCG-2 grid
    middleware
  • In parallel develop a next generation grid
    facility
  • migrate LCG-2 to new middleware in 2005

16
EGEE Pilot Applications (I)
  • HEP
  • Have been running large distributed computing
    systems for many years
  • Now focus on computing for LHC ? hence LCG (LHC
    computing grid project)
  • other current HEP experiments use grid technology
    (Babar,CDF,D0..)
  • LHC experiments are currently executing large
    scale data challenges (DCs)

17
EGEE Pilot Applications (II)
  • Biomedics
  • Bioinformatics (gene/proteome databases
    distributions)
  • Medical applications (screening, epidemiology,
    image databases distribution, Parallel algorithms
    for medical image processing, simulation, etc)
  • Interactive application (human supervision or
    simulation)
  • Security/privacy constraints
  • Heterogeneous data formats (genomics, proteomics,
    image formats)
  • Frequent data updates
  • Complex data sets (medical records)
  • Long term archiving requirements

18
Who else will benefit from EGEE?
  • Feed-backs from Astrophysics (EVO and Planck
    satellite), Earth Observation (ozone maps,
    seismology, climate), Digital Libraries (DILIGENT
    Project), Grid Search Engines (GRACE Project),
    Industrial applications (SIMDAT Project)
  • Interest also from Computational Chemistry (Italy
    and Czech Republic), Civil Engineering (Spain),
    and Geophysics (Switzerland and France)
    communities

19
How to access EGEE (I)
  • 0) Review information provided on the EGEE
    website (www.eu-egee.org)
  • 1) Establish contact with the EGEE applications
    group lead by Vincent Breton (breton_at_clermont.in2p
    3.fr)
  • 2) Provide information by completing a
    questionnaire describing your application
  • 3) Applications selected based on scientific
    criteria, Grid added value, effort involved in
    deployment, resources consumed/contributed etc.

20
How to access EGEE (II)
  • 4) Follow a training session
  • 5) Migrate application to EGEE infrastructure
    with the support of EGEE BMI technical experts
  • 6) Initial deployment for testing purposes
  • 7) Production usage (contribute computing
    resources for heavy production demands)

21
Moving your application to EGEE (I)
  • Data Intensive
  • Access to diverse data sources (format,
    read/write, location etc.)
  • Quantity of data
  • Compute Intensive
  • EGEE attracts mostly farms of commodity PCs
  • MPI available for distributed applications at
    many sites
  • Interface to DEISA for application migration is
    under discussion
  • Interfaces
  • Standard interfaces provided (e.g. APIs, GENIUS
    portal)
  • Application specific interfaces can be linked to
    the infrastructure (DEVASPIM, HKIS, BioGrid)
  • Interactivity

22
Moving your application to EGEE (II)
  • Security
  • Infrastructure can help control access to sites,
    data, network and information
  • EGEE sites are administered/owned by different
    organisations
  • Sites have ultimate control over how their
    resources are used
  • Limiting the demands of your application will
    make it acceptable to more sites and hence make
    more resources available to you

23
Security Intellectual Property (I)
  • The existing EGEE grid middleware is distributed
    under an Open Source License developed by EU
    DataGrid
  • No restriction on usage (scientific or
    commercial) beyond acknowledgement
  • Same approach for new middleware
  • Application software maintains its own licensing
    scheme
  • Sites must obtain appropriate licenses before
    installation

24
Security Intellectual Property (II)
  • For applications that must operate in a closed
    environment EGEE middleware can be download and
    installed on closed infrastructures

25
EGEE and Industry
  • Industry as a partner - Through collaboration
    with individual EGEE partners, industry has the
    opportunity to participate in specific
    activities, thereby increasing know-how on Grid
    technologies.
  • Industry as a user - As part of the networking
    activities, specific industrial sectors will be
    targeted as potential users of the installed Grid
    infrastructure, for RD applications.
  • Industry as a provider - Building a production
    quality Grid will require industry involvement
    for long-term maintenance of established Grid
    services, such as call centres, support centres
    and computing resource provider centres

26
EGEE Industry Forum
  • EGEE Industry Forum
  • raise awareness of the project in industry to
    encourage industrial participation in the project
  • foster direct contact of the project partners
    with industry
  • ensure that the project can benefit from
    practical experience of industrial applications
  • For more info
  • www.eu-egee.org

27
EGEE Plans
  • two-year project conceived as part of a four-year
    programme
  • resources and user groups will rapidly expand
    during the course of the project
  • 3000 users active from at least five disciplines
    by the end of the second year
  • from over 3000 CPUs at the outset of the project
    to over 8000 by the end of the second year
  • A second two-year project is anticipated to
    follow on from EGEE, in which industry will
    progressively take up operations and maintenance

28
Conclusions
  • EGEE is the first attempt to build an
    international and worldwide Grid infrastructure
    for data intensive science
  • Similar aim to NSF CyberInfrastructure initiative
    in the US
  • EU playing a pioneering role with a substantial
    first two year funding
  • Important to develop a long term support strategy

29
Further info
  • EU EGEE www.eu-egee.org
  • EU DataGrid www.eu-edg.org
  • Other Grid projects - www.gridstart.org
  • The Grid - www.gridcafe.org
Write a Comment
User Comments (0)
About PowerShow.com