Cyberinfrastructure%20for%20Research%20and%20Education%20and%20its%20challenges%20%20Dr.%20Sebastien%20Goasguen%20Dr.%20Carol%20Song%20Rosen%20Center%20for%20Advanced%20Computing%20Purdue%20University%20carolxsong@purdue.edu%20http://www.rcac.purdue.edu - PowerPoint PPT Presentation

About This Presentation
Title:

Cyberinfrastructure%20for%20Research%20and%20Education%20and%20its%20challenges%20%20Dr.%20Sebastien%20Goasguen%20Dr.%20Carol%20Song%20Rosen%20Center%20for%20Advanced%20Computing%20Purdue%20University%20carolxsong@purdue.edu%20http://www.rcac.purdue.edu

Description:

Cyberinfrastructure for Research and Education and its challenges Dr' Sebastien Goasguen Dr' Carol S – PowerPoint PPT presentation

Number of Views:52
Avg rating:3.0/5.0

less

Transcript and Presenter's Notes

Title: Cyberinfrastructure%20for%20Research%20and%20Education%20and%20its%20challenges%20%20Dr.%20Sebastien%20Goasguen%20Dr.%20Carol%20Song%20Rosen%20Center%20for%20Advanced%20Computing%20Purdue%20University%20carolxsong@purdue.edu%20http://www.rcac.purdue.edu


1
Cyberinfrastructure for Research and Education
and its challenges Dr. Sebastien GoasguenDr.
Carol SongRosen Center for Advanced
ComputingPurdue Universitycarolxsong_at_purdue.edu
http//www.rcac.purdue.edu
(Work presented here are supported by OCI-0438246
(NMI nanoHUB), OCI-0503992 (TeraGrid RP))
2
Highlights
  • Infrastructure building
  • Community clusters, cycles harvest across campus,
    high speed network links, storage capacity
  • Data collections
  • System interoperability
  • Integrate computing infrastructures
  • Integrate services
  • Enabling multidisciplinary research and education
  • Most design decisions are guided by this
    principle

3
Outline
  • TeraGrid
  • HPC through community resources and
    interoperability
  • NanoHUB Science Gateway
  • Online Simulations Education
  • Multidisciplinary Data Management
  • Data source aggregation
  • Data Management
  • Workflow
  • Seamless integration of grids and services
  • Integration with Education (Sakai, podcast,
    Merlot)
  • NanoHUB TeraGrid, OSG
  • TG campus infrastructures

4
150M
(Charlie Catlett, TeraGrid Director, ANL)
5
TeraGrid
  • Grid Infrastructure Group (U Chicago)
  • TG integration, planning, management and
    coordination.
  • Resource Partners
  • 9 partners
  • Provide system resources, user support
  • Provide access to resources through policies,
    software and other mechanism
  • Individual PIs access TG high performance
    computing resources through unified user support,
    coordinated software and services, and extensive
    documentation and training.

6
(No Transcript)
7
(No Transcript)
8
(No Transcript)
9
TG Internals
CTSS software Stack HPC Grid
ssh using std unix practices Globus
submit Condor-g submit Wrappers scripts to get
work done (compute, store, move data)
PBS, LSF etclocal batch system and
schedulers Condor talks to Globus talks to
scheduler.
Globus enabled resources GT2 or GT4 WSRF
10
OSG Internals
Production manager Centralized Broker Metascheduli
ng
Vo accounts, no ssh access Globus submit Condor-g
submit Wrappers scripts to get work done
(compute, store, move data)
PBS, LSF etclocal batch system and
schedulers Condor talks to Globus talks to
scheduler. Mostly serial applications
OSG software stack Distributed through VDT
11
nanoHUB online simulation and more
12
NanoHUB Middleware
13
easy access
Remote access to simulators and compute power
nanoHUB infrastructure
internet
nanoHUB.org Web site
NMI Cluster
Browser (VNC)
14
Example CNT simulation
15
NanoHUB Learning Modules
16
nanoHUB Sakai Integration for Assessment
Services
  • Assessment of learning impact is a key metric
  • Sakai Service-oriented Assessment Service
    Integration

17
SAKAI Integration Architecture
Web Svcs
nanoHUB
Framework
Application
Session based launch
SakaiLogin.jws SakaiSite.jws
SAKAI
18
Workspaces
19
nanoHUB Internals
Local Virtual Machines Migratable Isolated from
Local infrastructure
VIOLIN Virtual Cluster
Delegated trust
Virtual Infrastructure over WAN
20
Purdue TG Data Management System
TeraGrid network - Provides HPC, storage
resources Multidisciplinary scientific data -
Remote sensing, weather, modeling data SRB
middleware system developed at SDSC - Provides
distributed data management - Logical and System
Attributes Server-side data processing tools -
OPeNDAP/THREDDS data server Web Services
interface - File query, File listing, Metadata
query, File download Purdue TG data portal -
JSR-168 compliant portlets, based on Gridsphere -
Uses SRB Jargon API for data access
21
LARS Dataset (Laboratory for Applications of
Remote Sensing)
  • Multispectral and Hyperspectral remote sensing
    images for Indiana
  • ERDAS LAN, Leica Geosystems Imagine, GeoTIFF, and
    HDF formats
  • 1972 to 2004
  • IndianaView Glovis web access
  • Part of the AmericaView initiative
  • Funded through USGS
  • Graphical Interface for viewing and downloading
    remote sensing image data
  • http//indianaview.envision.purdue.edu/glovis/inde
    x.htm

22
PTO Satellite Data(Purdue Terrestrial
Observatory)
  • GOES-GVAR sensor (L band), 3.7m. fixed antenna,
    Feb. 2005.
  • Terra-MODIS, Aqua-MODIS, NOAA-AVHRR and FY1-MVISR
    sensors (L- and X- band), 4.27 m. tracking
    antenna , April. 2006.
  • 10 Node cluster data processing and visualization
    server, more than 25 different products.

23
National Weather Service Data
  • Next Generation Radar (NEXRAD) Level II data
  • 159 Weather Surveillance Radar-1988 Doppler
    (WSR-88D) sites
  • Real-time streaming, high-resolution data from
    the national network
  • Reflectivity, mean radial velocity, and spectrum
    width
  • One of the four top-level distributors
  • THREDDS/OPeNDAP data servers

24
CCSM Climate Simulation Data
  • Community Climate System Model (CCSM) to simulate
    climate change on Earth
  • Ocean, Land, and Atmospheric models
  • NetCDF format
  • OPeNDAP server provides post-processing
    functionalities

25
Architecture
  • Data Capture
  • Commercial vendor HW, SW
  • Data drivers to
  • Harvest and register meta data
  • Ingest data to SRB server
  • Normalize application data to standards
  • SRB (Storage Resource Broker) - SDSC
  • Stores data in logical collections, associated
    with meta data.
  • Stores raw and processed data for access
  • Meta data catalog (MCAT) in SDSC, data servers at
    Purdue.
  • Application layer Integrates applications for
    enhanced data access
  • THREDDS (Thematic Real-time Environmental
    Distributed Data Services) for Doppler radar data
  • OPeNDAP (Open-source Project for a Network Data
    Access Protocol) for climate modeling data
  • Presentation layer
  • Gridsphere based portlets browse, search,
    download data.

26
Data Access
  • Command line (SRB S-commands)
  • Sinit, Sls, Sget, Sexit
  • Web Interface MySRB
  • Windows GUI Client inQ
  • OPeNDAP/THREDDS clients
  • Purdue Environmental Data Portal
  • Web Services

27
Purdue Environmental Data Portal
28
Climate Data Processing Workflow
29
Bioscience Data Applications
30
An interoperable Infrastructure
Desktop/Rappture other web tech (gridsphere
etc..)
?
31
Security Challenges of Interoperable Grid
Infrastructure
Services
SOA
Certificate Delegation Trust level
Services
32
SOA with Authorization
Attribute Server
Services
Certificate Delegation Policies Trust level
Services
Authorization Policy
33
Future nanoHUB Authentication and Authorization
Shibboleth
  1. Integrate Shib Identity Provider
  2. Strengthen authorization
  3. Interoperate with VO based authorization at RP
  4. Potential to delegate authentication

34
nanoHUB Community Credential
TG RP
Username Password
(1)
Username Shibboleth IdP Id
Mambo
(2)
ltSAMLgt grid_proxy_init
PHP scripting
(3)
(5) Globus request
Globus Gatekeeper
Apache Web server
(4)
Exec(Condor_Submit)
Attribute-based policies
SAML authentication assertion
LDAP
Policy Information Point
Virtual Machine
nanoShib
Attribute request
SAML assertion
SAML-enabled attributes handlers for GT4
-extract SAML assertion from proxy - query Shib
AA based on SAML assertion from proxy - render
access control decision based on attributes from
Shib AA
Shibboleth IdP
(6) Attributes request
(7) SAML authorization assertion
AA
Back end
Front end
35
IT Org supporting 21st Century Science
Rosen Center for Advanced Computing Purdue
University
User Support
Science Gateway
Grid protocols
Security
Systems Group
Write a Comment
User Comments (0)
About PowerShow.com