Distributed Monitoring and Information Services for the Grid - PowerPoint PPT Presentation

1 / 55
About This Presentation
Title:

Distributed Monitoring and Information Services for the Grid

Description:

Defines a subscription/notification interface for accessing resource property information ... WS-N for query and subscription. 2) Common configuration mechanism ... – PowerPoint PPT presentation

Number of Views:75
Avg rating:3.0/5.0
Slides: 56
Provided by: Carl1173
Category:

less

Transcript and Presenter's Notes

Title: Distributed Monitoring and Information Services for the Grid


1
Distributed Monitoring and Information Services
for the Grid
  • Jennifer M. Schopf
  • Argonne National Laboratory
  • NeSC
  • Dec 6, 2005

2
What is a Grid
  • Resource sharing
  • Computers, storage, sensors, networks,
  • Sharing always conditional issues of trust,
    policy, negotiation, payment,
  • Coordinated problem solving
  • Beyond client-server distributed data analysis,
    computation, collaboration,
  • Dynamic, multi-institutional virtual orgs
  • Community overlays on classic org structures
  • Large or small, static or dynamic

3
Why is this hard/different?
  • Lack of central control
  • Where things run
  • When they run
  • Shared resources
  • Contention, variability
  • Communication
  • Different sites implies different sys admins,
    users, institutional goals, and often strong
    personalities

4
So why do it?
  • Computations that need to be done with a time
    limit
  • Data that cant fit on one site
  • Data owned by multiple sites
  • Applications that need to be run bigger, faster,
    more

5
What Is Grid Monitoring?
  • Sharing of community data between sites using a
    standard interface for querying and notification
  • A way to discover what services and resources are
    available to use
  • A way to understand the status/attributes of
    those services
  • A system to warn you when things fail

6
Monitoring Use cases
  • PPGD/GriPhyN/iVDGL monitoring group (2002-2004)
    found roughly 4 categories
  • Health of system (NW, servers, cpus, etc)
  • Resource selection
  • System upgrade evaluation (have systems reached
    capacity)
  • Application-specific progress tracking
  • First three types need roughly the same
    information
  • Fourth is user-specific and application specific
    no general solution yet
  • http//www.mcs.anl.gov/jms/pg-monitoring

7
Health of the SystemIs the Grid up?
  • Brief Description
  • User of a grid replication service finds actions
    are much slower than normal
  • Not sure if problem is with network, disk, CPU
    end points, or something inbetween
  • Need archive data for historical, current
    streaming for comparison
  • Performance events/sensors required
  • Host monitoring - CPU,memory, disk
  • Network path monitoring - bw, lat., traceroute
  • GridFTP monitoring
  • TCP stack monitoring (web 100)
  • Possibly switch/router monitoring
  • May want different data for user vs sys admins

8
Resource Selection
  • Brief Description
  • User/Broker wants to decide where to run a job
  • Sites advertise cluster information for
    grid-level scheduling decisions
  • Also need data about storage locations and access
    speeds
  • Information must be summarized for advertising to
    Grid, scalability is key issue
  • Performance events/sensors required
  • Static number of compute nodes, cpu type and
    speed, OS, installed sw, available storage
    systems
  • DynamicQueue lengths, large file transfer times

9
What shouldmonitoring systems look like?
  • All sensors must be non-intrusive
  • All data is small, and must be as timely as
    possible
  • All data must be kept for a long time (years),
    and must be accessible in many ways
  • No one really knows how many sensors will be
    accessed at one time (or reporting to a higher
    level service), or how often they will be
    accessed
  • Security isnt of concern YET except for job
    data

10
Monitoring Systems (2)
  • Line between monitoring system and higher level
    services isnt always clear
  • Archiving
  • Summary statistics
  • Predictions
  • Error detection
  • Alarms/notification

11
OUTLINE
  • Grid Monitoring and Use Cases
  • MDS4
  • Index Service
  • Trigger Service
  • Information Providers
  • Deployments
  • Metascheduling data for TeraGrid
  • Service failure warning for ESG
  • Performance Numbers

12
What is MDS4?
  • Grid-level monitoring system used most often for
    resource selection
  • Aid user/agent to identify host(s) on which to
    run an application
  • Uses standard interfaces to provide publishing of
    data, discovery, and data access, including
    subscription/notification
  • WS-ResourceProperties, WS-BaseNotification,
    WS-ServiceGroup
  • Part of the Globus Toolkit v4
  • Functions as an hourglass to provide a common
    interface to lower-level monitoring tools

13
Information Users Schedulers, Portals, Warning
Systems, etc.
WS standard interfaces for subscription,
registration, notification
GLUE Schema Attributes (cluster info, queue info,
FS info)
14
Web ServiceResource Framework (WS-RF)
  • Defines standard interfaces and behaviors for
    distributed system integration, especially (for
    us)
  • Standard XML-based service information model
  • Standard interfaces for push and pull mode access
    to service data
  • Notification and subscription

15
MDS4 UsesWeb Service Standards
  • WS-ResourceProperties
  • Defines a mechanism by which Web Services can
    describe and publish resource properties, or sets
    of information about a resource
  • Resource property types defined in services WSDL
  • Resource properties can be retrieved using
    WS-ResourceProperties query operations
  • WS-BaseNotification
  • Defines a subscription/notification interface for
    accessing resource property information
  • WS-ServiceGroup
  • Defines a mechanism for grouping related
    resources and/or services together as service
    groups

16
MDS4 Components
  • Higher level services
  • Index Service a way to aggregate data
  • Trigger Service a way to be notified of changes
  • Both built on common aggregator framework
  • Information providers
  • Monitoring is a part of every WSRF service
  • Non-WS services can also be used
  • Clients
  • WebMDS
  • All of the tool are schema-agnostic, but
    interoperability needs a well-understood common
    language

17
MDS4 Index Service
  • Index Service is both registry and cache
  • Subscribes to information providers
  • Publishes (as resource properties)
  • Datatype and data provider info, like a registry
  • Last value of data, like a cache
  • In memory default approach, DB backing store
    currently being developed to allow for very large
    indexes
  • Soft-state registration
  • Can be set up for a site or set of sites, a
    specific set of project data, or for
    user-specific data only
  • Can be a multi-rooted hierarchy

18
Index Service Facts 1
  • No single global Index provides information about
    every resource on the Grid
  • No person in the world is part of every VO!
  • Hierarchies or special purpose indexs are common
  • Each virtual organization will have different
    policies on who can access its resources
  • The presence of a resource in an Index makes no
    guarantee about the availability of the resource
    for users of that Index
  • Ultimate decision about whether to use the
    resources is left to direct negotiation between
    user and rsc
  • MDS does not need to keep track of policy
    information (something that is hard to do
    concisely)
  • Rscs do not need to reveal their policies publicly

19
Index Service Facts 2
  • MDS has a soft consistency model
  • Published information is recent, but not
    guaranteed to be the absolute latest
  • Load caused by information updates is reduced at
    the expense of having slightly older information
  • Free disk space on a system 5 minutes ago rather
    than 2 seconds ago.
  • Each registration into an Index Service is
    subject to soft-state lifetime management
  • All registrations has expiry times and must be
    periodically renewed
  • Index is self-cleaning, since outdated entries
    disappearing automatically

20
MDS4 Trigger Service
  • Subscribe to a set of resource properties
  • Evaluate that data against a set of
    pre-configured conditions (triggers)
  • When a condition matches, email is sent to
    pre-defined address
  • Similar functionality in Hawkeye

21
Aggregator Framework
  • General framework for building services that
    collect and aggregate data
  • Index and Trigger service both use this
  • 1) Common interface implemention
  • Java class that implements an interface to
    collect XML-formatted data from information
    providers
  • Implements WS-RP and WS-N for query and
    subscription
  • 2) Common configuration mechanism
  • Maintain information about which information
    providers to use and their associated parameters
  • Specify what data to get, and from where
  • 3) Services are self-cleaning
  • Each registration has a lifetime
  • If a registration expires without being
    refreshed, it and its associated data are removed
    from the server

22
Aggregator Framework
  • General framework for building services that
    collect and aggregate data
  • Index and Trigger service both use this
  • 1) Collect information via aggregator sources
    (information providers)
  • Java class that implements an interface to
    collect XML-formatted data
  • Query source uses WS-ResourceProperty mechanisms
    to poll a WSRF service
  • Subscription source collects data from a service
    via WS-Notification subscription/notification
  • Execution source executes an administrator-supplie
    d program to collect information

23
Aggregator Framework (cont)
  • 2) Common configuration mechanism
  • Maintain information about which aggregator
    sources to use and their associated parameters
  • Specify what data to get, and from where
  • 3) Aggregator services are self-cleaning
  • Each registration has a lifetime
  • If a registration expires without being
    refreshed, it and its associated data are removed
    from the server.

24
Aggregator Framework
25
Information Providers
  • Data sources for the higher level services (eg.
    Index, Trigger)
  • WSRF-compliant service
  • WS-ResourceProperty for Query source
  • WS-Notification mechanism for Subscription source
  • Other services/data sources
  • Executable program that obtains data via some
    domain-specific mechanism for Execution source.

26
Information ProvidersCluster and Queue Data
  • Interfaces to Hawkeye, Ganglia, CluMon
  • Not WS so these are Execution Sources
  • Basic host data (name, ID), processor
    information, memory size, OS name and version,
    file system data, processor load data
  • Some condor/cluster specific data
  • Interfaces to PBS, Torque LSF queue system
  • Queue information, number of CPUs available and
    free, job count information, some memory
    statistics and host info for head node of cluster

27
Information ProvidersGT4 Services
  • Every WS built using GT4 core
  • ServiceMetaDataInfo element includes start time,
    version, and service type name
  • Reliable File Transfer Service (RFT)
  • Service status data, number of active transfers,
    transfer status, information about the resource
    running the service
  • Community Authorization Service (CAS)
  • Identifies the VO served by the service instance
  • Replica Location Service (RLS)
  • Note not a WS
  • Location of replicas on physical storage systems
    (based on user registrations) for later queries

28
Sample Deployment
29
WebMDS User Interface
  • Web-based interface to WSRF resource property
    information
  • User-friendly front-end to the Index Service
  • Uses standard resource property requests to query
    resource property data
  • XSLT transforms to format and display them
  • Customized pages are simply done by using HTML
    form options and creating your own XSLT
    transforms
  • Sample page
  • http//mds.globus.org8080/webmds/webmds?infoinde
    xinfoxslservicegroupxsl

30
WebMDS Service
31
(No Transcript)
32
(No Transcript)
33
(No Transcript)
34
Any questions before I walk through two current
deployments?
  • Grid Monitoring and Use Cases
  • MDS4
  • Index Service
  • Trigger Service
  • Information Providers
  • Deployments
  • Metascheduling Data for TeraGrid
  • Service Failure warning for ESG
  • Performance Numbers

35
Working with TeraGrid
  • Large US project across 9 different sites
  • Different hardware, queuing systems and lower
    level monitoring packages
  • Starting to explore MetaScheduling approaches
  • GRMS (Poznan)
  • W. Smith (TACC)
  • K. Yashimoto (SDSC)
  • User Portal
  • Need a common source of data with a standard
    interface for basic scheduling info

36
Cluster Data
  • Provide data at the subcluster level
  • Sys admin defines a subcluster, we query one node
    of it to dynamically retrieve relevant data
  • Can also list per-host details
  • Interfaces to Ganglia, Hawkeye and CluMon
    available now
  • Nagios should be set by late January

37
Cluster Info
  • UniqueID
  • Benchmark/Clock speed
  • Processor
  • MainMemory
  • OperatingSystem
  • Architecture
  • Number of nodes in a cluster/subcluster
  • TG specific Node properties
  • StorageDevice
  • Disk names, mount point, space available

38
Data to collect Queue info
  • Interface to PBS (Pro, Open, Torque), LSF
  • LRMSType
  • LRMSVersion
  • DefaultGRAMVersion and port and host
  • TotalCPUs
  • Status (up/down)
  • TotalJobs (in the queue)
  • RunningJobs
  • WaitingJobs
  • FreeCPUs
  • MaxWallClockTime
  • MaxCPUTime
  • MaxTotalJobs
  • MaxRunningJobs

39
How will the data be accessed?
  • Java and command line APIs to a common TG-wide
    Index server
  • Alternatively each site can be queried directly
  • One common web page for TG
  • http//snipurl.com/j24r
  • Query page is next!

40
(No Transcript)
41
Status
  • Currently have a demo system up
  • Queuing data from SDSC and NCSA
  • Cluster data using CluMon interface at NCSA
  • Basic WebMDS interface
  • Being deployed more widely for TeraGrid this week
  • General patch for 4.0.1 deployments should be
    available next week let me know if youre
    interested!

42
ESG use of MDS4 Trigger Service
  • Need a way to notify system administrators and
    users what the status of their services are
  • In aprticular, interested in
  • Replica Locatoin Service (RLS)
  • Storage Resource Manager service (SRM)
  • OpenDAP
  • Web Server (HTTP)
  • GridFTP fileservers

43
Trigger Service and ESG Cont.
  • The Trigger service periodically checks to see if
    services are up and running
  • If a service is gone down or is unavailable for
    any reason, an action script is executed
  • Sends email to administrators
  • Update portal status page
  • Been in use for over a year (used GT3 version
    previously)

44
(No Transcript)
45
OUTLINE
  • Grid Monitoring and Use Cases
  • MDS4
  • Index Service
  • Trigger Service
  • Information Providers
  • Deployments
  • Metascheduling Data for TeraGrid
  • Service Failure warning for ESG
  • Performance Numbers

46
Index Server Stability 4.0.0
  • Zero-entry index on same server
  • Ran queries against it for 8,338,435 seconds
    (just over 96 days)
  • Server machine needed to be rebuilt for patches
  • Processed 623,395,877requests
  • Avg 74 per second
  • Average query round-trip time of 13ms
  • No noticeable performance or usability
    degradation over the entire duration of the test

47
4.0.1 Index Stability
  • 100-entry index on same server, running just over
    47 days
  • 190K of data has been retrieved
  • Processed over 20 million requests, averaging 5
    per second
  • No noticeable performance or usability
    degradation.

48
Scalability Experiments
  • MDS index
  • Dual 2.4GHz Xeon processors, 3.5 GB RAM
  • Sizes 1, 10, 25, 50, 100
  • Clients
  • 20 nodes also dual 2.6 GHz Xeon, 3.5 GB RAM
  • 1, 2, 3, 4, 5, 6, 7, 8, 16, 32, 64, 128, 256,
    384, 512, 640, 768, 800
  • Nodes connected via 1Gb/s network
  • Each data point is average of 8 minutes
  • Ran for 10 mins but first 2 spent getting clients
    up and running
  • Error bars are SD over 8 mins
  • Experiments by Ioan Raicu, U of Chicago, using
    DiPerf

49
(No Transcript)
50
(No Transcript)
51
Performance
  • Is this enough?
  • We dont know!
  • Currently gathering up usage statistics to find
    out what people need
  • Bottleneck examination
  • In the process of doing in depth performance
    analysis of what happens during a query
  • MDS code, implementation of WS-N, WS-RP, etc
  • Goal- HPDC submission (early January)

52
Summary
  • MDS4 is a WS-based Grid monitoring system that
    uses current standards for interfaces and
    mechanisms
  • Available as part of the GT4 release
  • Currently in use for resource selection and fault
    notification
  • Initial performance results arent awful we
    need to do more work to determine bottlenecks

53
Where do we go next?
  • Extend MDS4 information providers
  • More data from GT4 WS
  • GRAM, RFT, CAS
  • More data from GT4 non-WS components
  • RLS, GridFTP
  • Interface to other data sources
  • Inca, GRASP
  • Interface to archivers
  • PinGER, NetLogger
  • Additional scalability testing and development
  • Additional clients

54
Other Possible HigherLevel Services
  • Archiving service
  • The next high leverl service well build
  • Looking at Xindice as a possibility
  • Site Validation Service (ala Inca)
  • Prediction service (ala NWS)
  • What else do you think we need?

55
Contributing to MDS4
  • Globus is opening up its development environment
    similar to Apache Jakarta
  • MDS4 will be a project in the new scheme
  • Contact me for more details
  • jms_at_mcs.anl.gov
  • http//dev.globus.org

56
Thanks
  • MDS4 Team Mike DArcy (ISI), Laura Pearlman
    (ISI), Neill Miller (UC), Jennifer Schopf (ANL)
  • Students Ioan Raicu, Xuehai Zhang
  • This work was supported in part by the
    Mathematical, Information, and Computational
    Sciences Division subprogram of the Office of
    Advanced Scientific Computing Research, U.S.
    Department of Energy, under contract
    W-31-109-Eng-38, and NSF NMI Award SCI-0438372.
    This work also supported by DOESG SciDAC Grant,
    iVDGL from NSF, and others.

57
For More Information
  • Jennifer Schopf
  • Jms_at_mcs.anl.gov
  • http//www.mcs.anl.gov/jms
  • Globus Toolkit MDS4
  • http//www.globus.org/toolkit/mds
  • Monitoring and Discovery in a Web Services
    Framework Functionality and Performance of the
    Globus Toolkit's MDS4
  • www.mcs.anl.gov/jms/Pubs/mds-sc05.pdf

58
CLADE 2005Challenges of Large Applicationsin
Distributed Environments Workshop
  • In conjunction with HPDC 2005
  • June 19 or 20, Paris (papers due Feb 1)
  • If you have a large scale application and would
    like to report on
  • Results on the development, deployment,
    management and evaluation of aps
  • Ways your application has benefited from
  • Innovative resource management or scheduling
  • The use of extremely large data sets and data
    mgmt
  • Runtime support for intelligent, adaptive systems
  • Portability, quality of service, or
    fault-tolerance Performance analysis, evaluation,
    and prediction of adaptive systems
  • http//www.mcs.anl.gov/bair/CLADE2006/

59
(No Transcript)
60
Site 1
1. Resources at Sites
Rsc 1.a
Site 3
Rsc 2.a
Rsc 3.a
Rsc 2.b
Rsc 1.d
61
Site 1
2. Site Index Setup
Rsc 1.a
Site 3
Rsc 2.a
Rsc 3.a
Rsc 2.b
Rsc 1.d
62
Site 1
3. VO Index Setup
Rsc 1.a
Site 3
Rsc 2.a
Rsc 3.a
Rsc 2.b
Rsc 1.d
63
WebMDS
Trigger Service
Site 1
Rsc 1.a
Site 3
Rsc 2.a
Rsc 3.a
Rsc 2.b
Rsc 1.d
64
WebMDS
4. Application Index Setup
Site 1
Rsc 1.a
Site 3
Rsc 2.a
Rsc 3.a
Rsc 2.b
Rsc 1.d
65
With this deployment, the project can
  • Discover needed data from services in order to
    make job submission or replica selection
    decisions by querying the VO-wide Index
  • Evaluate the status of Grid services by looking
    at the VO-wide WebMDS setup
  • Be notified when disks are full or other error
    conditions happen by being on the list of
    administrators
  • Individual projects can examine the state of the
    resources and services of interest to them
Write a Comment
User Comments (0)
About PowerShow.com