SSS Deployment using OSCAR - PowerPoint PPT Presentation

About This Presentation
Title:

SSS Deployment using OSCAR

Description:

John Mugler, Thomas Naughton & Stephen Scott. May 2005, Argonne, IL SSS ... reduces time/learning curve ... Validation, 3) Durability/Stress, 4) Performance ... – PowerPoint PPT presentation

Number of Views:51
Avg rating:3.0/5.0
Slides: 14
Provided by: csmo
Learn more at: https://www.csm.ornl.gov
Category:

less

Transcript and Presenter's Notes

Title: SSS Deployment using OSCAR


1
SSS Deployment using OSCAR
May 2005, Argonne, IL

SSS Face-to-face meeting
  • John Mugler, Thomas Naughton Stephen Scott

2
OSCAR Cluster Toolkit
  • Framework for cluster management
  • simplifies installation, configuration and
    operation
  • reduces time/learning curve for cluster build
  • requires pre-installed headnode w. supported
    Linux distribution
  • thereafter wizard guides user thru setup/install
    of entire cluster
  • Package-based framework
  • Content Software Configuration, Tests, Docs
  • Types
  • Core SIS, C3, Switcher, ODA, OPD, (Support
    Libs)
  • Non-core selected third-party
  • Access repositories accessible via OPD/OPDer

3
OSCAR Wizard
OSCAR-3.0 release
4
Using OSCAR for SSS
  • Problem Helping users obtain and install SSS
    software.
  • Solution Leverage OSCAR framework to package and
    distribute the SSS suite, sss-oscar.
  • sss-oscar ? A release of OSCAR containing all
    SSS software in single downloadable bundle.

5
OSCAR-ized SSS Components
  • Bamboo Queue/Job Manager
  • BLCR Berkeley Checkpoint/Restart
  • Gold Accounting Allocation Management System
  • LAM/MPI (w/ BLCR) Checkpoint/Restart enabled
    MPI
  • MAUI-SSS Job Scheduler
  • SSSLib SSS Communication library
  • Includes SD, EM, PM, BCM, NSM, NWI
  • Warehouse Distributed System Monitor
  • MPD2 MPI Process Manager

As of May 2005
6
Current Status
  • Released v1.0 at SC04
  • Based on oscar-3.0 (using Red Hat 9/x86)
  • All SSS components represented
  • Testing for v1.1 release
  • Small update release
  • Still oscar-3.0 based
  • Synchronize with OSCAR release schedule
  • oscar-4.1 released
  • Shift to oscar-4.1 in sss-oscar-1.2 release
    (2Q2005)

7
OSCAR v4.1 Highlights
  • SSSs APItest tool integrated into v4.1 release
  • Improved use of DepMan/PackMan abs. layer
  • Distributions supported in v4.1
  • x86 RH 9, FC2, MDK 10.0
  • x86 ia64 RH EL 3
  • Initial work started for Debian ?
  • Not in v4.1 release but working with 4.x devel
    tree

8
TODO SSS
  • Short term
  • Complete testing for v1.1beta release
  • Update SSS documentation
  • Medium term
  • Migrate to new FRE testbed and repository
    (pending approval)
  • New/more Linux distribution/architecture/kernel
    support
  • Longer term
  • Extend SSS component tests
  • 1) Installation, 2) Validation, 3)
    Durability/Stress, 4) Performance
  • Track oscar-4.x releases for v5.0 compatibility
  • Distribute as OSCAR Package Set
  • Pending feature support in OSCAR
  • OPKG ordering within a phase
  • Pending feature support in OSCAR

9
SSS-OSCAR Release Schedule
Add features to Tracker _at_ http//sf.net/projects/s
ss-oscar/
10
Roadmap
  • 1.2 (frz jun, rel jul)
  • Fedor Core 2 / Pkg rebuild
  • BLCR upgrade to linux-2.6
  • Improved install/validation tests
  • oscar-4.1 opkg modifications (updates)
  • Updates to HOWTO as needed
  • Simplify XML meta file
  • Close (most) open tracker issues
  • 2.0 (frz aug, rel sep)
  • LRS change over
  • Fedora Core 4 / Pkg rebuild
  • Improved install/validation tests
  • Add performance/stress tests?
  • oscar-4.x opkg modifications (updates)
  • Updates to HOWTO as needed
  • Meta-scheduler (Silver)?
  • 2.0.1 (frz oct, rel nov) SC05
  • Any bugfixes/minor updates
  • 2.02

11
Goals for sss-oscar-2.0
  • Release v2.0 at SC05
  • Compatible with oscar-5.0
  • Support current Linux distribution(s)
  • Improve interoperability with standard OSCAR
  • Users obtain via SSS OSCAR Pkg Repository
  • Likely leverage Package Sets for logical
    grouping
  • Clarify SSS package dependencies
  • What about outside of SSS-OSCAR?
  • Improved testing
  • Supply thorough installation/validation/performanc
    e tests
  • Documentation
  • Specifications for component interfaces
    (schemas), etc.

12
Comments/Discussion
  • Provide a lower cost of entry
  • Doc to help knit system together
  • Clarify dependencies/interactions
  • Intra-component and inter-component
  • Feedback to help Ron O. for testing/validation
  • Tests to verify against component specs.
  • Ex. The PM specs state X capability it work in
    this build
  • Effectively conformance tests to optional SSS
    specs.
  • What do we need to help coming releases?
  • Louder drum for Thomas?
  • Dedicated integration periods (face-to-face
    and/or virtual)?

13
Resources
  • ORNL test clusters
  • Systems sss-xtorc, test1, test2
  • Access via ORNL SSH Login Server
  • Must do reservations/coordinate use (Note, no
    remote power mgmt)
  • Investigating ORNL FRE (enclaves)
  • Add testX system to alleviate ORNL SSH Login
    Server
  • SSS-OSCAR Project page
  • Hosted at http//sourceforge.net/projects/sss-osca
    r/
  • OSCAR Homepage
  • http//www.OpenClusterGroup.org/OSCAR/
  • Includes HOWTO Create an OSCAR Package document

14
Validation Testing
Working Components and Interfaces (bold) Spr03
Color code
BCWG
PMWG
RMWG
VIWG
N/A
15
OSCAR Features Slated for 4.x/5.0
  • v4.1 (freeze Feb 15)
  • Integrate APItest
  • Smarter RPM uninstall (PackageInUn)
  • oscar-release RPM
  • Intel Compilers OPkg
  • v4.3 (freeze Aug 15)
  • Smarter RPM uninstall (NEST)
  • Support for Package Sets
  • Debian/x86 support?
  • VServer testing harness
  • v4.2 (freeze May 15)
  • New DB schema
  • New DB API/Library (v1)
  • NEST for OPkg mgmt
  • Debian PackMan/DepMan
  • v5.0 (freeze Oct 10)
  • LibOPkg available
  • SGE
  • Fedora Core 3/4, x86, x86-64

This list is speculative and only highlights
items that would likely help/effect SSS releases.
16
Tentative OSCAR Release Schedule
Write a Comment
User Comments (0)
About PowerShow.com