Condor - A Project and a System - PowerPoint PPT Presentation

About This Presentation
Title:

Condor - A Project and a System

Description:

face software/middleware engineering challenges in a UNIX/Linux/Windows/OS X environment, ... Tevfik Kosar went to LSU. Sonny (Sechang) Son went to NetApp ... – PowerPoint PPT presentation

Number of Views:37
Avg rating:3.0/5.0
Slides: 34
Provided by: Miron1
Category:

less

Transcript and Presenter's Notes

Title: Condor - A Project and a System


1
Welcome!!!
Condor Week 2006
2
1986-2006Celebrating 20 years since we first
installed Condor in our department
3
The Condor Project (Established 85)
  • Distributed Computing research performed by a
    team of 40 faculty, full time staff and students
    who
  • face software/middleware engineering challenges
    in a UNIX/Linux/Windows/OS X environment,
  • involved in national and international
    collaborations,
  • interact with users in academia and industry,
  • maintain and support a distributed production
    environment (more than 3800 CPUs at UW),
  • and educate and train students.
  • Funding DOE, NIH, NSF, INTEL,
  • Micron, Microsoft and the UW Graduate School

4
Excellence
S u p p o r t
Software Functionality
Research
5
  • Since the early days of mankind the primary
    motivation for the establishment of communities
    has been the idea that by being part of an
    organized group the capabilities of an individual
    are improved. The great progress in the area of
    inter-computer communication led to the
    development of means by which stand-alone
    processing sub-systems can be integrated into
    multi-computer communities.

Miron Livny, Study of Load Balancing Algorithms
for Decentralized Distributed Processing
Systems., Ph.D thesis, July 1983.
6
Main Threads of Activities
  • Distributed Computing Research develop and
    evaluate new concepts, frameworks and
    technologies
  • The Open Science Grid (OSG) build and operate a
    national distributed computing and storage
    infrastructure
  • Keep Condor flight worthy and support our users
  • The NSF Middleware Initiative (NMI) develop,
    build and operate a national Build and Test
    facility
  • The Grid Laboratory Of Wisconsin (GLOW) build,
    maintain and operate a distributed computing and
    storage infrastructure on the UW campus

7
Downloads per month
900
X86/Linux
600
X86/Windows
8
Condor-Users Messages per month
Condor Team Contributions
9
The past year
  • Two Ph.D students graduated
  • Tevfik Kosar went to LSU
  • Sonny (Sechang) Son went to NetApp
  • Three staff members left to start graduate
    studies
  • Released Condor 6.6.9-.11
  • Released Condor 6.7.6-.18
  • Contributed to the formation of the Open Science
    Grid (OSG) consortium and the OSG Facility
  • Interfaced Condor with BOINC
  • Started the NSF funded CondorDB project
  • Released Virtual Data Toolkit (VDT) 1.3.3-.10
  • Distributed five instances of the NSF Middleware
    Initiative (NMI) Build and Test facility

10
(No Transcript)
11
The search for SUSY
  • Sanjay Padhi is a UW Chancellor Fellow who is
    working at the group of Prof. Sau Lan Wu at CERN
  • Using Condor Technologies he established a grid
    access point in his office at CERN
  • Through this access-point he managed to harness
    in 3 month (12/05-2/06) more that 500 CPU years
    from the LHC Computing Grid (LCG) the Open
    Science Grid (OSG) and UW Condor resources

12
(No Transcript)
13
(No Transcript)
14
(No Transcript)
15
(No Transcript)
16
(No Transcript)
17
(No Transcript)
18
GAMS/Grid CPLEX
  • Work of Prof. Michael Ferris from the
    optimization group in our department
  • Commercial modeling system abundance of real
    life models to solve
  • Any model types allowed
  • Scheduling problems
  • Radiotherapy treatment planning
  • World trade (economic) models
  • New decomposition features facilitate use of
    grid/condor solution
  • Mixed Integer Programs can be extremely hard to
    solve to optimality
  • MIPLIB has 13 unsolved examples

19
Tool and expertise combined
  • Various decomposition schemes coupled with
  • Fastest commercial solver - CPLEX
  • Shared file system / condor_chirp for
    inter-process communication
  • Sophisticated problem domain branching and cuts
  • Takes over 1 year of computation and goes nowhere
    but knowledge gained!
  • Adaptive refinement strategy
  • Dedicated resources
  • Timtab2 and a1c1s1 problems solved to
    optimality (using over 650 machines running tasks
    each of which take between 1 hour and 5 days)

20
Function Shipping,Data Shipping,or maybe
simplyObject Shipping?
21
Customer requestsPlace y_at_S at L!System
delivers.
22
Basic step for y_at_S?L
  1. Allocate size(y) at L,
  2. Allocate resources (disk bandwidth, memory, CPU,
    outgoing network bandwidth) on S
  3. Allocate resources (disk bandwidth, memory, CPU,
    incoming network bandwidth) on L
  4. Match S and L

23
Or in other words, it takes two (or more) to
Tango (or to place data)!
24
When the source plays niceit asks for
permission to place data at destinationin
advance
25
MatchMaker
Match!
Match!
I am S and am looking for L to place a file
I am L and I have what it takes to place a file
26
The SC05 effortJoint with the Globus GridFTP
team
27
Stork controls number of outgoing connections
Destination advertises incoming connections
28
A Master Workerview of the same effort
29
Master
Files
Worker
For Workers
30
When the source does not play nice,
destination must protect itself
31
NeST
  • Manages storage space and connections for a
    GridFTP server with commands like
  • ADD_NEST_USER
  • ADD_USER_TO_LOT
  • ATTACH_LOT_TO_FILE
  • TERMINATE_LOT

32
GridFTP
Chirp
33
Thank you for building such
a wonderful community
Write a Comment
User Comments (0)
About PowerShow.com