Vladimir Litvin - PowerPoint PPT Presentation

1 / 11

About This Presentation

Title:

Vladimir Litvin

Description:

input = file including all CMSIM inputs and DAG output for this job logged to Caltech machine. the DAG run at Wisc is for all 100 CMSIM jobs ... – PowerPoint PPT presentation

Number of Views:57

Avg rating:3.0/5.0

Slides: 12

Provided by: RandyB153

Category:

Tags: dag | litvin | vladimir

Transcript and Presenter's Notes

Title: Vladimir Litvin

1
Infrastructure for CMS Production Runs on
NCSA/Alliance Resources A Prototype

Vladimir Litvin
Caltech HEP
Scott Koranda
NCSA
University of Wisconsin--Milwaukee

2
Why build this prototype now?

NCSA/Alliance wants to help US CMS use Alliance
resources as efficiently as possible (of course)
but also successful distributed run important
part of upcoming NCSA proposal for DTF
Given roughly two weeks start to finish to make
success story happen!

3
Data Terascale Facility

Alliance and NPACI Proposal to NSF (April
19)NCSA (UIUC), SDSC, Argonne, Caltech Linked
at OC192
To Deploy a DTF based on Linux clusters,
large-scale data archives and high bandwidth
national networks
Atop the DTF hardware, deploy a TeraGrid a new
unified modelof distributed data analysis,
computing and communication for science
Integration Partners IBM, Intel, Qwest
Four Complementary Foci
Computing intensive applications (NCSA)
6 TF IA-64, Myrinet, gt 100 TB disk, 1 PB
Tertiary
Data intensive applications (SDSC)
4 TF Linux Cluster, gt 100 TB Disk, Multi-PB
Tertiary
Remote rendering and visualization
(Argonne)
Linux clusters and graphics cards serving remote
imagery
Applications Consortia (Caltech)
Software Linux and vendor (IBM) cluster
softwareGlobus, Condor and other Grid tools

4
Build on top of Condor/Globus

Leverage new CondorG from Wisc
Personal Condor with ability to submit universe
globus jobs
use Condor to launch jobs through Globus
gatekeeper
Includes Condor DAGMan
Directed Acyclic Graph Meta-scheduler
graph described using parent-child relationship
pre- and post- scripts for each job

5
What we started with

CMSIM already running directly on Wisc Condor
pool
usually runs of 100 Condor jobs
each approximately 500 events
NCSA UniTree running GSI-enabled FTP server
2 Tb disk cache
http//www.ncsa.uiuc.edu/SCD/Hardware/UniTree
32 nodes (64 proc) of 1 GHz IBM Linux plugged in
but never been used
1 frame of 1024 platinum cluster
friendly user status next week, full production
May
login node with Globus 1.1.4

6
Timeline for a Run

At Caltech submit Condor DAGMan job
two jobs in DAG
launch Wisc part of job
CMSIM
transfer zebra data files to NCSA UniTree
launch NCSA part of job
retrieve zebra data files from UniTree
ooHits
ooDigis (ORCA reconstruction)
post-script to run after Wisc job
DAGMan fires up job 1

7
Timeline for a Run

Job 1 is the Wisc part of run
itself a Condor job
universe globus
globusscheduler beak.cs.wisc.edu/jobmanger-INTEL
-LINUX
executable condor_dagman
input ltfile including all CMSIM inputs and DAGgt
output for this job logged to Caltech machine
the DAG run at Wisc is for all 100 CMSIM jobs
end of each CMSIM post-script is run to
gsincftpput file to NCSA UniTree
X509 proxy cert authenticates on user behalf

8
Timeline for a Run

Job 1 at Wisc completes DAGMan at Caltech runs
Job 1 post-script
since Globus doesnt (currently!) pass along
return value need to check if Job 1 succeeded
post-script checks log file on Caltech machine
for success
DAGMan starts Job 2 at NCSA
Job 2 is another Condor job
universe globus
globusscheduler posic.ncsa.uiuc.edu
executable ltscript on posicgt

9
Timeline for a Run

Why not use jobmanager-PBS?
turned out that PBS installation was a bit
customized
somewhat of a culture issue
Globus admins not always same as systems admins
default Globus scripts for submiting PBS jobs
wouldnt work
no time to customize so punt and use the default
fork jobmanger
in future definitely fix this!

10
Timeline for a Run

The executable run through Globus jobmanager
was script prepared ahead of time
In future
have direct access to batch system (PBS) so
better control for entire process, leverage more
of Condor
necessary non-data input files prepared and
transferred directly from Caltech
obvious goal is no need to connect to Wisc or
NCSA machine and do anything by hand

11
Future Directions

Automatic preparation for launch
add Alliance LosLobos cluster as resource
512 processor (733 MHz) Linux
use both for CMSIM by gliding in to Wisc Condor
pool, and for ORCA reconstruction
add 3rd party file transfers so CondorG at
Caltech manages Wisc to NCSA data transfer
more sophisticated monitoring and logging

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

World's Best PowerPoint Templates PowerPoint PPT Presentation

World's Best PowerPoint Templates - CrystalGraphics offers more PowerPoint templates than anyone else in the world, with over 4 million to choose from. Winner of the Standing Ovation Award for “Best PowerPoint Templates” from Presentations Magazine. They'll give your presentations a professional, memorable appearance - the kind of sophisticated look that today's audiences expect. Boasting an impressive range of designs, they will support your presentations with inspiring background photos or videos that support your themes, set the right mood, enhance your credibility and inspire your audiences.

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Vladimir Litvin, Harvey Newman, PowerPoint PPT Presentation

Vladimir Litvin, Harvey Newman, - Multiple runs should be submitted in one PBS job. Due to NCSA computing policy, PBS cannot allocate one CPU it always allocates one NODE. ... | PowerPoint PPT presentation | free to view

Vladimir Litvin, Harvey Newman PowerPoint PPT Presentation

Vladimir Litvin, Harvey Newman - Master Condor at Caltech launches job to stage data from NCSA UniTree to NCSA Linux cluster ... job launched via Globus jobmanager on cluster ... | PowerPoint PPT presentation | free to view

Vladimir Litvin, Harvey Newman, PowerPoint PPT Presentation

Vladimir Litvin, Harvey Newman, - PBS accounting cannot calculate correctly CPU used time when ssh was used to ... PBS has a set of limitations MAXJOB limit, finest granularity is node, not CPU, ... | PowerPoint PPT presentation | free to view

Vladimir Litvin, Harvey Newman, Sergei Shevchenko PowerPoint PPT Presentation

Vladimir Litvin, Harvey Newman, Sergei Shevchenko - Performance comparison of single photons with cmsim and OSCAR for the RS-1 ... Calorimeter isolation criteria: For each SC, the energy in a cone of DR = 0.5 ... | PowerPoint PPT presentation | free to view

Vladimir Litvin, Harvey Newman, PowerPoint PPT Presentation

Vladimir Litvin, Harvey Newman, - Preliminary study of QCD background preselection for diphoton RS graviton decay: ... ( Marc Besan on, Nadia Lahrichi and Emmanuelle Perez) working on the same topic ... | PowerPoint PPT presentation | free to view

Vladimir Litvin, Harvey Newman, Sergei Shevchenko, Tony Lee PowerPoint PPT Presentation

Vladimir Litvin, Harvey Newman, Sergei Shevchenko, Tony Lee - potentially large leakage from the rear side of the crystals due to the large ... for 5sigma discovery for different mass windows and different graviton masses ... | PowerPoint PPT presentation | free to view

The Caltech CMS/L3 Group Physics, Software and Computing; Grids and Networks for HENP PowerPoint PPT Presentation

The Caltech CMS/L3 Group Physics, Software and Computing; Grids and Networks for HENP - E. Aslakson, J. Bunn, G. Denis, P. Galvez, M. Gataullin, ... Vladimir Litvin US CMS Software Engineers; Distributed. Iosif LeGrand Computing and Data Systems ... | PowerPoint PPT presentation | free to view

Vladimir Litvin, Harvey Newman, Sergey Schevchenko PowerPoint PPT Presentation

Vladimir Litvin, Harvey Newman, Sergey Schevchenko - Using of Grid Prototype Infrastructure for QCD Background Study to ... beak.cs.wisc.edu/jobmanager- condor-INTEL-LINUX. environment = CONDOR_UNIVERSE=scheduler ... | PowerPoint PPT presentation | free to view

US-CMS Core Application Software Planning, Schedule, and Milestones PowerPoint PPT Presentation

US-CMS Core Application Software Planning, Schedule, and Milestones - Vladimir Litvin has been appointed Calorimetry code coordinator. ... The new Analysis Architecture will be supported and developed through 2002 ... | PowerPoint PPT presentation | free to view

Automatic distrubited production system: prototype PowerPoint PPT Presentation

Automatic distrubited production system: prototype - Data cards. CMGEN. CMSIM. user.init datasetname .userenv ... we have a possibility to compile binaries with user custom-made code ... | PowerPoint PPT presentation | free to view

Agenda PowerPoint PPT Presentation

Agenda - Common pack/unpack method for all Calo base (Sasha Nikitenko) 5' Calorimeter tower ... Selective Readout status&plans (Scott Rutherford) 5' HCAL status ... | PowerPoint PPT presentation | free to view

Adding a Level 1 Trigger to Calibrate the CMS ECAL with p0s PowerPoint PPT Presentation

Adding a Level 1 Trigger to Calibrate the CMS ECAL with p0s - Adding a Level 1 Trigger to Calibrate the CMS ECAL with 0s Sean Simon, Elizabeth Dusinberre, Jim Branson | PowerPoint PPT presentation | free to view

Open Science Grid and TeraGrid Interoperability PowerPoint PPT Presentation

Open Science Grid and TeraGrid Interoperability - OSG Blueprint Meeting: Open Science Grid and TeraGrid ... Julian J. Bunn (Caltech) Greg Cross (University of Chicago/Argonne National Laboratory) ... | PowerPoint PPT presentation | free to view

Calorimetry Status PowerPoint PPT Presentation

Calorimetry Status - CVS tag headcobra5 - latest HEAD revision. CVS tag headcobra6 - same HEAD ... PersistentCaloCluster needs CARF bugfix (not in the latest COBRA prerelease yet) ... | PowerPoint PPT presentation | free to view

Calorimetry code: recent changes and future plans PowerPoint PPT Presentation

Calorimetry code: recent changes and future plans - move as much 'similar' code as possible to CaloBase ... Move code related to Calo cell numbering scheme and hit writing/reading to COBRA ... | PowerPoint PPT presentation | free to view

US-CMS Core Application Software Progress and Activities PowerPoint PPT Presentation

US-CMS Core Application Software Progress and Activities - News from CERN. LHC Grid Computing Project Launching Workshop was the second ... San Diego. 40 participants attended. Software Tutorials now have a lot to cover ... | PowerPoint PPT presentation | free to view

Calorimetry Changes PowerPoint PPT Presentation

Calorimetry Changes - Step1 consists of three major parts: 1) Changes in CaloFrontEndResponce and CaloRecHitFormatter ... for(frame=framecache begin();frame!=framecache- end();frame ... | PowerPoint PPT presentation | free to view

RandallSundrum Graviton in the diphoton channel PowerPoint PPT Presentation

RandallSundrum Graviton in the diphoton channel - Graviton resonances mn = xn k exp ... Stabilization needs to introduce a scalar field, the radion ... corrections developed for the di-electron chanel ... | PowerPoint PPT presentation | free to view

US-CMS Core Application Software Progress and Activities PowerPoint PPT Presentation

US-CMS Core Application Software Progress and Activities - ... started to extract Objectivity from COBRA at the beginning of last year. ... Bill's work maintained COBRA integrity and swapped out the persistency mechanism ... | PowerPoint PPT presentation | free to view

RS1 Graviton Diphoton Decay Study: Status and Plans PowerPoint PPT Presentation

RS1 Graviton Diphoton Decay Study: Status and Plans - There are five backgrounds we need to study: born (MSEL = 0, MSUB = 18) ... We are using PHOTOS radiation for Z boson decay - MSTJ 41 = 1 was added. ... | PowerPoint PPT presentation | free to view

Overview of Extra Dimension studies at LHC PowerPoint PPT Presentation

Overview of Extra Dimension studies at LHC - There are five backgrounds we need to study: born (MSEL = 0, MSUB = 18) ... We are using PHOTOS radiation for Z boson decay - MSTJ 41 = 1 was added. ... | PowerPoint PPT presentation | free to view

US-CMS Core Application Software Status Report PowerPoint PPT Presentation

US-CMS Core Application Software Status Report - ... one must store summary another way or loop over higher ... At the time of the last review only CERN and Fermilab had successfully run all production steps. ... | PowerPoint PPT presentation | free to view

Ian M. Fisk PowerPoint PPT Presentation

Ian M. Fisk - Three activities are driving most of the US-CMS Software and Computing efforts ... Capability (Yujun Wu, James Letts, Michael Ernst, Tanya Levshina) ... | PowerPoint PPT presentation | free to view

Vladimir Litvin, Harvey Newman, PowerPoint PPT Presentation

Vladimir Litvin, Harvey Newman, - Vladimir Litvin, Harvey Newman, Sergei Shevchenko. Caltech CMS. Preliminary study of QCD background preselection for diphoton RS graviton decay: ... | PowerPoint PPT presentation | free to view

Core Application Software Planning and Preparation PowerPoint PPT Presentation

Core Application Software Planning and Preparation - Specific Changes in Planning since the last review. Near-term ... of a hodge-podge. Mixture of on and off project. activities. A number of items that could ... | PowerPoint PPT presentation | free to view

High energy photons and large diphoton invariant mass resonances: HLT Express Stream PowerPoint PPT Presentation

High energy photons and large diphoton invariant mass resonances: HLT Express Stream - Box and born backgrounds (CKIN(1) 200 GeV) Drell Yan (requested by other group EWK) Missing: QCD and gamma jet backgrounds due to the lack of filter, working ... | PowerPoint PPT presentation | free to view

Readout Units in Calorimetry Status PowerPoint PPT Presentation

Readout Units in Calorimetry Status - Comment. Ru1 z1 eta1 phi1. Ru1 z2 eta2 phi2. Ru2 z3 eta3 phi3 ... Comment. 1 1 1 1. 1 2 2 2. 2 1 2 2. 2 1 1 1. 3 2 3 4 ... Comment. Ru1 z1 pl1 w1 s1. Ru1 z2 pl2 ... | PowerPoint PPT presentation | free to view