Title: Information Technology and Computing Infrastructure for U.S. LHC Physics
1Information Technology and Computing
Infrastructure for U.S. LHC Physics
- Lothar A.T. Bauerdick, Fermilab
- Project Manager U.S. CMS Software and Computing
2LHC Physics Discovery through Information
Technology and Computing Infrastructure
- LHC Computing Unprecedented in Scale and
Complexity (and Costs) - ? Advanced Coherent Global Information-Infrastruc
ture - Partnerships International, Interdisciplinary,
Inter-agency!
3LHC Physics Discoveriesby Researchers at U.S.
Universities and Labs
- U.S. LHC is Committed to Empower the U.S.
Scientists - to do Research on LHC Physics Data
-
- This is why we are interested in the Grid as an
Enabling Technology
4Distributed Grid Computing and Data Model
Adapted as the LHC Baseline ModelDistributed
Communities, Distributed Resources Coherent
Global Data Access, Analysis, Management
ExampleUS Atlas Grid Testbed
- Major Successes and Advances of Grid
Infrastructure in the U.S. - RD and Testbeds Prototyping and Hardening of
Grid Infrastructure, Deploying Grid Tools,
Developing and Integrating Grid Applications
5Tier-ed System of Regional Centers
- Developing further the hierarchically organized
fabric of Grid Nodes
6Transition to Production-Quality Grid
- Centers taking part in LHC Grid 2003
- Production Service Around the World ? Around the
Clock!
7towards Dynamic Workspaces for Scientists
- Communities of Scientists Working Locally within
a Global Context - Infrastructure for sharing, consistency of
physics and calibration data, software
New IT Needed!
8LHC Research Program Has Started Strongly!
- Sizable RD efforts and major investments in
Tier-1 centers started - First Grid Infrastructure in place, in
collaboration with the LHC Computing Grid
Project at CERN and elsewhere - U.S. LHC Scientists will profit in major ways
- develop the strong U.S. LHC environment for
Physics Analysis - Address the core issues in U.S. LHC SC
- developing and implementing the distributed
computing model central for success of U.S.
Universities participation - Focus on end-to-end services,
- Focus on distributed data access and management
- Work with Grid Projects like PPDG, NSF projects,
DOE Science Grid etcwork with CERN and other
centers around the world to setup a global
information infrastructure (info-structure) to
enable the U.S. for scientific discovery at the
LHC
9The Goal
- Provide capabilities to individual physicists and
communities of scientists that allow them - To participate as an equal in the LHC research
- To be fully represented in the Global Experiment
Enterprise - To receive on-demand whatever resources and
information they need to explore their science
interest while respecting the collaboration-wide
priorities and needs - Provide massive computing, storage, networking
resources - Including opportunistic use of resources that
are not LHC owned! - Provide full access to dauntingly complex
meta-data - That need to be kept consistent to make sense of
the event data - Collaborative Environment and Info-systems
10These Goals Require Substantial RD
- Global Access and Global Management of Massive
and Complex Data - Location Transparency of Complex Processing
Environments and of Vast Data Collections - Virtual Data, Workflow, Knowledge Management
Technologies - Monitoring, Simulation, Scheduling and
Optimization on a Heterogeneous Grid of
Computing Facilities and Networks - End-to-End Networking Performance, Application
Integration - Management of Virtual Organizations across the
Grid, Technologies and Services for Security,
Privacy, Accounting - Scientific Collaboration over the distance
- Etc
Grids are the Enabling TechnologyLHC Needs are
Pushing the LimitsTechnology and Architecture
still evolvingNew Research and Development in IT
is required
11U.S. LHC Grid Technology Cycles
- Rolling Prototypes evolution of the facility
and data systems - Prototyping and early roll out to production
quality services - Participation in Computing and Physics Data
Challenges - Emphasis on Quality, Documentation,
Dissemination, - Tracking of external practices
12Grid Testbeds And Production Grids
- Grid Testbeds Research, Development and
Dissemination! - LHC Grid Testbeds first real-life large Grid
installations, becoming production quality - Strong Partnership between Labs, Universities,
with Grid (iVDGL, GriPhyN, PPDG) and Middleware
Projects (Condor, Globus) - Strong dissemination component, together with
Grid Projects - E.g. U.S. CMS Testbed
- Caltech, UCSD, U.Florida, UW Madison, Fermilab,
CERN
MIT Rice Minnesota Iowa Princeton
13Example Grid Monitoring and Information Services
- MonALISA Monitoring System (Caltech)deployed in
U.S. CERN Grid Testbed - Dynamic information services and Grid resource
discovery mechanisms using intelligent agents - Use and further developnovel Grid Technologies
and Grid Interfaces - Grid Control Room For LHC Grid
- Technology driver for other projects
14Distributed Collaborative Engineering
- Projectization essential for Software and
Computing Effort of this complexity - Requires expert manpower and engineering
- Physics and Detector Groups at Universities are
the first to profit from this - Example Atlas Detector Geometry Description
Databases - Idea and Concept
- Geometry Modeller based on CDF U.Pittsburg
- Massive Development Effort
- NOVA MySQL Database BNL
- Repository of persistent configuration
information - NOVA Service ANL
- Retrieval of transient C objects from NOVA
Database - Conditions Database Service ANL/Lisbon
- Access to time-varying information based on type,
time, version and key - Used in conjunction with other persistency
services (e.g. NOVA Service) - Interval Of Validity Service LBNL
- Registration of clients retrieval of updated
information when validity expires caching policy
management - Release as scheduled to Detector and Physics
Groups - Prototype at Silicon alignment workshop in
December 2002
15Example Detector Description
- Geometry Modeller, Database, Visualization,
Optimization
Detail from Barrel Liquid Argon(parameterized -
40kB in memory)
Detail from TRT
16Work Packages for LHC Computing
- Facilities and Fabric Infrastructure
- U.S. Tier-1 and Tier-2 centers, U.S. University
infrastructure - Distributed Computing Infrastructure
- Networks, throughput, servers, catalogs
- Grid Services
- Middleware, Virtual Organizations support,
end-to-end and higher level services, trouble
shooting and fault tolerance, distributed science
environment - Experiment Specific Software
- Core software, frameworks, architectures,
applications physics and detector support - Collaboratory Tools and Support
- Communication, conferencing, sharing, Virtual
Control Room - Support Services
- Training, info services, help desk
17 Grid Projects are a Large International Effort
- Map of Grid Projects Directly Related to LHC
- e.g. the U.S. Particle Physics Data Grid (PPDG),
funded by SciDAC
18Partnership for Global Infostructure
I.Gaines, 4-Agency meeting at CERN March 21st,
2003
- Physics Computer Science/Information Technology
Funding Agencies
19DOE Labs have great impact on U.S. LHC
ScienceInter-agency partnership between the
DOE-funded Tier-1 and NSF-funded Tier-2 efforts
Tier-1s at Fermilab and BNL address the
majorGrid Infrastructure expertise and 24x7
Support
- Information Technology and Computing
Infostructurefor LHC Physics Discovery requires - Research, Development, Deployment, Dissemination
- and Sustained Reliable Running of Facilities
and Support Services!
20The U.S. LHC Mission is Physics Discovery at the
Energy Frontier!
- This partnership takes advantage of the
significant strengths of U.S. labs and
universities in the area of CS and IT - and exploit synergy betweenU.S. Universities and
National Labs,Software Professionals and
Physicists,Computer Scientists and High Energy
Physicists - LHC is amongst the first to put a truly
distributed Info-Structure in place,
spearheading important innovations in how we do
science