Title: The%20Cyberinfrastructure%20Movement%20and%20the%20Potential%20for%20Revolutionizing%20Science/Engineering%20Research%20and%20Education:%20A%20Harbinger%20of%20the%20Future%20of%20Scholarship%20(and%20Education)%20in%20the%20Digital%20Age
1The Cyberinfrastructure Movement and the
Potential for Revolutionizing Science/Engineering
Research and EducationA Harbinger of the Future
of Scholarship (and Education) in the Digital Age
- Daniel E. Atkins
- atkins_at_umich.edu
- School of Information Department of EECS
- University of Michigan, U.S.A.
JISC-CNI, Brighton, July 8, 2004
2Perspective from a Blend of Vantage Points
- Researcher
- Dean College of Engineering, School of
Information (www.si.umich.edu) - Chair of NSF Blue Ribbon Panel on
Cyberinfrastructure - Consultant
- Not speaking officially for the US NSF
3Converging Streams of Activity
ACLS-Mellon CI for Humanities
E-science
www.nas.org
Science-driven pilots (not using above labels)
4NSF Blue Ribbon Panel on Cyberinfrastructure
a new age has dawned in scientific and
engineering research, pushed by continuing
progress in computing, information, and
communication technology, and pulled by the
expanding complexity, scope, and scale of
todays challenges. The capacity of this
technology has crossed thresholds that now make
possible a comprehensive cyberinfrastructure on
which to build new types of scientific and
engineering knowledge environments and
organizations and to pursue research in new ways
and with increased efficacy.
- http//www.cise.nsf.gov/sci/reports/toc.cfm
5Cyberinfrastructure-enabled Knowledge
Communities (CKCs)
Virtual teams, communities, organizations,
knowledge communities/environments/ecologies
Cyberinfrastructure Equipment, Software, People,
Institutions
Computation, Storage, Communication and Interface
Technologies
or perhaps Organizations
6Knowledge Communities
research, scholarship
- This phase is shorthand for a group of people
working together to create, disseminate, use
and/or preserve knowledge. - I do not mean a static collection of knowledge
(as in a body of knowledge) -- I mean people
engaged in knowledge-based activities. (Knowing
Communities) - Not one huge knowledge community but rather many
specialized communities, often with overlapping
membership. Individual role may vary among
different knowledge communities.
teaching, learning
7Some Names for CKCs
- Co-laboratory, Collaboratory
- Grid Community
- e-X Community (as in e-science)
- Cyber-X Community (as in cyberscience)
- Community Gateways or Portals
- Virtual Community, Virtual Organizations
8Big Ideas
- Global cyberinfrastructure can become a platform
for routine, effective, computationally
supported, distance-independent activities of
knowledge communities. - Research communities are creating functionally
complete virtual communities that are absolutely
necessary for their next decade of research
aspirations. - Cyberinfrastructure offers new options for what
is done, how it is done, and who participates. - We now have the opportunity (and responsibility)
to help make it real.
9Push and Pull for CKCs
ACP Advanced Cyberinfrastructure Program
10(No Transcript)
11Dimensions of CI RD
usefulness
12Cyberinfrastructure
13Cyberinfrastructure
14(No Transcript)
15Translight Consortia
16Cyberinfrastructure
17NSF Middleware Initiative
http//www.nsf-middleware.org/
18Middleware Globus and Grid-Web Services
Convergence
From Ian Foster
19Core Middleware
- Identity and Identifiers namespaces, identifier
crosswalks, real world levels of assurance, etc. - Authentication campus technologies and
policies, interrealm interoperability via PKI,
Kerberos, etc. - Directories enterprise directory services
architectures and tools, standard objectclasses,
interrealm and registry services - Authorization permissions and access controls,
delegation, privacy management, etc. - Integration Activities open management tools,
application of virtual, federated and
hierarchical trust, enabling common applications
with core middleware
20Mellon Foundation Higher-Ed Open Source Projects
http//rit.mellon.org/twiki/bin/view/Main/Pubcooki
eTwiki
21Open Middleware Infrastructure Institute
22Cyberinfrastructure
23Computation-Simulation
Report of the High-end Computing Revitalization
Task Force (5-04)
Capability not just capacity technology,
policy, tools. Still need some center-based
leadership,super computers. On-demand
supercomputing, not just batch.
24Japanese Earth Simulation Center
25Top 5 Supercomputers
From http//www.top500.org/list/2003/11/
26Virginia Tech Terascale Cluster (1,100 Mac G5s)
http//computing.vt.edu/research_computing/terasca
le/
27Cyberinfrastructure
28Some Implications for the DL Research Community
- Onward to the GII - ubiquitous knowledge
environments and information ether. - Increased variety and scale of information.
- New scholarly communication systems
- Reducing participation overload.
- The openness movement.
- Persistence of access - digital preservation.
See http//www.sis.pitt.edu/dlwkshop/
29Openness Movement
- Open source software and its communities
- Open standards
- Open content open knowledge. Digital
repositories. - Open alliances for creating middleware
- Open intellectual properties more broadly
- Keeping the Internet architecture open
- University as counterbalance to overly
restrictive access (rights management)
30Research Challenges in Digital Archiving and
Long-Term Preservation
http//www.si.umich.edu/digarch/
http//www.digitalpreservation.gov/index.php
Includes Preserving Our Digital Heritage report
Time report.
Research Issues!
31Cyberinfrastructure
32NEESgrid Earthquake Engineering Collaboratory
www.neesgrid.org
33Reconstructions of Dendritic Spines by High
Performance Parallel Electron Tomography
The Spines on these nerve cell dendrites are
where connections are rapidly made and unmade
between cells in the brain.
This 3MeV Electron Microscope in Osaka Japan is
the Biggest in World
34The Initial Facility at Sondrestrom, Greenland
The University of Michigan Upper Atmospheric
Research Collaboratory (UARC)
35Embedded Sensors RD Use
http//www.cens.ucla.edu/index.html
Ocean Research Interactive Observatory Networks
National Ecological Observatory Network (NEON)
http//www.coreocean.org/Dev2Go.web?Anchororion_h
ome_pagernd17953
http//www.nsf.gov/bio/neon/start.htm
36(No Transcript)
37Cyberinfrastructure
38Electronic Visualization Lab
http//www.evl.uic.edu
Tele-Immersive Collaboration in the CAVE Research
Network
39Institute for Creative Technology
http//www.ict.usc.edu/
40Cyberinfrastructure
41Time-Space Collaboration
Time
Different
Same
Physically together... Drop in lab, physical library, museum
Audio, chat, video conference, group applications Email, threaded-discussions, shared files...
Same
Place
Different
distance matters
beyond being there
42Some examples of experimental CKCs
43The Initial Facility at Sondrestrom, Greenland
The University of Michigan Upper Atmospheric
Research Collaboratory (UARC)
44UARC Interface
computational models
Real-time instruments
dynamic work rooms
annotation
team chat
Session replay
Archival data
Journals
45Evolved into a Network of Instruments (one global
instrument)
46UARC Patterns of Communication
1998 Smithsonian Science Award
http//crew.umich.edu/
47Vignettes UARC/SPARC
- Shared, tele-instruments expertise.
- Rapid response, opportunistic campaigns.
- Multi-eyes, complementary expertise.
- Isolated instruments became a global instrument
chain. - Cross-mentoring/training.
- New earlier opportunities/exposure for grad
students.
48- Enhanced participation. Legitimate peripheral
participation. - Support for authentic, inquiry-based learning at
UG and pre-college level. - Distributed workshops for post-campaign data
analysis. - Session re-play for delayed participation.
- Data-theory closure.
- A living specification to stretch vision of
possibilities.
49Cyberinfrastructure is a First-Class Tool for
Science
50The Information Technology Needs of the
NeuroscienceCommunity Provide Major Challenges
for Tomorrows Information Technology
Infrastructure
- BRAIN RESEARCH of the Future will be conducted in
a DISTRIBUTED ENVIRONMENT - The NSF PACIs and Emergence of new Information
Technologies have catalyzed this change - Projects to Federate Neuroscience Data, Build
Data and Computational Grids and Telescience
Tools are creating a Cyber Infrastructure that
will Enable New Science in this and other
disciplines
Enable new understanding of the brain by linking
data about macroscopic brain function to its
molecular and cellular underpinnings
Bioinformatics Research Network (BIRN)
http//www.nbirn.net/
51Crab Nebula in 4 spectral regionsX-ray, optical,
infrared, radio
http//www.us-vo.org/
Virtual Observatory Prototype Produces Surprise
Discovery. Early demo project identifies new
brown dwarf.
http//www.us-vo.org/news/brown-dwarf.html
52Who are the Physicists? GriPhyN/iVDGL Science
Drivers
- US-ATLAS, US-CMS (LHC expts)
- Fundamental nature of matter
- 100s of Petabytes
- LIGO observatory
- Gravitational wave search
- 100s of Terabytes
- Sloan Digital Sky Survey
- Astronomical research
- 10s of Terabytes
a growing number of biologists other
scientists computer scientists needing
experimental apparatus
53Broader Implications
54Cyberinfrastructure for Multi-use
New ACLS Panel of CI for Humanities
55Potential Mission Synergy Shared-Use CKCs
C-CLEAR Cyberinfrastructure-enabled
Collaborative Learning, Engagement, and Research
56The QuarkNet-Trillium collaboration is about
using Grid virtual data tools and methods to
enrich science education
We uploaded the data to the Grid used the
grid analysis tools to find the show
57Sloan DigitalSky Survey/SkyserverJohns Hopkins
UniversityAlex Szalay (Co-PI), Jordan Raddick
- Database of 80 million objects
- Search and Analysis Tools
- Projects for Teachers and Students
58Benefits
- Students who engage in authentic research will
understand the process of science - Be more likely to pursue a career in science
- Grow up to be scientifically literate adults
- Improve science teaching through ongoing teacher
professional development and mentoring - Reconnect with science
- Reduce isolation
- Increase Retention
59Hands On UniverseLawrence Hall of Science,UC
BerkeleyCarl Pennypacker (Co-PI)
- Involve high school teachers and students in
authentic astronomy research - Visualization Tools (IDL)
- Image Archives
- Observation Requests
- Internet Controlled Telescopes
- International Partners
60Beyond being there... Global Examples..
- Global Graduate Seminar on ICT Globalization -
Cogburn, UM/SI - Global Product Design - Dutta UM/ME
61CKCs -- lofty aspirations
- Provide greater equity of access and
participation. - Open up more experiences and increase the
probability of intellectual linking across
disciplinary boundaries. - Enrich the diversity of participation,
perspective, ideas, experiences. - Enable sharing of resources and better
amortization of unique resource/facilities. - Support existing teams and accelerate the
formation of new teams, fields, disciplines. - Support rapid ad-hoc team formation to respond to
unexpected emergencies.
62Comments on the RD Model to Create and Apply
Cyberinfrastructure
63Creating effective CKCs require PQ research models
CKCs can support PQ research models
Creation of knowledge basic, curiosity-driven
research
Application of knowledge
Classic Linear Research Model
64Borromean Ring RD Teams
Behavioral organizational sciences practice
Computer information, science, engineering
practice
Teams learning together - collateral learning
Users, Communities of Practice, Organizations in
Society
Three symmetric, interlocking rings, no two of
which are interlinked. Removing one destroys the
synergy.
65CKCs RD Approach
Experimental prototypes
but robust usable
Iterative Design
Long-term support
B-teams
Human-centered
66(No Transcript)
67Cyberinfrastructure
68And-And CI-E Organizations Activities
- big and small
- local and global
- centralized and decentralized
- learning and research/discovery
- basic and applied (Pasteurs Quadrant)
- changes boundaries between when to compete and
when to cooperate - multi roles expert, student, observer,
policy-maker
cyberinfrastructure-enable
69Budget Recommendation Overview(Incremental,
Recurring)
NSF estimate of current investments in
cyberinfrastructure development use 400M/year
70Emergent Framework
- Science and Engineering Frontiers (Frontiers)
SE opportunities to be realized using CI - Integrating Architectures (IAs) limited number
of common architectures that support
domain-specific applications using a common,
reconfigurable set of open source software tools,
technologies and services. - Computation-intensive
- Information-intensive
- Instrumentation-rich
- Desk-top
71- Core (Core) CI foundation including backbone
networks, widely shared compute and storage
facilities education and workforce development
activities a portfolio of activities aimed at
yielding new knowledge on the science of
cyberinfrastructure, including its human and
social dimensions and rigorous evaluation and
assessment activities. - CI-enabling Research (CI-R) research
investments that will create new information
technology tools and resources to enrich
cyberinfrastructure for the foreseeable future.
72Core
- Education, Training, Community Development
Establish coordinated programmatic activities to - prepare current and future scientists and
engineers to use, develop and support
cyberinfrastructure - catalyze domain SE community efforts to define
unique CI-enabled research and education
opportunities (coordinated with Frontier
activities) - take targeted action to broaden participation of
underrepresented groups and organizations in CI
activities.
73- Networks Undertake comprehensive NSF-wide
planning to - examine current international investments in
network infrastructure to support Frontier
projects now and in the future - develop network infrastructure needs assessment
and gap analysis - identify last-mile connections solutions and
priorities - Informed by the above determine programmatic
strategy to invest in enabling networks both
Frontier and Core approaches as appropriate.
74- Supercomputing Platforms Define complementary
supercomputing investments to be made through
Core, CI-R, and Frontier activities. To inform
these investments - conduct comprehensive assessment of national
supercomputing resources - identify promising interagency strategies to
address supercomputing needs for open-science
community - identify organizational and economic models that
support centralized and/or distributed
supercomputing investments
75- Federated Data Archives and Digital Libraries
Undertake comprehensive NSF-wide planning to - explore the efficacy of creating a national
databank of federated data archives. recognizing
that data assets reside in different locations
and belong to multiple domains, individuals and
organizations - assess domain-specific interests in establishing
and supporting digital libraries - identify organizational and economic models that
support centralized and/or distributed data
archives and digital library investments.
76- Science of Cyberinfrastructure
- Study the impact and use of cyberinfrastructure
in research and education, to develop a better
understanding of its sociological, economic,
technological and societal implications. - Initiate a comprehensive research, evaluation and
assessment activity that seeks to develop new
knowledge on the most effective strategies to
design, develop and use cyberinfrastructure with
emphasis on usability, accessibility and
scientific utility.
77Questions, Discussion