Microsoft Research CERN-Pasadena at 1 GBps (8 Gbps) World Wide Telescope - PowerPoint PPT Presentation

1 / 12
About This Presentation
Title:

Microsoft Research CERN-Pasadena at 1 GBps (8 Gbps) World Wide Telescope

Description:

Bring all scientific literature and data online. Focus on large database issues, ... 10 billon records, 2 TB. Online query to any and all. Also used for education ... – PowerPoint PPT presentation

Number of Views:30
Avg rating:3.0/5.0
Slides: 13
Provided by: spea2
Category:

less

Transcript and Presenter's Notes

Title: Microsoft Research CERN-Pasadena at 1 GBps (8 Gbps) World Wide Telescope


1
Microsoft ResearchCERN-Pasadena at 1 GBps (8
Gbps) World Wide Telescope
  • Jim Gray
  • Researcher
  • Microsoft Research

2
Microsoft Research
  • Organizational goal Advance state of the art
  • More than 700 staff, 55 areas
  • Labs in US, Europe, Asia Internationally
    recognized teams
  • University organizational modelOpen research
    environmentClose ties to universities
  • Close working relations with development.

3
My Research Goal
  • Information at your fingertips
  • Bring all scientific literature and data online
  • Focus on large database issues, and scalable
    servers.

4
Challenge Move Data from CERN to Remote
Centers _at_ 1GBps
  • Disk-to-Disk
  • gigabyte / second data rates
  • 80TB/day
  • 30 petabytes by 2008
  • 1 exabyte by 2014

Graphics courtesy of Harvey Newman _at_ Caltech
5
Current Status CERN ? Pasadena
  • Multi Stream tpc/ip 7.1 Gbps 900 MBps
  • New speed record _at_ http//ultralight.caltech.edu/l
    sr-winhec/
  • Single Stream tpc/ip 6.5 Gbps 800 MBps
  • File Transfer Speed 450 MBps

7,000
6,000
5,000
4,000
mbps per second
3,000
2,000
1,000
0
2000
2001
2002
2003
2004
2005
6
World Wide Telescope
  • Premise Most Astronomy data is online
  • The Internet is the worlds best telescope
  • It has data on every part of the sky
  • In every measured spectral band
  • As deep as the best instruments
  • It is up when you are up.The seeing is always
    great (no working at night, no clouds no moons
    no..).
  • Its a smart telescope links objects and
    data with literature.

7
SkyServer.SDSS.orgBuilt with Johns Hopkins U.
  • A modern archive
  • Raw data in file servers
  • Catalog data (derived objects) in Database
  • 10 billon records, 2 TB
  • Online query to any and all
  • Also used for education
  • 150 hours of online Astronomy
  • Implicitly teaches data analysis
  • Interesting things
  • Based on Web Services
  • Spatial data search
  • Cloned by other surveys (a design template)

8
Service Oriented ArchitectureData Federations of
Web Services
  • Massive datasets live near their owners
  • Near instrument software pipeline, apps
  • Near data knowledge and curation
  • Each Archive publishes a web service
  • Schema documents the data
  • Methods on objects (queries)
  • Uniform access to multiple Archives
  • A common global schema
  • Scientists get personalized extracts

DB
9
Federation SkyQuery.Net
  • Combines 15 archives
  • Send query to portal, portal joins data from
    archives.
  • Problem want to do multi-step data analysis
    (not just single query).
  • Solution Allow personal databases on portal
  • Problem some queries are monsters
  • Solution batch scheduler on portal server,
    Deposits answer in personal database.

10
SkyQuery Structure
  • Each SkyNode publishes
  • Schema Web Service
  • Data Query Web Service
  • Portal
  • Plans Query (2 phase)
  • Integrates answers
  • Is itself a web service

11
Summary
  • Microsoft Research is active inside and outside
    Microsoft.
  • 10Gbps Networking is coming,x-64 is comingand
    we are investing to make them real.
  • World Wide Telescope is coming
  • Exemplifies service oriented architecture
  • Built with web services and databases
  • Has interesting spatial database algorithms
  • Details on my websitehttp//research.microsoft.c
    om/Gray

12
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com