Big Data. New Physics. And Why Geospatial Data is Analytic SuperFood - PowerPoint PPT Presentation

About This Presentation
Title:

Big Data. New Physics. And Why Geospatial Data is Analytic SuperFood

Description:

Big Data. New Physics. And Why Geospatial Data is Analytic SuperFood Jeff Jonas, IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics – PowerPoint PPT presentation

Number of Views:542
Avg rating:3.0/5.0
Slides: 37
Provided by: ibp6
Category:

less

Transcript and Presenter's Notes

Title: Big Data. New Physics. And Why Geospatial Data is Analytic SuperFood


1
Big Data. New Physics.And Why Geospatial Data is
Analytic SuperFood
Jeff Jonas, IBM Distinguished Engineer Chief
Scientist, IBM Entity Analytics JeffJonas_at_us.ibm.c
om January 18th, 2011
2
The data will find the data and the relevance
will find you.
3
My Background
  • Early 80s Founded Systems Research
    Development (SRD), a custom software consultancy
  • 1989 2003 Built numerous systems for Las Vegas
    casinos including a technology known as
    Non-Obvious Relationship Awareness (NORA)
  • 2005 IBM acquires SRD, now chief scientist of
    IBM Entity Analytics
  • Personally designed and deployed /- 100 systems,
    a number of which contained multi-billions of
    transactions describing 100s of millions of
    entities
  • Today My focus is in the area of sensemaking on
    streams with special attention towards privacy
    and civil liberties protections

4
Sensemaking on Streams
  • 1) Evaluate new information against previous
    information as it arrives.
  • 2) Determine if what is being observing is
    relevant.
  • 3) Deliver this relevant, actionable insight fast
    enough to do something about it as its
    happening.
  • 4) Do this with sufficient accuracy and scale to
    really matter.

5
Trend Organizations Are Getting Dumber
Available Observation Space
Context
Computing Power Growth
Sensemaking Algorithms
Time
6
Simply Overwhelming
Every two days now we create as much information
as we did from the dawn of civilization up until
2003. Eric Schmidt, CEO Google
7
Trend Organizations Are Getting Dumber
Available Observation Space
Context
Computing Power Growth
Sensemaking Algorithms
Time
8
Algorithms at Dead End. You Cant Squeeze
Knowledge Out of a Pixel.
9
No Context
10
  • Context, definition
  • Better understanding something by taking into
    account the things around it.

11
Information in Context and Accumulating
12
From Pixels to Pictures to Insight
Relevance
Contextualization
Observations
Consumer (An analyst, a system, the sensor
itself, etc.)
Information in Context
13
The Puzzle Metaphor
  • Imagine an ever-growing pile of puzzle pieces of
    varying sizes, shapes and colors
  • What it represents is unknown (there is no
    picture on hand)
  • Is it one puzzle, 15 puzzles, or 1,500 different
    puzzles?
  • Some pieces are duplicates, missing, incomplete,
    low quality, or have been misinterpreted
  • Some pieces may even be professionally fabricated
    lies
  • Point being Until you take the pieces to the
    table and attempt assembly, you dont know what
    you are dealing with

14
How Context Accumulates
  • With each new observation one of three
    assertions are made 1) Un-associated 2) placed
    near like neighbors or 3) connected
  • Must favor the false negative
  • New observations sometimes reverse earlier
    assertions
  • Some observations produce novel discovery
  • As the working space expands, computational
    effort increases
  • Given sufficient observations, there can come a
    tipping point, at which time 1) confidence
    begins to improve and 2) computational effort
    begins to decrease!

15
One Form of Context Is Expert Counting
  • Is it 5 people each with 1 account or is it 1
    person with 5 accounts?
  • Is it 20 cases of H1N1 in 20 cities or one case
    reported 20 times?
  • If one cannot count one cannot estimate vector
    or velocity (direction and speed).
  • Without vector and velocity prediction is
    nearly impossible.

16
Counting Degrees of Difficulty
Exactly Same
17
Key Features Enable Expert Counting
  • People Cars Router
  • Name Make Device ID
  • Address Model Make
  • Date of Birth Year Model
  • Phone License Plate No. Firmware Vers.
  • Passport VIN Asset ID
  • Nationality Owner Etc.
  • Biometric Etc.
  • Etc.

18
Consider Lying Identical Twins
19
  • The same thing cannot be in two places at the
    same time.
  • Two different things cannot occupy the same space
    at the same time.

20
Space Time Enables Absolute Disambiguation
  • People Cars Router

Name Make Device ID Address Model Make Date
of Birth Year Model Phone License Plate
No. Firmware Vers. Passport VIN Asset
ID Nationality Owner Etc. Biometric Etc. Etc.
When When When Where Where Where
21
Life Arcs Are Also Telling
Bill Smith 4/13/67 Seattle, Washington
Bill Smith 4/13/67 Salem, Oregon
22
Space-Time-Travel
23
Space-Time-Travel
  • Cell phones are generating a staggering amount of
    geo-locational data 600B transactions per day
    being created in the US alone
  • This data is being de-identified and shared
    with third parties in volume and in real-time
  • Your movement quickly reveals where you spend
    your time (e.g., evenings vs. working hours) and
    who you spend your time with
  • Re-identification (figuring out who is who) is
    somewhat trivial

24
Analytic Superfood for Prediction
  • Route suggestions pushed to drivers,
    just-in-time, to avert significant traffic events
  • Search results optimized using personalized life
    arc forecasts
  • A nation able to work right through an extreme
    global pandemic

25
And Other Predictions
  • Prediction with 87 certainty where you will be
    next Thursday at 535pm
  • Names of the top 10 people you co-locate with,
    not at home and not at work
  • The Uberstan intelligence service preempts the
    next mass protest in real-time
  • A political opponent is crushed and resigns two
    days after announcing their candidacy

26
Consequences
  • Space-time-travel data is the ultimate biometric
  • It will enable enormous opportunity
  • It will unravel ones secrets
  • It will challenge existing notions of privacy
  • And, its here now and more to come

27
Surveillance society is irresistible. And you
are doing it. GPS-enhanced search, free email,
Facebook, etc.
28
Responsible innovation Privacy by
design Better data protection Data
anonymization, active audit logs, etc.
29
Closing Thoughts
30
Wish This On The Adversary
Available Observation Space
Context
Computing Power Growth
Sensemaking Algorithms
Time
31
Context Accumulation The Way Forward
Available Observation Space
Context
Context Accumulation
Computing Power Growth
Sensemaking Algorithms
Time
32
Geospatial-Enabled Intelligence ... Today
Geospatial Visualization
Current Focus
Geospatial Analytics
33
Geospatial-Enabled Intelligence Tomorrow
Geospatial Analytics
Future Focus
Geospatial Visualization
34
Big Data. New Physics.
  • More Data Better prediction
  • Less false positives
  • Less false negatives
  • More Data Bad data good
  • More Data Less compute effort

35
Related Blog Posts
  • Algorithms At Dead-End Cannot Squeeze Knowledge
    Out Of A Pixel
  • Puzzling How Observations Are Accumulated Into
    Context
  • Big Data. New Physics.
  • Smart Sensemaking Systems, First and Foremost,
    Must be Expert Counting Systems
  • Your Movements Speak for Themselves Space-Time
    Travel Data is Analytic Super-Food!
  • Big Data Flows vs. Wicked Leaks
  • Data Finds Data
  • Macro Trends The Privacy and Civil Liberties
    Consequences and Comments on Responsible
    Innovation My DHS DPIAC Testimony, September
    2008

36
Big Data. New Physics.And Why Geospatial Data is
Analytic SuperFood
Jeff Jonas, IBM Distinguished Engineer Chief
Scientist, IBM Entity Analytics JeffJonas_at_us.ibm.c
om January 18th, 2011
Write a Comment
User Comments (0)
About PowerShow.com