Dynamic%20Visualization%20of%20Transient%20Data%20Streams - PowerPoint PPT Presentation

About This Presentation
Title:

Dynamic%20Visualization%20of%20Transient%20Data%20Streams

Description:

Dynamic Visualization of Transient Data Streams P. Wong, et al The Pacific Northwest National Laboratory Presented by John Sharko Visualization of Massive Datasets – PowerPoint PPT presentation

Number of Views:106
Avg rating:3.0/5.0
Slides: 24
Provided by: JohnSh168
Learn more at: https://www.cs.uml.edu
Category:

less

Transcript and Presenter's Notes

Title: Dynamic%20Visualization%20of%20Transient%20Data%20Streams


1
Dynamic Visualization of Transient Data Streams
  • P. Wong, et al
  • The Pacific Northwest National Laboratory
  • Presented by John Sharko
  • Visualization of Massive Datasets

2
Characteristics of Data Streams
  • Arrives continuously
  • Arrives unpredictably
  • Arrives unboundedly
  • Arrives without persistent patterns

3
Examples of Data Streams
  • Newswires
  • Internet click streams
  • Network resource management
  • Phone call records
  • Remote sensing imagery

4
Visualization Problem
  • Fusing a large amount of previously analyzed
    information with a small amount of new
    information
  • Reprocess the whole dataset in full detail

5
First Objective
  • Achieve the best understanding of transient data
    when influx rate exceed processing rate
  • Approach Data stratification to reduce data size

6
Second Objective
  • Incremental visualization technique
  • Approach Project new information incrementally
    onto previous data

7
Primary Visualization OutputMultidimensional
Scaling
OJ Simpson trial
Oklahoma bombing
French elections
8
Adaptive Visualization Using Stratification
9
Methods for Adaptive Visualization
  • Vector dimension reduction
  • Vector sampling

10
Vector Dimension Reduction
  • Approach dyadic wavelets (Haar)

200 terms
100 terms
50 terms
11
Results of Vector Dimension Reduction
50
200
100
Dimensions
12
Results of Vector Sampling
3298
824
1649
Number of Documents
13
Scatterplot Similarity Matching
14
Scatterplot Similarity Matching
  • Procrustes Analysis Results

200 100 50
All 0.0 (self) 0.022 0.084
1/2 0.016 0.051 0.111
1/4 0.033 0.062 0.141
15
Incremental Visualization Using Fusion
  • Reprocessing by projecting new items onto
    existing visualization
  • Feature reprocessing the entire dataset is often
    not required

16
Hyperspectral Image Processing
  • Apply MDS to scale pixel vectors
  • K-mean process to assign unique colors
  • Stratify the vectors progressively

17
Robust Eigenvectors
  • Generate three MDS scatter plots for each third
    of the image

18
Robust Eigenvectors (contd)
  • Generate MDS scatterplot for entire dataset

19
Robust Eigenvectors (contd)
  • Extract points from cropped areas

20
Using Multiple Sliding Windows
Sliding Direction
Data Stream
Long Window
Short Window
  • Eigenvectors determined by the long window
  • New vectors are projected using the Eigenvectors
    of the long window

21
Dynamic Visualization Steps
  • 1. When influx rate lt processing rate, use MDS
  • 2. When influx rate gt processing rate, halt MDS
  • 3. Use multiple sliding windows for pre-defined
    number of steps
  • 4. Use stratification approach for fast overview
  • 5. Check for accumulated error using Procrustes
    analysis
  • 6. If error threshold not reached, go to step 3
  • If error threshold reached, go to step 1

22
Conclusions
  • The data stratification approach can
    substantially accelerate visualization process
  • The data fusion approach can provide instant
    updates

23
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com