The Joint Distribution of Internet Flow Sizes and Durations - PowerPoint PPT Presentation

1 / 20
About This Presentation
Title:

The Joint Distribution of Internet Flow Sizes and Durations

Description:

The University of North Carolina ... Data description, scatter plots and density estimation ... (EDA) Different earlier analyses of Internet Flow Sizes ... – PowerPoint PPT presentation

Number of Views:16
Avg rating:3.0/5.0
Slides: 21
Provided by: cwp47
Category:

less

Transcript and Presenter's Notes

Title: The Joint Distribution of Internet Flow Sizes and Durations


1
The Joint Distribution of Internet Flow Sizes
and Durations
  • CHEOLWOO PARK
  • J. STEPHEN MARRON
  • The University of North Carolina at Chapel Hill

2
The Joint Distribution of Internet Flow Sizes and
Durations
  • Motivation of the study
  • Data description, scatter plots and density
    estimation
  • Correlation plots
  • Conclusions and future plans

3
The Joint Distribution of Internet Flow Sizes and
Durations
  • Started from conflict between two papers
  • Extremal Dependence Internet Traffic
    Applications (2002) - Felix Hernandez Campos, J.
    S. Marron, Sidney I. Resnick and Kevin Jeffay
  • On the Characteristics and Origins of Internet
    Flow Rates (2002) - Yin Zhang, Lee Breslau, Vern
    Paxson and Scott Shenker, SIGCOMM02

4
The Joint Distribution of Internet Flow Sizes and
Durations
  • Why interested in this topic?
  • Size and rate are naturally considered as
    independent
  • Users determine sizes of files transferred
    depending on their available bandwidths?
  • Modeling of Internet traffic

5
The Joint Distribution of Internet Flow Sizes and
Durations
  • Different earlier analyses of Internet Flow Sizes
    and Durations

S Size, D Duration, R (S/D) Rate, IR
Inverse Rate
  • Nearly contradictory answers!

6
The Joint Distribution of Internet Flow Sizes and
Durations
  • Why? Possibilities
  • Data from different sources?
  • Different types of data? (HTTP Resp. vs all web
    traces)
  • Different correlation measure?
  • Different threshold values?

7
The Joint Distribution of Internet Flow Sizes and
Durations
  • Threshold values
  • applied thresholding to different variables
  • used different threshold values

8
The Joint Distribution of Internet Flow Sizes and
Durations
  • Motivation of the study
  • Data description, scatter plots and density
    estimation
  • Correlation plots
  • Conclusions and future plans

9
The Joint Distribution of Internet Flow Sizes and
Durations
  • Data
  • HTTP responses
  • Sunday Morning (800 AM 1200 PM)
  • In April 2001
  • From UNC Main Link
  • Variables of Interest
  • S  Size (bytes)
  • D  Duration (time in seconds)
  • R  Rate (throughput, byte/sec)
  • IR  Inverse Rate (sec/byte)

10
The Joint Distribution of Internet Flow Sizes and
Durations
  • Scatterplot log10(Size) vs. log10(Duration)

11
The Joint Distribution of Internet Flow Sizes and
Durations
  • Scatterplot log10(Size) vs. log10(Rate)

12
The Joint Distribution of Internet Flow Sizes and
Durations
  • Scatterplot log10(Duration) vs. log10(Inv. Rate)

13
The Joint Distribution of Internet Flow Sizes and
Durations
  • Motivation of the Study
  • Data description and scatter plots
  • Log-log correlation plots with global
    thresholdings
  • Conclusions and future plans

14
The Joint Distribution of Internet Flow Sizes and
Durations
log10(Size) vs. log10(Duration)
15
The Joint Distribution of Internet Flow Sizes and
Durations
log10(Size) vs. log10(Rate)
16
The Joint Distribution of Internet Flow Sizes and
Durations
log10(Duration) vs. log10(Inv. Rate)
17
The Joint Distribution of Internet Flow Sizes and
Durations
Simulated bivariate normal
log10(Size) vs. log10(Rate)
18
The Joint Distribution of Internet Flow Sizes and
Durations
  • Motivation of the Study
  • Data description and scatter plots
  • Log-log correlation plots with global
    thresholdings
  • Conclusions and future plans

19
The Joint Distribution of Internet Flow Sizes and
Durations
  • Conclusions
  • The blind men and the elephant
  • Thresholding is CRITICAL

20
The Joint Distribution of Internet Flow Sizes and
Durations
  • Deeper investigation
  • What values should we use ?
  • On Size ?
  • On Duration ?
  • On Both ?
  • How to handle 0 durations ?
  • Which methods are robust to thresholding?
Write a Comment
User Comments (0)
About PowerShow.com