Network Traffic Self-Similarity - PowerPoint PPT Presentation

1 / 85
About This Presentation
Title:

Network Traffic Self-Similarity

Description:

Discussion of network traffic self-similarity, definitions, issues, implications. Focuses on the Ethernet LAN traffic paper by Leland et al. – PowerPoint PPT presentation

Number of Views:107
Avg rating:3.0/5.0
Slides: 86
Provided by: CareyWil9
Category:

less

Transcript and Presenter's Notes

Title: Network Traffic Self-Similarity


1
Network Traffic Self-Similarity
  • Carey Williamson

Department of Computer Science University of
Calgary
2
Introduction
  • A recent measurement study has shown that
    aggregate Ethernet LAN traffic is
    self-similar Leland et al 1993
  • A statistical property that is very different
    from the traditional Poisson-based models
  • This presentation definition of network traffic
    self-similarity, Bellcore Ethernet LAN data,
    implications of self-similarity

3
Measurement Methodology
  • Collected lengthy traces of Ethernet LAN traffic
    on Ethernet LAN(s) at Bellcore
  • High resolution time stamps
  • Analyzed statistical properties of the resulting
    time series data
  • Each observation represents the number of packets
    (or bytes) observed per time interval (e.g., 10
    4 8 12 7 2 0 5 17 9 8 8 2...)

4
Self-Similarity The Intuition
  • If you plot the number of packets observed per
    time interval as a function of time, then the
    plot looks the same regardless of what
    interval size you choose
  • E.g., 10 msec, 100 msec, 1 sec, 10 sec,...
  • Same applies if you plot number of bytes observed
    per interval of time

5
Self-Similarity The Intuition
  • In other words, self-similarity implies a
    fractal-like behaviour no matter what time
    scale you use to examine the data, you see
    similar patterns
  • Implications
  • Burstiness exists across many time scales
  • No natural length of a burst
  • Traffic does not necessarilty get smoother
    when you aggregate it (unlike Poisson traffic)

6
Self-Similarity The Mathematics
  • Self-similarity is a rigourous statistical
    property (i.e., a lot more to it than just the
    pretty fractal-like pictures)
  • Assumes you have time series data with finite
    mean and variance (i.e., covariance stationary
    stochastic process)
  • Must be a very long time series
    (infinite is best!)
  • Can test for presence of self-similarity

7
Self-Similarity The Mathematics
  • Self-similarity manifests itself in several
    equivalent fashions
  • Slowly decaying variance
  • Long range dependence
  • Non-degenerate autocorrelations
  • Hurst effect

8
Slowly Decaying Variance
  • The variance of the sample decreases more slowly
    than the reciprocal of the sample size
  • For most processes, the variance of a sample
    diminishes quite rapidly as the sample size is
    increased, and stabilizes soon
  • For self-similar processes, the variance
    decreases very slowly, even when the sample size
    grows quite large

9
Variance-Time Plot
  • The variance-time plot is one means
    to test for the slowly decaying
    variance property
  • Plots the variance of the sample versus the
    sample size, on a log-log plot
  • For most processes, the result is a straight line
    with slope -1
  • For self-similar, the line is much flatter

10
Variance-Time Plot
Variance
m
11
Variance-Time Plot
100.0
10.0
Variance of sample on a logarithmic scale
Variance
0.01
0.001
0.0001
m
12
Variance-Time Plot
Variance
Sample size m on a logarithmic scale
4
5
6
7
m
1
10
100
10
10
10
10
13
Variance-Time Plot
Variance
m
14
Variance-Time Plot
Variance
m
15
Variance-Time Plot
Slope -1 for most processes
Variance
m
16
Variance-Time Plot
Variance
m
17
Variance-Time Plot
Slope flatter than -1 for self-similar process
Variance
m
18
Long Range Dependence
  • Correlation is a statistical measure of the
    relationship, if any, between two random
    variables
  • Positive correlation both behave similarly
  • Negative correlation behave as opposites
  • No correlation behaviour of one is unrelated to
    behaviour of other

19
Long Range Dependence (Contd)
  • Autocorrelation is a statistical measure of the
    relationship, if any, between a random variable
    and itself, at different time lags
  • Positive correlation big observation usually
    followed by another big, or small by small
  • Negative correlation big observation usually
    followed by small, or small by big
  • No correlation observations unrelated

20
Long Range Dependence (Contd)
  • Autocorrelation coefficient can range between 1
    (very high positive correlation) and -1 (very
    high negative correlation)
  • Zero means no correlation
  • Autocorrelation function shows the value of the
    autocorrelation coefficient for different time
    lags k

21
Autocorrelation Function
1
0
Autocorrelation Coefficient
-1
lag k
0
100
22
Autocorrelation Function
1
Maximum possible positive correlation
0
Autocorrelation Coefficient
-1
lag k
0
100
23
Autocorrelation Function
1
0
Autocorrelation Coefficient
Maximum possible negative correlation
-1
lag k
0
100
24
Autocorrelation Function
1
No observed correlation at all
0
Autocorrelation Coefficient
-1
lag k
0
100
25
Autocorrelation Function
1
0
Autocorrelation Coefficient
-1
lag k
0
100
26
Autocorrelation Function
1
Significant positive correlation at short lags
0
Autocorrelation Coefficient
-1
lag k
0
100
27
Autocorrelation Function
1
0
Autocorrelation Coefficient
No statistically significant correlation beyond
this lag
-1
lag k
0
100
28
Long Range Dependence (Contd)
  • For most processes (e.g., Poisson, or compound
    Poisson), the autocorrelation function drops to
    zero very quickly (usually immediately, or
    exponentially fast)
  • For self-similar processes, the autocorrelation
    function drops very slowly (i.e., hyperbolically)
    toward zero, but may never reach zero
  • Non-summable autocorrelation function

29
Autocorrelation Function
1
0
Autocorrelation Coefficient
-1
lag k
0
100
30
Autocorrelation Function
1
Typical short-range dependent process
0
Autocorrelation Coefficient
-1
lag k
0
100
31
Autocorrelation Function
1
0
Autocorrelation Coefficient
-1
lag k
0
100
32
Autocorrelation Function
1
Typical long-range dependent process
0
Autocorrelation Coefficient
-1
lag k
0
100
33
Autocorrelation Function
1
Typical long-range dependent process
0
Autocorrelation Coefficient
Typical short-range dependent process
-1
lag k
0
100
34
Non-Degenerate Autocorrelations
  • For self-similar processes, the autocorrelation
    function for the aggregated process is
    indistinguishable from that of the original
    process
  • If autocorrelation coefficients match for all
    lags k, then called exactly self-similar
  • If autocorrelation coefficients match only for
    large lags k, then called asymptotically
    self-similar

35
Autocorrelation Function
1
0
Autocorrelation Coefficient
-1
lag k
0
100
36
Autocorrelation Function
1
Original self-similar process
0
Autocorrelation Coefficient
-1
lag k
0
100
37
Autocorrelation Function
1
Original self-similar process
0
Autocorrelation Coefficient
-1
lag k
0
100
38
Autocorrelation Function
1
Original self-similar process
0
Autocorrelation Coefficient
Aggregated self-similar process
-1
lag k
0
100
39
Aggregation
  • Aggregation of a time series X(t) means smoothing
    the time series by averaging the observations
    over non-overlapping blocks of size m to get a
    new time series X (t)

m
40
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated series for m 2 is

41
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated series for m 2 is

42
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated series for m 2 is
  • 4.5

43
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated series for m 2 is
  • 4.5 8.0

44
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated series for m 2 is
  • 4.5 8.0 2.5

45
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated series for m 2 is
  • 4.5 8.0 2.5 5.0

46
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated series for m 2 is
  • 4.5 8.0 2.5 5.0 6.0 7.5 7.0 4.0 4.5
    5.0...

47
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 5 is

48
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 5 is

49
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 5 is
  • 6.0

50
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 5 is
  • 6.0 4.4

51
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 5 is
  • 6.0 4.4 6.4 4.8
    ...

52
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 10 is

53
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 10 is
  • 5.2

54
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 10 is
  • 5.2 5.6

55
Aggregation An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1...
  • Then the aggregated time series for m 10 is
  • 5.2 5.6 ...

56
Autocorrelation Function
1
Original self-similar process
0
Autocorrelation Coefficient
Aggregated self-similar process
-1
lag k
0
100
57
The Hurst Effect
  • For almost all naturally occurring time series,
    the rescaled adjusted range statistic (also
    called the R/S statistic) for sample size n obeys
    the relationship
  • ER(n)/S(n) c n
  • where
  • R(n) max(0, W , ... W ) - min(0, W , ... W )
  • S (n) is the sample variance, and
  • W ??X - k X for k 1, 2, ... n

H
1
1
n
n
2
k
k
i
n
i 1
58
The Hurst Effect (Contd)
  • For models with only short range dependence, H is
    almost always 0.5
  • For self-similar processes, 0.5 lt H lt 1.0
  • This discrepancy is called the Hurst Effect, and
    H is called the Hurst parameter
  • Single parameter to characterize
    self-similar processes

59
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • There are 20 data points in this example

60
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • There are 20 data points in this example
  • For R/S analysis with n 1, you get 20 samples,
    each of size 1

61
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • There are 20 data points in this example
  • For R/S analysis with n 1, you get 20 samples,
    each of size 1
  • Block 1 X 2, W 0, R(n) 0, S(n) 0

n
1
62
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • There are 20 data points in this example
  • For R/S analysis with n 1, you get 20 samples,
    each of size 1
  • Block 2 X 7, W 0, R(n) 0, S(n) 0

n
1
63
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 2, you get 10 samples,
    each of size 2

64
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 2, you get 10 samples,
    each of size 2
  • Block 1 X 4.5, W -2.5, W 0,
  • R(n) 0 - (-2.5) 2.5, S(n) 2.5,
  • R(n)/S(n) 1.0

n
1
2
65
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 2, you get 10 samples,
    each of size 2
  • Block 2 X 8.0, W -4.0, W 0,
  • R(n) 0 - (-4.0) 4.0, S(n) 4.0,
  • R(n)/S(n) 1.0

n
1
2
66
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 3, you get 6 samples,
    each of size 3

67
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 3, you get 6 samples,
    each of size 3
  • Block 1 X 4.3, W -2.3, W 0.3, W 0
  • R(n) 0.3 - (-2.3) 2.6, S(n) 2.05,
  • R(n)/S(n) 1.30

n
1
2
3
68
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 3, you get 6 samples,
    each of size 3
  • Block 2 X 5.7, W 6.3, W 5.7, W 0
  • R(n) 6.3 - (0) 6.3, S(n) 4.92,
  • R(n)/S(n) 1.28

n
1
2
3
69
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 5, you get 4 samples,
    each of size 5

70
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 5, you get 4 samples,
    each of size 4
  • Block 1 X 6.0, W -4.0, W -3.0,
  • W -5.0 , W 1.0 , W 0, S(n) 3.41,
  • R(n) 1.0 - (-5.0) 6.0, R(n)/S(n) 1.76

n
1
2
3
4
5
71
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 5, you get 4 samples,
    each of size 4
  • Block 2 X 4.4, W -4.4, W -0.8,
  • W -3.2 , W 0.4 , W 0, S(n) 3.2,
  • R(n) 0.4 - (-4.4) 4.8, R(n)/S(n) 1.5

n
1
2
3
4
5
72
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 10, you get 2 samples,
    each of size 10

73
R/S Statistic An Example
  • Suppose the original time series X(t) contains
    the following (made up) values
  • 2 7 4 12 5 0 8 2 8 4 6 9 11 3 3 5 7 2 9 1
  • For R/S analysis with n 20, you get 1 sample
    of size 20

74
R/S Plot
  • Another way of testing for self-similarity, and
    estimating the Hurst parameter
  • Plot the R/S statistic for different values of n,
    with a log scale on each axis
  • If time series is self-similar, the resulting
    plot will have a straight line shape with a slope
    H that is greater than 0.5
  • Called an R/S plot, or R/S pox diagram

75
R/S Pox Diagram
R/S Statistic
Block Size n
76
R/S Pox Diagram
R/S statistic R(n)/S(n) on a logarithmic scale
R/S Statistic
Block Size n
77
R/S Pox Diagram
R/S Statistic
Sample size n on a logarithmic scale
Block Size n
78
R/S Pox Diagram
R/S Statistic
Block Size n
79
R/S Pox Diagram
R/S Statistic
Slope 0.5
Block Size n
80
R/S Pox Diagram
R/S Statistic
Slope 0.5
Block Size n
81
R/S Pox Diagram
Slope 1.0
R/S Statistic
Slope 0.5
Block Size n
82
R/S Pox Diagram
Slope 1.0
R/S Statistic
Slope 0.5
Block Size n
83
R/S Pox Diagram
Self- similar process
Slope 1.0
R/S Statistic
Slope 0.5
Block Size n
84
R/S Pox Diagram
Slope H (0.5 lt H lt 1.0) (Hurst parameter)
Slope 1.0
R/S Statistic
Slope 0.5
Block Size n
85
Summary
  • Self-similarity is an important mathematical
    property that has recently been identified as
    present in network traffic measurements
  • Important property burstiness across many time
    scales, traffic does not aggregate well
  • There exist several mathematical methods to test
    for the presence of self-similarity, and to
    estimate the Hurst parameter H
  • There exist models for self-similar traffic
Write a Comment
User Comments (0)
About PowerShow.com