Measuring and monitoring Microsofts enterprise network - PowerPoint PPT Presentation

1 / 18
About This Presentation
Title:

Measuring and monitoring Microsofts enterprise network

Description:

1. Measuring and monitoring Microsoft's enterprise ... Packet encapsulated in shim header plus trailer. IP protocol moved into trailer and header rewritten ... – PowerPoint PPT presentation

Number of Views:42
Avg rating:3.0/5.0
Slides: 19
Provided by: mort98
Category:

less

Transcript and Presenter's Notes

Title: Measuring and monitoring Microsofts enterprise network


1
Measuring and monitoring Microsofts enterprise
network
  • Richard Mortier (mort), Rebecca Isaacs, Laurent
    Massoulié, Peter Key

2
We monitored our network
  • and this is how
  • and this is what we saw
  • How did we monitor it?
  • What did we see?

3
Microsoft CorpNet _at_ MSR Cambridge
CORPNET
EMEA
MSRC
area3
area2
LatinAmerica
area1
Area 0
eBGP
NorthAmerica
AsiaPacific
4
Capture setup
  • MSRC site organized using IP subnets
  • Roughly one per wing plus one for datacenter
  • Datacenter is by far the most active
  • Captured using VLAN spanning
  • 11 mapping between (Ethernet) VLAN and IP subnet
  • Mapped all VLANs to one port (NS trace)
  • except datacenter, mapped to second port (DC
    trace)
  • Also took a capture at one VLANs Ethernet switch
  • Allowed us to estimate amount of traffic not
    captured
  • gt99 traffic is routed (i.e. goes off-VLAN)
  • Missed printer, some subnet broadcast, some SMB

5
Capture setup VLAN spanning
offsite
north side
NS trace
filter on VLAN
port
MSR Cambridge site router
subnet
DC trace
data center
south side
6
Packet processing
  • Assigned packets to application
  • Used port numbers, RPC GUID, signature byte
    strings, server name
  • Assigned applications to category
  • 40 applications ? 10 categories
  • Generated packet and flow records
  • Reduce disk IO, increase performance
  • Still took 10 days per complete run
  • Python scripts processed records

7
Problems with this setup
  • Duplication
  • No DC switch some hosts directly connected to
    router
  • See their packets twice (on the way in and out)
  • Deduplicate both traces careful selection from
    NS trace
  • IPSec transport mode deployment
  • Packet encapsulated in shim header plus trailer
  • IP protocol moved into trailer and header
    rewritten
  • Wrote custom capture tools to unpick
    encapsulation
  • Flow detection
  • Network flow ? transport flow ? application flow
  • Used IP 5-tuple and timeout 90 seconds

8
Flow idle timeout
9
Trace characteristics
10
Traffic classification
11
Protocol distribution
12
Communication patterns
flows src ports suggesting client behaviour
flows use few src ports suggests server behaviour
13
flows src ports suggesting client behaviour
flows use few src ports suggests server behaviour
neither client nor server suggests peer-to-peer
neither client nor server suggests peer-to-peer
14
Communication patterns
15
Traffic dynamics
  • Headlines seasonal, highly volatile
  • Examine through
  • Autocorrelations
  • Variation per-application per-hour
  • Variation per-application per-host
  • Variation in heavy-hitter set

16
Correlograms onsite traffic
17
Correlograms offsite traffic
18
Variation per-application per-hour
  • Onsite (left)
  • Offsite (down)
  • Exponential decay
  • Light-tailed

19
Variation per-application per-host
  • Onsite (left)
  • Offsite (down)
  • Linear decay
  • Heavy-tailed
  • Heavy hitters

20
Implications for modelling
  • Timeseries modelling is hard
  • Tried ARMA, ARIMA models but per-application only
  • Exponentiation leads to large errors in
    forecasting
  • Client/server distinction unclear
  • Tried PCA, projection pursuit method
  • Neither found anything
  • PCA discovered singleton clusters in rank order...

21
Implications for endsystem measurement
  • Heavy hitter tracking a useful approach for
    network monitoring
  • Must be dynamic since heavy hitter set varies
  • between applications and
  • over time per-application
  • but is it possible to define a baseline against
    which to detect (volume) anomalies?

22
Questions?
Write a Comment
User Comments (0)
About PowerShow.com