Lightweight Application Classification for Network Management - PowerPoint PPT Presentation

1 / 14
About This Presentation
Title:

Lightweight Application Classification for Network Management

Description:

A lightweight application classification scheme based on NetFlow data ... (Cisco NetFlow, Juniper cflowd, Huawei NetStream,...) Sampling further reduces the cost ... – PowerPoint PPT presentation

Number of Views:17
Avg rating:3.0/5.0
Slides: 15
Provided by: IBMU393
Category:

less

Transcript and Presenter's Notes

Title: Lightweight Application Classification for Network Management


1
Lightweight Application Classification for
Network Management
  • Hongbo Jiang
  • Case Western Reserve University

Andrew W. Moore University of Cambridge
Zihui Ge Adverplex Inc.
Jia Wang ATT Labs - Research
Shudng Jin Case Western Reserve University
ACM SIGCOMM Workshop on Internet Network
Management (INM) Kyoto, Japan, August 31, 2007
2
Why do Network Traffic Classification?
  • Network planning
  • Traffic engineering
  • Accounting and billing
  • Security profiling

3
Our Contribution
  • A lightweight application classification scheme
    based on NetFlow data
  • Evaluation Sensitivity Analysis
  • Trivial features
  • Derivative features
  • Training-set size
  • Packet sampling

4
Flow-level Traffic Classification
  • Previous traffic classification use features
    derived from streams of packets
  • Can achieve good accuracy (e.g., 95)
  • Have high complexity and cost
  • Commonly available flow-level statistics
  • (Cisco NetFlow, Juniper cflowd, Huawei
    NetStream,)
  • Sampling further reduces the cost

5
Probabilistic Method Example
Training Set
Pr .97
6
Our Approach (cont.)
  • Features ranked by importance
  • Use Symmetric Uncertainty (based on entropy)
  • (See paper and references therein for details.)
  • Ranked features allows for a
  • sensitivity analysis, and the
  • removal of irrelevant and redundant features.

7
Evaluation
  • Dataset (not from ATT!)
  • Full-duplex 1Gbps access-link 1000 researchers
  • Data was hand-classified into a number of
    application classes e.g. web-browsing, email,
    FTP, attack, P2P,
  • Focused on TCP/IP flows only
  • 800,000 simplex TCP/IP application-level flows
  • (97 of traffic by byte-volume)
  • Netflow Generation
  • Software simulation of Cisco NetFlow v5 engine
  • Independent training and test sets
  • Flows randomly assigned to each

8
Baseline and Derivative Features
Category Baseline Derivative Baseline Application Baseline Derivative
Features srcIP/dstIPsrcPort/dstPort ToS sTime/eTime tcpFlag bytes packets Duration pktSize byteRate pktRate tcpFxxx (syn/ack/fin/rst/psh/urg) Low port High port
Accuracy 88.3 89.1 91.4
Comparison Port based 50-70, Packet based 95
9
Highly Relevant Features
Refers to specific privileged services and
protocols
Differentiate Email and FTP from Web-browsing
Compact features
10
Reducing Feature Complexity
Runtime 600x (s)
Runtime 1x (s)
Accuracy remains high even after removing
irrelevant and redundant features.
11
Reducing Training SetSize
More features may lead-to noise (insufficiently
representative)
12
Impact of Packet Sampling
  • NetFlow characteristic
  • Observed flow-count will decrease as sampling
    rate decreases
  • Packet sampling has little impact on accuracy

13
Conclusion Future Works
  • Conclusion
  • Application Classification can be done with
    Flow-level (NetFlow) information
  • Trivially-derived features improve accuracy
  • Packet sampling have minimal impact
  • Future works
  • NetFlow v9??
  • Other M-L methods?

14
Thanks
Write a Comment
User Comments (0)
About PowerShow.com