Comparative Evaluation of Four Different Sensitive Tabular Data Protection Methods Using a Real Life Table Structure of Complex Hierarchies and Links Populated with Artificial Data Ramesh A. Dandekar Energy Information Administration Washington - PowerPoint PPT Presentation

1 / 14
About This Presentation
Title:

Comparative Evaluation of Four Different Sensitive Tabular Data Protection Methods Using a Real Life Table Structure of Complex Hierarchies and Links Populated with Artificial Data Ramesh A. Dandekar Energy Information Administration Washington

Description:

Manchester, ... Manchester, United Kingdom. 7. 1,000 Synthetic Micro Data Records ... Manchester, United Kingdom. 10. Comparative Evaluation of Cell ... – PowerPoint PPT presentation

Number of Views:69
Avg rating:3.0/5.0

less

Transcript and Presenter's Notes

Title: Comparative Evaluation of Four Different Sensitive Tabular Data Protection Methods Using a Real Life Table Structure of Complex Hierarchies and Links Populated with Artificial Data Ramesh A. Dandekar Energy Information Administration Washington


1
Comparative Evaluation of Four Different
Sensitive Tabular Data Protection Methods Using a
Real Life Table Structure of Complex Hierarchies
and LinksPopulated with Artificial DataRamesh
A. DandekarEnergy Information AdministrationWash
ington DC(Ramesh.Dandekar_at_EIA.DOE.GOV)
UNECE2007 17-19 December 2007
2
Tabular Data Protection Methods
  • Classical LP Based Cell Suppression
  • Network Flow Based Cell Suppression (USBC)
  • LP Based Synthetic Tabular Data / CTA (Dandekar
    2001)
  • Micro Data Level Noise Addition (USBC)
  • P 10 Rule Used
  • Uses Proprietary Research Tools

3
Two Three Dimensional HYPOTHETICALTablesLinked
in Four Dimensional Space1st Table Volumes by
Grade, Sales Type, PAD District, and State ,
and 2nd Table Volumes by Formulation, Sales
Type, PAD District, and State
4
(No Transcript)
5
1st Table 2nd Table Grades
Formulations
  • Regular
  • Midgrade
  • Premium
  • Total All Grades
  • Conventional
  • Oxygenated
  • Reformulated
  • Total All Formulations

6
Grades
MISSING PORTION
Formulations
2nd Table By Formulations
1st Table By Grades
Total

Four Layers 1) DTW 2) Rack 3) Bulk 4)
Total Corresponding to each PAD, State and US
Level Cell
7
1,000 Synthetic Micro Data Records Containing Six
Variables
  • Four Categorical Variables
  • 51 States
  • 3 Grade Types
  • 3 Sale Types
  • 3 Formulation Types
  • One Magnitude Variable
  • One Sample Weight Variable

8
(No Transcript)
9
(No Transcript)
10
Comparative Evaluation of Cell Suppression Methods
  • Classical Cell Suppression
  • 294 Suppressions
  • Sensitive Cells Fully Protected
  • Network Flow Method
  • 479 Suppressions
  • Sensitive Cells Fully Protected
  • 3 exact Disclosures of non-sensitive cells

11
CTA vs NOISE - TABULAR DATA QUALITY
633
1432
12
(No Transcript)
13
(No Transcript)
14
THANK YOU!ADDITIONAL INFORMATION
FROMhttp//mysite.verizon.net/vze7w8vk/
Write a Comment
User Comments (0)
About PowerShow.com