A selective editing method considering both suspicion and potential impact, developed and applied to the Swedish foreign trade statistics Topic (ii), WP 12 - PowerPoint PPT Presentation

1 / 27
About This Presentation
Title:

A selective editing method considering both suspicion and potential impact, developed and applied to the Swedish foreign trade statistics Topic (ii), WP 12

Description:

A selective editing method considering both suspicion and potential impact, ... Homogenous groups modest demand on number of observations ... – PowerPoint PPT presentation

Number of Views:32
Avg rating:3.0/5.0
Slides: 28
Provided by: Ntet8
Learn more at: https://unece.org
Category:

less

Transcript and Presenter's Notes

Title: A selective editing method considering both suspicion and potential impact, developed and applied to the Swedish foreign trade statistics Topic (ii), WP 12


1
A selective editing method considering both
suspicion and potential impact, developed and
applied to the Swedish foreign trade
statisticsTopic (ii), WP 12
Anders Jäder and Anders Norberg, Statistics Sweden
2
The data
  • Main variables collected monthly
  • Commodity code (8-digit CN codes)
  • Country of dispatch/arrival
  • Quantity (weight and supplementary unit)
  • Invoiced Value
  • 350 000 observations per month

3
Score function
  • Computed as a weighted geometric mean of measures
    of Suspicion and Potential impact

4
Selective editing
  • The 1,500 observations with the highest scores
    are flagged

5
Suspicion
  • The difference between Unit price and the
    lower/upper quartile, divided by inter-quartiles
    distance. Logarithmic scale
  • (Euro/Kg)

6
Potential Impact
  • The difference between Invoiced Value and the
    median of Unit price multiplied by
    Quantity(Euro)

7
(No Transcript)
8
(No Transcript)
9
(No Transcript)
10
(No Transcript)
11
(No Transcript)
12
(No Transcript)
13
Hit rate 30
14
Hit rate46
Impact65
15
Hit rate30
Impact80
16
Hit rate34Impact81 Best!
17
Potential impact
The 8-digit commodity codes can be aggregated to
6, 4 and 2-digit commodity codes (CN6, CN4, CN2)
and other classifications , e.g. the SITC
classification. ? Over 10,000 estimates to be
computed
18
Potential impact
  • We have developed a formula with which the impact
    of an error on the statistics on all aggregation
    levels and sizes of estimates can be expressed in
    one single variable.

19
Potential impact
  • Excel demonstration

20
Potential impact
21
Strategy
  • SCB has saved raw and corrected data for all
    months since 2000. We analyzed them
  • New system with parameters
  • Produce monthly process data for a continuous
    search of best parameter values

Will we be misled when we analyze data that has
been flagged by the old method ???
22
Study
  • We need many months of historical data current
    data is not enough
  • Homogenous groups modest demand on number of
    observations
  • Computation of median and quartiles weighted by
    Quantity
  • Suspicion versus probability of error
    transformation of Suspicion

23
Suspicion versus probability of error
Suspicion
24
Experiences from production
Hit rate by variable
25
Experiences from production
Impact by variable
26
Experiences from production
- Impact on variable invoiced value
27
Thank You!
Write a Comment
User Comments (0)
About PowerShow.com