Title: Probabilistic QPFs for the Indian Monsoon using Reforecasts
1Probabilistic QPFs for the Indian Monsoon using
Reforecasts
NOAA Earth System Research Laboratory
- Tom Hamill
- NOAA / ESRL
- tom.hamill_at_noaa.gov
2Outline
- Background
- Why reforecasting?
- NOAAs reforecast data set.
- How skill can be overestimated using the
conventional method of applying metrics - Monsoon climatology
- Logistic regression review
- Results
- Stepwise elimination logistic regression
results - Brier skill scores and forecast reliability,
before - and after calibration using logistic regression
- Some examples of actual calibrated forecasts
3- Problem with current ensemble forecast systems
- Forecasts may be biased and/or deficient in
spread, - so that probabilities are mis-estimated.
Calibration (statistical correction) needed.
Heavy rain in an area where none of the ensemble
members predicted it.
http//www.spc.noaa.gov/exper/sref/
4This article on reforecasting in the Bulletin of
the American Meteorological Society is a
good place to start for an overview.
5NOAAs reforecast data set
- Model T62L28 NCEP GFS, circa 1998
- Initial States NCEP-NCAR Reanalysis II plus 7
/- bred modes. - Duration 15-day integrations every day at 00Z
from 19781101 to now. (http//www.cdc.noaa.gov/peo
ple/jeffrey.s.whitaker/refcst/week2). - Data Selected fields (winds, hgt, temp on 5
press levels, precip, t2m, u10m, v10m, pwat,
prmsl, rh700, heating). NCEP/NCAR reanalysis
verifying fields included (Web form to download
at http//www.cdc.noaa.gov/reforecast). Data
saved on 2.5-degree grid. - Experimental precipitation forecast products
http//www.cdc.noaa.gov/reforecast/narr .
6Reforecast data was archived on global domain.
For this experiment we saved forecast total
precipitation, column precipitable water, and
sea-level pressure tendency) on coarse and fine
grids, as shown, for May 15 - Oct 15, 1979-2007.
1.0
2.5
7Overestimating skill a review of the Brier
Skill Score
Brier Score Mean-squared error of probabilistic
forecasts.
Brier Skill Score Skill relative to some
reference, like climatology. 1.0 perfect
forecast, 0.0 skill of reference.
8Overestimating skill another example
5-mm threshold
Location A Pf 0.05, Pclim 0.05, Obs 0
Location B Pf 0.05, Pclim 0.25, Obs 0
why not 0.48?
Locations A and B
9An alternative BSS
- Say m overall samples, and k categories where
climatological event probabilities are similar in
this category. ns(k) samples assigned to this
category. Then form BSS from weighted average of
skills in the categories.
(for more details on all of this, see Hamill and
Juras, QJRMS, October C, 2006)
10Monsoon precipitation climatology
11Monsoon precipitation climatology
12Monsoon precipitation climatology
13Monsoon precipitation climatology
14Logistic regression
where x1, xn are model predictors, betas are
fitted regression coefficients. Used NAG library
routine.
- Predictors tested v(ensemble-mean precip),
precipitable water, SLP tendency - Observed data Indian precipitation analyses on
1-degree grid, 1979-2004 - Stepwise elimination to determine which
predictors are useful. - Train with data /- 10 days around date of
interest. Cross validated, so, for example,
regression coefficients for 1979 were trained on
1980-2004 data. - Test June 1 - October 1, 1979 - 2004.
15Which predictors in logistic regression with
stepwise elimination? Day 1
For every day of the monsoon season, a stepwise
linear regression was run to determine which
predictors provided a reduction in error. As
shown, a power-transformed ensemble- mean
forecast precipitation was uniformly selected as
an important predictor. Precipitable water was
occasionally selected, and sea-level pressure
change was virtually never selected. Based on
these results, all subsequent logistic
regression analyses will be based on using only
one predictor, the power- transformed
ensemble-mean precipitation amount.
16Which predictors in logistic regression with
stepwise elimination? Day 3
The same conclusion is reached when considering
other forecast leads.
17Brier Skill Scores
Confidence intervals are so small they dont show
up on the plot.
18Reliability, Ens. Relative Frequency, 1 and 5 mm
solid lines frequency distribution of
climatology
5, 95 percent confidence intervals via block
bootstrap.
19Reliability, Ens. Relative Frequency, 10 and 25 mm
20Reliability, Logistic Regression, 1 and 5 mm
21Reliability, Logistic Regression, 10 and 25 mm
22Map of Logistic Regression BSS, Day 1
Note This method of calculating BSS lumps all
samples at a particular grid point together
for dates between 1 May and 1 October. To
the extent that the climatological
event probability varies over this range of
dates, the BSS may be somewhat inflated. Please
read Hamill and Juras, QJRMS, Oct (c) 2006.
23Map of Logistic Regression BSS, Day 2
24Map of Logistic Regression BSS, Day 3
25Map of Logistic Regression BSS, Day 4
26Map of Logistic Regression BSS, Day 5
27Monthly variations of skill, 1 mm
28Monthly variations of skill, 10 mm
More skill later in the monsoon season.
29Logistic regression forecast example 1, 1-day
lead
30Logistic regression forecast example 1, 3-day
lead
31Logistic regression forecast example 2, 1-day
lead
32Logistic regression forecast example 2, 3-day
lead
33Logistic regression forecast example 3, 1-day
lead
34Logistic regression forecast example 3, 3-day
lead
35Logistic regression forecast example 4, 1-day
lead
36Logistic regression forecast example 4, 3-day
lead
37Conclusions
- Precipitation probabilities estimated directly
from the ensemble are very unreliable and
unskillful because of substantial model
deficiencies (coarse resolution, sub-optimal
model physics, methods of generating ensemble,
limited ensemble size). - With a large data set of past forecasts
(reforecasts) using the same model that is run
operationally (here, a 1998 version of NCEPs
GFS), the model forecasts can be post-processed
to yield reliable and somewhat skillful
probabilistic forecasts. - The 1st-generation NOAA reforecast model is out
of date, but there is growing interest worldwide
in producing reforecast data sets with current
models (e.g., ECMWF will produce limited
reforecasts starting in 2008).
38References(downloadable from www.cdc.noaa.gov/peo
ple/tom.hamill/cv.html)
- Hamill, T. M., J. S. Whitaker, and X. Wei, 2004
Ensemble re-forecasting improving medium-range
forecast skill using retrospective forecasts.
Mon. Wea. Rev., 132, 1434-1447. - Hamill, T. M., J. S. Whitaker, and S. L. Mullen,
2006 Reforecasts, an important dataset for
improving weather predictions. Bull. Amer.
Meteor. Soc., 87, 33-46. - Hamill, T. M., and J. S. Whitaker, 2006
Probabilistic quantitative precipitation
forecasts based on reforecast analogs theory and
application. Mon. Wea. Rev.,134, 3209-3229. - Hamill, T. M., and J. Juras, 2006 Measuring
forecast skill is it real skill or is it the
varying climatology? Quart. J. Royal Meteor.
Soc., 132, 2905-2923. - Wilks, D. S., and T. M. Hamill, 2007 Comparison
of ensemble-MOS methods using GFS reforecasts.
Mon. Wea. Rev., 135, 2379-2390. - Hamill, T. M., and J. S. Whitaker, 2006 Ensemble
calibration of 500 hPa geopotential height and
850 hPa and 2-meter temperatures using
reforecasts. Mon. Wea. Rev., 135, 3273-3280 - Hagedorn, R, T. M. Hamill, and J. S. Whitaker,
2007 Probabilistic forecast calibration using
ECMWF and GFS ensemble forecasts. Part I 2-meter
temperature. Mon. Wea. Rev., accepted. - Hamill, T. M., R. Hagedorn, and J. S. Whitaker,
2007 Probabilistic forecast calibration using
ECMWF and GFS ensemble forecasts. Part II
precipitation. Mon. Wea. Rev., accepted.