Some Remarks on the Reliability of Categorical Probability ForecastsSource: Monthly Weather Review:;2008:;volume( 136 ):;issue: 011::page 4488Author:Bröcker, Jochen
DOI: 10.1175/2008MWR2329.1Publisher: American Meteorological Society
Abstract: Studies on forecast evaluation often rely on estimating limiting observed frequencies conditioned on specific forecast probabilities (the reliability diagram or calibration function). Obviously, statistical estimates of the calibration function are based on only limited amounts of data and therefore contain residual errors. Although errors and variations of calibration function estimates have been studied previously, either they are often assumed to be small or unimportant, or they are ignored altogether. It is demonstrated how these errors can be described in terms of bias and variance, two concepts well known in the statistics literature. Bias and variance adversely affect estimates of the reliability and sharpness terms of the Brier score, recalibration of forecasts, and the assessment of forecast reliability through reliability diagram plots. Ways to communicate and appreciate these errors are presented. It is argued that these errors can become quite substantial if individual sample points have too large influence on the estimate, which can be avoided by using regularization techniques. As an illustration, it is discussed how to choose an appropriate bin size in the binning and counting method, and an appropriate bandwidth parameter for kernel estimates.
|
Collections
Show full item record
contributor author | Bröcker, Jochen | |
date accessioned | 2017-06-09T16:26:00Z | |
date available | 2017-06-09T16:26:00Z | |
date copyright | 2008/11/01 | |
date issued | 2008 | |
identifier issn | 0027-0644 | |
identifier other | ams-67796.pdf | |
identifier uri | http://onlinelibrary.yabesh.ir/handle/yetl/4209282 | |
description abstract | Studies on forecast evaluation often rely on estimating limiting observed frequencies conditioned on specific forecast probabilities (the reliability diagram or calibration function). Obviously, statistical estimates of the calibration function are based on only limited amounts of data and therefore contain residual errors. Although errors and variations of calibration function estimates have been studied previously, either they are often assumed to be small or unimportant, or they are ignored altogether. It is demonstrated how these errors can be described in terms of bias and variance, two concepts well known in the statistics literature. Bias and variance adversely affect estimates of the reliability and sharpness terms of the Brier score, recalibration of forecasts, and the assessment of forecast reliability through reliability diagram plots. Ways to communicate and appreciate these errors are presented. It is argued that these errors can become quite substantial if individual sample points have too large influence on the estimate, which can be avoided by using regularization techniques. As an illustration, it is discussed how to choose an appropriate bin size in the binning and counting method, and an appropriate bandwidth parameter for kernel estimates. | |
publisher | American Meteorological Society | |
title | Some Remarks on the Reliability of Categorical Probability Forecasts | |
type | Journal Paper | |
journal volume | 136 | |
journal issue | 11 | |
journal title | Monthly Weather Review | |
identifier doi | 10.1175/2008MWR2329.1 | |
journal fristpage | 4488 | |
journal lastpage | 4502 | |
tree | Monthly Weather Review:;2008:;volume( 136 ):;issue: 011 | |
contenttype | Fulltext |