Alternatives to the Chi-Square Test for Evaluating Rank Histograms from Ensemble Forecasts

Elmore, Kimberly L.

Source: Weather and Forecasting:;2005:;volume( 020 ):;issue: 005::page 789

Author:

DOI: 10.1175/WAF884.1

Publisher: American Meteorological Society

Abstract: Rank histograms are a commonly used tool for evaluating an ensemble forecasting system?s performance. Because the sample size is finite, the rank histogram is subject to statistical fluctuations, so a goodness-of-fit (GOF) test is employed to determine if the rank histogram is uniform to within some statistical certainty. Most often, the ?2 test is used to test whether the rank histogram is indistinguishable from a discrete uniform distribution. However, the ?2 test is insensitive to order and so suffers from troubling deficiencies that may render it unsuitable for rank histogram evaluation. As shown by examples in this paper, more powerful tests, suitable for small sample sizes, and very sensitive to the particular deficiencies that appear in rank histograms are available from the order-dependent Cramér?von Mises family of statistics, in particular, the Watson and Anderson?Darling statistics.

Download: (201.2Kb)
Show Full MetaData Hide Full MetaData
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Alternatives to the Chi-Square Test for Evaluating Rank Histograms from Ensemble Forecasts

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4231252

Collections

Weather and Forecasting

contributor author	Elmore, Kimberly L.
date accessioned	2017-06-09T17:35:01Z
date available	2017-06-09T17:35:01Z
date copyright	2005/10/01
date issued	2005
identifier issn	0882-8156
identifier other	ams-87569.pdf
identifier uri	http://onlinelibrary.yabesh.ir/handle/yetl/4231252
description abstract	Rank histograms are a commonly used tool for evaluating an ensemble forecasting system?s performance. Because the sample size is finite, the rank histogram is subject to statistical fluctuations, so a goodness-of-fit (GOF) test is employed to determine if the rank histogram is uniform to within some statistical certainty. Most often, the ?2 test is used to test whether the rank histogram is indistinguishable from a discrete uniform distribution. However, the ?2 test is insensitive to order and so suffers from troubling deficiencies that may render it unsuitable for rank histogram evaluation. As shown by examples in this paper, more powerful tests, suitable for small sample sizes, and very sensitive to the particular deficiencies that appear in rank histograms are available from the order-dependent Cramér?von Mises family of statistics, in particular, the Watson and Anderson?Darling statistics.
publisher	American Meteorological Society
title	Alternatives to the Chi-Square Test for Evaluating Rank Histograms from Ensemble Forecasts
type	Journal Paper
journal volume	20
journal issue	5
journal title	Weather and Forecasting
identifier doi	10.1175/WAF884.1
journal fristpage	789
journal lastpage	795
tree	Weather and Forecasting:;2005:;volume( 020 ):;issue: 005
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive