Alternatives to the Chi-Square Test for Evaluating Rank Histograms from Ensemble ForecastsSource: Weather and Forecasting:;2005:;volume( 020 ):;issue: 005::page 789Author:Elmore, Kimberly L.
DOI: 10.1175/WAF884.1Publisher: American Meteorological Society
Abstract: Rank histograms are a commonly used tool for evaluating an ensemble forecasting system?s performance. Because the sample size is finite, the rank histogram is subject to statistical fluctuations, so a goodness-of-fit (GOF) test is employed to determine if the rank histogram is uniform to within some statistical certainty. Most often, the ?2 test is used to test whether the rank histogram is indistinguishable from a discrete uniform distribution. However, the ?2 test is insensitive to order and so suffers from troubling deficiencies that may render it unsuitable for rank histogram evaluation. As shown by examples in this paper, more powerful tests, suitable for small sample sizes, and very sensitive to the particular deficiencies that appear in rank histograms are available from the order-dependent Cramér?von Mises family of statistics, in particular, the Watson and Anderson?Darling statistics.
|
Collections
Show full item record
contributor author | Elmore, Kimberly L. | |
date accessioned | 2017-06-09T17:35:01Z | |
date available | 2017-06-09T17:35:01Z | |
date copyright | 2005/10/01 | |
date issued | 2005 | |
identifier issn | 0882-8156 | |
identifier other | ams-87569.pdf | |
identifier uri | http://onlinelibrary.yabesh.ir/handle/yetl/4231252 | |
description abstract | Rank histograms are a commonly used tool for evaluating an ensemble forecasting system?s performance. Because the sample size is finite, the rank histogram is subject to statistical fluctuations, so a goodness-of-fit (GOF) test is employed to determine if the rank histogram is uniform to within some statistical certainty. Most often, the ?2 test is used to test whether the rank histogram is indistinguishable from a discrete uniform distribution. However, the ?2 test is insensitive to order and so suffers from troubling deficiencies that may render it unsuitable for rank histogram evaluation. As shown by examples in this paper, more powerful tests, suitable for small sample sizes, and very sensitive to the particular deficiencies that appear in rank histograms are available from the order-dependent Cramér?von Mises family of statistics, in particular, the Watson and Anderson?Darling statistics. | |
publisher | American Meteorological Society | |
title | Alternatives to the Chi-Square Test for Evaluating Rank Histograms from Ensemble Forecasts | |
type | Journal Paper | |
journal volume | 20 | |
journal issue | 5 | |
journal title | Weather and Forecasting | |
identifier doi | 10.1175/WAF884.1 | |
journal fristpage | 789 | |
journal lastpage | 795 | |
tree | Weather and Forecasting:;2005:;volume( 020 ):;issue: 005 | |
contenttype | Fulltext |