Show simple item record

contributor authorFovell, Robert G.
date accessioned2017-06-09T15:35:21Z
date available2017-06-09T15:35:21Z
date copyright1997/06/01
date issued1997
identifier issn0894-8755
identifier otherams-4793.pdf
identifier urihttp://onlinelibrary.yabesh.ir/handle/yetl/4187211
description abstractA ?consensus clustering? strategy is applied to long-term temperature and precipitation time series data for the purpose of delineating climate zones of the conterminous United States in a ?data-driven? (as opposed to ?rule-driven?) fashion. Cluster analysis simplifies a dataset by arranging ?objects? (here, climate divisions or stations) into a smaller number of relatively homogeneous groups or clusters on the basis of interobject dissimilarities computed using the identified ?attributes? (here, temperature and precipitation measurements recorded for the objects). The results demonstrate the spatial scales associated with climatic variability and may suggest climatically justified ways in which the number of objects in a dataset may be reduced. Implicit in this work is the arguable contention that temperature and precipitation data are both necessary and sufficient for the delineation of climatic zones. In prior work, the temperature and precipitation data were mixed during the computation of the interobject dissimilarities. This allowed the clusters to jointly reflect temperature and precipitation distinctions, but also had inherent problems relating to arbitrary attribute scaling and information redundancy that proved difficult to resolve. In the present approach, the temperature and precipitation data are clustered separately and then categorically intersected to forge consensus clusters. The consensus outcome may be viewed as having identified the temperature subzones of precipitation clusters (or vice versa) or as representing distinct groupings that are relatively homogeneous with respect to both attribute types simultaneously. The dissimilarity measure employed herein is the Euclidean distance. As it employs only continuous time series data representing a single information type (temperature or precipitation), the consensus approach has the advantage of allowing an attractively simple interpretation of the total Euclidean distance between object pairs. The total squared distance may be subdivided into three components representing object dissimilarity with respect to temporal mean (level), seasonality (variability), and coseasonality (relative temporal phasing). Therefore, concerns about redundancy or arbitrary scaling problems are neutralized. This is seen as the chief advantage of consensus clustering. The consensus strategy has several disadvantages. It is possible for two (or more) relatively general, undetailed clusterings to produce a very complex and fragmented clustering following categorical intersection. Further, the fact that the analyst chooses the clustering levels of the separate, contributing clusterings means that he or she has considerable freedom in fashioning the consensus outcome, which makes it difficult (if not impossible) to argue that true, ?natural? clusters have been identified. The latter often applies to cluster analysis in general, however. It is believed that the consensus approach merits consideration owing to its advantages. Two consensus outcomes are presented: a lower-order solution with 14 clusters and a higher-order solution with 26 clusters. The sensitivity of these clusterings to perturbations in the input data is assessed. The regionalizations are compared with those presented in prior work.
publisherAmerican Meteorological Society
titleConsensus Clustering of U.S. Temperature and Precipitation Data
typeJournal Paper
journal volume10
journal issue6
journal titleJournal of Climate
identifier doi10.1175/1520-0442(1997)010<1405:CCOUST>2.0.CO;2
journal fristpage1405
journal lastpage1427
treeJournal of Climate:;1997:;volume( 010 ):;issue: 006
contenttypeFulltext


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record