Examination of Numerical Results from Tangent Linear and Adjoint of Discontinuous Nonlinear Models

Zhang, S.; Zou, X.; Ahlquist, Jon E.

Source: Monthly Weather Review:;2001:;volume( 129 ):;issue: 011::page 2791

Author:

Zhang, S.

Zou, X.

Ahlquist, Jon E.

DOI: 10.1175/1520-0493(2001)129<2791:EONRFT>2.0.CO;2

Publisher: American Meteorological Society

Abstract: The forward model solution and its functional (e.g., the cost function in 4DVAR) are discontinuous with respect to the model's control variables if the model contains discontinuous physical processes that occur during the assimilation window. In such a case, the tangent linear model (the first-order approximation of a finite perturbation) is unable to represent the sharp jumps of the nonlinear model solution. Also, the first-order approximation provided by the adjoint model is unable to represent a finite perturbation of the cost function when the introduced perturbation in the control variables crosses discontinuous points. Using an idealized simple model and the Arakawa?Schubert cumulus parameterization scheme, the authors examined the behavior of a cost function and its gradient obtained by the adjoint model with discontinuous model physics. Numerical results show that a cost function involving discontinuous physical processes is zeroth-order discontinuous, but piecewise differentiable. The maximum possible number of involved discontinuity points of a cost function increases exponentially as 2kn, where k is the total number of thresholds associated with on?off switches, and n is the total number of time steps in the assimilation window. A backward adjoint model integration with the proper forcings added at various time steps, similar to the backward adjoint model integration that provides the gradient of the cost function at a continuous point, produces a one-sided gradient (called a subgradient and denoted as ?sJ) at a discontinuous point. An accuracy check of the gradient shows that the adjoint-calculated gradient is computed exactly on either side of a discontinuous surface. While a cost function evaluated using a small interval in the control variable space oscillates, the distribution of the gradient calculated at the same resolution not only shows a rather smooth variation, but also is consistent with the general convexity of the original cost function. The gradients of discontinuous cost functions are observed roughly smooth since the adjoint integration correctly computes the one-sided gradient at either side of discontinuous surface. This implies that, although (?sJ)Tδx may not approximate δJ = J(x + δx) ? J(x) well near the discontinuous surface, the subgradient calculated by the adjoint of discontinuous physics may still provide useful information for finding the search directions in a minimization procedure. While not eliminating the possible need for the use of a nondifferentiable optimization algorithm for 4DVAR with discontinuous physics, consistency between the computed gradient by adjoints and the convexity of the cost function may explain why a differentiable limited-memory quasi-Newton algorithm still worked well in many 4DVAR experiments that use a diabatic assimilation model with discontinuous physics.

Download: (236.4Kb)
Show Full MetaData Hide Full MetaData
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Examination of Numerical Results from Tangent Linear and Adjoint of Discontinuous Nonlinear Models

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4204868

Collections

Monthly Weather Review

Show full item record

contributor author	Zhang, S.
contributor author	Zou, X.
contributor author	Ahlquist, Jon E.
date accessioned	2017-06-09T16:14:01Z
date available	2017-06-09T16:14:01Z
date copyright	2001/11/01
date issued	2001
identifier issn	0027-0644
identifier other	ams-63822.pdf
identifier uri	http://onlinelibrary.yabesh.ir/handle/yetl/4204868
description abstract	The forward model solution and its functional (e.g., the cost function in 4DVAR) are discontinuous with respect to the model's control variables if the model contains discontinuous physical processes that occur during the assimilation window. In such a case, the tangent linear model (the first-order approximation of a finite perturbation) is unable to represent the sharp jumps of the nonlinear model solution. Also, the first-order approximation provided by the adjoint model is unable to represent a finite perturbation of the cost function when the introduced perturbation in the control variables crosses discontinuous points. Using an idealized simple model and the Arakawa?Schubert cumulus parameterization scheme, the authors examined the behavior of a cost function and its gradient obtained by the adjoint model with discontinuous model physics. Numerical results show that a cost function involving discontinuous physical processes is zeroth-order discontinuous, but piecewise differentiable. The maximum possible number of involved discontinuity points of a cost function increases exponentially as 2kn, where k is the total number of thresholds associated with on?off switches, and n is the total number of time steps in the assimilation window. A backward adjoint model integration with the proper forcings added at various time steps, similar to the backward adjoint model integration that provides the gradient of the cost function at a continuous point, produces a one-sided gradient (called a subgradient and denoted as ?sJ) at a discontinuous point. An accuracy check of the gradient shows that the adjoint-calculated gradient is computed exactly on either side of a discontinuous surface. While a cost function evaluated using a small interval in the control variable space oscillates, the distribution of the gradient calculated at the same resolution not only shows a rather smooth variation, but also is consistent with the general convexity of the original cost function. The gradients of discontinuous cost functions are observed roughly smooth since the adjoint integration correctly computes the one-sided gradient at either side of discontinuous surface. This implies that, although (?sJ)Tδx may not approximate δJ = J(x + δx) ? J(x) well near the discontinuous surface, the subgradient calculated by the adjoint of discontinuous physics may still provide useful information for finding the search directions in a minimization procedure. While not eliminating the possible need for the use of a nondifferentiable optimization algorithm for 4DVAR with discontinuous physics, consistency between the computed gradient by adjoints and the convexity of the cost function may explain why a differentiable limited-memory quasi-Newton algorithm still worked well in many 4DVAR experiments that use a diabatic assimilation model with discontinuous physics.
publisher	American Meteorological Society
title	Examination of Numerical Results from Tangent Linear and Adjoint of Discontinuous Nonlinear Models
type	Journal Paper
journal volume	129
journal issue	11
journal title	Monthly Weather Review
identifier doi	10.1175/1520-0493(2001)129<2791:EONRFT>2.0.CO;2
journal fristpage	2791
journal lastpage	2804
tree	Monthly Weather Review:;2001:;volume( 129 ):;issue: 011
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive