Sign function: Difference between revisions

From testwiki
Jump to navigation Jump to search
imported>Jacobolus
m Basic properties: "product of its absolute value and its sign function" -> "product of its absolute value and its sign"; the former is somewhere between confusing and wrong
 
(No difference)

Latest revision as of 00:11, 9 January 2025

Template:Short description Template:Redirect Template:Distinguish

Signum function y=sgnx

In mathematics, the sign function or signum function (from signum, Latin for "sign") is a function that has the value Template:Math, Template:Math or Template:Math according to whether the sign of a given real number is positive or negative, or the given number is itself zero. In mathematical notation the sign function is often represented as sgnx or sgn(x).[1]

Definition

The signum function of a real number x is a piecewise function which is defined as follows:[1] sgnx:={1if x<0,0if x=0,1if x>0.

The law of trichotomy states that every real number must be positive, negative or zero. The signum function denotes which unique category a number falls into by mapping it to one of the values Template:Math, Template:Math or Template:Math which can then be used in mathematical expressions or further calculations.

For example: sgn(2)=+1,sgn(π)=+1,sgn(8)=1,sgn(12)=1,sgn(0)=0.

Basic properties

Any real number can be expressed as the product of its absolute value and its sign: x=|x|sgnx.

It follows that whenever x is not equal to 0 we have sgnx=x|x|=|x|x.

Similarly, for any real number x, |x|=xsgnx. We can also be certain that: sgn(xy)=(sgnx)(sgny), and so sgn(xn)=(sgnx)n.

Some algebraic identities

The signum can also be written using the Iverson bracket notation: sgnx=[x<0]+[x>0].

The signum can also be written using the floor and the absolute value functions: sgnx=x|x|+1x|x|+1. If 00 is accepted to be equal to 1, the signum can also be written for all real numbers as sgnx=0(x+|x|)0(x+|x|).

Properties in mathematical analysis

Discontinuity at zero

The sign function is not continuous at x=0.

Although the sign function takes the value Template:Math when x is negative, the ringed point Template:Math in the plot of sgnx indicates that this is not the case when x=0. Instead, the value jumps abruptly to the solid point at Template:Math where sgn(0)=0. There is then a similar jump to sgn(x)=+1 when x is positive. Either jump demonstrates visually that the sign function sgnx is discontinuous at zero, even though it is continuous at any point where x is either positive or negative.

These observations are confirmed by any of the various equivalent formal definitions of continuity in mathematical analysis. A function f(x), such as sgn(x), is continuous at a point x=a if the value f(a) can be approximated arbitrarily closely by the sequence of values f(a1),f(a2),f(a3),, where the an make up any infinite sequence which becomes arbitrarily close to a as n becomes sufficiently large. In the notation of mathematical limits, continuity of f at a requires that f(an)f(a) as n for any sequence (an)n=1 for which ana. The arrow symbol can be read to mean approaches, or tends to, and it applies to the sequence as a whole.

This criterion fails for the sign function at a=0. For example, we can choose an to be the sequence 1,12,13,14,, which tends towards zero as n increases towards infinity. In this case, ana as required, but sgn(a)=0 and sgn(an)=+1 for each n, so that sgn(an)1sgn(a). This counterexample confirms more formally the discontinuity of sgnx at zero that is visible in the plot.

Despite the sign function having a very simple form, the step change at zero causes difficulties for traditional calculus techniques, which are quite stringent in their requirements. Continuity is a frequent constraint. One solution can be to approximate the sign function by a smooth continuous function; others might involve less stringent approaches that build on classical methods to accommodate larger classes of function.

Smooth approximations and limits

The signum function coincides with the limits sgnx=limn12nx1+2nx. and sgnx=limn2πarctan(nx)=limn2πtan1(nx).as well as,

sgnx=limntanh(nx).Here, tanh(x) is the Hyperbolic tangent and the superscript of -1, above it, is shorthand notation for the inverse function of the Trigonometric function, tangent.

For k>1, a smooth approximation of the sign function is sgnxtanhkx. Another approximation is sgnxxx2+ε2. which gets sharper as ε0; note that this is the derivative of x2+ε2. This is inspired from the fact that the above is exactly equal for all nonzero x if ε=0, and has the advantage of simple generalization to higher-dimensional analogues of the sign function (for example, the partial derivatives of x2+y2).

See Template:Section link.

Differentiation

The signum function sgnx is differentiable everywhere except when x=0. Its derivative is zero when x is non-zero: d(sgnx)dx=0for x0.

This follows from the differentiability of any constant function, for which the derivative is always zero on its domain of definition. The signum sgnx acts as a constant function when it is restricted to the negative open region x<0, where it equals Template:Math. It can similarly be regarded as a constant function within the positive open region x>0, where the corresponding constant is Template:Math Although these are two different constant functions, their derivative is equal to zero in each case.

It is not possible to define a classical derivative at x=0, because there is a discontinuity there.

Although it is not differentiable at x=0 in the ordinary sense, under the generalized notion of differentiation in distribution theory, the derivative of the signum function is two times the Dirac delta function. This can be demonstrated using the identity [2] sgnx=2H(x)1, where H(x) is the Heaviside step function using the standard H(0)=12 formalism. Using this identity, it is easy to derive the distributional derivative:[3] dsgnxdx=2dH(x)dx=2δ(x).

Integration

The signum function has a definite integral between any pair of finite values Template:Mvar and Template:Mvar, even when the interval of integration includes zero. The resulting integral for Template:Mvar and Template:Mvar is then equal to the difference between their absolute values: ab(sgnx)dx=|b||a|.

In fact, the signum function is the derivative of the absolute value function, except where there is an abrupt change in gradient at zero: d|x|dx=sgnxfor x0.

We can understand this as before by considering the definition of the absolute value |x| on the separate regions x>0 and x<0. For example, the absolute value function is identical to x in the region x>0, whose derivative is the constant value Template:Math, which equals the value of sgnx there.

Because the absolute value is a convex function, there is at least one subderivative at every point, including at the origin. Everywhere except zero, the resulting subdifferential consists of a single value, equal to the value of the sign function. In contrast, there are many subderivatives at zero, with just one of them taking the value sgn(0)=0. A subderivative value Template:Math occurs here because the absolute value function is at a minimum. The full family of valid subderivatives at zero constitutes the subdifferential interval [1,1], which might be thought of informally as "filling in" the graph of the sign function with a vertical line through the origin, making it continuous as a two dimensional curve.

In integration theory, the signum function is a weak derivative of the absolute value function. Weak derivatives are equivalent if they are equal almost everywhere, making them impervious to isolated anomalies at a single point. This includes the change in gradient of the absolute value function at zero, which prohibits there being a classical derivative.

Fourier transform

The Fourier transform of the signum function is[4] PV(sgnx)eikxdx=2ikfor k0, where PV means taking the Cauchy principal value.

Generalizations

Complex signum

The signum function can be generalized to complex numbers as: sgnz=z|z| for any complex number z except z=0. The signum of a given complex number z is the point on the unit circle of the complex plane that is nearest to z. Then, for z0, sgnz=eiargz, where arg is the complex argument function.

For reasons of symmetry, and to keep this a proper generalization of the signum function on the reals, also in the complex domain one usually defines, for z=0: sgn(0+0i)=0

Another generalization of the sign function for real and complex expressions is csgn,[5] which is defined as: csgnz={1if Re(z)>0,1if Re(z)<0,sgnIm(z)if Re(z)=0 where Re(z) is the real part of z and Im(z) is the imaginary part of z.

We then have (for z0): csgnz=zz2=z2z.

Polar decomposition of matrices

Thanks to the Polar decomposition theorem, a matrix 𝑨𝕂n×n (n and 𝕂{,}) can be decomposed as a product 𝑸𝑷 where 𝑸 is a unitary matrix and 𝑷 is a self-adjoint, or Hermitian, positive definite matrix, both in 𝕂n×n. If 𝑨 is invertible then such a decomposition is unique and 𝑸 plays the role of 𝑨's signum. A dual construction is given by the decomposition 𝑨=𝑺𝑹 where 𝑹 is unitary, but generally different than 𝑸. This leads to each invertible matrix having a unique left-signum 𝑸 and right-signum 𝑹.

In the special case where 𝕂=, n=2, and the (invertible) matrix 𝑨=[abba], which identifies with the (nonzero) complex number a+ib=c, then the signum matrices satisfy 𝑸=𝑷=[abba]/|c| and identify with the complex signum of c, sgnc=c/|c|. In this sense, polar decomposition generalizes to matrices the signum-modulus decomposition of complex numbers.

Signum as a generalized function

At real values of x, it is possible to define a generalized function–version of the signum function, ε(x) such that ε(x)2=1 everywhere, including at the point x=0, unlike sgn, for which (sgn0)2=0. This generalized signum allows construction of the algebra of generalized functions, but the price of such generalization is the loss of commutativity. In particular, the generalized signum anticommutes with the Dirac delta function[6] ε(x)δ(x)+δ(x)ε(x)=0; in addition, ε(x) cannot be evaluated at x=0; and the special name, ε is necessary to distinguish it from the function sgn. (ε(0) is not defined, but sgn0=0.)

See also

Notes