Stress majorization

Template:Short description Stress majorization is an optimization strategy used in multidimensional scaling (MDS) where, for a set of $n$ $m$ -dimensional data items, a configuration $X$ of $n$ points in $r$ $(≪ m)$ -dimensional space is sought that minimizes the so-called stress function $σ (X)$ . Usually $r$ is $2$ or $3$ , i.e. the $(n \times r)$ matrix $X$ lists points in $2 -$ or $3 -$ dimensional Euclidean space so that the result may be visualised (i.e. an MDS plot). The function $σ$ is a cost or loss function that measures the squared differences between ideal ( $m$ -dimensional) distances and actual distances in r-dimensional space. It is defined as:

σ (X) = \sum_{i < j \leq n} w_{i j} (d_{i j} (X) - δ_{i j})^{2}

where $w_{i j} \geq 0$ is a weight for the measurement between a pair of points $(i, j)$ , $d_{i j} (X)$ is the euclidean distance between $i$ and $j$ and $δ_{i j}$ is the ideal distance between the points (their separation) in the $m$ -dimensional data space. Note that $w_{i j}$ can be used to specify a degree of confidence in the similarity between points (e.g. 0 can be specified if there is no information for a particular pair).

A configuration $X$ which minimizes $σ (X)$ gives a plot in which points that are close together correspond to points that are also close together in the original $m$ -dimensional data space.

There are many ways that $σ (X)$ could be minimized. For example, Kruskal^[1] recommended an iterative steepest descent approach. However, a significantly better (in terms of guarantees on, and rate of, convergence) method for minimizing stress was introduced by Jan de Leeuw.^[2] De Leeuw's iterative majorization method at each step minimizes a simple convex function which both bounds $σ$ from above and touches the surface of $σ$ at a point $Z$ , called the supporting point. In convex analysis such a function is called a majorizing function. This iterative majorization process is also referred to as the SMACOF algorithm ("Scaling by MAjorizing a COmplicated Function").

The SMACOF algorithm

The stress function $σ$ can be expanded as follows:

σ (X) = \sum_{i < j \leq n} w_{i j} (d_{i j} (X) - δ_{i j})^{2} = \sum_{i < j} w_{i j} δ_{i j}^{2} + \sum_{i < j} w_{i j} d_{i j}^{2} (X) - 2 \sum_{i < j} w_{i j} δ_{i j} d_{i j} (X)

Note that the first term is a constant $C$ and the second term is quadratic in $X$ (i.e. for the Hessian matrix $V$ the second term is equivalent to tr $X^{'} V X$ ) and therefore relatively easily solved. The third term is bounded by:

\sum_{i < j} w_{i j} δ_{i j} d_{i j} (X) = tr X^{'} B (X) X \geq tr X^{'} B (Z) Z

where $B (Z)$ has:

b_{i j} = - \frac{w_{i j} δ_{i j}}{d_{i j} (Z)}

for

d_{i j} (Z) \neq 0, i \neq j

and $b_{i j} = 0$ for $d_{i j} (Z) = 0, i \neq j$

and $b_{i i} = - \sum_{j = 1, j \neq i}^{n} b_{i j}$ .

Proof of this inequality is by the Cauchy-Schwarz inequality, see Borg^[3] (pp. 152–153).

Thus, we have a simple quadratic function $τ (X, Z)$ that majorizes stress:

σ (X) = C + tr X^{'} V X - 2 tr X^{'} B (X) X

\leq C + tr X^{'} V X - 2 tr X^{'} B (Z) Z = τ (X, Z)

The iterative minimization procedure is then:

at the $k^{t h}$ step we set $Z \leftarrow X^{k - 1}$
$X^{k} \leftarrow \min_{X} τ (X, Z)$
stop if $σ (X^{k - 1}) - σ (X^{k}) < ϵ$ otherwise repeat.

This algorithm has been shown to decrease stress monotonically (see de Leeuw^[2]).

Use in graph drawing

Stress majorization and algorithms similar to SMACOF also have application in the field of graph drawing.^[4]^[5] That is, one can find a reasonably aesthetically appealing layout for a network or graph by minimizing a stress function over the positions of the nodes in the graph. In this case, the $δ_{i j}$ are usually set to the graph-theoretic distances between nodes $i$ and $j$ and the weights $w_{i j}$ are taken to be $δ_{i j}^{- α}$ . Here, $α$ is chosen as a trade-off between preserving long- or short-range ideal distances. Good results have been shown for $α = 2$ .^[6]

References

Template:Reflist

[1] Template:Citation.

[de_Leeuw-2] 2.0 ^2.1 Template:Citation.

[borg-3] Template:Citation.

[4] Template:Citation.

[5] Template:Citation.

[6] Template:Citation.

[1]

[2]

[3]

[4]

[5]

[6]

Stress majorization

The SMACOF algorithm

Use in graph drawing

References

Navigation menu

Search