Getis–Ord statistics

Template:Short description

Getis–Ord statistics, also known as G_i^*, are used in spatial analysis to measure the local and global spatial autocorrelation. Developed by statisticians Arthur Getis and J. Keith Ord they are commonly used for Hot Spot Analysis^[1]^[2] to identify where features with high or low values are spatially clustered in a statistically significant way. Getis-Ord statistics are available in a number of software libraries such as CrimeStat, GeoDa, ArcGIS, PySAL^[3] and R.^[4]^[5]

Local statistics

There are two different versions of the statistic, depending on whether the data point at the target location $i$ is included or not^[6]

G_{i} = \frac{\sum_{j \neq i} w_{i j} x_{j}}{\sum_{j \neq i} x_{j}}

G_{i}^{*} = \frac{\sum_{j} w_{i j} x_{j}}{\sum_{j} x_{j}}

Here $x_{i}$ is the value observed at the $i^{t h}$ spatial site and $w_{i j}$ is the spatial weight matrix which constrains which sites are connected to one another. For $G_{i}^{*}$ the denominator is constant across all observations.

A value larger (or smaller) than the mean suggests a hot (or cold) spot corresponding to a high-high (or low-low) cluster. Statistical significance can be estimated using analytical approximations as in the original work^[7]^[8] however in practice permutation testing is used to obtain more reliable estimates of significance for statistical inference.^[6]

Global statistics

The Getis-Ord statistics of overall spatial association are^[7]^[9]

G = \frac{\sum_{i j, i \neq j} w_{i j} x_{i} x_{j}}{\sum_{i j, i \neq j} x_{i} x_{j}}

G^{*} = \frac{\sum_{i j} w_{i j} x_{i} x_{j}}{\sum_{i j} x_{i} x_{j}}

The local and global $G^{*}$ statistics are related through the weighted average

\frac{\sum_{i} x_{i} G_{i}^{*}}{\sum_{i} x_{i}} = \frac{\sum_{i j} x_{i} w_{i j} x_{j}}{\sum_{i} x_{i} \sum_{j} x_{j}} = G^{*}

The relationship of the $G$ and $G_{i}$ statistics is more complicated due to the dependence of the denominator of $G_{i}$ on $i$ .

Relation to Moran's I

Moran's I is another commonly used measure of spatial association defined by

I = \frac{N}{W} \frac{\sum_{i j} w_{i j} (x_{i} - \bar{x}) (x_{j} - \bar{x})}{\sum_{i} (x_{i} - \bar{x})^{2}}

where $N$ is the number of spatial sites and $W = \sum_{i j} w_{i j}$ is the sum of the entries in the spatial weight matrix. Getis and Ord show^[7] that

I = (K_{1} / K_{2}) G - K_{2} \bar{x} \sum_{i} (w_{i \cdot} + w_{\cdot i}) x_{i} + K_{2} {\bar{x}}^{2} W

Where $w_{i \cdot} = \sum_{j} w_{i j}$ , $w_{\cdot i} = \sum_{j} w_{j i}$ , $K_{1} = {(\sum_{i j, i \neq j} x_{i} x_{j})}^{- 1}$ and $K_{2} = \frac{W}{N} {(\sum_{i} (x_{i} - \bar{x})^{2})}^{- 1}$ . They are equal if $w_{i j} = w$ is constant, but not in general.

Ord and Getis^[8] also show that Moran's I can be written in terms of $G_{i}^{*}$

I = \frac{1}{W} (\sum_{i} z_{i} V_{i} G_{i}^{*} - N)

where $z_{i} = (x_{i} - \bar{x}) / s$ , $s$ is the standard deviation of $x$ and

V_{i}^{2} = \frac{1}{N - 1} \sum_{j} {(w_{i j} - \frac{1}{N} \sum_{k} w_{i k})}^{2}

is an estimate of the variance of $w_{i j}$ .

References

Template:Reflist

[1] Template:Cite web

[2] Template:Cite web

[3] ttps://pysal.org/

[4] Template:Cite web

[5] Template:Cite journal

[geoda-6] 6.0 ^6.1 Template:Cite web

[go1-7] 7.0 ^7.1 ^7.2 Template:Cite journal

[go2-8] 8.0 ^8.1 Template:Cite journal

[9] Template:Cite web

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

Getis–Ord statistics

Contents

Local statistics

Global statistics

Relation to Moran's I

See also

References

Navigation menu

Getis–Ord statistics

Local statistics

Global statistics

Relation to Moran's I

See also

References

Navigation menu

Search