Getis–Ord statistics

From testwiki
Jump to navigation Jump to search

Template:Short description

Getis–Ord statistics, also known as Gi*, are used in spatial analysis to measure the local and global spatial autocorrelation. Developed by statisticians Arthur Getis and J. Keith Ord they are commonly used for Hot Spot Analysis[1][2] to identify where features with high or low values are spatially clustered in a statistically significant way. Getis-Ord statistics are available in a number of software libraries such as CrimeStat, GeoDa, ArcGIS, PySAL[3] and R.[4][5]

Local statistics

Hot spot map showing hot and cold spots in the 2020 USA Contiguous Unemployment Rate, calculated using Getis Ord Gi*

There are two different versions of the statistic, depending on whether the data point at the target location i is included or not[6]

Gi=jiwijxjjixj
Gi=jwijxjjxj

Here xi is the value observed at the ith spatial site and wij is the spatial weight matrix which constrains which sites are connected to one another. For Gi the denominator is constant across all observations.

A value larger (or smaller) than the mean suggests a hot (or cold) spot corresponding to a high-high (or low-low) cluster. Statistical significance can be estimated using analytical approximations as in the original work[7][8] however in practice permutation testing is used to obtain more reliable estimates of significance for statistical inference.[6]

Global statistics

The Getis-Ord statistics of overall spatial association are[7][9]

G=ij,ijwijxixjij,ijxixj
G=ijwijxixjijxixj

The local and global G statistics are related through the weighted average

ixiGiixi=ijxiwijxjixijxj=G

The relationship of the G and Gi statistics is more complicated due to the dependence of the denominator of Gi on i.

Relation to Moran's I

Moran's I is another commonly used measure of spatial association defined by

I=NWijwij(xix¯)(xjx¯)i(xix¯)2

where N is the number of spatial sites and W=ijwij is the sum of the entries in the spatial weight matrix. Getis and Ord show[7] that

I=(K1/K2)GK2x¯i(wi+wi)xi+K2x¯2W

Where wi=jwij, wi=jwji, K1=(ij,ijxixj)1 and K2=WN(i(xix¯)2)1. They are equal if wij=w is constant, but not in general.

Ord and Getis[8] also show that Moran's I can be written in terms of Gi

I=1W(iziViGiN)

where zi=(xix¯)/s, s is the standard deviation of x and

Vi2=1N1j(wij1Nkwik)2

is an estimate of the variance of wij.

See also

References

Template:Reflist