Getis–Ord statistics

From testwiki
Revision as of 06:29, 9 May 2024 by imported>WikiCleanerBot (v2.05b - Bot T20 CW#61 - Fix errors for CW project (Reference before punctuation))
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Short description

Getis–Ord statistics, also known as Gi*, are used in spatial analysis to measure the local and global spatial autocorrelation. Developed by statisticians Arthur Getis and J. Keith Ord they are commonly used for Hot Spot Analysis[1][2] to identify where features with high or low values are spatially clustered in a statistically significant way. Getis-Ord statistics are available in a number of software libraries such as CrimeStat, GeoDa, ArcGIS, PySAL[3] and R.[4][5]

Local statistics

Hot spot map showing hot and cold spots in the 2020 USA Contiguous Unemployment Rate, calculated using Getis Ord Gi*

There are two different versions of the statistic, depending on whether the data point at the target location i is included or not[6]

Gi=jiwijxjjixj
Gi*=jwijxjjxj

Here xi is the value observed at the ith spatial site and wij is the spatial weight matrix which constrains which sites are connected to one another. For Gi* the denominator is constant across all observations.

A value larger (or smaller) than the mean suggests a hot (or cold) spot corresponding to a high-high (or low-low) cluster. Statistical significance can be estimated using analytical approximations as in the original work[7][8] however in practice permutation testing is used to obtain more reliable estimates of significance for statistical inference.[6]

Global statistics

The Getis-Ord statistics of overall spatial association are[7][9]

G=ij,ijwijxixjij,ijxixj
G*=ijwijxixjijxixj

The local and global G* statistics are related through the weighted average

ixiGi*ixi=ijxiwijxjixijxj=G*

The relationship of the G and Gi statistics is more complicated due to the dependence of the denominator of Gi on i.

Relation to Moran's I

Moran's I is another commonly used measure of spatial association defined by

I=NWijwij(xix¯)(xjx¯)i(xix¯)2

where N is the number of spatial sites and W=ijwij is the sum of the entries in the spatial weight matrix. Getis and Ord show[7] that

I=(K1/K2)GK2x¯i(wi+wi)xi+K2x¯2W

Where wi=jwij, wi=jwji, K1=(ij,ijxixj)1 and K2=WN(i(xix¯)2)1. They are equal if wij=w is constant, but not in general.

Ord and Getis[8] also show that Moran's I can be written in terms of Gi*

I=1W(iziViGi*N)

where zi=(xix¯)/s, s is the standard deviation of x and

Vi2=1N1j(wij1Nkwik)2

is an estimate of the variance of wij.

See also

References

Template:Reflist