Scatter matrix

From testwiki
Revision as of 17:30, 15 January 2024 by imported>BD2412 (Definition: clean up spacing around commas and other punctuation fixes, replaced: ,C → , C (2), ,X → , X, , → , (2))
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Short description

For the notion in quantum mechanics, see scattering matrix.

In multivariate statistics and probability theory, the scatter matrix is a statistic that is used to make estimates of the covariance matrix, for instance of the multivariate normal distribution.

Definition

Given n samples of m-dimensional data, represented as the m-by-n matrix, X=[𝐱1,𝐱2,,𝐱n], the sample mean is

𝐱=1nj=1n𝐱j

where 𝐱j is the j-th column of X.[1]

The scatter matrix is the m-by-m positive semi-definite matrix

S=j=1n(𝐱j𝐱)(𝐱j𝐱)T=j=1n(𝐱j𝐱)(𝐱j𝐱)=(j=1n𝐱j𝐱jT)n𝐱𝐱T

where ()T denotes matrix transpose,[2] and multiplication is with regards to the outer product. The scatter matrix may be expressed more succinctly as

S=XCnXT

where Cn is the n-by-n centering matrix.

Application

The maximum likelihood estimate, given n samples, for the covariance matrix of a multivariate normal distribution can be expressed as the normalized scatter matrix

CML=1nS.[3]

When the columns of X are independently sampled from a multivariate normal distribution, then S has a Wishart distribution.

See also

References

Template:Reflist


Template:Statistics-stub Template:Matrix-stub