Common spatial pattern

From testwiki
Jump to navigation Jump to search
Two sets of overlapping data used to illustrate how CSP can separate the data.
Two sets of data after rotation by CSP to maximize the ratio of the variances along the two axes.

Common spatial pattern (CSP) is a mathematical procedure used in signal processing for separating a multivariate signal into additive subcomponents which have maximum differences in variance between two windows.[1]

Details

Let 𝐗1 of size (n,t1) and 𝐗2 of size (n,t2) be two windows of a multivariate signal, where n is the number of signals and t1 and t2 are the respective number of samples.

The CSP algorithm determines the component 𝐰T such that the ratio of variance (or second-order moment) is maximized between the two windows:

𝐰=argmax𝐰𝐰𝐗12𝐰𝐗22

The solution is given by computing the two covariance matrices:

𝐑1=𝐗1𝐗1Tt1
𝐑2=𝐗2𝐗2Tt2

Then, the simultaneous diagonalization of those two matrices (also called generalized eigenvalue decomposition) is realized. We find the matrix of eigenvectors 𝐏=[𝐩1𝐩n] and the diagonal matrix 𝐃 of eigenvalues {λ1,,λn} sorted by decreasing order such that:

𝐏T𝐑1𝐏=𝐃

and

𝐏T𝐑2𝐏=𝐈n

with 𝐈n the identity matrix.

This is equivalent to the eigendecomposition of 𝐑21𝐑1:

𝐑21𝐑1=𝐏𝐃𝐏1
𝐰T will correspond to the first column of 𝐏:
𝐰=𝐩1T


Discussion

Relation between variance ratio and eigenvalue

The eigenvectors composing 𝐏 are components with variance ratio between the two windows equal to their corresponding eigenvalue:

λi=𝐩iT𝐗12𝐩iT𝐗22

Other components

The vectorial subspace Ei generated by the i first eigenvectors [𝐩1𝐩i] will be the subspace maximizing the variance ratio of all components belonging to it:

Ei=argmaxE(minpE𝐩T𝐗12𝐩T𝐗22)

On the same way, the vectorial subspace Fj generated by the j last eigenvectors [𝐩nj+1𝐩n] will be the subspace minimizing the variance ratio of all components belonging to it:

Fj=argminF(maxpF𝐩T𝐗12𝐩T𝐗22)

Variance or second-order moment

CSP can be applied after a mean subtraction (a.k.a. "mean centering") on signals in order to realize a variance ratio optimization. Otherwise CSP optimizes the ratio of second-order moment.

Choice of windows X1 and X2

  • The standard use consists on choosing the windows to correspond to two periods of time with different activation of sources (e.g. during rest and during a specific task).
  • It is also possible to choose the two windows to correspond to two different frequency bands in order to find components with specific frequency pattern.[2] Those frequency bands can be on temporal or on frequential basis. Since the matrix 𝐏 depends only of the covariance matrices, the same results can be obtained if the processing is applied on the Fourier transform of the signals.
  • Y. Wang [3] has proposed a particular choice for the first window 𝐗1 in order to extract components which have a specific period. 𝐗1 was the mean of the different periods for the examined signals.
  • If there is only one window, 𝐑2 can be considered as the identity matrix and then CSP corresponds to Principal component analysis.

Relation between LDA and CSP

Linear discriminant analysis (LDA) and CSP apply in different circumstances. LDA separates data that have different means, by finding a rotation that maximizes the (normalized) distance between the centers of the two sets of data. On the other hand, CSP ignores the means. Thus CSP is good, for example, in separating the signal from the noise in an event-related potential (ERP) experiment because both distributions have zero mean and there is no distinction for LDA to separate. Thus CSP finds a projection that makes the variance of the components of the average ERP as large as possible so the signal stands out above the noise.

Applications

The CSP method can be applied to multivariate signals in generally, is commonly found in application to electroencephalographic (EEG) signals. Particularly, the method is often used in brain–computer interfaces to retrieve the component signals which best transduce the cerebral activity for a specific task (e.g. hand movement).[4] It can also be used to separate artifacts from EEG signals.[2]

CSP can be adapted for the analysis of the event-related potentials.[5]

See also

References

Template:Reflist

  1. ↑ Zoltan J. Koles, Michael S. Lazaret and Steven Z. Zhou, "Spatial patterns underlying population differences in the background EEG", Brain topography, Vol. 2 (4) pp. 275-284, 1990
  2. ↑ 2.0 2.1 S. Boudet, "Filtrage d'artefacts par analyse multicomposantes de l'Γ©lectroencephalogramme de patients Γ©pileptiques.", PhD. Thesis: UnviversitΓ© de Lille 1, 07/2008
  3. ↑ Y. Wang, "Reduction of cardiac artifacts in magnetoencephalogram." Proc. of the 12th Int. Conf. on Biomagnetism, 2000
  4. ↑ G. Pfurtscheller, C. Guger and H. Ramoser "EEG-based brain-computer interface using subject-specific spatial filters", Engineering applications of bio-inspired artificial neural networks, Lecture Notes in Computer Science, 1999, Vol. 1607/1999, pp. 248-254
  5. ↑ M. Congedo, L. Korczowski, A. Delorme and F. Lopes da Silva, "Spatio-temporal common pattern: A companion method for ERP analysis in the time domain", Journal of Neuroscience Methods, Vol. 267, pp. 74-88, 2016