Biweight midcorrelation

Template:Short description In statistics, biweight midcorrelation (also called bicor) is a measure of similarity between samples. It is median-based, rather than mean-based, thus is less sensitive to outliers, and can be a robust alternative to other similarity metrics, such as Pearson correlation or mutual information.^[1]

Derivation

Here we find the biweight midcorrelation of two vectors $x$ and $y$ , with $i = 1, 2, \dots, m$ items, representing each item in the vector as $x_{1}, x_{2}, \dots, x_{m}$ and $y_{1}, y_{2}, \dots, y_{m}$ . First, we define $med (x)$ as the median of a vector $x$ and $mad (x)$ as the median absolute deviation (MAD), then define $u_{i}$ and $v_{i}$ as,

\begin{matrix} u_{i} & = \frac{x_{i} - med (x)}{9 mad (x)}, \\ v_{i} & = \frac{y_{i} - med (y)}{9 mad (y)} . \end{matrix}

Now we define the weights $w_{i}^{(x)}$ and $w_{i}^{(y)}$ as,

\begin{matrix} w_{i}^{(x)} & = {(1 - u_{i}^{2})}^{2} I (1 - | u_{i} |) \\ w_{i}^{(y)} & = {(1 - v_{i}^{2})}^{2} I (1 - | v_{i} |) \end{matrix}

where $I$ is the identity function where,

I (x) = {\begin{matrix} 1, & if x > 0 \\ 0, & otherwise \end{matrix}

Then we normalize so that the sum of the weights is 1:

\begin{matrix} {\tilde{x}}_{i} & = \frac{(x_{i} - med (x)) w_{i}^{(x)}}{\sqrt{\sum_{j = 1}^{m} {[(x_{j} - med (x)) w_{j}^{(x)}]}^{2}}} \\ {\tilde{y}}_{i} & = \frac{(y_{i} - med (y)) w_{i}^{(y)}}{\sqrt{\sum_{j = 1}^{m} {[(y_{j} - med (y)) w_{j}^{(y)}]}^{2}}} . \end{matrix}

Finally, we define biweight midcorrelation as,

b i c o r (x, y) = \sum_{i = 1}^{m} {\tilde{x}}_{i} {\tilde{y}}_{i}

Applications

Biweight midcorrelation has been shown to be more robust in evaluating similarity in gene expression networks,^[2] and is often used for weighted correlation network analysis.

Implementations

Biweight midcorrelation has been implemented in the R statistical programming language as the function bicor as part of the WGCNA package^[3]

Also implemented in the Raku programming language as the function bi_cor_coef as part of the Statistics module.^[4]

References

Template:Reflist

[1] Template:Cite book

[2] Template:Cite journal

[3] Template:Cite web

[4] Template:Cite web

[1]

[2]

[3]

[4]

Biweight midcorrelation

Contents

Derivation

Applications

Implementations

See also

References

Navigation menu

Biweight midcorrelation

Derivation

Applications

Implementations

See also

References

Navigation menu

Search