MINQUE

From testwiki
Jump to navigation Jump to search

Template:Short description In statistics, the theory of minimum norm quadratic unbiased estimation (MINQUE)[1][2][3] was developed by C. R. Rao. MINQUE is a theory alongside other estimation methods in estimation theory, such as the method of moments or maximum likelihood estimation. Similar to the theory of best linear unbiased estimation, MINQUE is specifically concerned with linear regression models.[1] The method was originally conceived to estimate heteroscedastic error variance in multiple linear regression.[1] MINQUE estimators also provide an alternative to maximum likelihood estimators or restricted maximum likelihood estimators for variance components in mixed effects models.[3] MINQUE estimators are quadratic forms of the response variable and are used to estimate a linear function of the variances.

Principles

We are concerned with a mixed effects model for the random vector ๐˜โˆˆโ„n with the following linear structure.

๐˜=๐—๐œท+๐”1๐ƒ1+โ‹ฏ+๐”k๐ƒk

Here, ๐—โˆˆโ„nร—m is a design matrix for the fixed effects, ๐œทโˆˆโ„m represents the unknown fixed-effect parameters, ๐”iโˆˆโ„nร—ci is a design matrix for the i-th random-effect component, and ๐ƒiโˆˆโ„ci is a random vector for the i-th random-effect component. The random effects are assumed to have zero mean (๐”ผ[๐ƒi]=๐ŸŽ) and be uncorrelated (๐•[๐ƒi]=ฯƒi2๐ˆci). Furthermore, any two random effect vectors are also uncorrelated (๐•[๐ƒi,๐ƒj]=๐ŸŽโˆ€iโ‰ j). The unknown variances ฯƒ12,โ‹ฏ,ฯƒk2 represent the variance components of the model.

This is a general model that captures commonly used linear regression models.

  1. Gauss-Markov Model[3]: If we consider a one-component model where ๐”1=๐ˆn, then the model is equivalent to the Gauss-Markov model ๐˜=๐—๐œท+๐ with ๐”ผ[๐]=๐ŸŽ and ๐•[๐]=ฯƒ12๐ˆn.
  2. Heteroscedastic Model[1]: Each set of random variables in ๐˜ that shares a common variance can be modeled as an individual variance component with an appropriate ๐”i.

A compact representation for the model is the following, where ๐”=[๐”1โ‹ฏ๐”k] and ๐ƒโŠค=[๐ƒ1โŠคโ‹ฏ๐ƒkโŠค].

๐˜=๐—๐œท+๐”๐ƒ

Note that this model makes no distributional assumptions about ๐˜ other than the first and second moments.[3]

๐”ผ[๐˜]=๐—๐œท

๐•[๐˜]=ฯƒ12๐”1๐”1โŠค+โ‹ฏ+ฯƒk2๐”k๐”kโŠคโ‰กฯƒ12๐•1+โ‹ฏ+ฯƒk2๐•k

The goal in MINQUE is to estimate ฮธ=โˆ‘i=1kpiฯƒi2 using a quadratic form ฮธ^=๐˜โŠค๐€๐˜. MINQUE estimators are derived by identifying a matrix ๐€ such that the estimator has some desirable properties,[2][3] described below.

Optimal Estimator Properties to Constrain MINQUE

Invariance to translation of the fixed effects

Consider a new fixed-effect parameter ๐œธ=๐œทโˆ’๐œท0, which represents a translation of the original fixed effect. The new, equivalent model is now the following.

๐˜โˆ’๐—๐œท0=๐—๐œธ+๐”๐ƒ

Under this equivalent model, the MINQUE estimator is now (๐˜โˆ’๐—๐œท0)โŠค๐€(๐˜โˆ’๐—๐œท0). Rao argued that since the underlying models are equivalent, this estimator should be equal to ๐˜โŠค๐€๐˜.[2][3] This can be achieved by constraining ๐€ such that ๐€๐—=๐ŸŽ, which ensures that all terms other than ๐˜โŠค๐€๐˜ in the expansion of the quadratic form are zero.

Unbiased estimation

Suppose that we constrain ๐€๐—=๐ŸŽ, as argued in the section above. Then, the MINQUE estimator has the following form

ฮธ^=๐˜โŠค๐€๐˜=(๐—๐œท+๐”๐ƒ)โŠค๐€(๐—๐œท+๐”๐ƒ)=๐ƒโŠค๐”โŠค๐€๐”๐ƒ

To ensure that this estimator is unbiased, the expectation of the estimator ๐”ผ[ฮธ^] must equal the parameter of interest, ฮธ. Below, the expectation of the estimator can be decomposed for each component since the components are uncorrelated with each other. Furthermore, the cyclic property of the trace is used to evaluate the expectation with respect to ๐ƒi.

๐”ผ[ฮธ^]=๐”ผ[๐ƒโŠค๐”โŠค๐€๐”๐ƒ]=โˆ‘i=1k๐”ผ[๐ƒiโŠค๐”iโŠค๐€๐”i๐ƒi]=โˆ‘i=1kฯƒi2Tr[๐”iโŠค๐€๐”i]

To ensure that this estimator is unbiased, Rao suggested setting โˆ‘i=1kฯƒi2Tr[๐”iโŠค๐€๐”i]=โˆ‘i=1kpiฯƒi2, which can be accomplished by constraining ๐€ such that Tr[๐”iโŠค๐€๐”i]=Tr[๐€๐•i]=pi for all components.[3]

Minimum Norm

Rao argues that if ๐ƒ were observed, a "natural" estimator for ฮธ would be the following[2][3] since ๐”ผ[๐ƒiโŠค๐ƒi]=ciฯƒi2. Here, ๐œŸ is defined as a diagonal matrix.

p1c1๐ƒ1โŠค๐ƒ1+โ‹ฏ+pkck๐ƒkโŠค๐ƒk=๐ƒโŠค[diag(p1ci,โ‹ฏ,pkck)]๐ƒโ‰ก๐ƒโŠค๐œŸ๐ƒ

The difference between the proposed estimator and the natural estimator is ๐ƒโŠค(๐”โŠค๐€๐”โˆ’๐œŸ)๐ƒ. This difference can be minimized by minimizing the norm of the matrix โ€–๐”โŠค๐€๐”โˆ’๐œŸโ€–.

Procedure

Given the constraints and optimization strategy derived from the optimal properties above, the MINQUE estimator ฮธ^ for ฮธ=โˆ‘i=1kpiฯƒi2 is derived by choosing a matrix ๐€ that minimizes โ€–๐”โŠค๐€๐”โˆ’๐œŸโ€–, subject to the constraints

  1. ๐€๐—=๐ŸŽ, and
  2. Tr[๐€๐•i]=pi.

Examples of Estimators

Standard Estimator for Homoscedastic Error

In the Gauss-Markov model, the error variance ฯƒ2 is estimated using the following.

s2=1nโˆ’m(๐˜โˆ’๐—๐œท^)โŠค(๐˜โˆ’๐—๐œท^)

This estimator is unbiased and can be shown to minimize the Euclidean norm of the form โ€–๐”โŠค๐€๐”โˆ’๐œŸโ€–.[1] Thus, the standard estimator for error variance in the Gauss-Markov model is a MINQUE estimator.

Random Variables with Common Mean and Heteroscedastic Error

For random variables Y1,โ‹ฏ,Yn with a common mean and different variances ฯƒ12,โ‹ฏ,ฯƒn2, the MINQUE estimator for ฯƒi2 is nnโˆ’2(Yiโˆ’Yโ€พ)2โˆ’s2nโˆ’2, where Yโ€พ=1nโˆ‘i=1nYi and s2=1nโˆ’1โˆ‘i=1n(Yiโˆ’Yโ€พ)2.[1]

Estimator for Variance Components

Rao proposed a MINQUE estimator for the variance components model based on minimizing the Euclidean norm.[2] The Euclidean norm โ€–โ‹…โ€–2 is the square root of the sum of squares of all elements in the matrix. When evaluating this norm below, ๐•=๐•1+โ‹ฏ+๐•k=๐”๐”โŠค. Furthermore, using the cyclic property of traces, Tr[๐”โŠค๐€๐”๐œŸ]=Tr[๐€๐”๐œŸ๐”โŠค]=Tr[โˆ‘i=1kpici๐€๐•i]=Tr[๐œŸ๐œŸ].

โ€–๐”โŠค๐€๐”โˆ’๐œŸโ€–22=(๐”โŠค๐€๐”โˆ’๐œŸ)โŠค(๐”โŠค๐€๐”โˆ’๐œŸ)=Tr[๐”โŠค๐€๐”๐”๐€๐”โŠค]โˆ’Tr[2๐”โŠค๐€๐”๐œŸ]+Tr[๐œŸ๐œŸ]=Tr[๐€๐•๐€๐•]โˆ’Tr[๐œŸ๐œŸ]

Note that since Tr[๐œŸ๐œŸ] does not depend on ๐€, the MINQUE with the Euclidean norm is obtained by identifying the matrix ๐€ that minimizes Tr[๐€๐•๐€๐•], subject to the MINQUE constraints discussed above.

Rao showed that the matrix ๐€ that satisfies this optimization problem is

๐€โ‹†=โˆ‘i=1kฮปi๐‘๐•i๐‘,

where ๐‘=๐•โˆ’1(๐ˆโˆ’๐), ๐=๐—(๐—โŠค๐•โˆ’1๐—)โˆ’๐—โŠค๐•โˆ’1 is the projection matrix into the column space of ๐—, and (โ‹…)โˆ’ represents the generalized inverse of a matrix.

Therefore, the MINQUE estimator is the following, where the vectors ๐€ and ๐ are defined based on the sum.

ฮธ^=๐˜โŠค๐€โ‹†๐˜=โˆ‘i=1kฮปi๐˜โŠค๐‘๐•i๐‘๐˜โ‰กโˆ‘i=1kฮปiQiโ‰ก๐€โŠค๐

The vector ๐€ is obtained by using the constraint Tr[๐€โ‹†๐•i]=pi. That is, the vector represents the solution to the following system of equations โˆ€jโˆˆ{1,โ‹ฏ,k}.

Tr[๐€โ‹†๐•j]=pjTr[โˆ‘i=1kฮปi๐‘๐•i๐‘๐•j]=pjโˆ‘i=1kฮปiTr[๐‘๐•i๐‘๐•j]=pj

This can be written as a matrix product ๐’๐€=๐ฉ, where ๐ฉ=[p1โ‹ฏpk]โŠค and ๐’ is the following.

๐’=[Tr[๐‘๐•1๐‘๐•1]โ‹ฏTr[๐‘๐•k๐‘๐•1]โ‹ฎโ‹ฑโ‹ฎTr[๐‘๐•1๐‘๐•k]โ‹ฏTr[๐‘๐•k๐‘๐•k]]

Then, ๐€=๐’โˆ’๐ฉ. This implies that the MINQUE is ฮธ^=๐€โŠค๐=๐ฉโŠค(๐’โˆ’)โŠค๐=๐ฉโŠค๐’โˆ’๐. Note that ฮธ=โˆ‘i=1kpiฯƒi2=๐ฉโŠค๐ˆ, where ๐ˆ=[ฯƒ12โ‹ฏฯƒk2]โŠค. Therefore, the estimator for the variance components is ๐ˆ^=๐’โˆ’๐.

Extensions

MINQUE estimators can be obtained without the invariance criteria, in which case the estimator is only unbiased and minimizes the norm.[2] Such estimators have slightly different constraints on the minimization problem.

The model can be extended to estimate covariance components.[3] In such a model, the random effects of a component are assumed to have a common covariance structure ๐•[๐ƒi]=๐œฎ. A MINQUE estimator for a mixture of variance and covariance components was also proposed.[3] In this model, ๐•[๐ƒi]=๐œฎ for iโˆˆ{1,โ‹ฏ,s} and ๐•[๐ƒi]=ฯƒi2๐ˆci for iโˆˆ{s+1,โ‹ฏ,k}.

Template:Stats-stub

References