Tracy–Widom distribution

From testwiki
Jump to navigation Jump to search

Template:Short description

Densities of Tracy–Widom distributions for β = 1, 2, 4

The Tracy–Widom distribution is a probability distribution from random matrix theory introduced by Template:Harvs. It is the distribution of the normalized largest eigenvalue of a random Hermitian matrix. The distribution is defined as a Fredholm determinant.

In practical terms, Tracy–Widom is the crossover function between the two phases of weakly versus strongly coupled components in a system.[1] It also appears in the distribution of the length of the longest increasing subsequence of random permutations,Template:Sfnp as large-scale statistics in the Kardar-Parisi-Zhang equation,[2] in current fluctuations of the asymmetric simple exclusion process (ASEP) with step initial condition,[3] and in simplified mathematical models of the behavior of the longest common subsequence problem on random inputs.Template:Sfnp See Template:Harvtxt and Template:Harvtxt for experimental testing (and verifying) that the interface fluctuations of a growing droplet (or substrate) are described by the TW distribution F2 (or F1) as predicted by Template:Harvtxt.

The distribution F1 is of particular interest in multivariate statistics.[4] For a discussion of the universality of Fβ, β=1,2,4, see Template:Harvtxt. For an application of F1 to inferring population structure from genetic data see Template:Harvtxt. In 2017 it was proved that the distribution F is not infinitely divisible.Template:Sfnp

Definition as a law of large numbers

The empirical distribution of the largest eigenvalue of matrices sampled from the Gaussian ensembles, for increasingly large matrix sizes. They converge to their respective Tracy–Widom distributions.

Let Fβ denote the cumulative distribution function of the Tracy–Widom distribution with given β. It can be defined as a law of large numbers, similar to the central limit theorem.

There are typically three Tracy–Widom distributions, Fβ, with β{1,2,4}. They correspond to the three gaussian ensembles: orthogonal (β=1), unitary (β=2), and symplectic (β=4).

In general, consider a gaussian ensemble with beta value β, with its diagonal entries having variance 1, and off-diagonal entries having variance σ2, and let FN,β(s) be probability that an N×N matrix sampled from the ensemble have maximal eigenvalue s, then define[5]Fβ(x)=limNFN,β(σ(2N1/2+N1/6x))=limNPr(N1/6(λmax/σ2N1/2)x)where λmax denotes the largest eigenvalue of the random matrix. The shift by 2σN1/2 centers the distribution, since at the limit, the eigenvalue distribution converges to the semicircular distribution with radius 2σN1/2. The multiplication by N1/6 is used because the standard deviation of the distribution scales as N1/6 (first derived in [6]).

For example:Template:Sfnp

F2(x)=limNProb((λmax4N)N1/6x),

where the matrix is sampled from the gaussian unitary ensemble with off-diagonal variance 1.

The definition of the Tracy–Widom distributions Fβ may be extended to all β>0 (Slide 56 in Template:Harvtxt, Template:Harvtxt).

One may naturally ask for the limit distribution of second-largest eigenvalues, third-largest eigenvalues, etc. They are known.[7][5]

Functional forms

Fredholm determinant

F2 can be given as the Fredholm determinant

F2(s)=det(IAs)=1+n=1(1)nn!(s,)ndeti,j=1,...,n[As(xi,xj)]dx1dxn

of the kernel As ("Airy kernel") on square integrable functions on the half line (s,), given in terms of Airy functions Ai by

As(x,y)={Ai(x)Ai(y)Ai(x)Ai(y)xyif xyAi(x)2x(Ai(x))2if x=y

Painlevé transcendents

F2 can also be given as an integral

F2(s)=exp(s(xs)q2(x)dx)

in terms of a solution[note 1] of a Painlevé equation of type II

q(s)=sq(s)+2q(s)3

with boundary condition q(s)Ai(s),s. This function q is a Painlevé transcendent.

Other distributions are also expressible in terms of the same q:Template:Sfnp

F1(s)=exp(12sq(x)dx)(F2(s))1/2F4(s/2)=cosh(12sq(x)dx)(F2(s))1/2.

Functional equations

Define F(x)=exp(12x(yx)q(y)2dy)E(x)=exp(12xq(y)dy)then[5]F1(x)=E(x)F(x),F2(x)=F(x)2,F4(x2)=12(E(x)+1E(x))F(x)

Occurrences

Other than in random matrix theory, the Tracy–Widom distributions occur in many other probability problems.[8]

Let ln be the length of the longest increasing subsequence in a random permutation sampled uniformly from Sn, the permutation group on n elements. Then the cumulative distribution function of ln2N1/2N1/6 converges to F2.[9]

Third-order phase transition

The Tracy–Widom distribution exhibits a third-order phase transition in the large deviation behavior of the largest eigenvalue of a random matrix. This transition occurs at the edge of the Wigner semicircle distribution, where the probability density of the largest eigenvalue follows distinct scaling laws depending on whether it deviates to the left or right of the edge.

Let Φ(w) denote the rate function governing the large deviations of the largest eigenvalue λmax. For a Gaussian unitary ensemble, the probability density function of λmax satisfies, for large N,

P(λmax=w)exp(βNpΦ(w)),

where p=2 for deviations to the left of the spectral edge and p=1 for deviations to the right. When the small-deviation parts of the probability density are included, we haveP(λmax=w,N){exp[βN2Φ(w)],w<2&|w2|𝒪(1)2N23β(2N23(w2)),exp[βNΦ+(w)],w>2&|w2|𝒪(N23)

Rate function of the Tracy–Widom large deviation, and its leading approximation near the critical point.

The rate function Φ(w) is given by separate expressions for w<2 and w>2. Φ(w)=1108[36w2w4(15w+w3)w2+6+27(ln182ln(w+w2+6))],w<2.

Φ+(w)=12ww22+ln[ww222]

Near the critical point w=2, the leading order behavior is

Φ(w)162(2w)3,w2.
Φ+(w)27/43(w2)3/2,w2+.

The third derivative of Φ(w) is discontinuous at w=2, which classifies this as a third-order phase transition. This type of transition is analogous to the Gross-Witten-Wadia phase transition in lattice gauge theory and the Douglas-Kazakov phase transition in two-dimensional quantum chromodynamics. The discontinuity in the third derivative of the free energy marks a fundamental change in the behavior of the system, where fluctuations transition between different scaling regimes.

This third-order transition has also been observed in problems related to the maximal height of non-intersecting Brownian excursions, conductance fluctuations in mesoscopic systems, and entanglement entropy in random pure states.[8]

To interpret this as a third-order transition in statistical mechanics, define the (generalized) free energy density of the system as(w)=1N2lnP(λmax=w,N)then at the N limit, (w)={Φ(w)w<20w>2 has continuous first and second derivatives at the critical point w=2, but a discontinuous third derivative.

The lnPN2 lower end can be interpreted as the strongly interacting regime, where N objects are interacting strongly pairwise, so the total energy is proportional to N2. The lnPN upper end can be interpreted as the weakly interacting regime, where the objects are basically not interacting, so the total energy is proportional to N. The Tracy–Widom distribution phase transition then occurs at the point as the system switches from strongly to weakly interacting.

Coulomb gas model

Coulomb Gas distribution for various wall positions.

This can be visualized in the Coulomb gas model by considering a gas of electric charges in a V(x)=12x2 potential well. The distribution of the charges is the same as the distribution of the matrix eigenvalues. This gives the Wigner semi-circle law. To find the distribution of the largest eigenvalue, we take a wall and push against the Coulomb gas. If the wall is above +2, then most of the gas remains unaffected, and we are in the weak interaction regime. If the wall is below +2, then the entire bulk of the Coulomb gas is affected, and we are in the strong interaction regime.

The minimal Coulomb gas distribution is explicitly solvable asρw*(λ)={1π2λ2, with 2λ2 for w>2λ+L(w)2πwλ[w+L(w)2λ] with L(w)λw for w<2where w is the position of the wall, and L(w)=2w2+6w3.

Asymptotics

Probability density function

Let fβ(x)=Fβ(x) be the probability density function for the distribution, then[8]fβ(x){eβ24|x|3,xe2β3|x|3/2,x+In particular, we see that it is severely skewed to the right: it is much more likely for λmax to be much larger than 2σN than to be much smaller. This could be intuited by seeing that the limit distribution is the semicircle law, so there is "repulsion" from the bulk of the distribution, forcing λmax to be not much smaller than 2σN.

At the x limit, a more precise expression is (equation 49 [8])fβ(x)τβ|x|(β2+46β)/16βexp[β|x|324+2β26|x|3/2]for some positive number τβ that depends on β.

Cumulative distribution function

At the x+ limit,[10]F(x)=1e43x3/232πx3/2(13524x3/2+𝒪(x3)),E(x)=1e23x3/24πx3/2(14148x3/2+𝒪(x3))and at the x limit,F(x)=21/48e12ζ(1)e124|x|3|x|1/16(1+327|x|3+O(|x|6))E(x)=121/4e132|x|3/2(11242|x|3/2+𝒪(|x|3)).where ζ is the Riemann zeta function, and ζ(1)=0.1654211437.

This allows derivation of x± behavior of Fβ. For example,1F2(x)=132πx3/2e4x3/2/3(1+O(x3/2)),F2(x)=21/24eζ(1)x1/8ex3/12(1+326x3+O(x6)).

Painlevé transcendent

The Painlevé transcendent has asymptotic expansion at x (equation 4.1 of [11])q(x)=x2(1+18x373128x6+106571024x9+O(x12))This is necessary for numerical computations, as the qx/2 solution is unstable: any deviation from it tends to drop it to the qx/2 branch instead.[12]

Numerics

Numerical techniques for obtaining numerical solutions to the Painlevé equations of the types II and V, and numerically evaluating eigenvalue distributions of random matrices in the beta-ensembles were first presented by Template:Harvtxt using MATLAB. These approximation techniques were further analytically justified in Template:Harvtxt and used to provide numerical evaluation of Painlevé II and Tracy–Widom distributions (for β=1,2,4) in S-PLUS. These distributions have been tabulated in Template:Harvtxt to four significant digits for values of the argument in increments of 0.01; a statistical table for p-values was also given in this work. Template:Harvtxt gave accurate and fast algorithms for the numerical evaluation of Fβ and the density functions fβ(s)=dFβ/ds for β=1,2,4. These algorithms can be used to compute numerically the mean, variance, skewness and excess kurtosis of the distributions Fβ.[13]

β Mean Variance Skewness Excess kurtosis
1 −1.2065335745820 1.607781034581 0.29346452408 0.1652429384
2 −1.771086807411 0.8131947928329 0.224084203610 0.0934480876
4 −2.306884893241 0.5177237207726 0.16550949435 0.0491951565

Functions for working with the Tracy–Widom laws are also presented in the R package 'RMTstat' by Template:Harvtxt and MATLAB package 'RMLab' by Template:Harvtxt.

For a simple approximation based on a shifted gamma distribution see Template:Harvtxt.

Template:Harvtxt developed a spectral algorithm for the eigendecomposition of the integral operator As, which can be used to rapidly evaluate Tracy–Widom distributions, or, more generally, the distributions of the kth largest level at the soft edge scaling limit of Gaussian ensembles, to machine accuracy.

Tracy-Widom and KPZ universality

The Tracy-Widom distribution appears as a limit distribution in the universality class of the KPZ equation. For example it appears under t1/3 scaling of the one-dimensional KPZ equation with fixed time.[14]

See also

Footnotes

Template:Reflist

Template:ReflistTemplate:Reflist

References

Further reading

Template:ProbDistributions


Cite error: <ref> tags exist for a group named "note", but no corresponding <references group="note"/> tag was found