Wilks's lambda distribution

Probability distribution used in multivariate hypothesis testing

In statistics, Wilks' lambda distribution (named for Samuel S. Wilks), is a probability distribution used in multivariate hypothesis testing, especially with regard to the likelihood-ratio test and multivariate analysis of variance (MANOVA).

Definition

Wilks' lambda distribution is defined from two independent Wishart distributed variables as the ratio distribution of their determinants,[1]

given

A W p ( Σ , m ) B W p ( Σ , n ) {\displaystyle \mathbf {A} \sim W_{p}(\Sigma ,m)\qquad \mathbf {B} \sim W_{p}(\Sigma ,n)}

independent and with m p {\displaystyle m\geq p}

λ = det ( A ) det ( A + B ) = 1 det ( I + A 1 B ) Λ ( p , m , n ) {\displaystyle \lambda ={\frac {\det(\mathbf {A} )}{\det(\mathbf {A+B} )}}={\frac {1}{\det(\mathbf {I} +\mathbf {A} ^{-1}\mathbf {B} )}}\sim \Lambda (p,m,n)}

where p is the number of dimensions. In the context of likelihood-ratio tests m is typically the error degrees of freedom, and n is the hypothesis degrees of freedom, so that n + m {\displaystyle n+m} is the total degrees of freedom.[1]

Approximations

Computations or tables of the Wilks' distribution for higher dimensions are not readily available and one usually resorts to approximations. One approximation is attributed to M. S. Bartlett and works for large m[2] allows Wilks' lambda to be approximated with a chi-squared distribution

( p n + 1 2 m ) log Λ ( p , m , n ) χ n p 2 . {\displaystyle \left({\frac {p-n+1}{2}}-m\right)\log \Lambda (p,m,n)\sim \chi _{np}^{2}.} [1]

Another approximation is attributed to C. R. Rao.[1][3]

Properties

There is a symmetry among the parameters of the Wilks distribution,[1]

Λ ( p , m , n ) Λ ( n , m + n p , p ) {\displaystyle \Lambda (p,m,n)\sim \Lambda (n,m+n-p,p)}

Related distributions

The distribution can be related to a product of independent beta-distributed random variables

u i B ( m + i p 2 , p 2 ) {\displaystyle u_{i}\sim B\left({\frac {m+i-p}{2}},{\frac {p}{2}}\right)}
i = 1 n u i Λ ( p , m , n ) . {\displaystyle \prod _{i=1}^{n}u_{i}\sim \Lambda (p,m,n).}

As such it can be regarded as a multivariate generalization of the beta distribution.

It follows directly that for a one-dimension problem, when the Wishart distributions are one-dimensional with p = 1 {\displaystyle p=1} (i.e., chi-squared-distributed), then the Wilks' distribution equals the beta-distribution with a certain parameter set,

Λ ( 1 , m , n ) B ( m 2 , n 2 ) . {\displaystyle \Lambda (1,m,n)\sim B\left({\frac {m}{2}},{\frac {n}{2}}\right).}

From the relations between a beta and an F-distribution, Wilks' lambda can be related to the F-distribution when one of the parameters of the Wilks lambda distribution is either 1 or 2, e.g.,[1]

1 Λ ( p , m , 1 ) Λ ( p , m , 1 ) p m p + 1 F p , m p + 1 , {\displaystyle {\frac {1-\Lambda (p,m,1)}{\Lambda (p,m,1)}}\sim {\frac {p}{m-p+1}}F_{p,m-p+1},}

and

1 Λ ( p , m , 2 ) Λ ( p , m , 2 ) p m p + 1 F 2 p , 2 ( m p + 1 ) . {\displaystyle {\frac {1-{\sqrt {\Lambda (p,m,2)}}}{\sqrt {\Lambda (p,m,2)}}}\sim {\frac {p}{m-p+1}}F_{2p,2(m-p+1)}.}

See also

References

  1. ^ a b c d e f Kanti Mardia, John T. Kent and John Bibby (1979). Multivariate Analysis. Academic Press. ISBN 0-12-471250-9.
  2. ^ M. S. Bartlett (1954). "A Note on the Multiplying Factors for Various χ 2 {\displaystyle \chi ^{2}} Approximations". J R Stat Soc Ser B. 16 (2): 296–298. JSTOR 2984057.
  3. ^ C. R. Rao (1951). "An Asymptotic Expansion of the Distribution of Wilks' Criterion". Bulletin de l'Institut International de Statistique. 33: 177–180.
  • v
  • t
  • e
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)DirectionalDegenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
  • Category
  • Commons
  • v
  • t
  • e
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
  • Z-test (normal)
  • Student's t-test
  • F-test
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
  • Category
  • icon Mathematics portal
  • Commons
  • WikiProject