Kent distribution

Three points sets sampled from the Kent distribution. The mean directions are shown with arrows. The κ {\displaystyle \kappa \,} parameter is highest for the red set.

In directional statistics, the Kent distribution, also known as the 5-parameter Fisher–Bingham distribution (named after John T. Kent, Ronald Fisher, and Christopher Bingham), is a probability distribution on the unit sphere (2-sphere S2 in 3-space R3). It is the analogue on S2 of the bivariate normal distribution with an unconstrained covariance matrix. The Kent distribution was proposed by John T. Kent in 1982, and is used in geology as well as bioinformatics.

Definition

The probability density function f ( x ) {\displaystyle f(\mathbf {x} )\,} of the Kent distribution is given by:

f ( x ) = 1 c ( κ , β ) exp { κ γ 1 T x + β [ ( γ 2 T x ) 2 ( γ 3 T x ) 2 ] } {\displaystyle f(\mathbf {x} )={\frac {1}{{\textrm {c}}(\kappa ,\beta )}}\exp \left\{\kappa {\boldsymbol {\gamma }}_{1}^{T}\cdot \mathbf {x} +\beta [({\boldsymbol {\gamma }}_{2}^{T}\cdot \mathbf {x} )^{2}-({\boldsymbol {\gamma }}_{3}^{T}\cdot \mathbf {x} )^{2}]\right\}}

where x {\displaystyle \mathbf {x} \,} is a three-dimensional unit vector, ( ) T {\displaystyle (\cdot )^{T}} denotes the transpose of ( ) {\displaystyle (\cdot )} , and the normalizing constant c ( κ , β ) {\displaystyle {\textrm {c}}(\kappa ,\beta )\,} is:

c ( κ , β ) = 2 π j = 0 Γ ( j + 1 2 ) Γ ( j + 1 ) β 2 j ( 1 2 κ ) 2 j 1 2 I 2 j + 1 2 ( κ ) {\displaystyle c(\kappa ,\beta )=2\pi \sum _{j=0}^{\infty }{\frac {\Gamma (j+{\frac {1}{2}})}{\Gamma (j+1)}}\beta ^{2j}\left({\frac {1}{2}}\kappa \right)^{-2j-{\frac {1}{2}}}I_{2j+{\frac {1}{2}}}(\kappa )}

Where I v ( κ ) {\displaystyle I_{v}(\kappa )} is the modified Bessel function and Γ ( ) {\displaystyle \Gamma (\cdot )} is the gamma function. Note that c ( 0 , 0 ) = 4 π {\displaystyle c(0,0)=4\pi } and c ( κ , 0 ) = 4 π ( κ 1 ) sinh ( κ ) {\displaystyle c(\kappa ,0)=4\pi (\kappa ^{-1})\sinh(\kappa )} , the normalizing constant of the Von Mises–Fisher distribution.

The parameter κ {\displaystyle \kappa \,} (with κ > 0 {\displaystyle \kappa >0\,} ) determines the concentration or spread of the distribution, while β {\displaystyle \beta \,} (with 0 2 β < κ {\displaystyle 0\leq 2\beta <\kappa } ) determines the ellipticity of the contours of equal probability. The higher the κ {\displaystyle \kappa \,} and β {\displaystyle \beta \,} parameters, the more concentrated and elliptical the distribution will be, respectively. Vector γ 1 {\displaystyle \gamma _{1}\,} is the mean direction, and vectors γ 2 , γ 3 {\displaystyle \gamma _{2},\gamma _{3}\,} are the major and minor axes. The latter two vectors determine the orientation of the equal probability contours on the sphere, while the first vector determines the common center of the contours. The 3×3 matrix ( γ 1 , γ 2 , γ 3 ) {\displaystyle (\gamma _{1},\gamma _{2},\gamma _{3})\,} must be orthogonal.

Generalization to higher dimensions

The Kent distribution can be easily generalized to spheres in higher dimensions. If x {\displaystyle x} is a point on the unit sphere S p 1 {\displaystyle S^{p-1}} in R p {\displaystyle \mathbb {R} ^{p}} , then the density function of the p {\displaystyle p} -dimensional Kent distribution is proportional to

exp { κ γ 1 T x + j = 2 p β j ( γ j T x ) 2 }   , {\displaystyle \exp \left\{\kappa {\boldsymbol {\gamma }}_{1}^{T}\cdot \mathbf {x} +\sum _{j=2}^{p}\beta _{j}({\boldsymbol {\gamma }}_{j}^{T}\cdot \mathbf {x} )^{2}\right\}\ ,}

where j = 2 p β j = 0 {\displaystyle \sum _{j=2}^{p}\beta _{j}=0} and 0 2 | β j | < κ {\displaystyle 0\leq 2|\beta _{j}|<\kappa } and the vectors { γ j j = 1 , , p } {\displaystyle \{{\boldsymbol {\gamma }}_{j}\mid j=1,\ldots ,p\}} are orthonormal. However, the normalization constant becomes very difficult to work with for p > 3 {\displaystyle p>3} .

See also

References

  • Boomsma, W., Kent, J.T., Mardia, K.V., Taylor, C.C. & Hamelryck, T. (2006) Graphical models and directional statistics capture protein structure Archived 2021-05-07 at the Wayback Machine. In S. Barber, P.D. Baxter, K.V.Mardia, & R.E. Walls (Eds.), Interdisciplinary Statistics and Bioinformatics, pp. 91–94. Leeds, Leeds University Press.
  • Hamelryck T, Kent JT, Krogh A (2006) Sampling Realistic Protein Conformations Using Local Structural Bias. PLoS Comput Biol 2(9): e131
  • Kent, J. T. (1982) The Fisher–Bingham distribution on the sphere., J. Royal. Stat. Soc., 44:71–80.
  • Kent, J. T., Hamelryck, T. (2005). Using the Fisher–Bingham distribution in stochastic models for protein structure Archived 2021-05-07 at the Wayback Machine. In S. Barber, P.D. Baxter, K.V.Mardia, & R.E. Walls (Eds.), Quantitative Biology, Shape Analysis, and Wavelets, pp. 57–60. Leeds, Leeds University Press.
  • Mardia, K. V. M., Jupp, P. E. (2000) Directional Statistics (2nd edition), John Wiley and Sons Ltd. ISBN 0-471-95333-4
  • Peel, D., Whiten, WJ., McLachlan, GJ. (2001) Fitting mixtures of Kent distributions to aid in joint set identification. J. Am. Stat. Ass., 96:56–63
  • v
  • t
  • e
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)DirectionalDegenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
  • Category
  • Commons