Wrapped distribution

In probability theory and directional statistics, a wrapped probability distribution is a continuous probability distribution that describes data points that lie on a unit n-sphere. In one dimension, a wrapped distribution consists of points on the unit circle. If ϕ {\displaystyle \phi } is a random variate in the interval ( , ) {\displaystyle (-\infty ,\infty )} with probability density function (PDF) p ( ϕ ) {\displaystyle p(\phi )} , then z = e i ϕ {\displaystyle z=e^{i\phi }} is a circular variable distributed according to the wrapped distribution p w z ( θ ) {\displaystyle p_{wz}(\theta )} and θ = arg ( z ) {\displaystyle \theta =\arg(z)} is an angular variable in the interval ( π , π ] {\displaystyle (-\pi ,\pi ]} distributed according to the wrapped distribution p w ( θ ) {\displaystyle p_{w}(\theta )} .

Any probability density function p ( ϕ ) {\displaystyle p(\phi )} on the line can be "wrapped" around the circumference of a circle of unit radius.[1] That is, the PDF of the wrapped variable

θ = ϕ mod 2 π {\displaystyle \theta =\phi \mod 2\pi } in some interval of length 2 π {\displaystyle 2\pi }

is

p w ( θ ) = k = p ( θ + 2 π k ) {\displaystyle p_{w}(\theta )=\sum _{k=-\infty }^{\infty }{p(\theta +2\pi k)}}

which is a periodic sum of period 2 π {\displaystyle 2\pi } . The preferred interval is generally ( π < θ π ) {\displaystyle (-\pi <\theta \leq \pi )} for which ln ( e i θ ) = arg ( e i θ ) = θ {\displaystyle \ln(e^{i\theta })=\arg(e^{i\theta })=\theta } .

Theory

In most situations, a process involving circular statistics produces angles ( ϕ {\displaystyle \phi } ) which lie in the interval ( , ) {\displaystyle (-\infty ,\infty )} , and are described by an "unwrapped" probability density function p ( ϕ ) {\displaystyle p(\phi )} . However, a measurement will yield an angle θ {\displaystyle \theta } which lies in some interval of length 2 π {\displaystyle 2\pi } (for example, 0 to 2 π {\displaystyle 2\pi } ). In other words, a measurement cannot tell whether the true angle ϕ {\displaystyle \phi } or a wrapped angle θ = ϕ + 2 π a {\displaystyle \theta =\phi +2\pi a} , where a {\displaystyle a} is some unknown integer, has been measured.

If we wish to calculate the expected value of some function of the measured angle it will be:

f ( θ ) = p ( ϕ ) f ( ϕ + 2 π a ) d ϕ {\displaystyle \langle f(\theta )\rangle =\int _{-\infty }^{\infty }p(\phi )f(\phi +2\pi a)d\phi } .

We can express the integral as a sum of integrals over periods of 2 π {\displaystyle 2\pi } :

f ( θ ) = k = 2 π k 2 π ( k + 1 ) p ( ϕ ) f ( ϕ + 2 π a ) d ϕ {\displaystyle \langle f(\theta )\rangle =\sum _{k=-\infty }^{\infty }\int _{2\pi k}^{2\pi (k+1)}p(\phi )f(\phi +2\pi a)d\phi } .

Changing the variable of integration to θ = ϕ 2 π k {\displaystyle \theta '=\phi -2\pi k} and exchanging the order of integration and summation, we have

f ( θ ) = 0 2 π p w ( θ ) f ( θ + 2 π a ) d θ {\displaystyle \langle f(\theta )\rangle =\int _{0}^{2\pi }p_{w}(\theta ')f(\theta '+2\pi a')d\theta '}

where p w ( θ ) {\displaystyle p_{w}(\theta ')} is the PDF of the wrapped distribution and a {\displaystyle a'} is another unknown integer ( a = a + k ) {\displaystyle (a'=a+k)} . The unknown integer a {\displaystyle a'} introduces an ambiguity into the expected value of f ( θ ) {\displaystyle f(\theta )} , similar to the problem of calculating angular mean. This can be resolved by introducing the parameter z = e i θ {\displaystyle z=e^{i\theta }} , since z {\displaystyle z} has an unambiguous relationship to the true angle ϕ {\displaystyle \phi } :

z = e i θ = e i ϕ {\displaystyle z=e^{i\theta }=e^{i\phi }} .

Calculating the expected value of a function of z {\displaystyle z} will yield unambiguous answers:

f ( z ) = 0 2 π p w ( θ ) f ( e i θ ) d θ {\displaystyle \langle f(z)\rangle =\int _{0}^{2\pi }p_{w}(\theta ')f(e^{i\theta '})d\theta '} .

For this reason, the z {\displaystyle z} parameter is preferred over measured angles θ {\displaystyle \theta } in circular statistical analysis. This suggests that the wrapped distribution function may itself be expressed as a function of z {\displaystyle z} such that:

f ( z ) = p w z ( z ) f ( z ) d z {\displaystyle \langle f(z)\rangle =\oint p_{wz}(z)f(z)\,dz}

where p w ( z ) {\displaystyle p_{w}(z)} is defined such that p w ( θ ) | d θ | = p w z ( z ) | d z | {\displaystyle p_{w}(\theta )\,|d\theta |=p_{wz}(z)\,|dz|} . This concept can be extended to the multivariate context by an extension of the simple sum to a number of F {\displaystyle F} sums that cover all dimensions in the feature space:

p w ( θ ) = k 1 , . . . , k F = p ( θ + 2 π k 1 e 1 + + 2 π k F e F ) {\displaystyle p_{w}({\vec {\theta }})=\sum _{k_{1},...,k_{F}=-\infty }^{\infty }{p({\vec {\theta }}+2\pi k_{1}\mathbf {e} _{1}+\dots +2\pi k_{F}\mathbf {e} _{F})}}

where e k = ( 0 , , 0 , 1 , 0 , , 0 ) T {\displaystyle \mathbf {e} _{k}=(0,\dots ,0,1,0,\dots ,0)^{\mathsf {T}}} is the k {\displaystyle k} th Euclidean basis vector.

Expression in terms of characteristic functions

A fundamental wrapped distribution is the Dirac comb, which is a wrapped Dirac delta function:

Δ 2 π ( θ ) = k = δ ( θ + 2 π k ) {\displaystyle \Delta _{2\pi }(\theta )=\sum _{k=-\infty }^{\infty }{\delta (\theta +2\pi k)}} .

Using the delta function, a general wrapped distribution can be written

p w ( θ ) = k = p ( θ ) δ ( θ θ + 2 π k ) d θ {\displaystyle p_{w}(\theta )=\sum _{k=-\infty }^{\infty }\int _{-\infty }^{\infty }p(\theta ')\delta (\theta -\theta '+2\pi k)\,d\theta '} .

Exchanging the order of summation and integration, any wrapped distribution can be written as the convolution of the unwrapped distribution and a Dirac comb:

p w ( θ ) = p ( θ ) Δ 2 π ( θ θ ) d θ {\displaystyle p_{w}(\theta )=\int _{-\infty }^{\infty }p(\theta ')\Delta _{2\pi }(\theta -\theta ')\,d\theta '} .

The Dirac comb may also be expressed as a sum of exponentials, so we may write:

p w ( θ ) = 1 2 π p ( θ ) n = e i n ( θ θ ) d θ {\displaystyle p_{w}(\theta )={\frac {1}{2\pi }}\,\int _{-\infty }^{\infty }p(\theta ')\sum _{n=-\infty }^{\infty }e^{in(\theta -\theta ')}\,d\theta '} .

Again exchanging the order of summation and integration:

p w ( θ ) = 1 2 π n = p ( θ ) e i n ( θ θ ) d θ {\displaystyle p_{w}(\theta )={\frac {1}{2\pi }}\,\sum _{n=-\infty }^{\infty }\int _{-\infty }^{\infty }p(\theta ')e^{in(\theta -\theta ')}\,d\theta '} .

Using the definition of ϕ ( s ) {\displaystyle \phi (s)} , the characteristic function of p ( θ ) {\displaystyle p(\theta )} yields a Laurent series about zero for the wrapped distribution in terms of the characteristic function of the unwrapped distribution:

p w ( θ ) = 1 2 π n = ϕ ( n ) e i n θ {\displaystyle p_{w}(\theta )={\frac {1}{2\pi }}\,\sum _{n=-\infty }^{\infty }\phi (n)\,e^{-in\theta }}

or

p w z ( z ) = 1 2 π n = ϕ ( n ) z n {\displaystyle p_{wz}(z)={\frac {1}{2\pi }}\,\sum _{n=-\infty }^{\infty }\phi (n)\,z^{-n}}

Analogous to linear distributions, ϕ ( m ) {\displaystyle \phi (m)} is referred to as the characteristic function of the wrapped distribution (or more accurately, the characteristic sequence).[2] This is an instance of the Poisson summation formula, and it can be seen that the coefficients of the Fourier series for the wrapped distribution are simply the coefficients of the Fourier transform of the unwrapped distribution at integer values.

Moments

The moments of the wrapped distribution p w ( z ) {\displaystyle p_{w}(z)} are defined as:

z m = p w z ( z ) z m d z {\displaystyle \langle z^{m}\rangle =\oint p_{wz}(z)z^{m}\,dz} .

Expressing p w ( z ) {\displaystyle p_{w}(z)} in terms of the characteristic function and exchanging the order of integration and summation yields:

z m = 1 2 π n = ϕ ( n ) z m n d z {\displaystyle \langle z^{m}\rangle ={\frac {1}{2\pi }}\sum _{n=-\infty }^{\infty }\phi (n)\oint z^{m-n}\,dz} .

From the residue theorem we have

z m n d z = 2 π δ m n {\displaystyle \oint z^{m-n}\,dz=2\pi \delta _{m-n}}

where δ k {\displaystyle \delta _{k}} is the Kronecker delta function. It follows that the moments are simply equal to the characteristic function of the unwrapped distribution for integer arguments:

z m = ϕ ( m ) {\displaystyle \langle z^{m}\rangle =\phi (m)} .

Generation of random variates

If X {\displaystyle X} is a random variate drawn from a linear probability distribution P {\displaystyle P} , then Z = e i X {\displaystyle Z=e^{iX}} is a circular variate distributed according to the wrapped P {\displaystyle P} distribution, and θ = arg ( Z ) {\displaystyle \theta =\arg(Z)} is the angular variate distributed according to the wrapped P {\displaystyle P} distribution, with π < θ π {\displaystyle -\pi <\theta \leq \pi } .

Entropy

The information entropy of a circular distribution with probability density p w ( θ ) {\displaystyle p_{w}(\theta )} is defined as:

H = Γ p w ( θ ) ln ( p w ( θ ) ) d θ {\displaystyle H=-\int _{\Gamma }p_{w}(\theta )\,\ln(p_{w}(\theta ))\,d\theta }

where Γ {\displaystyle \Gamma } is any interval of length 2 π {\displaystyle 2\pi } .[1] If both the probability density and its logarithm can be expressed as a Fourier series (or more generally, any integral transform on the circle), the orthogonal basis of the series can be used to obtain a closed form expression for the entropy.

The moments of the distribution ϕ ( n ) {\displaystyle \phi (n)} are the Fourier coefficients for the Fourier series expansion of the probability density:

p w ( θ ) = 1 2 π n = ϕ n e i n θ {\displaystyle p_{w}(\theta )={\frac {1}{2\pi }}\sum _{n=-\infty }^{\infty }\phi _{n}e^{-in\theta }} .

If the logarithm of the probability density can also be expressed as a Fourier series:

ln ( p w ( θ ) ) = m = c m e i m θ {\displaystyle \ln(p_{w}(\theta ))=\sum _{m=-\infty }^{\infty }c_{m}e^{im\theta }}

where

c m = 1 2 π Γ ln ( p w ( θ ) ) e i m θ d θ {\displaystyle c_{m}={\frac {1}{2\pi }}\int _{\Gamma }\ln(p_{w}(\theta ))e^{-im\theta }\,d\theta } .

Then, exchanging the order of integration and summation, the entropy may be written as:

H = 1 2 π m = n = c m ϕ n Γ e i ( m n ) θ d θ {\displaystyle H=-{\frac {1}{2\pi }}\sum _{m=-\infty }^{\infty }\sum _{n=-\infty }^{\infty }c_{m}\phi _{n}\int _{\Gamma }e^{i(m-n)\theta }\,d\theta } .

Using the orthogonality of the Fourier basis, the integral may be reduced to:

H = n = c n ϕ n {\displaystyle H=-\sum _{n=-\infty }^{\infty }c_{n}\phi _{n}} .

For the particular case when the probability density is symmetric about the mean, c m = c m {\displaystyle c_{-m}=c_{m}} and the logarithm may be written:

ln ( p w ( θ ) ) = c 0 + 2 m = 1 c m cos ( m θ ) {\displaystyle \ln(p_{w}(\theta ))=c_{0}+2\sum _{m=1}^{\infty }c_{m}\cos(m\theta )}

and

c m = 1 2 π Γ ln ( p w ( θ ) ) cos ( m θ ) d θ {\displaystyle c_{m}={\frac {1}{2\pi }}\int _{\Gamma }\ln(p_{w}(\theta ))\cos(m\theta )\,d\theta }

and, since normalization requires that ϕ 0 = 1 {\displaystyle \phi _{0}=1} , the entropy may be written:

H = c 0 2 n = 1 c n ϕ n {\displaystyle H=-c_{0}-2\sum _{n=1}^{\infty }c_{n}\phi _{n}} .

See also

References

  1. ^ a b Mardia, Kantilal; Jupp, Peter E. (1999). Directional Statistics. Wiley. ISBN 978-0-471-95333-3. Retrieved 19 July 2011.
  2. ^ Mardia, K. (1972). Statistics of Directional Data. New York: Academic press.
  • Borradaile, Graham (2003). Statistics of Earth Science Data. Springer. ISBN 978-3-540-43603-4.
  • Fisher, N. I. (1996). Statistical Analysis of Circular Data. Cambridge University Press. ISBN 978-0-521-56890-6.

External links

  • Circular Values Math and Statistics with C++11, A C++11 infrastructure for circular values (angles, time-of-day, etc.) mathematics and statistics
  • v
  • t
  • e
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)DirectionalDegenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
  • Category
  • Commons