Wrapped Cauchy distribution

Wrapped Cauchy
Probability density function
Plot of the wrapped Cauchy PDF, '"`UNIQ--postMath-00000001-QINU`"'
The support is chosen to be [-π,π)
Cumulative distribution function
Plot of the wrapped Cauchy CDF '"`UNIQ--postMath-00000002-QINU`"'
The support is chosen to be [-π,π)
Parameters μ {\displaystyle \mu } Real
γ > 0 {\displaystyle \gamma >0}
Support π θ < π {\displaystyle -\pi \leq \theta <\pi }
PDF 1 2 π sinh ( γ ) cosh ( γ ) cos ( θ μ ) {\displaystyle {\frac {1}{2\pi }}\,{\frac {\sinh(\gamma )}{\cosh(\gamma )-\cos(\theta -\mu )}}}
CDF {\displaystyle \,}
Mean μ {\displaystyle \mu } (circular)
Variance 1 e γ {\displaystyle 1-e^{-\gamma }} (circular)
Entropy ln ( 2 π ( 1 e 2 γ ) ) {\displaystyle \ln(2\pi (1-e^{-2\gamma }))} (differential)
CF e i n μ | n | γ {\displaystyle e^{in\mu -|n|\gamma }}

In probability theory and directional statistics, a wrapped Cauchy distribution is a wrapped probability distribution that results from the "wrapping" of the Cauchy distribution around the unit circle. The Cauchy distribution is sometimes known as a Lorentzian distribution, and the wrapped Cauchy distribution may sometimes be referred to as a wrapped Lorentzian distribution.

The wrapped Cauchy distribution is often found in the field of spectroscopy where it is used to analyze diffraction patterns (e.g. see Fabry–Pérot interferometer).

Description

The probability density function of the wrapped Cauchy distribution is:[1]

f W C ( θ ; μ , γ ) = n = γ π ( γ 2 + ( θ μ + 2 π n ) 2 ) π < θ < π {\displaystyle f_{WC}(\theta ;\mu ,\gamma )=\sum _{n=-\infty }^{\infty }{\frac {\gamma }{\pi (\gamma ^{2}+(\theta -\mu +2\pi n)^{2})}}\qquad -\pi <\theta <\pi }

where γ {\displaystyle \gamma } is the scale factor and μ {\displaystyle \mu } is the peak position of the "unwrapped" distribution. Expressing the above pdf in terms of the characteristic function of the Cauchy distribution yields:

f W C ( θ ; μ , γ ) = 1 2 π n = e i n ( θ μ ) | n | γ = 1 2 π sinh γ cosh γ cos ( θ μ ) {\displaystyle f_{WC}(\theta ;\mu ,\gamma )={\frac {1}{2\pi }}\sum _{n=-\infty }^{\infty }e^{in(\theta -\mu )-|n|\gamma }={\frac {1}{2\pi }}\,\,{\frac {\sinh \gamma }{\cosh \gamma -\cos(\theta -\mu )}}}

The PDF may also be expressed in terms of the circular variable z = e and the complex parameter ζ = ei(μ+)

f W C ( z ; ζ ) = 1 2 π 1 | ζ | 2 | z ζ | 2 {\displaystyle f_{WC}(z;\zeta )={\frac {1}{2\pi }}\,\,{\frac {1-|\zeta |^{2}}{|z-\zeta |^{2}}}}

where, as shown below, ζ = ⟨z⟩.

In terms of the circular variable z = e i θ {\displaystyle z=e^{i\theta }} the circular moments of the wrapped Cauchy distribution are the characteristic function of the Cauchy distribution evaluated at integer arguments:

z n = Γ e i n θ f W C ( θ ; μ , γ ) d θ = e i n μ | n | γ . {\displaystyle \langle z^{n}\rangle =\int _{\Gamma }e^{in\theta }\,f_{WC}(\theta ;\mu ,\gamma )\,d\theta =e^{in\mu -|n|\gamma }.}

where Γ {\displaystyle \Gamma \,} is some interval of length 2 π {\displaystyle 2\pi } . The first moment is then the average value of z, also known as the mean resultant, or mean resultant vector:

z = e i μ γ {\displaystyle \langle z\rangle =e^{i\mu -\gamma }}

The mean angle is

θ = A r g z = μ {\displaystyle \langle \theta \rangle =\mathrm {Arg} \langle z\rangle =\mu }

and the length of the mean resultant is

R = | z | = e γ {\displaystyle R=|\langle z\rangle |=e^{-\gamma }}

yielding a circular variance of 1 − R.

Estimation of parameters

A series of N measurements z n = e i θ n {\displaystyle z_{n}=e^{i\theta _{n}}} drawn from a wrapped Cauchy distribution may be used to estimate certain parameters of the distribution. The average of the series z ¯ {\displaystyle {\overline {z}}} is defined as

z ¯ = 1 N n = 1 N z n {\displaystyle {\overline {z}}={\frac {1}{N}}\sum _{n=1}^{N}z_{n}}

and its expectation value will be just the first moment:

z ¯ = e i μ γ {\displaystyle \langle {\overline {z}}\rangle =e^{i\mu -\gamma }}

In other words, z ¯ {\displaystyle {\overline {z}}} is an unbiased estimator of the first moment. If we assume that the peak position μ {\displaystyle \mu } lies in the interval [ π , π ) {\displaystyle [-\pi ,\pi )} , then Arg ( z ¯ ) {\displaystyle ({\overline {z}})} will be a (biased) estimator of the peak position μ {\displaystyle \mu } .

Viewing the z n {\displaystyle z_{n}} as a set of vectors in the complex plane, the R ¯ 2 {\displaystyle {\overline {R}}^{2}} statistic is the length of the averaged vector:

R ¯ 2 = z ¯ z ¯ = ( 1 N n = 1 N cos θ n ) 2 + ( 1 N n = 1 N sin θ n ) 2 {\displaystyle {\overline {R}}^{2}={\overline {z}}\,{\overline {z^{*}}}=\left({\frac {1}{N}}\sum _{n=1}^{N}\cos \theta _{n}\right)^{2}+\left({\frac {1}{N}}\sum _{n=1}^{N}\sin \theta _{n}\right)^{2}}

and its expectation value is

R ¯ 2 = 1 N + N 1 N e 2 γ . {\displaystyle \langle {\overline {R}}^{2}\rangle ={\frac {1}{N}}+{\frac {N-1}{N}}e^{-2\gamma }.}

In other words, the statistic

R e 2 = N N 1 ( R ¯ 2 1 N ) {\displaystyle R_{e}^{2}={\frac {N}{N-1}}\left({\overline {R}}^{2}-{\frac {1}{N}}\right)}

will be an unbiased estimator of e 2 γ {\displaystyle e^{-2\gamma }} , and ln ( 1 / R e 2 ) / 2 {\displaystyle \ln(1/R_{e}^{2})/2} will be a (biased) estimator of γ {\displaystyle \gamma } .

Entropy

The information entropy of the wrapped Cauchy distribution is defined as:[1]

H = Γ f W C ( θ ; μ , γ ) ln ( f W C ( θ ; μ , γ ) ) d θ {\displaystyle H=-\int _{\Gamma }f_{WC}(\theta ;\mu ,\gamma )\,\ln(f_{WC}(\theta ;\mu ,\gamma ))\,d\theta }

where Γ {\displaystyle \Gamma } is any interval of length 2 π {\displaystyle 2\pi } . The logarithm of the density of the wrapped Cauchy distribution may be written as a Fourier series in θ {\displaystyle \theta \,} :

ln ( f W C ( θ ; μ , γ ) ) = c 0 + 2 m = 1 c m cos ( m θ ) {\displaystyle \ln(f_{WC}(\theta ;\mu ,\gamma ))=c_{0}+2\sum _{m=1}^{\infty }c_{m}\cos(m\theta )}

where

c m = 1 2 π Γ ln ( sinh γ 2 π ( cosh γ cos θ ) ) cos ( m θ ) d θ {\displaystyle c_{m}={\frac {1}{2\pi }}\int _{\Gamma }\ln \left({\frac {\sinh \gamma }{2\pi (\cosh \gamma -\cos \theta )}}\right)\cos(m\theta )\,d\theta }

which yields:

c 0 = ln ( 1 e 2 γ 2 π ) {\displaystyle c_{0}=\ln \left({\frac {1-e^{-2\gamma }}{2\pi }}\right)}

(c.f. Gradshteyn and Ryzhik[2] 4.224.15) and

c m = e m γ m f o r m > 0 {\displaystyle c_{m}={\frac {e^{-m\gamma }}{m}}\qquad \mathrm {for} \,m>0}

(c.f. Gradshteyn and Ryzhik[2] 4.397.6). The characteristic function representation for the wrapped Cauchy distribution in the left side of the integral is:

f W C ( θ ; μ , γ ) = 1 2 π ( 1 + 2 n = 1 ϕ n cos ( n θ ) ) {\displaystyle f_{WC}(\theta ;\mu ,\gamma )={\frac {1}{2\pi }}\left(1+2\sum _{n=1}^{\infty }\phi _{n}\cos(n\theta )\right)}

where ϕ n = e | n | γ {\displaystyle \phi _{n}=e^{-|n|\gamma }} . Substituting these expressions into the entropy integral, exchanging the order of integration and summation, and using the orthogonality of the cosines, the entropy may be written:

H = c 0 2 m = 1 ϕ m c m = ln ( 1 e 2 γ 2 π ) 2 m = 1 e 2 n γ n {\displaystyle H=-c_{0}-2\sum _{m=1}^{\infty }\phi _{m}c_{m}=-\ln \left({\frac {1-e^{-2\gamma }}{2\pi }}\right)-2\sum _{m=1}^{\infty }{\frac {e^{-2n\gamma }}{n}}}

The series is just the Taylor expansion for the logarithm of ( 1 e 2 γ ) {\displaystyle (1-e^{-2\gamma })} so the entropy may be written in closed form as:

H = ln ( 2 π ( 1 e 2 γ ) ) {\displaystyle H=\ln(2\pi (1-e^{-2\gamma }))\,}

Circular Cauchy distribution

If X is Cauchy distributed with median μ and scale parameter γ, then the complex variable

Z = X i X + i {\displaystyle Z={\frac {X-i}{X+i}}}

has unit modulus and is distributed on the unit circle with density:[3]

f C C ( θ , μ , γ ) = 1 2 π 1 | ζ | 2 | e i θ ζ | 2 {\displaystyle f_{CC}(\theta ,\mu ,\gamma )={\frac {1}{2\pi }}{\frac {1-|\zeta |^{2}}{|e^{i\theta }-\zeta |^{2}}}}

where

ζ = ψ i ψ + i {\displaystyle \zeta ={\frac {\psi -i}{\psi +i}}}

and ψ expresses the two parameters of the associated linear Cauchy distribution for x as a complex number:

ψ = μ + i γ {\displaystyle \psi =\mu +i\gamma \,}

It can be seen that the circular Cauchy distribution has the same functional form as the wrapped Cauchy distribution in z and ζ (i.e. fWC(z,ζ)). The circular Cauchy distribution is a reparameterized wrapped Cauchy distribution:

f C C ( θ , m , γ ) = f W C ( e i θ , m + i γ i m + i γ + i ) {\displaystyle f_{CC}(\theta ,m,\gamma )=f_{WC}\left(e^{i\theta },\,{\frac {m+i\gamma -i}{m+i\gamma +i}}\right)}

The distribution f C C ( θ ; μ , γ ) {\displaystyle f_{CC}(\theta ;\mu ,\gamma )} is called the circular Cauchy distribution[3][4] (also the complex Cauchy distribution[3]) with parameters μ and γ. (See also McCullagh's parametrization of the Cauchy distributions and Poisson kernel for related concepts.)

The circular Cauchy distribution expressed in complex form has finite moments of all orders

E [ Z n ] = ζ n , E [ Z ¯ n ] = ζ ¯ n {\displaystyle \operatorname {E} [Z^{n}]=\zeta ^{n},\quad \operatorname {E} [{\bar {Z}}^{n}]={\bar {\zeta }}^{n}}

for integer n ≥ 1. For |φ| < 1, the transformation

U ( z , ϕ ) = z ϕ 1 ϕ ¯ z {\displaystyle U(z,\phi )={\frac {z-\phi }{1-{\bar {\phi }}z}}}

is holomorphic on the unit disk, and the transformed variable U(Z, φ) is distributed as complex Cauchy with parameter U(ζ, φ).

Given a sample z1, ..., zn of size n > 2, the maximum-likelihood equation

n 1 U ( z , ζ ^ ) = n 1 U ( z j , ζ ^ ) = 0 {\displaystyle n^{-1}U\left(z,{\hat {\zeta }}\right)=n^{-1}\sum U\left(z_{j},{\hat {\zeta }}\right)=0}

can be solved by a simple fixed-point iteration:

ζ ( r + 1 ) = U ( n 1 U ( z , ζ ( r ) ) , ζ ( r ) ) {\displaystyle \zeta ^{(r+1)}=U\left(n^{-1}U(z,\zeta ^{(r)}),\,-\zeta ^{(r)}\right)\,}

starting with ζ(0) = 0. The sequence of likelihood values is non-decreasing, and the solution is unique for samples containing at least three distinct values.[5]

The maximum-likelihood estimate for the median ( μ ^ {\displaystyle {\hat {\mu }}} ) and scale parameter ( γ ^ {\displaystyle {\hat {\gamma }}} ) of a real Cauchy sample is obtained by the inverse transformation:

μ ^ ± i γ ^ = i 1 + ζ ^ 1 ζ ^ . {\displaystyle {\hat {\mu }}\pm i{\hat {\gamma }}=i{\frac {1+{\hat {\zeta }}}{1-{\hat {\zeta }}}}.}

For n ≤ 4, closed-form expressions are known for ζ ^ {\displaystyle {\hat {\zeta }}} .[6] The density of the maximum-likelihood estimator at t in the unit disk is necessarily of the form:

1 4 π p n ( χ ( t , ζ ) ) ( 1 | t | 2 ) 2 , {\displaystyle {\frac {1}{4\pi }}{\frac {p_{n}(\chi (t,\zeta ))}{(1-|t|^{2})^{2}}},}

where

χ ( t , ζ ) = | t ζ | 2 4 ( 1 | t | 2 ) ( 1 | ζ | 2 ) {\displaystyle \chi (t,\zeta )={\frac {|t-\zeta |^{2}}{4(1-|t|^{2})(1-|\zeta |^{2})}}} .

Formulae for p3 and p4 are available.[7]

See also

References

  1. ^ a b Mardia, Kantilal; Jupp, Peter E. (1999). Directional Statistics. Wiley. ISBN 978-0-471-95333-3.
  2. ^ a b Gradshteyn, Izrail Solomonovich; Ryzhik, Iosif Moiseevich; Geronimus, Yuri Veniaminovich; Tseytlin, Michail Yulyevich (February 2007). Jeffrey, Alan; Zwillinger, Daniel (eds.). Table of Integrals, Series, and Products. Translated by Scripta Technica, Inc. (7 ed.). Academic Press, Inc. ISBN 0-12-373637-4. LCCN 2010481177.
  3. ^ a b c McCullagh, Peter (June 1992). "Conditional inference and Cauchy models" (PDF). Biometrika. 79 (2): 247–259. doi:10.1093/biomet/79.2.247. Retrieved 26 January 2016.
  4. ^ K.V. Mardia (1972). Statistics of Directional Data. Academic Press.[page needed]
  5. ^ J. Copas (1975). "On the unimodality of the likelihood function for the Cauchy distribution". Biometrika. 62 (3): 701–704. doi:10.1093/biomet/62.3.701.
  6. ^ Ferguson, Thomas S. (1978). "Maximum Likelihood Estimates of the Parameters of the Cauchy Distribution for Samples of Size 3 and 4". Journal of the American Statistical Association. 73 (361): 211–213. doi:10.1080/01621459.1978.10480031. JSTOR 2286549.
  7. ^ P. McCullagh (1996). "Möbius transformation and Cauchy parameter estimation". Annals of Statistics. 24 (2): 786–808. JSTOR 2242674.
  • Borradaile, Graham (2003). Statistics of Earth Science Data. Springer. ISBN 978-3-540-43603-4. Retrieved 31 Dec 2009.
  • Fisher, N. I. (1996). Statistical Analysis of Circular Data. Cambridge University Press. ISBN 978-0-521-56890-6. Retrieved 2010-02-09.
  • v
  • t
  • e
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)DirectionalDegenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
  • Category
  • Commons