Laplace distribution

Probability distribution
Laplace
Probability density function
Probability density plots of Laplace distributions
Cumulative distribution function
Cumulative distribution plots of Laplace distributions
Parameters μ {\displaystyle \mu } location (real)
b > 0 {\displaystyle b>0} scale (real)
Support R {\displaystyle \mathbb {R} }
PDF 1 2 b exp ( | x μ | b ) {\displaystyle {\frac {1}{2b}}\exp \left(-{\frac {|x-\mu |}{b}}\right)}
CDF { 1 2 exp ( x μ b ) if  x μ 1 1 2 exp ( x μ b ) if  x μ {\displaystyle {\begin{cases}{\frac {1}{2}}\exp \left({\frac {x-\mu }{b}}\right)&{\text{if }}x\leq \mu \\[8pt]1-{\frac {1}{2}}\exp \left(-{\frac {x-\mu }{b}}\right)&{\text{if }}x\geq \mu \end{cases}}}
Quantile { μ + b ln ( 2 F ) if  F 1 2 μ b ln ( 2 2 F ) if  F 1 2 {\displaystyle {\begin{cases}\mu +b\ln \left(2F\right)&{\text{if }}F\leq {\frac {1}{2}}\\[8pt]\mu -b\ln \left(2-2F\right)&{\text{if }}F\geq {\frac {1}{2}}\end{cases}}}
Mean μ {\displaystyle \mu }
Median μ {\displaystyle \mu }
Mode μ {\displaystyle \mu }
Variance 2 b 2 {\displaystyle 2b^{2}}
MAD b {\displaystyle b}
Skewness 0 {\displaystyle 0}
Excess kurtosis 6 {\displaystyle 6}
Entropy ln ( 2 b e ) {\displaystyle \ln(2be)}
MGF exp ( μ t ) 1 b 2 t 2  for  | t | < 1 / b {\displaystyle {\frac {\exp(\mu t)}{1-b^{2}t^{2}}}{\text{ for }}|t|<1/b}
CF exp ( μ i t ) 1 + b 2 t 2 {\displaystyle {\frac {\exp(\mu it)}{1+b^{2}t^{2}}}}
Expected shortfall { μ + b ( p 1 p ) ( 1 ln ( 2 p ) ) , p < .5 μ + b ( 1 ln ( 2 ( 1 p ) ) ) , p .5 {\displaystyle {\begin{cases}\mu +b\left({\frac {p}{1-p}}\right)(1-\ln(2p))&,p<.5\\\mu +b\left(1-\ln \left(2(1-p)\right)\right)&,p\geq .5\end{cases}}} [1]

In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponential distributions (with an additional location parameter) spliced together along the abscissa, although the term is also sometimes used to refer to the Gumbel distribution. The difference between two independent identically distributed exponential random variables is governed by a Laplace distribution, as is a Brownian motion evaluated at an exponentially distributed random time[citation needed]. Increments of Laplace motion or a variance gamma process evaluated over the time scale also have a Laplace distribution.

Definitions

Probability density function

A random variable has a Laplace ( μ , b ) {\displaystyle \operatorname {Laplace} (\mu ,b)} distribution if its probability density function is

f ( x μ , b ) = 1 2 b exp ( | x μ | b ) , {\displaystyle f(x\mid \mu ,b)={\frac {1}{2b}}\exp \left(-{\frac {|x-\mu |}{b}}\right),}

were μ {\displaystyle \mu } is a location parameter, and b > 0 {\displaystyle b>0} , which is sometimes referred to as the "diversity", is a scale parameter. If μ = 0 {\displaystyle \mu =0} and b = 1 {\displaystyle b=1} , the positive half-line is exactly an exponential distribution scaled by 1/2.

The probability density function of the Laplace distribution is also reminiscent of the normal distribution; however, whereas the normal distribution is expressed in terms of the squared difference from the mean μ {\displaystyle \mu } , the Laplace density is expressed in terms of the absolute difference from the mean. Consequently, the Laplace distribution has fatter tails than the normal distribution. It is a special case of the generalized normal distribution and the hyperbolic distribution. Continuous symmetric distributions that have exponential tails, like the Laplace distribution, but which have probability density functions that are differentiable at the mode include the logistic distribution, hyperbolic secant distribution, and the Champernowne distribution.

Cumulative distribution function

The Laplace distribution is easy to integrate (if one distinguishes two symmetric cases) due to the use of the absolute value function. Its cumulative distribution function is as follows:

F ( x ) = x f ( u ) d u = { 1 2 exp ( x μ b ) if  x < μ 1 1 2 exp ( x μ b ) if  x μ = 1 2 + 1 2 sgn ( x μ ) ( 1 exp ( | x μ | b ) ) . {\displaystyle {\begin{aligned}F(x)&=\int _{-\infty }^{x}\!\!f(u)\,\mathrm {d} u={\begin{cases}{\frac {1}{2}}\exp \left({\frac {x-\mu }{b}}\right)&{\mbox{if }}x<\mu \\1-{\frac {1}{2}}\exp \left(-{\frac {x-\mu }{b}}\right)&{\mbox{if }}x\geq \mu \end{cases}}\\&={\tfrac {1}{2}}+{\tfrac {1}{2}}\operatorname {sgn}(x-\mu )\left(1-\exp \left(-{\frac {|x-\mu |}{b}}\right)\right).\end{aligned}}}

The inverse cumulative distribution function is given by

F 1 ( p ) = μ b sgn ( p 0.5 ) ln ( 1 2 | p 0.5 | ) . {\displaystyle F^{-1}(p)=\mu -b\,\operatorname {sgn}(p-0.5)\,\ln(1-2|p-0.5|).}

Properties

Moments

μ r = ( 1 2 ) k = 0 r [ r ! ( r k ) ! b k μ ( r k ) { 1 + ( 1 ) k } ] . {\displaystyle \mu _{r}'={\bigg (}{\frac {1}{2}}{\bigg )}\sum _{k=0}^{r}{\bigg [}{\frac {r!}{(r-k)!}}b^{k}\mu ^{(r-k)}\{1+(-1)^{k}\}{\bigg ]}.}

Related distributions

  • If X Laplace ( μ , b ) {\displaystyle X\sim {\textrm {Laplace}}(\mu ,b)} then k X + c Laplace ( k μ + c , | k | b ) {\displaystyle kX+c\sim {\textrm {Laplace}}(k\mu +c,|k|b)} .
  • If X Laplace ( 0 , 1 ) {\displaystyle X\sim {\textrm {Laplace}}(0,1)} then b X Laplace ( 0 , b ) {\displaystyle bX\sim {\textrm {Laplace}}(0,b)} .
  • If X Laplace ( 0 , b ) {\displaystyle X\sim {\textrm {Laplace}}(0,b)} then | X | Exponential ( b 1 ) {\displaystyle \left|X\right|\sim {\textrm {Exponential}}\left(b^{-1}\right)} (exponential distribution).
  • If X , Y Exponential ( λ ) {\displaystyle X,Y\sim {\textrm {Exponential}}(\lambda )} then X Y Laplace ( 0 , λ 1 ) {\displaystyle X-Y\sim {\textrm {Laplace}}\left(0,\lambda ^{-1}\right)}
  • If X Laplace ( μ , b ) {\displaystyle X\sim {\textrm {Laplace}}(\mu ,b)} then | X μ | Exponential ( b 1 ) {\displaystyle \left|X-\mu \right|\sim {\textrm {Exponential}}(b^{-1})} .
  • If X Laplace ( μ , b ) {\displaystyle X\sim {\textrm {Laplace}}(\mu ,b)} then X EPD ( μ , b , 1 ) {\displaystyle X\sim {\textrm {EPD}}(\mu ,b,1)} (exponential power distribution).
  • If X 1 , . . . , X 4 N ( 0 , 1 ) {\displaystyle X_{1},...,X_{4}\sim {\textrm {N}}(0,1)} (normal distribution) then X 1 X 2 X 3 X 4 Laplace ( 0 , 1 ) {\displaystyle X_{1}X_{2}-X_{3}X_{4}\sim {\textrm {Laplace}}(0,1)} and ( X 1 2 X 2 2 + X 3 2 X 4 2 ) / 2 Laplace ( 0 , 1 ) {\displaystyle (X_{1}^{2}-X_{2}^{2}+X_{3}^{2}-X_{4}^{2})/2\sim {\textrm {Laplace}}(0,1)} .
  • If X i Laplace ( μ , b ) {\displaystyle X_{i}\sim {\textrm {Laplace}}(\mu ,b)} then 2 b i = 1 n | X i μ | χ 2 ( 2 n ) {\displaystyle {\frac {\displaystyle 2}{b}}\sum _{i=1}^{n}|X_{i}-\mu |\sim \chi ^{2}(2n)} (chi-squared distribution).
  • If X , Y Laplace ( μ , b ) {\displaystyle X,Y\sim {\textrm {Laplace}}(\mu ,b)} then | X μ | | Y μ | F ( 2 , 2 ) {\displaystyle {\tfrac {|X-\mu |}{|Y-\mu |}}\sim \operatorname {F} (2,2)} . (F-distribution)
  • If X , Y U ( 0 , 1 ) {\displaystyle X,Y\sim {\textrm {U}}(0,1)} (uniform distribution) then log ( X / Y ) Laplace ( 0 , 1 ) {\displaystyle \log(X/Y)\sim {\textrm {Laplace}}(0,1)} .
  • If X Exponential ( λ ) {\displaystyle X\sim {\textrm {Exponential}}(\lambda )} and Y Bernoulli ( 0.5 ) {\displaystyle Y\sim {\textrm {Bernoulli}}(0.5)} (Bernoulli distribution) independent of X {\displaystyle X} , then X ( 2 Y 1 ) Laplace ( 0 , λ 1 ) {\displaystyle X(2Y-1)\sim {\textrm {Laplace}}\left(0,\lambda ^{-1}\right)} .
  • If X Exponential ( λ ) {\displaystyle X\sim {\textrm {Exponential}}(\lambda )} and Y Exponential ( ν ) {\displaystyle Y\sim {\textrm {Exponential}}(\nu )} independent of X {\displaystyle X} , then λ X ν Y Laplace ( 0 , 1 ) {\displaystyle \lambda X-\nu Y\sim {\textrm {Laplace}}(0,1)}
  • If X {\displaystyle X} has a Rademacher distribution and Y Exponential ( λ ) {\displaystyle Y\sim {\textrm {Exponential}}(\lambda )} then X Y Laplace ( 0 , 1 / λ ) {\displaystyle XY\sim {\textrm {Laplace}}(0,1/\lambda )} .
  • If V Exponential ( 1 ) {\displaystyle V\sim {\textrm {Exponential}}(1)} and Z N ( 0 , 1 ) {\displaystyle Z\sim N(0,1)} independent of V {\displaystyle V} , then X = μ + b 2 V Z L a p l a c e ( μ , b ) {\displaystyle X=\mu +b{\sqrt {2V}}Z\sim \mathrm {Laplace} (\mu ,b)} .
  • If X GeometricStable ( 2 , 0 , λ , 0 ) {\displaystyle X\sim {\textrm {GeometricStable}}(2,0,\lambda ,0)} (geometric stable distribution) then X Laplace ( 0 , λ ) {\displaystyle X\sim {\textrm {Laplace}}(0,\lambda )} .
  • The Laplace distribution is a limiting case of the hyperbolic distribution.
  • If X | Y N ( μ , Y 2 ) {\displaystyle X|Y\sim {\textrm {N}}(\mu ,Y^{2})} with Y Rayleigh ( b ) {\displaystyle Y\sim {\textrm {Rayleigh}}(b)} (Rayleigh distribution) then X Laplace ( μ , b ) {\displaystyle X\sim {\textrm {Laplace}}(\mu ,b)} . Note that if Y Rayleigh ( b ) {\displaystyle Y\sim {\textrm {Rayleigh}}(b)} , then Y 2 Gamma ( 1 , 2 b 2 ) {\displaystyle Y^{2}\sim {\textrm {Gamma}}(1,2b^{2})} with E ( Y 2 ) = 2 b 2 {\displaystyle {\textrm {E}}(Y^{2})=2b^{2}} , which in turn equals the exponential distribution Exp ( 1 / ( 2 b 2 ) ) {\displaystyle {\textrm {Exp}}(1/(2b^{2}))} .
  • Given an integer n 1 {\displaystyle n\geq 1} , if X i , Y i Γ ( 1 n , b ) {\displaystyle X_{i},Y_{i}\sim \Gamma \left({\frac {1}{n}},b\right)} (gamma distribution, using k , θ {\displaystyle k,\theta } characterization), then i = 1 n ( μ n + X i Y i ) Laplace ( μ , b ) {\displaystyle \sum _{i=1}^{n}\left({\frac {\mu }{n}}+X_{i}-Y_{i}\right)\sim {\textrm {Laplace}}(\mu ,b)} (infinite divisibility)[2]
  • If X has a Laplace distribution, then Y = eX has a log-Laplace distribution; conversely, if X has a log-Laplace distribution, then its logarithm has a Laplace distribution.

Probability of a Laplace being greater than another

Let X , Y {\displaystyle X,Y} be independent laplace random variables: X Laplace ( μ X , b X ) {\displaystyle X\sim {\textrm {Laplace}}(\mu _{X},b_{X})} and Y Laplace ( μ Y , b Y ) {\displaystyle Y\sim {\textrm {Laplace}}(\mu _{Y},b_{Y})} , and we want to compute P ( X > Y ) {\displaystyle P(X>Y)} .

The probability of P ( X > Y ) {\displaystyle P(X>Y)} can be reduced (using the properties below) to P ( μ + b Z 1 > Z 2 ) {\displaystyle P(\mu +bZ_{1}>Z_{2})} , where Z 1 , Z 2 Laplace ( 0 , 1 ) {\displaystyle Z_{1},Z_{2}\sim {\textrm {Laplace}}(0,1)} . This probability is equal to

P ( μ + b Z 1 > Z 2 ) = { b 2 e μ / b e μ 2 ( b 2 1 ) , when  μ < 0 1 b 2 e μ / b e μ 2 ( b 2 1 ) , when  μ > 0 {\displaystyle P(\mu +bZ_{1}>Z_{2})={\begin{cases}{\frac {b^{2}e^{\mu /b}-e^{\mu }}{2(b^{2}-1)}},&{\text{when }}\mu <0\\1-{\frac {b^{2}e^{-\mu /b}-e^{-\mu }}{2(b^{2}-1)}},&{\text{when }}\mu >0\\\end{cases}}}

When b = 1 {\displaystyle b=1} , both expressions are replaced by their limit as b 1 {\displaystyle b\to 1} :

P ( μ + Z 1 > Z 2 ) = { e μ ( 2 μ ) 4 , when  μ < 0 1 e μ ( 2 + μ ) 4 , when  μ > 0 {\displaystyle P(\mu +Z_{1}>Z_{2})={\begin{cases}e^{\mu }{\frac {(2-\mu )}{4}},&{\text{when }}\mu <0\\1-e^{-\mu }{\frac {(2+\mu )}{4}},&{\text{when }}\mu >0\\\end{cases}}}

To compute the case for μ > 0 {\displaystyle \mu >0} , note that P ( μ + Z 1 > Z 2 ) = 1 P ( μ + Z 1 < Z 2 ) = 1 P ( μ Z 1 > Z 2 ) = 1 P ( μ + Z 1 > Z 2 ) {\displaystyle P(\mu +Z_{1}>Z_{2})=1-P(\mu +Z_{1}<Z_{2})=1-P(-\mu -Z_{1}>-Z_{2})=1-P(-\mu +Z_{1}>Z_{2})}

since Z Z {\displaystyle Z\sim -Z} when Z Laplace ( 0 , 1 ) {\displaystyle Z\sim {\textrm {Laplace}}(0,1)}

Relation to the exponential distribution

A Laplace random variable can be represented as the difference of two independent and identically distributed (iid) exponential random variables.[2] One way to show this is by using the characteristic function approach. For any set of independent continuous random variables, for any linear combination of those variables, its characteristic function (which uniquely determines the distribution) can be acquired by multiplying the corresponding characteristic functions.

Consider two i.i.d random variables X , Y Exponential ( λ ) {\displaystyle X,Y\sim {\textrm {Exponential}}(\lambda )} . The characteristic functions for X , Y {\displaystyle X,-Y} are

λ i t + λ , λ i t + λ {\displaystyle {\frac {\lambda }{-it+\lambda }},\quad {\frac {\lambda }{it+\lambda }}}

respectively. On multiplying these characteristic functions (equivalent to the characteristic function of the sum of the random variables X + ( Y ) {\displaystyle X+(-Y)} ), the result is

λ 2 ( i t + λ ) ( i t + λ ) = λ 2 t 2 + λ 2 . {\displaystyle {\frac {\lambda ^{2}}{(-it+\lambda )(it+\lambda )}}={\frac {\lambda ^{2}}{t^{2}+\lambda ^{2}}}.}

This is the same as the characteristic function for Z Laplace ( 0 , 1 / λ ) {\displaystyle Z\sim {\textrm {Laplace}}(0,1/\lambda )} , which is

1 1 + t 2 λ 2 . {\displaystyle {\frac {1}{1+{\frac {t^{2}}{\lambda ^{2}}}}}.}

Sargan distributions

Sargan distributions are a system of distributions of which the Laplace distribution is a core member. A p {\displaystyle p} th order Sargan distribution has density[3][4]

f p ( x ) = 1 2 exp ( α | x | ) 1 + j = 1 p β j α j | x | j 1 + j = 1 p j ! β j , {\displaystyle f_{p}(x)={\tfrac {1}{2}}\exp(-\alpha |x|){\frac {\displaystyle 1+\sum _{j=1}^{p}\beta _{j}\alpha ^{j}|x|^{j}}{\displaystyle 1+\sum _{j=1}^{p}j!\beta _{j}}},}

for parameters α 0 , β j 0 {\displaystyle \alpha \geq 0,\beta _{j}\geq 0} . The Laplace distribution results for p = 0 {\displaystyle p=0} .

Statistical inference

Given n {\displaystyle n} independent and identically distributed samples x 1 , x 2 , . . . , x n {\displaystyle x_{1},x_{2},...,x_{n}} , the maximum likelihood (MLE) estimator of μ {\displaystyle \mu } is the sample median,[5]

μ ^ = m e d ( x ) . {\displaystyle {\hat {\mu }}=\mathrm {med} (x).}

The MLE estimator of b {\displaystyle b} is the mean absolute deviation from the median,[citation needed]

b ^ = 1 n i = 1 n | x i μ ^ | . {\displaystyle {\hat {b}}={\frac {1}{n}}\sum _{i=1}^{n}|x_{i}-{\hat {\mu }}|.}

revealing a link between the Laplace distribution and least absolute deviations. A correction for small samples can be applied as follows:

b ^ = b ^ n / ( n 2 ) {\displaystyle {\hat {b}}^{*}={\hat {b}}\cdot n/(n-2)}

(see: exponential distribution#Parameter estimation).

Occurrence and applications

The Laplacian distribution has been used in speech recognition to model priors on DFT coefficients [6] and in JPEG image compression to model AC coefficients [7] generated by a DCT.

  • The addition of noise drawn from a Laplacian distribution, with scaling parameter appropriate to a function's sensitivity, to the output of a statistical database query is the most common means to provide differential privacy in statistical databases.
Fitted Laplace distribution to maximum one-day rainfalls [8]
The Laplace distribution, being a composite or double distribution, is applicable in situations where the lower values originate under different external conditions than the higher ones so that they follow a different pattern.[12]

Random variate generation

Given a random variable U {\displaystyle U} drawn from the uniform distribution in the interval ( 1 / 2 , 1 / 2 ) {\displaystyle \left(-1/2,1/2\right)} , the random variable

X = μ b sgn ( U ) ln ( 1 2 | U | ) {\displaystyle X=\mu -b\,\operatorname {sgn}(U)\,\ln(1-2|U|)}

has a Laplace distribution with parameters μ {\displaystyle \mu } and b {\displaystyle b} . This follows from the inverse cumulative distribution function given above.

A Laplace ( 0 , b ) {\displaystyle {\textrm {Laplace}}(0,b)} variate can also be generated as the difference of two i.i.d. Exponential ( 1 / b ) {\displaystyle {\textrm {Exponential}}(1/b)} random variables. Equivalently, Laplace ( 0 , 1 ) {\displaystyle {\textrm {Laplace}}(0,1)} can also be generated as the logarithm of the ratio of two i.i.d. uniform random variables.

History

This distribution is often referred to as "Laplace's first law of errors". He published it in 1774, modeling the frequency of an error as an exponential function of its magnitude once its sign was disregarded. Laplace would later replace this model with his "second law of errors", based on the normal distribution, after the discovery of the central limit theorem.[13][14]

Keynes published a paper in 1911 based on his earlier thesis wherein he showed that the Laplace distribution minimised the absolute deviation from the median.[15]

See also

References

  1. ^ a b Norton, Matthew; Khokhlov, Valentyn; Uryasev, Stan (2019). "Calculating CVaR and bPOE for common probability distributions with application to portfolio optimization and density estimation" (PDF). Annals of Operations Research. 299 (1–2). Springer: 1281–1315. doi:10.1007/s10479-019-03373-1. Retrieved 2023-02-27.
  2. ^ a b Kotz, Samuel; Kozubowski, Tomasz J.; Podgórski, Krzysztof (2001). The Laplace distribution and generalizations: a revisit with applications to Communications, Economics, Engineering and Finance. Birkhauser. pp. 23 (Proposition 2.2.2, Equation 2.2.8). ISBN 9780817641665.
  3. ^ Everitt, B.S. (2002) The Cambridge Dictionary of Statistics, CUP. ISBN 0-521-81099-X
  4. ^ Johnson, N.L., Kotz S., Balakrishnan, N. (1994) Continuous Univariate Distributions, Wiley. ISBN 0-471-58495-9. p. 60
  5. ^ Robert M. Norton (May 1984). "The Double Exponential Distribution: Using Calculus to Find a Maximum Likelihood Estimator". The American Statistician. 38 (2). American Statistical Association: 135–136. doi:10.2307/2683252. JSTOR 2683252.
  6. ^ Eltoft, T.; Taesu Kim; Te-Won Lee (2006). "On the multivariate Laplace distribution" (PDF). IEEE Signal Processing Letters. 13 (5): 300–303. doi:10.1109/LSP.2006.870353. S2CID 1011487. Archived from the original (PDF) on 2013-06-06. Retrieved 2012-07-04.
  7. ^ Minguillon, J.; Pujol, J. (2001). "JPEG standard uniform quantization error modeling with applications to sequential and progressive operation modes" (PDF). Journal of Electronic Imaging. 10 (2): 475–485. doi:10.1117/1.1344592. hdl:10609/6263.
  8. ^ CumFreq for probability distribution fitting
  9. ^ Pardo, Scott (2020). Statistical Analysis of Empirical Data Methods for Applied Sciences. Springer. p. 58. ISBN 978-3-030-43327-7.
  10. ^ Kou, S.G. (August 8, 2002). "A Jump-Diffusion Model for Option Pricing". Management Science. 48 (8): 1086–1101. doi:10.1287/mnsc.48.8.1086.166. JSTOR 822677. Retrieved 2022-03-01.
  11. ^ Chen, Jian (2018). General Equilibrium Option Pricing Method: Theoretical and Empirical Study. Springer. p. 70. ISBN 9789811074288.
  12. ^ A collection of composite distributions
  13. ^ Laplace, P-S. (1774). Mémoire sur la probabilité des causes par les évènements. Mémoires de l’Academie Royale des Sciences Presentés par Divers Savan, 6, 621–656
  14. ^ Wilson, Edwin Bidwell (1923). "First and Second Laws of Error". Journal of the American Statistical Association. 18 (143). Informa UK Limited: 841–851. doi:10.1080/01621459.1923.10502116. ISSN 0162-1459. Public Domain This article incorporates text from this source, which is in the public domain.
  15. ^ Keynes, J. M. (1911). "The Principal Averages and the Laws of Error which Lead to Them". Journal of the Royal Statistical Society. 74 (3). JSTOR: 322–331. doi:10.2307/2340444. ISSN 0952-8385. JSTOR 2340444.

External links

  • v
  • t
  • e
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)DirectionalDegenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
  • Category
  • Commons