Location–scale family

Family of probability distributions

In probability theory, especially in mathematical statistics, a location–scale family is a family of probability distributions parametrized by a location parameter and a non-negative scale parameter. For any random variable X {\displaystyle X} whose probability distribution function belongs to such a family, the distribution function of Y = d a + b X {\displaystyle Y{\stackrel {d}{=}}a+bX} also belongs to the family (where = d {\displaystyle {\stackrel {d}{=}}} means "equal in distribution"—that is, "has the same distribution as").

In other words, a class Ω {\displaystyle \Omega } of probability distributions is a location–scale family if for all cumulative distribution functions F Ω {\displaystyle F\in \Omega } and any real numbers a R {\displaystyle a\in \mathbb {R} } and b > 0 {\displaystyle b>0} , the distribution function G ( x ) = F ( a + b x ) {\displaystyle G(x)=F(a+bx)} is also a member of Ω {\displaystyle \Omega } .

  • If X {\displaystyle X} has a cumulative distribution function F X ( x ) = P ( X x ) {\displaystyle F_{X}(x)=P(X\leq x)} , then Y = a + b X {\displaystyle Y{=}a+bX} has a cumulative distribution function F Y ( y ) = F X ( y a b ) {\displaystyle F_{Y}(y)=F_{X}\left({\frac {y-a}{b}}\right)} .
  • If X {\displaystyle X} is a discrete random variable with probability mass function p X ( x ) = P ( X = x ) {\displaystyle p_{X}(x)=P(X=x)} , then Y = a + b X {\displaystyle Y{=}a+bX} is a discrete random variable with probability mass function p Y ( y ) = p X ( y a b ) {\displaystyle p_{Y}(y)=p_{X}\left({\frac {y-a}{b}}\right)} .
  • If X {\displaystyle X} is a continuous random variable with probability density function f X ( x ) {\displaystyle f_{X}(x)} , then Y = a + b X {\displaystyle Y{=}a+bX} is a continuous random variable with probability density function f Y ( y ) = 1 b f X ( y a b ) {\displaystyle f_{Y}(y)={\frac {1}{b}}f_{X}\left({\frac {y-a}{b}}\right)} .

Moreover, if X {\displaystyle X} and Y {\displaystyle Y} are two random variables whose distribution functions are members of the family, and assuming existence of the first two moments and X {\displaystyle X} has zero mean and unit variance, then Y {\displaystyle Y} can be written as Y = d μ Y + σ Y X {\displaystyle Y{\stackrel {d}{=}}\mu _{Y}+\sigma _{Y}X} , where μ Y {\displaystyle \mu _{Y}} and σ Y {\displaystyle \sigma _{Y}} are the mean and standard deviation of Y {\displaystyle Y} .

In decision theory, if all alternative distributions available to a decision-maker are in the same location–scale family, and the first two moments are finite, then a two-moment decision model can apply, and decision-making can be framed in terms of the means and the variances of the distributions.[1][2][3]

Examples

Often, location–scale families are restricted to those where all members have the same functional form. Most location–scale families are univariate, though not all. Well-known families in which the functional form of the distribution is consistent throughout the family include the following:

Converting a single distribution to a location–scale family

The following shows how to implement a location–scale family in a statistical package or programming environment where only functions for the "standard" version of a distribution are available. It is designed for R but should generalize to any language and library.

The example here is of the Student's t-distribution, which is normally provided in R only in its standard form, with a single degrees of freedom parameter df. The versions below with _ls appended show how to generalize this to a generalized Student's t-distribution with an arbitrary location parameter m and scale parameter s.

Probability density function (PDF): dt_ls(x, df, m, s) = 1/s * dt((x - m) / s, df)
Cumulative distribution function (CDF): pt_ls(x, df, m, s) = pt((x - m) / s, df)
Quantile function (inverse CDF): qt_ls(prob, df, m, s) = qt(prob, df) * s + m
Generate a random variate: rt_ls(df, m, s) = rt(df) * s + m

Note that the generalized functions do not have standard deviation s since the standard t distribution does not have standard deviation of 1.

References

  1. ^ Meyer, Jack (1987). "Two-Moment Decision Models and Expected Utility Maximization". American Economic Review. 77 (3): 421–430. JSTOR 1804104.
  2. ^ Mayshar, J. (1978). "A Note on Feldstein's Criticism of Mean-Variance Analysis". Review of Economic Studies. 45 (1): 197–199. JSTOR 2297094.
  3. ^ Sinn, H.-W. (1983). Economic Decisions under Uncertainty (Second English ed.). North-Holland.

External links

  • http://www.randomservices.org/random/special/LocationScale.html
  • v
  • t
  • e
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
  • Z-test (normal)
  • Student's t-test
  • F-test
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
  • Category
  • icon Mathematics portal
  • Commons
  • WikiProject
  • v
  • t
  • e
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)
Directional
Degenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
  • Category
  • Commons