Misplaced Pages

Erlang distribution

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Family of continuous probability distributions This article is about the mathematical / statistical distribution concept. For other uses, see Erlang.
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Erlang distribution" – news · newspapers · books · scholar · JSTOR (June 2012) (Learn how and when to remove this message)
This article includes a list of general references, but it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations. (June 2012) (Learn how and when to remove this message)
(Learn how and when to remove this message)
Erlang
Probability density functionProbability density plots of Erlang distributions
Cumulative distribution functionCumulative distribution plots of Erlang distributions
Parameters k { 1 , 2 , 3 , } , {\displaystyle k\in \{1,2,3,\ldots \},} shape
λ ( 0 , ) , {\displaystyle \lambda \in (0,\infty ),} rate
alt.: β = 1 / λ , {\displaystyle \beta =1/\lambda ,} scale
Support x [ 0 , ) {\displaystyle x\in [0,\infty )}
PDF λ k x k 1 e λ x ( k 1 ) ! {\displaystyle {\frac {\lambda ^{k}x^{k-1}e^{-\lambda x}}{(k-1)!}}}
CDF P ( k , λ x ) = γ ( k , λ x ) ( k 1 ) ! = 1 n = 0 k 1 1 n ! e λ x ( λ x ) n {\displaystyle P(k,\lambda x)={\frac {\gamma (k,\lambda x)}{(k-1)!}}=1-\sum _{n=0}^{k-1}{\frac {1}{n!}}e^{-\lambda x}(\lambda x)^{n}}
Mean k λ {\displaystyle {\frac {k}{\lambda }}}
Median No simple closed form
Mode 1 λ ( k 1 ) {\displaystyle {\frac {1}{\lambda }}(k-1)}
Variance k λ 2 {\displaystyle {\frac {k}{\lambda ^{2}}}}
Skewness 2 k {\displaystyle {\frac {2}{\sqrt {k}}}}
Excess kurtosis 6 k {\displaystyle {\frac {6}{k}}}
Entropy ( 1 k ) ψ ( k ) + ln [ Γ ( k ) λ ] + k {\displaystyle (1-k)\psi (k)+\ln \left+k}
MGF ( 1 t λ ) k {\displaystyle \left(1-{\frac {t}{\lambda }}\right)^{-k}} for t < λ {\displaystyle t<\lambda }
CF ( 1 i t λ ) k {\displaystyle \left(1-{\frac {it}{\lambda }}\right)^{-k}}

The Erlang distribution is a two-parameter family of continuous probability distributions with support x [ 0 , ) {\displaystyle x\in [0,\infty )} . The two parameters are:

  • a positive integer k , {\displaystyle k,} the "shape", and
  • a positive real number λ , {\displaystyle \lambda ,} the "rate". The "scale", β , {\displaystyle \beta ,} the reciprocal of the rate, is sometimes used instead.

The Erlang distribution is the distribution of a sum of k {\displaystyle k} independent exponential variables with mean 1 / λ {\displaystyle 1/\lambda } each. Equivalently, it is the distribution of the time until the kth event of a Poisson process with a rate of λ {\displaystyle \lambda } . The Erlang and Poisson distributions are complementary, in that while the Poisson distribution counts the events that occur in a fixed amount of time, the Erlang distribution counts the amount of time until the occurrence of a fixed number of events. When k = 1 {\displaystyle k=1} , the distribution simplifies to the exponential distribution. The Erlang distribution is a special case of the gamma distribution in which the shape of the distribution is discretized.

The Erlang distribution was developed by A. K. Erlang to examine the number of telephone calls that might be made at the same time to the operators of the switching stations. This work on telephone traffic engineering has been expanded to consider waiting times in queueing systems in general. The distribution is also used in the field of stochastic processes.

Characterization

Probability density function

The probability density function of the Erlang distribution is

f ( x ; k , λ ) = λ k x k 1 e λ x ( k 1 ) ! for  x , λ 0 , {\displaystyle f(x;k,\lambda )={\lambda ^{k}x^{k-1}e^{-\lambda x} \over (k-1)!}\quad {\mbox{for }}x,\lambda \geq 0,}

The parameter k is called the shape parameter, and the parameter λ {\displaystyle \lambda } is called the rate parameter.

An alternative, but equivalent, parametrization uses the scale parameter β {\displaystyle \beta } , which is the reciprocal of the rate parameter (i.e., β = 1 / λ {\displaystyle \beta =1/\lambda } ):

f ( x ; k , β ) = x k 1 e x β β k ( k 1 ) ! for  x , β 0. {\displaystyle f(x;k,\beta )={\frac {x^{k-1}e^{-{\frac {x}{\beta }}}}{\beta ^{k}(k-1)!}}\quad {\mbox{for }}x,\beta \geq 0.}

When the scale parameter β {\displaystyle \beta } equals 2, the distribution simplifies to the chi-squared distribution with 2k degrees of freedom. It can therefore be regarded as a generalized chi-squared distribution for even numbers of degrees of freedom.

Cumulative distribution function (CDF)

The cumulative distribution function of the Erlang distribution is

F ( x ; k , λ ) = P ( k , λ x ) = γ ( k , λ x ) Γ ( k ) = γ ( k , λ x ) ( k 1 ) ! , {\displaystyle F(x;k,\lambda )=P(k,\lambda x)={\frac {\gamma (k,\lambda x)}{\Gamma (k)}}={\frac {\gamma (k,\lambda x)}{(k-1)!}},}

where γ {\displaystyle \gamma } is the lower incomplete gamma function and P {\displaystyle P} is the lower regularized gamma function. The CDF may also be expressed as

F ( x ; k , λ ) = 1 n = 0 k 1 1 n ! e λ x ( λ x ) n . {\displaystyle F(x;k,\lambda )=1-\sum _{n=0}^{k-1}{\frac {1}{n!}}e^{-\lambda x}(\lambda x)^{n}.}

Erlang-k

The Erlang-k distribution (where k is a positive integer) E k ( λ ) {\displaystyle E_{k}(\lambda )} is defined by setting k in the PDF of the Erlang distribution. For instance, the Erlang-2 distribution is E 2 ( λ ) = λ 2 x e λ x for  x , λ 0 {\displaystyle E_{2}(\lambda )={\lambda ^{2}x}e^{-\lambda x}\quad {\mbox{for }}x,\lambda \geq 0} , which is the same as f ( x ; 2 , λ ) {\displaystyle f(x;2,\lambda )} .

Median

An asymptotic expansion is known for the median of an Erlang distribution, for which coefficients can be computed and bounds are known. An approximation is k λ ( 1 1 3 k + 0.2 ) , {\displaystyle {\frac {k}{\lambda }}\left(1-{\dfrac {1}{3k+0.2}}\right),} i.e. below the mean k λ . {\displaystyle {\frac {k}{\lambda }}.}

Generating Erlang-distributed random variates

Erlang-distributed random variates can be generated from uniformly distributed random numbers ( U [ 0 , 1 ] {\displaystyle U\in } ) using the following formula:

E ( k , λ ) = 1 λ ln i = 1 k U i = 1 λ i = 1 k ln U i {\displaystyle E(k,\lambda )=-{\frac {1}{\lambda }}\ln \prod _{i=1}^{k}U_{i}=-{\frac {1}{\lambda }}\sum _{i=1}^{k}\ln U_{i}}

Applications

Waiting times

Events that occur independently with some average rate are modeled with a Poisson process. The waiting times between k occurrences of the event are Erlang distributed. (The related question of the number of events in a given amount of time is described by the Poisson distribution.)

The Erlang distribution, which measures the time between incoming calls, can be used in conjunction with the expected duration of incoming calls to produce information about the traffic load measured in erlangs. This can be used to determine the probability of packet loss or delay, according to various assumptions made about whether blocked calls are aborted (Erlang B formula) or queued until served (Erlang C formula). The Erlang-B and C formulae are still in everyday use for traffic modeling for applications such as the design of call centers.

Other applications

The age distribution of cancer incidence often follows the Erlang distribution, whereas the shape and scale parameters predict, respectively, the number of driver events and the time interval between them. More generally, the Erlang distribution has been suggested as good approximation of cell cycle time distribution, as result of multi-stage models.

The kinesin is a molecular machine with two "feet" that "walks" along a filament. The waiting time between each step is exponentially distributed. When green fluorescent protein is attached to a foot of the kinesin, then the green dot visibly moves with Erlang distribution of k = 2.

It has also been used in marketing for describing interpurchase times.

Properties

  • If X Erlang ( k , λ ) {\displaystyle X\sim \operatorname {Erlang} (k,\lambda )} then a X Erlang ( k , λ a ) {\displaystyle a\cdot X\sim \operatorname {Erlang} \left(k,{\frac {\lambda }{a}}\right)} with a R {\displaystyle a\in \mathbb {R} }
  • If X Erlang ( k 1 , λ ) {\displaystyle X\sim \operatorname {Erlang} (k_{1},\lambda )} and Y Erlang ( k 2 , λ ) {\displaystyle Y\sim \operatorname {Erlang} (k_{2},\lambda )} then X + Y Erlang ( k 1 + k 2 , λ ) {\displaystyle X+Y\sim \operatorname {Erlang} (k_{1}+k_{2},\lambda )} if X , Y {\displaystyle X,Y} are independent

Related distributions

  • The Erlang distribution is the distribution of the sum of k independent and identically distributed random variables, each having an exponential distribution. The long-run rate at which events occur is the reciprocal of the expectation of X , {\displaystyle X,} that is, λ / k . {\displaystyle \lambda /k.} The (age specific event) rate of the Erlang distribution is, for k > 1 , {\displaystyle k>1,} monotonic in x , {\displaystyle x,} increasing from 0 at x = 0 , {\displaystyle x=0,} to λ {\displaystyle \lambda } as x {\displaystyle x} tends to infinity.
    • That is: if X i Exponential ( λ ) , {\displaystyle X_{i}\sim \operatorname {Exponential} (\lambda ),} then i = 1 k X i Erlang ( k , λ ) {\displaystyle \sum _{i=1}^{k}{X_{i}}\sim \operatorname {Erlang} (k,\lambda )}
  • Because of the factorial function in the denominator of the PDF and CDF, the Erlang distribution is only defined when the parameter k is a positive integer. In fact, this distribution is sometimes called the Erlang-k distribution (e.g., an Erlang-2 distribution is an Erlang distribution with k = 2 {\displaystyle k=2} ). The gamma distribution generalizes the Erlang distribution by allowing k to be any positive real number, using the gamma function instead of the factorial function.
    • That is: if k is an integer and X Gamma ( k , λ ) , {\displaystyle X\sim \operatorname {Gamma} (k,\lambda ),} then X Erlang ( k , λ ) {\displaystyle X\sim \operatorname {Erlang} (k,\lambda )}
  • If U Exponential ( λ ) {\displaystyle U\sim \operatorname {Exponential} (\lambda )} and V Erlang ( n , λ ) {\displaystyle V\sim \operatorname {Erlang} (n,\lambda )} then U V + 1 Pareto ( 1 , n ) {\displaystyle {\frac {U}{V}}+1\sim \operatorname {Pareto} (1,n)}
  • The Erlang distribution is a special case of the Pearson type III distribution
  • The Erlang distribution is related to the chi-squared distribution. If X Erlang ( k , λ ) , {\displaystyle X\sim \operatorname {Erlang} (k,\lambda ),} then 2 λ X χ 2 k 2 . {\displaystyle 2\lambda X\sim \chi _{2k}^{2}.}
  • The Erlang distribution is related to the Poisson distribution by the Poisson process: If S n = i = 1 n X i {\displaystyle S_{n}=\sum _{i=1}^{n}X_{i}} such that X i Exponential ( λ ) , {\displaystyle X_{i}\sim \operatorname {Exponential} (\lambda ),} then S n Erlang ( n , λ ) {\displaystyle S_{n}\sim \operatorname {Erlang} (n,\lambda )} and Pr ( N ( x ) n 1 ) = Pr ( S n > x ) = 1 F X ( x ; n , λ ) = k = 0 n 1 1 k ! e λ x ( λ x ) k . {\displaystyle \operatorname {Pr} (N(x)\leq n-1)=\operatorname {Pr} (S_{n}>x)=1-F_{X}(x;n,\lambda )=\sum _{k=0}^{n-1}{\frac {1}{k!}}e^{-\lambda x}(\lambda x)^{k}.} Taking the differences over n {\displaystyle n} gives the Poisson distribution.

See also

Notes

  1. "h1.pdf" (PDF).
  2. Choi, K. P. (1994). "On the medians of gamma distributions and an equation of Ramanujan". Proceedings of the American Mathematical Society. 121: 245–251. doi:10.1090/S0002-9939-1994-1195477-8. JSTOR 2160389.
  3. Adell, J. A.; Jodrá, P. (2010). "On a Ramanujan equation connected with the median of the gamma distribution". Transactions of the American Mathematical Society. 360 (7): 3631. doi:10.1090/S0002-9947-07-04411-X.
  4. Jodrá, P. (2012). "Computing the Asymptotic Expansion of the Median of the Erlang Distribution". Mathematical Modelling and Analysis. 17 (2): 281–292. doi:10.3846/13926292.2012.664571.
  5. Banneheka, BMSG; Ekanayake, GEMUPD (2009). "A new point estimator for the median of gamma distribution". Viyodaya J Science. 14: 95–103.
  6. Resa. "Statistical Distributions - Erlang Distribution - Random Number Generator". www.xycoon.com. Retrieved 4 April 2018.
  7. Belikov, Aleksey V. (22 September 2017). "The number of key carcinogenic events can be predicted from cancer incidence". Scientific Reports. 7 (1). doi:10.1038/s41598-017-12448-7. PMC 5610194. PMID 28939880.
  8. Belikov, Aleksey V.; Vyatkin, Alexey; Leonov, Sergey V. (2021-08-06). "The Erlang distribution approximates the age distribution of incidence of childhood and young adulthood cancers". PeerJ. 9: e11976. doi:10.7717/peerj.11976. ISSN 2167-8359. PMC 8351573. PMID 34434669.
  9. Yates, Christian A. (21 April 2017). "A Multi-stage Representation of Cell Proliferation as a Markov Process". Bulletin of Mathematical Biology. 79 (1): 2905–2928. doi:10.1007/s11538-017-0356-4. PMC 5709504.
  10. Gavagnin, Enrico (21 November 2019). "The invasion speed of cell migration models with realistic cell cycle time distributions". Journal of Theoretical Biology. 481: 91–99. arXiv:1806.03140. doi:10.1016/j.jtbi.2018.09.010.
  11. Yildiz, Ahmet; Forkey, Joseph N.; McKinney, Sean A.; Ha, Taekjip; Goldman, Yale E.; Selvin, Paul R. (2003-06-27). "Myosin V Walks Hand-Over-Hand: Single Fluorophore Imaging with 1.5-nm Localization". Science. 300 (5628): 2061–2065. doi:10.1126/science.1084398. ISSN 0036-8075.
  12. Chatfield, C.; Goodhardt, G.J. (December 1973). "A Consumer Purchasing Model with Erlang Interpurchase Times". Journal of the American Statistical Association. 68: 828–835. doi:10.1080/01621459.1973.10481432.
  13. Cox, D.R. (1967) Renewal Theory, p20, Methuen.

References

External links

Probability distributions (list)
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)
Directional
Univariate (circular) directional
Circular uniform
Univariate von Mises
Wrapped normal
Wrapped Cauchy
Wrapped exponential
Wrapped asymmetric Laplace
Wrapped Lévy
Bivariate (spherical)
Kent
Bivariate (toroidal)
Bivariate von Mises
Multivariate
von Mises–Fisher
Bingham
Degenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
Categories:
Erlang distribution Add topic