Misplaced Pages

Noncentral t-distribution

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Non-central t-distribution) Probability distribution
Noncentral Student's t
Probability density function
Parameters ν > 0 degrees of freedom
μ {\displaystyle \mu \in \Re \,\!} noncentrality parameter
Support x ( ; + ) {\displaystyle x\in (-\infty ;+\infty )\,\!}
PDF see text
CDF see text
Mean see text
Mode see text
Variance see text
Skewness see text
Excess kurtosis see text

The noncentral t-distribution generalizes Student's t-distribution using a noncentrality parameter. Whereas the central probability distribution describes how a test statistic t is distributed when the difference tested is null, the noncentral distribution describes how t is distributed when the null is false. This leads to its use in statistics, especially calculating statistical power. The noncentral t-distribution is also known as the singly noncentral t-distribution, and in addition to its primary use in statistical inference, is also used in robust modeling for data.

Definitions

If Z is a standard normal random variable, and V is a chi-squared distributed random variable with ν degrees of freedom that is independent of Z, then

T = Z + μ V / ν {\displaystyle T={\frac {Z+\mu }{\sqrt {V/\nu }}}}

is a noncentral t-distributed random variable with ν degrees of freedom and noncentrality parameter μ ≠ 0. Note that the noncentrality parameter may be negative.

Cumulative distribution function

The cumulative distribution function of noncentral t-distribution with ν degrees of freedom and noncentrality parameter μ can be expressed as

F ν , μ ( x ) = { F ~ ν , μ ( x ) , if  x 0 ; 1 F ~ ν , μ ( x ) , if  x < 0 , {\displaystyle F_{\nu ,\mu }(x)={\begin{cases}{\tilde {F}}_{\nu ,\mu }(x),&{\mbox{if }}x\geq 0;\\1-{\tilde {F}}_{\nu ,-\mu }(x),&{\mbox{if }}x<0,\end{cases}}}

where

F ~ ν , μ ( x ) = Φ ( μ ) + 1 2 j = 0 [ p j I y ( j + 1 2 , ν 2 ) + q j I y ( j + 1 , ν 2 ) ] , {\displaystyle {\tilde {F}}_{\nu ,\mu }(x)=\Phi (-\mu )+{\frac {1}{2}}\sum _{j=0}^{\infty }\left,}
I y ( a , b ) {\displaystyle I_{y}\,\!(a,b)} is the regularized incomplete beta function,
y = x 2 x 2 + ν , {\displaystyle y={\frac {x^{2}}{x^{2}+\nu }},}
p j = 1 j ! exp { μ 2 2 } ( μ 2 2 ) j , {\displaystyle p_{j}={\frac {1}{j!}}\exp \left\{-{\frac {\mu ^{2}}{2}}\right\}\left({\frac {\mu ^{2}}{2}}\right)^{j},}
q j = μ 2 Γ ( j + 3 / 2 ) exp { μ 2 2 } ( μ 2 2 ) j , {\displaystyle q_{j}={\frac {\mu }{{\sqrt {2}}\Gamma (j+3/2)}}\exp \left\{-{\frac {\mu ^{2}}{2}}\right\}\left({\frac {\mu ^{2}}{2}}\right)^{j},}

and Φ is the cumulative distribution function of the standard normal distribution.

Alternatively, the noncentral t-distribution CDF can be expressed as:

F v , μ ( x ) = { 1 2 j = 0 1 j ! ( μ 2 ) j e μ 2 2 Γ ( j + 1 2 ) π I ( v v + x 2 ; v 2 , j + 1 2 ) , x 0 1 1 2 j = 0 1 j ! ( μ 2 ) j e μ 2 2 Γ ( j + 1 2 ) π I ( v v + x 2 ; v 2 , j + 1 2 ) , x < 0 {\displaystyle F_{v,\mu }(x)={\begin{cases}{\frac {1}{2}}\sum _{j=0}^{\infty }{\frac {1}{j!}}(-\mu {\sqrt {2}})^{j}e^{\frac {-\mu ^{2}}{2}}{\frac {\Gamma ({\frac {j+1}{2}})}{\sqrt {\pi }}}I\left({\frac {v}{v+x^{2}}};{\frac {v}{2}},{\frac {j+1}{2}}\right),&x\geq 0\\1-{\frac {1}{2}}\sum _{j=0}^{\infty }{\frac {1}{j!}}(-\mu {\sqrt {2}})^{j}e^{\frac {-\mu ^{2}}{2}}{\frac {\Gamma ({\frac {j+1}{2}})}{\sqrt {\pi }}}I\left({\frac {v}{v+x^{2}}};{\frac {v}{2}},{\frac {j+1}{2}}\right),&x<0\end{cases}}}

where Γ is the gamma function and I is the regularized incomplete beta function.

Although there are other forms of the cumulative distribution function, the first form presented above is very easy to evaluate through recursive computing. In statistical software R, the cumulative distribution function is implemented as pt.

Probability density function

The probability density function (pdf) for the noncentral t-distribution with ν > 0 degrees of freedom and noncentrality parameter μ can be expressed in several forms.

The confluent hypergeometric function form of the density function is

f ( x ) = Γ ( ν + 1 2 ) ν π Γ ( ν 2 ) ( 1 + x 2 ν ) ν + 1 2 StudentT ( x ; μ = 0 ) exp ( μ 2 2 ) { A ν ( x ; μ ) + B ν ( x ; μ ) } , {\displaystyle f(x)=\underbrace {{\frac {\Gamma ({\frac {\nu +1}{2}})}{{\sqrt {\nu \pi }}\Gamma ({\frac {\nu }{2}})}}\left(1+{\frac {x^{2}}{\nu }}\right)^{-{\tfrac {\nu +1}{2}}}} _{{\text{StudentT}}(x\,;\,\mu =0)}\exp {\big (}-{\tfrac {\mu ^{2}}{2}}{\big )}{\Big \{}A_{\nu }(x\,;\,\mu )+B_{\nu }(x\,;\,\mu ){\Big \}},}

where

A ν ( x ; μ ) = 1 F 1 ( ν + 1 2 ; 1 2 ; μ 2 x 2 2 ( x 2 + ν ) ) , B ν ( x ; μ ) = 2 μ x x 2 + ν Γ ( ν 2 + 1 ) Γ ( ν + 1 2 ) 1 F 1 ( ν 2 + 1 ; 3 2 ; μ 2 x 2 2 ( x 2 + ν ) ) , {\displaystyle {\begin{aligned}A_{\nu }(x\,;\,\mu )&={_{1}F}_{1}\left({\frac {\nu +1}{2}}\,;\,{\frac {1}{2}}\,;\,{\frac {\mu ^{2}x^{2}}{2(x^{2}+\nu )}}\right),\\B_{\nu }(x\,;\,\mu )&={\frac {{\sqrt {2}}\mu x}{\sqrt {x^{2}+\nu }}}{\frac {\Gamma ({\frac {\nu }{2}}+1)}{\Gamma ({\frac {\nu +1}{2}})}}{_{1}F}_{1}\left({\frac {\nu }{2}}+1\,;\,{\frac {3}{2}}\,;\,{\frac {\mu ^{2}x^{2}}{2(x^{2}+\nu )}}\right),\end{aligned}}}

and where 1F1 is a confluent hypergeometric function.

An alternative integral form is

f ( x ) = ν ν 2 exp ( ν μ 2 2 ( x 2 + ν ) ) π Γ ( ν 2 ) 2 ν 1 2 ( x 2 + ν ) ν + 1 2 0 y ν exp ( 1 2 ( y μ x x 2 + ν ) 2 ) d y . {\displaystyle f(x)={\frac {\nu ^{\frac {\nu }{2}}\exp \left(-{\frac {\nu \mu ^{2}}{2(x^{2}+\nu )}}\right)}{{\sqrt {\pi }}\Gamma ({\frac {\nu }{2}})2^{\frac {\nu -1}{2}}(x^{2}+\nu )^{\frac {\nu +1}{2}}}}\int _{0}^{\infty }y^{\nu }\exp \left(-{\frac {1}{2}}\left(y-{\frac {\mu x}{\sqrt {x^{2}+\nu }}}\right)^{2}\right)dy.}

A third form of the density is obtained using its cumulative distribution functions, as follows.

f ( x ) = { ν x { F ν + 2 , μ ( x 1 + 2 ν ) F ν , μ ( x ) } , if  x 0 ; Γ ( ν + 1 2 ) π ν Γ ( ν 2 ) exp ( μ 2 2 ) , if  x = 0. {\displaystyle f(x)={\begin{cases}{\frac {\nu }{x}}\left\{F_{\nu +2,\mu }\left(x{\sqrt {1+{\frac {2}{\nu }}}}\right)-F_{\nu ,\mu }(x)\right\},&{\mbox{if }}x\neq 0;\\{\frac {\Gamma ({\frac {\nu +1}{2}})}{{\sqrt {\pi \nu }}\Gamma ({\frac {\nu }{2}})}}\exp \left(-{\frac {\mu ^{2}}{2}}\right),&{\mbox{if }}x=0.\end{cases}}}

This is the approach implemented by the dt function in R.

Properties

Moments of the noncentral t-distribution

In general, the kth raw moment of the noncentral t-distribution is

E [ T k ] = { ( ν 2 ) k 2 Γ ( ν k 2 ) Γ ( ν 2 ) exp ( μ 2 2 ) d k d μ k exp ( μ 2 2 ) , if  ν > k ; Does not exist , if  ν k . {\displaystyle {\mbox{E}}\left={\begin{cases}\left({\frac {\nu }{2}}\right)^{\frac {k}{2}}{\frac {\Gamma \left({\frac {\nu -k}{2}}\right)}{\Gamma \left({\frac {\nu }{2}}\right)}}{\mbox{exp}}\left(-{\frac {\mu ^{2}}{2}}\right){\frac {d^{k}}{d\mu ^{k}}}{\mbox{exp}}\left({\frac {\mu ^{2}}{2}}\right),&{\mbox{if }}\nu >k;\\{\mbox{Does not exist}},&{\mbox{if }}\nu \leq k.\\\end{cases}}}

In particular, the mean and variance of the noncentral t-distribution are

E [ T ] = { μ ν 2 Γ ( ( ν 1 ) / 2 ) Γ ( ν / 2 ) , if  ν > 1 ; Does not exist , if  ν 1 , Var [ T ] = { ν ( 1 + μ 2 ) ν 2 μ 2 ν 2 ( Γ ( ( ν 1 ) / 2 ) Γ ( ν / 2 ) ) 2 , if  ν > 2 ; Does not exist , if  ν 2. {\displaystyle {\begin{aligned}{\mbox{E}}\left&={\begin{cases}\mu {\sqrt {\frac {\nu }{2}}}{\frac {\Gamma ((\nu -1)/2)}{\Gamma (\nu /2)}},&{\mbox{if }}\nu >1;\\{\mbox{Does not exist}},&{\mbox{if }}\nu \leq 1,\\\end{cases}}\\{\mbox{Var}}\left&={\begin{cases}{\frac {\nu (1+\mu ^{2})}{\nu -2}}-{\frac {\mu ^{2}\nu }{2}}\left({\frac {\Gamma ((\nu -1)/2)}{\Gamma (\nu /2)}}\right)^{2},&{\mbox{if }}\nu >2;\\{\mbox{Does not exist}},&{\mbox{if }}\nu \leq 2.\\\end{cases}}\end{aligned}}}

An excellent approximation to ν 2 Γ ( ( ν 1 ) / 2 ) Γ ( ν / 2 ) {\displaystyle {\sqrt {\frac {\nu }{2}}}{\frac {\Gamma ((\nu -1)/2)}{\Gamma (\nu /2)}}} is ( 1 3 4 ν 1 ) 1 {\displaystyle \left(1-{\frac {3}{4\nu -1}}\right)^{-1}} , which can be used in both formulas.

Asymmetry

The non-central t-distribution is asymmetric unless μ is zero, i.e., a central t-distribution. In addition, the asymmetry becomes smaller the larger degree of freedom. The right tail will be heavier than the left when μ > 0, and vice versa. However, the usual skewness is not generally a good measure of asymmetry for this distribution, because if the degrees of freedom is not larger than 3, the third moment does not exist at all. Even if the degrees of freedom is greater than 3, the sample estimate of the skewness is still very unstable unless the sample size is very large.


Mode

The noncentral t-distribution is always unimodal and bell shaped, but the mode is not analytically available, although for μ ≠ 0 we have

ν ν + ( 5 / 2 ) < m o d e μ < ν ν + 1 {\displaystyle {\sqrt {\frac {\nu }{\nu +(5/2)}}}<{\frac {\mathrm {mode} }{\mu }}<{\sqrt {\frac {\nu }{\nu +1}}}}

In particular, the mode always has the same sign as the noncentrality parameter μ. Moreover, the negative of the mode is exactly the mode for a noncentral t-distribution with the same number of degrees of freedom ν but noncentrality parameter −μ.

The mode is strictly increasing with μ (it always moves in the same direction as μ is adjusted in). In the limit, when μ → 0, the mode is approximated by

ν 2 Γ ( ν + 2 2 ) Γ ( ν + 3 2 ) μ ; {\displaystyle {\sqrt {\frac {\nu }{2}}}{\frac {\Gamma \left({\frac {\nu +2}{2}}\right)}{\Gamma \left({\frac {\nu +3}{2}}\right)}}\mu ;\,}

and when μ → ∞, the mode is approximated by

ν ν + 1 μ . {\displaystyle {\sqrt {\frac {\nu }{\nu +1}}}\mu .}

Related distributions

  • Central t-distribution: the central t-distribution can be converted into a location/scale family. This family of distributions is used in data modeling to capture various tail behaviors. The location/scale generalization of the central t-distribution is a different distribution from the noncentral t-distribution discussed in this article. In particular, this approximation does not respect the asymmetry of the noncentral t-distribution. However, the central t-distribution can be used as an approximation to the noncentral t-distribution.
  • If T is noncentral t-distributed with ν degrees of freedom and noncentrality parameter μ and F = T, then F has a noncentral F-distribution with 1 numerator degree of freedom, ν denominator degrees of freedom, and noncentrality parameter μ.
  • If T is noncentral t-distributed with ν degrees of freedom and noncentrality parameter μ and Z = lim ν T {\displaystyle Z=\lim _{\nu \rightarrow \infty }T} , then Z has a normal distribution with mean μ and unit variance.
  • When the denominator noncentrality parameter of a doubly noncentral t-distribution is zero, then it becomes a noncentral t-distribution.

Special cases

Occurrence and applications

Use in power analysis

Suppose we have an independent and identically distributed sample X1, ..., Xn each of which is normally distributed with mean θ and variance σ, and we are interested in testing the null hypothesis θ = 0 vs. the alternative hypothesis θ ≠ 0. We can perform a one sample t-test using the test statistic

T = X ¯ σ ^ / n = X ¯ θ ( σ / n ) + θ ( σ / n ) ( σ ^ 2 σ 2 / ( n 1 ) ) / ( n 1 ) {\displaystyle T={\frac {\bar {X}}{{\hat {\sigma }}/{\sqrt {n}}}}={\frac {{\frac {{\bar {X}}-\theta }{(\sigma /{\sqrt {n}})}}+{\frac {\theta }{(\sigma /{\sqrt {n}})}}}{\sqrt {\left.\left({\frac {{\hat {\sigma }}^{2}}{\sigma ^{2}/(n-1)}}\right)\right/(n-1)}}}}

where X ¯ {\displaystyle {\bar {X}}} is the sample mean and σ ^ 2 {\displaystyle {\hat {\sigma }}^{2}\,\!} is the unbiased sample variance. Since the right hand side of the second equality exactly matches the characterization of a noncentral t-distribution as described above, T has a noncentral t-distribution with n−1 degrees of freedom and noncentrality parameter n θ / σ {\displaystyle {\sqrt {n}}\theta /\sigma \,\!} .

If the test procedure rejects the null hypothesis whenever | T | > t 1 α / 2 {\displaystyle |T|>t_{1-\alpha /2}\,\!} , where t 1 α / 2 {\displaystyle t_{1-\alpha /2}\,\!} is the upper α/2 quantile of the (central) Student's t-distribution for a pre-specified α ∈ (0, 1), then the power of this test is given by

1 F n 1 , n θ / σ ( t 1 α / 2 ) + F n 1 , n θ / σ ( t 1 α / 2 ) . {\displaystyle 1-F_{n-1,{\sqrt {n}}\theta /\sigma }(t_{1-\alpha /2})+F_{n-1,{\sqrt {n}}\theta /\sigma }(-t_{1-\alpha /2}).}

Similar applications of the noncentral t-distribution can be found in the power analysis of the general normal-theory linear models, which includes the above one sample t-test as a special case.

Use in tolerance intervals

One-sided normal tolerance intervals have an exact solution in terms of the sample mean and sample variance based on the noncentral t-distribution. This enables the calculation of a statistical interval within which, with some confidence level, a specified proportion of a sampled population falls.

See also

References

  1. ^ Lenth, Russell V (1989). "Algorithm AS 243: Cumulative Distribution Function of the Non-central t Distribution". Journal of the Royal Statistical Society, Series C. 38 (1): 185–189. JSTOR 2347693.
  2. Scharf, L. (1991). Statistical Signal Processing. Reading: Addison-Wesley. p. 177. ISBN 0-201-19038-9.
  3. Hogben, D; Pinkham, RS; Wilk, MB (1961). "The moments of the non-central t-distribution". Biometrika. 48 (3–4): 465–468. doi:10.1093/biomet/48.3-4.465. hdl:2027/coo.31924001119068. JSTOR 2332772.
  4. Hedges, Larry V. (June 1981). "Distribution Theory for Glass's Estimator of Effect size and Related Estimators". Journal of Educational Statistics. 6 (2): 107–128. doi:10.3102/2F10769986006002107.
  5. Tothfalusi, Laszlo; Endrenyi, Laszlo (1 March 2016). "An Exact Procedure for the Evaluation of Reference-Scaled Average Bioequivalence". The AAPS Journal. 18 (2): 476–489. doi:10.1208/s12248-016-9873-6. PMC 4779113.
  6. van Aubel, A; Gawronski, W (2003). "Analytic properties of noncentral distributions". Applied Mathematics and Computation. 141: 3–12. doi:10.1016/S0096-3003(02)00316-8.
  7. Helena Chmura Kraemer; Minja Paik (1979). "A Central t Approximation to the Noncentral t Distribution". Technometrics. 21 (3): 357–360. doi:10.1080/00401706.1979.10489781. JSTOR 1267759.
  8. Derek S. Young (August 2010). "tolerance: An R Package for Estimating Tolerance Intervals". Journal of Statistical Software. 36 (5): 1–39. ISSN 1548-7660. Retrieved 19 February 2013., p.23

External links

Probability distributions (list)
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)
Directional
Univariate (circular) directional
Circular uniform
Univariate von Mises
Wrapped normal
Wrapped Cauchy
Wrapped exponential
Wrapped asymmetric Laplace
Wrapped Lévy
Bivariate (spherical)
Kent
Bivariate (toroidal)
Bivariate von Mises
Multivariate
von Mises–Fisher
Bingham
Degenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
Statistics
Descriptive statistics
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Data collection
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical inference
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical / Multivariate / Time-series / Survival analysis
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Applications
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Category:
Noncentral t-distribution Add topic