23 Chi-squared Distribution (1 parameter)

The random variate \(X\) defined for the range \(0 \leq X \leq +\infty\), is said to have a Chi-squared Distribution with 1 parameter (i.e. \(X \sim \chi^2 \left( n \right)\)) with shape parameter \(n \in \mathbb{N}^+\).

23.1 Probability Density Function

\[ \text{f}(X) = \frac{ X^{\frac{n}{2}-1} e^{- \frac{X}{2} } }{ 2^{\frac{n}{2}} \mathrm{ \Gamma} \left[ \frac{n}{2} \right] } \]

The figure below shows an example of the Chi-squared Probability Density function with \(df = 10\).

Code

curve(dchisq(x, df = 10), from = 0, to = 40, xlab="X", ylab="f(X)", main = "Chi-squared density", sub = "(df = 10)")

Figure 23.1: Example of Chi-squared Probability Density Function (df = 10)

23.2 Distribution Function

If \(n/2 \notin \mathbb{N}^+\) then there is no closed form. If \(n/2 \in \mathbb{N}^+\) then

\[ \text{F}(X) = 1 - e^{-\frac{X}{2}} \sum_{j=0}^{r-1} \frac{\left( \frac{X}{2} \right)^j}{j!} \]

where \(r = \frac{n}{2}\).

The figure below shows an example of the Chi-squared Distribution with \(df = 10\).

Code

curve(pchisq(x, df = 10), from = 0, to = 40, xlab="X", ylab="F(X)", main = "Chi-squared distribution", sub = "(df = 10)")

Figure 23.2: Example of Chi-squared Distribution Function (df = 10)

23.3 Moment Generating Function

\[ \text{M}_X(t) = (1-2t)^{-\frac{n}{2}} \]

for \(t < \frac{1}{2}\).

23.4 Uncentered Moments

\[ \mu_j' = 2^j \frac{\mathrm{\Gamma}\left[ \frac{n}{2}+j \right]}{\mathrm{\Gamma}\left[ \frac{n}{2} \right]} \]

23.5 Expected Value

\[ \text{E}(X) = n \]

23.6 Variance

\[ \text{V}(X) = 2n \]

23.7 Mode

\[ \text{Mo}(X) = n - 2 \]

for \(n \geq 2\).

23.8 Skewness

\[ g_1 = 2 \sqrt{\frac{2}{n}} \]

23.9 Kurtosis

\[ g_2 = 3 + \frac{12}{n} \]

23.10 Coefficient of Variation

\[ VC = \sqrt{\frac{2}{n}} \]

23.11 R Module

The best fitting Chi-squared Density function can be obtained by estimating the degrees of freedom \(n\) according to the so-called Maximum Likelihood procedure which can be found on the public website:

https://compute.wessa.net/rwasp_fitdistrchisq1.wasp

The Maximum Likelihood Fitting for the Chi-squared Distribution is also available in RFC under the menu “Distributions / ML Fitting” (you have to select the appropriate function in the designated “Density Function” drop menu).

If you prefer to compute the Chi-squared ML fitting on your local computer, the following code snippets can be used in the R console:

library(MASS)
library(car)
x <- as.numeric(AirPassengers)
chi_df <- fitdistr(x, 'chi-squared', start = list(df=3), method = 'Brent', lower = 0.1, upper = 10000)
chi_k <- chi_df[[1]][1]
cat("estimated df = ", chi_df$estimate, "\n")
cat("standard deviation = ", chi_df$sd, "\n")

estimated df =  256.2321 
standard deviation =  1.882792

and

Code

xlab <- paste('Chisq(df =', round(chi_df$estimate[[1]],2),')', sep = '')
qqPlot(x, dist = 'chisq', df = chi_df$estimate[[1]], ncp = 0, main = 'QQ plot (Chi-squared 1 param.)', xlab = xlab )

[1] 139 140

Figure 23.3: ML Fitting for Chi-squared Distribution

The main function in this R script is fitdistr and is limited by the user-specified lower and upper limit. Instead of displaying a histogram, the script calls the qqPlot function from the car library. The interpretation of this plot is explained in Descriptive Statistics.

23.12 Example

We analyze the time series of monthly divorces (in thousands) and wish to find out whether it can be adequately described by the Chi-squared Distribution. The ML Fitting module can be used to find the best fitting Chi-squared Distribution for the divorces data.

Interactive Shiny app (click to load).

Open in new tab

The estimated degrees of freedom is \(n = 3.46\) but the Chi-squared distribution does not fit the data well (as is shown in the Figure). The visual evidence suggests that a Chi-squared density is not appropriate for these data; for formal goodness-of-fit testing, see Section 2, Section 124.1, and Chapter 125.

23.13 Random Number Generator

If the following is true

\[ \begin{align*} \begin{cases} \text{U}(0,1) \text{ denotes a Uniform Distribution} \\ \text{N}(0,1) \text{ denotes a Standard Normal Distribution} \end{cases} \end{align*} \]

then \(\chi^2(n) \sim -2 \text{ln} \left( \prod_{i=1}^{r} \text{U}_i(0,1) \right)\) with \(r=\frac{n}{2}\) and \(n\) is even

and \(\chi^2(n) \sim -2 \ln \left( \prod_{i=1}^{r} \text{U}_i(0,1) \right) + \left( \text{N}\left( 0, 1 \right) \right)^2\) with \(r=\frac{n-1}{2}\) and \(n\) is odd

23.1 Probability Density Function

23.2 Distribution Function

23.3 Moment Generating Function

23.4 Uncentered Moments

23.5 Expected Value

23.6 Variance

23.7 Mode

23.8 Skewness

23.9 Kurtosis

23.10 Coefficient of Variation

23.11 R Module

23.12 Example

23.13 Random Number Generator

23.14 Related Distributions 1: Gamma Representation

23.15 Related Distributions 2: Equivalent Gamma Forms

23.16 Related Distributions 3: Sum of Squares of Standard Normals

23.17 Related Distributions 4: Large-df Normal Approximation

23.18 Related Distributions 5: Sum of Squares Around the Population Mean

23.19 Related Distributions 6: Sum of Squares Around the Sample Mean

23.20 Related Distributions 7: Pooled Chi-squared for Two Samples (Biased Form)

23.21 Related Distributions 8: Ratio of Independent Chi-squared Variables (F)

23.22 Related Distributions 9: Chi-squared as a Limit of F

23.23 Related Distributions 10: Relation with Student t and Standard Normal

23.24 Related Distributions 11: Link to the Two-parameter Chi-squared Form

23.25 Related Distributions 12: Poisson Tail Identity

23.26 Related Distributions 13: Difference of Two Chi-squared(2) Variables