The Tukey-Lambda PPCC Plot is available in RFC under the “Distributions / Tukey lambda PPCC Plot”.
To compute the Tukey-Lambda PPCC Plot on your local machine, the following script can be used in the R console:
x <-rnorm(500) #should result in lambda = 0.14#x <- runif(500) #should result in lambda = 1gp <-function(lambda, p) { (p^lambda-(1-p)^lambda)/lambda}sortx <-sort(x)c <-array(NA,dim=c(201))for (i in1:201) {if (i !=101) c[i] <-cor(gp(ppoints(x), lambda=(i-101)/100),sortx)}plot((-100:100)/100,c[1:201],xlab='lambda',ylab='correlation',main='PPCC Plot - Tukey lambda')grid()
To compute the Tukey-Lambda PPCC Plot, the R code uses a custom-made function called gp that computes the lambda function. Furthermore, a loop iterates over lambda values between -1 and 1 with a stepsize of 0.01. If a Uniform distribution is used instead of a Normal distribution, the optimal value of lambda changes from 0.14 to 1.
78.5 Purpose
The PPCC Plot is used to find the shape parameter value which produces the best fit (i.e. highest correlation). If the Tukey Lambda PPCC Plot is computed, the value of \(\lambda\) may provide information about the symmetric distribution which fits the data best -- Table 78.1 shows how different values of \(\lambda\) correspond to symmetric distributions.
Warning
The Tukey-Lambda PPCC interpretation table is intended for symmetric distributions only. If the data are clearly skewed, the fitted \(\lambda\) value should not be interpreted as evidence for one of the symmetric families in Table 78.1.
The Tukey-Lambda PPCC Plot has the following advantages:
it provides useful information about the distributional shape of the data under investigation
it is easy to interpret
78.6.2 Cons
The Tukey-Lambda PPCC Plot has the following disadvantages:
there are only few software packages that allow this plot to be generated
most readers are not familiar with this plot
the plot is not suited for asymmetric distributions
78.7 Example
The following analysis shows the Tukey-Lambda PPCC Plot for the monthly marriages time series in Belgium. From this analysis it can be concluded that the Uniform Distribution has the best fit for the data (see also Table 78.1).
Compute the Tukey-Lambda PPCC Plot for the monthly divorces time series and interpret the results. Why does the Divorces time series exhibit a distribution which is completely different from the marriages time series?