The Survey Scores Rank Order Comparison (SSROC) attempts to assess whether or not a series of survey scores for similar questions1 (which are based on a Likert scale2) can be treated as quasi-interval variables (i.e. quantitative variables for which it is possible to compute quantitative statistics, such as the Arithmetic Mean).
In a first step, the SSROC computes the following statistics:
The Arithmetic Average \(\bar{X}\) of the Likert Scores for each item (i.e. question).
The sum of positive scores \(P_s\) for the Likert scores, after subtracting the midpoint of the Likert scale.
The absolute sum of negative scores \(N_s\) for the Likert scores, after subtracting the midpoint of the Likert scale.
The count of positive scores \(P_c\) for the Likert scores, after subtracting the midpoint of the Likert scale.
The count of negative scores \(N_c\) for the Likert scores, after subtracting the midpoint of the Likert scale.
The sum-based statistic \(A_s = \frac{P_s - N_s}{P_s + N_s}\).
The count-based statistic \(A_c = \frac{P_c - N_c}{P_c + N_c}\).
For instance, if the 5-point Likert scale \(X = [4, 3, 1]\) then the following values are obtained:
\(\bar{X} = \frac{4 + 3 + 1}{3} \simeq 2.667\).
\(P_s = 1\) because the sum of positive values in \(X - \text{midpoint of Likert scale} = [4 - 3, 3 - 3, 1 - 3] = [1, 0, -2]\) is 1.
\(N_s = 2\) because the absolute sum of negative values in \(X - \text{midpoint of Likert scale} = [4 - 3, 3 - 3, 1 - 3] = [1, 0, -2]\) is 2.
\(P_c = 1\) because the count of positive values in \([1, 0, -2]\) is 1.
\(N_c = 1\) because the count of negative values in \([1, 0, -2]\) is 1.
The sum-based average \(A_s = \frac{1 - 2}{1 + 2} = \frac{-1}{3}\).
The count-based average \(A_c = \frac{1 - 1}{1 + 1} = 0\).
The Arithmetic Mean should only be used if the underlying Likert scores \(X\) can be interpreted as a (truly) quantitative variable (rather than an ordinal variable). The count-based average \(A_c\), however, can always be used without problems (even if the Likert scores can not be interpreted as a quantitative variable). The sum-based average \(A_s\) is somewhere in-between.
Hence, if all these statistics are computed for similar questions it is possible to assess whether or not the three statistics (i.e. \(\bar{X}\), \(A_s\), and \(A_c\)) preserve their rank orders. If the rank order is preserved (i.e. the Rank Order Correlation is close to +1) then the Likert scores can be interpreted as quantitative variables (because the result of ranking similar questions does not depend on which statistic is used). However, if the rank orders are not strongly correlated then it is certainly not wise to treat the Likert scores as a truly quantitative variable. As a practical guideline, Rank Order Correlations above 0.9 suggest that the three statistics agree and the Arithmetic Mean can be used with reasonable confidence; lower correlations indicate that the ordinal nature of the scale should be respected.
To compute the SSROC, the R code iterates over all columns of the multivariate dataset and computes the various statistics (see column names of myresult). In addition, the Pearson and Kendall correlation matrices are also computed.
84.3 Purpose
The purpose of the SSROC is to determine whether Likert scores of similar items in a survey can be treated as quantitative variables (which allows one to compute the Arithmetic Mean and other quantitative statistics). If this is not the case, the data should be properly categorized and/or treated as purely qualitative data.
84.4 Pros & Cons
84.4.1 Pros
The SSROC has the following advantages:
It allows one to assess the appropriateness of quantitative statistics (such as the Arithmetic Mean) for Likert scores of similar items in a survey.
It is easy to compute the alternative averages \(A_s\) and \(A_c\).
The interpretation of \(A_c\) is very easy and has the potential to be very informative (it can always be applied and it does not depend on the neutral scores).
84.4.2 Cons
The SSROC has the following disadvantages:
Most readers do not know about the SSROC.
Most researchers simply compute quantitative statistics for Likert scores and do not want to be bothered about the validity of doing so.
It can only be used if there are sufficient items that can be compared in the rank order comparison.
84.5 Example
The following analysis shows the three statistics (\(\bar{X}\), \(A_s\), and \(A_c\)) for 10 similar items based on 7-point Likert scores. The statistics \(\bar{X}\) and \(\frac{P_s-N_s}{P_s+N_s}\) are scaled between -3 and 3 (because we use a 7-point Likert score). The statistic \(\frac{P_c-N_c}{P_c+N_c}\) always has a minimum of -1 and a maximum of 1.
Observe how, for instance, the first item has a count-based average score of 0.59 (this is 59% of the maximum) while the Arithmetic Mean is only 0.86 (for a maximum of +3). The count-based score does not only apply to ordinal as well as quantitative variables, it also ignores the neutral scores of the survey!
84.6 Task
In the previous example, change the scale and examine what happens with the scores. Do you see why it is important to set the correct Likert scale?
In this context, similar questions are defined as questions which attempt to measure the same underlying opinion but with different phrases. Surveys often contain similar questions in order to improve the validity of the survey.↩︎
A Likert scale represents an ordinal measurement. For instance, a 5-point Likert scale assigns the values 1, 2, 3, 4, and 5 to represent the degree to which the respondent agrees with a specified statement (i.e. 1 = totally disagree and 5 = totally agree).↩︎