Kendall's Tau: Statistical Background
Kendall's Tau (τ), originally proposed by Maurice Kendall (1938), is a non-parametric rank correlation coefficient that quantifies the strength and direction of the monotonic association between two variables. The coefficient makes no distributional assumptions, does not presuppose linearity, and is computed entirely from the ordinal ranking of paired observations rather than from their raw magnitudes.
The statistic is grounded in a pairwise comparison principle. For every distinct pair of observations \((X_i, Y_i)\) and \((X_j, Y_j)\), one determines whether the two variables co-vary in the same direction (a concordant pair) or in opposite directions (a discordant pair). Kendall's Tau is the normalised difference between these two counts across all \(\binom{n}{2}\) possible pairs.
Concordant Pairs, Discordant Pairs, and Ties
For any two distinct observation pairs \((X_i, Y_i)\) and \((X_j, Y_j)\) with \(i \neq j\), the classification proceeds as follows:
The concordance sum is \(S = C - D\), and the total number of pairs is \(n_0 = \binom{n}{2} = \frac{n(n-1)}{2}\).
The Formulas
Tau-a (no tie correction)
Tau-a does not account for tied pairs and can range from −1 to +1 only in the absence of ties. It underestimates the association when ties are present.
Tau-b: The Tie-Corrected Standard
Here \(n_0 = \frac{n(n-1)}{2}\) denotes the total number of observation pairs; \(n_1 = \sum_k \frac{t_k(t_k-1)}{2}\) is the number of pairs tied on X, summed over all tie groups \(t_k\) in X; and \(n_2 = \sum_k \frac{u_k(u_k-1)}{2}\) is the number of pairs tied on Y. The denominator \(\sqrt{(n_0-n_1)(n_0-n_2)}\) represents the geometric mean of the untied pair counts in each variable, and serves as the maximum value S could take given the observed tie structure. As a consequence, Tau-b is bounded strictly within \([-1, +1]\) regardless of the degree of tying.
Hypothesis Testing
Exact p-Value for Small Samples
Under H₀: \(\tau = 0\), all \(n!\) permutations of the ranks of one variable relative to the other are equally probable. The exact null distribution of the concordance sum S is therefore enumerable in principle, and in practice is computed efficiently via a dynamic programming recurrence. The exact p-value is the proportion of permutations yielding a value of |S| at least as large as the observed |S₀|. This approach is employed when n ≤ 50 and no ties are present in X, conditions under which the exact distribution is well-defined and computationally tractable.
Asymptotic Test for Large Samples or Tied Data
When the sample is large or ties are present in either variable, the exact permutation distribution is replaced by its normal approximation. Under H₀, the concordance sum S is asymptotically normally distributed with mean zero and variance given by the Kendall (1970) formula:
In this expression, \(v_0 = n(n-1)(2n+5)\) is the untied term; \(v_t = \sum_k t_k(t_k-1)(2t_k+5)\) and \(v_u = \sum_k u_k(u_k-1)(2u_k+5)\) are correction terms for tie groups \(t_k\) in X and \(u_k\) in Y respectively; \(v_1 = \sum_k t_k(t_k-1) \cdot \sum_k u_k(u_k-1)\) and \(v_2 = \sum_k t_k(t_k-1)(t_k-2) \cdot \sum_k u_k(u_k-1)(u_k-2)\) account for joint tie structure. The standardised test statistic is \(z = S / \sqrt{\text{Var}(S)}\), referred to the standard normal distribution. The approximation is satisfactory for n ≥ 10 in most practical applications.
Variants of the Kendall Coefficient: Tau-a, Tau-b, and Tau-c
Selection Among Rank Correlation Coefficients
Effect Size Classification
The following thresholds, adapted from Cohen (1988) for rank-based correlation coefficients, provide conventional benchmarks for interpreting the practical magnitude of Kendall's Tau-b. These benchmarks are descriptive guidelines and should be contextualised against domain-specific expectations and prior literature.
Assumptions Underlying the Test
- Independence of observation pairs. Each bivariate observation \((X_i, Y_i)\) must be statistically independent of all other pairs. Note that independence within a pair is not required; X and Y may be correlated, as that is precisely the quantity under investigation. Repeated-measures designs and clustered sampling schemes violate inter-pair independence and require specialised methods.
- Ordinal measurement scale. Both variables must be measured on at least an ordinal scale, meaning a consistent rank ordering of values must be meaningful. Kendall's Tau is applicable to continuous, discrete, and ordered categorical data, but it is not defined for purely nominal categories lacking a natural ordering.
- Random sampling. For inferences to generalise to the target population, observations must constitute a probability sample from that population. Results based on convenience samples may not support population-level conclusions regardless of statistical significance.
Selected Methodological Questions
Bounds of the Kendall's Tau Coefficient
Tau-b is bounded within the closed interval \([-1, +1]\) by construction. Tau-a reaches these bounds only when the data contain no ties. The value +1 is obtained when all pairs are concordant, which occurs when the rankings of X and Y are in perfect agreement. The value −1 arises when all pairs are discordant, corresponding to a perfect reversal of rankings.
Interpreting a Non-Significant Result
A non-significant Kendall's Tau indicates that the observed data do not provide sufficient evidence to reject the null hypothesis of no monotonic association at the chosen significance level. This does not constitute evidence that the two variables are independent. Insufficient statistical power, arising from a small sample size or a small true effect, may produce a non-significant result even when a real association exists. For this reason, the point estimate and confidence interval for Tau-b should always be reported alongside the significance decision.
Computation of the Confidence Interval for Tau-b
The confidence interval is constructed using the asymptotic standard error \(\text{SE}(\tau_b) = \sqrt{\text{Var}(S)} / \sqrt{(n_0 - n_1)(n_0 - n_2)}\), yielding the approximate Wald interval \(\tau_b \pm z_{\alpha/2} \cdot \text{SE}(\tau_b)\), constrained to \([-1, +1]\). For small samples (n < 25) or when Tau-b falls near the boundary of its range, bootstrap-based confidence intervals are preferable to the Wald approximation.
Application to Ordinal Likert-Scale Data
Kendall's Tau-b is well suited to Likert-scale responses. Likert items produce ordered categorical data (e.g., 1 = strongly disagree through 5 = strongly agree) that naturally induce ties, and Tau-b's denominator correction handles this tie structure in a theoretically principled manner. Pearson's r, by contrast, treats Likert responses as interval-level measurements with equal spacing, an assumption that is generally unjustified and that inflates or deflates the apparent magnitude of the association depending on the score distribution.