How is p2 calculated for Mahalanobis distances?

The following table of Mahalanobis distances was obtained from an analysis of data with 73 cases. Only the first five rows of the table are shown here.

Observation number	Mahalanobis d-squared	p1	p2
42	18.7468824	.0046132	.2864768
20	17.2011378	.0085718	.1299040
3	13.2641516	.0390278	.5461262
35	12.9541160	.0437704	.3973690
28	12.7304279	.0475222	.2662369
...	...	...	...

In what follows, I will write d² for d-squared, p₁ for p1 and p₂ for p2.

The meaning of p₁ and p₂

The first row of the table shows that p₁ = .0046132 and p₂ = .2864768 for case 42, which is the one case out of 73 cases that is furthest from the centroid in Mahalanobis d² units. This means that

p₁ = P(d² for case 42 > 18.7468824) = .0046132

and

p₂ = P(The largest d² > 18.7468824) = .2864768

Calculating p₂ for the case with the largest d²

Here is how p₂ was calculated for the case furthest from the centroid:

p₂ = P(The largest d² > 18.7468824)

= 1 - P(The largest d² <= 18.7468824)

= 1 - P(All 73 d² values are <= 18.7468824)

= 1 - (1 - .0046132)⁷³ = 0.28648

Calculating p₂ for the case with the second largest d²

p₂ for the case that is second-furthest from the centroid (the case in the second row of the table) was calculated as follows.

p₂ = P(The second-largest d² > 17.2011378)

= 1 - P(The second-largest d² <= 17.2011378)

= 1 - P(exactly 72 or 73 cases have d² <= 17.2011378)

= 1 - P(exactly 72 cases have d² <= 17.2011378)

- P(exactly 73 cases have d² <= 17.2011378)

= 1 - ₇₃C₇₂(1 - .0085718)⁷²(.0085718)¹ - ₇₃C₇₃(1 - .0085718)⁷³(.0085718)⁰

= .12990

where _NC_k is the number of subsets of k objects in a set of N objects.

Calculating p₂ for the case with the k-th largest d²

In general, for the case that is k-th furthest from the centroid (meaning that there are k-1 cases further from the centroid), p₂ is calculated by first evaluating p₁ for that case and then calculating

p₂ = 1 - _NC_N-0(1-p₁)^N(p₁)⁰

- _NC_N-1(1-p₁)^N-1(p₁)¹

- _NC_N-2(1-p₁)^N-2(p₁)²

...

- _NC_N-k+1(1-p₁)^N-k+1(p₁)^k-1

where N is the number of cases.

How is p2 calculated for Mahalanobis distances?

The meaning of p1 and p2

Calculating p2 for the case with the largest d2

Calculating p2 for the case with the second largest d2

Calculating p2 for the case with the k-th largest d2

The meaning of p₁ and p₂

Calculating p₂ for the case with the largest d²

Calculating p₂ for the case with the second largest d²

Calculating p₂ for the case with the k-th largest d²