
Next: Parameters Up: KSTAT Prev: Input output data
Ties
The K statistic is an "order statistic" - it is based on comparing
the order of the values in the two arrays - and it needs correction
if there are many identical values in either array. This is achieved
by counting the number of "tied pairs", which can be larger than the
number of array elements. For example the 10 element array: 1 2 2 3
7 7 7 7 7 9, contains a total of 11 ties (one tie in 2, and ten in 7).
KSTAT corrects for such ties automatically, and informs the user of
the number of ties in each input array. The correction factor applied
is
N_p/sqrt((N_p-N_t1)*(N_p-N_t2))
where N_p is the total number of pairs ( = 0.5*N*(N-1), where
N is the array length) and N_t1 and N_t2
are the number of tied pairs in each array. So long as the number of ties
is substantially smaller than the total number of pairs this correction
will be small (ie. near unity).

Next: Parameters Up: KSTAT Prev: Input output data