The following article is Open access

Computationally efficient algorithms for the two-dimensional Kolmogorov–Smirnov test

, and

Published under licence by IOP Publishing Ltd
, , Citation R H C Lopes et al 2008 J. Phys.: Conf. Ser. 119 042019 DOI 10.1088/1742-6596/119/4/042019

1742-6596/119/4/042019

Abstract

Goodness-of-fit statistics measure the compatibility of random samples against some theoretical or reference probability distribution function. The classical one-dimensional Kolmogorov-Smirnov test is a non-parametric statistic for comparing two empirical distributions which defines the largest absolute difference between the two cumulative distribution functions as a measure of disagreement. Adapting this test to more than one dimension is a challenge because there are 2d-1 independent ways of ordering a cumulative distribution function in d dimensions. We discuss Peacock's version of the Kolmogorov-Smirnov test for two-dimensional data sets which computes the differences between cumulative distribution functions in 4n2 quadrants. We also examine Fasano and Franceschini's variation of Peacock's test, Cooke's algorithm for Peacock's test, and ROOT's version of the two-dimensional Kolmogorov-Smirnov test. We establish a lower-bound limit on the work for computing Peacock's test of Ω(n2lgn), introducing optimal algorithms for both this and Fasano and Franceschini's test, and show that Cooke's algorithm is not a faithful implementation of Peacock's test. We also discuss and evaluate parallel algorithms for Peacock's test.

Export citation and abstract BibTeX RIS

Please wait… references are loading.
10.1088/1742-6596/119/4/042019