discont.README.txt 3.1 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950
  1. Discont
  2. Jean-Pierre Grimmeau - Université Libre de Bruxelles <grimmeau@ulb.ac.be>
  3. The discont algorithm systematically searches discontinuities in the slope
  4. of the cumulated frequencies curve, by approximating this curve through
  5. straight line segments whose vertices define the class breaks. This
  6. algorithm is inspired by techniques of automatic line generalization used
  7. in cartography [1]. The first approximation is a straight line which links
  8. the two end nodes of the curve. This line is then replaced by a
  9. two-segmented polyline whose central node is the point on the curve which
  10. is farthest from the preceding straight line. The point on the curve
  11. furthest from this new polyline is then chosen as a new node to create
  12. break up one of the two preceding segments, and so forth. The problem of
  13. the difference in terms of units between the two axes is solved by
  14. rescaling both amplitudes to an interval between 0 and 1. In the original
  15. algorithm, the process is stopped when the difference between the slopes
  16. of the two new segments is no longer significant. As the slope is the
  17. ratio between the frequency and the amplitude of the corresponding
  18. interval, i.e. its density, this effectively tests whether the frequencies
  19. of the two newly proposed classes are different from those obtained by
  20. simply distributing the sum of their frequencies amongst them in
  21. proportion to the class amplitudes.
  22. The algorithm described above creates class breaks which each are
  23. identical to a specific observation. It is thus necessary to decide to
  24. which class these observations should be attributed. It seems logical to
  25. prefer the densest, i.e. the one with the strongest slope. The
  26. automatisation of this method allows to distinguish classes with high
  27. frequencies from those with low frequencies, but also to introduce
  28. subtleties and to delimit transition classes.
  29. This method, inspired by Jenks' algorithm [2], provides a good analysis of
  30. the distribution, but not necessarily cartographically satisfying class
  31. breaks. It is thus up to the cartographer to judge whether all the
  32. identified breaks are cartographically useful (or whether some should be
  33. combined) and whether any of the class amplitudes is too large. In the
  34. latter case, the class should be subdivided into equal intervals
  35. (arithmetic progression) as by definition, the classes resulting from the
  36. discont algorithm have a homogeneous interior distribution. If the general
  37. distribution of the data is close to the normal distribution, it is also
  38. possible to combine equiprobable class breaks [3] , with their advantage
  39. of regularity, with discont class breaks for the extremes which often have
  40. large amplitudes when using equiprobable class breaks.
  41. [1] Douglas, D.H. & Peucker, T.K. (1973) Algorithms for the reduction of the number of points required to represent a digitized line or its caricature, The Canadian Cartographer, 10, pp. 112-122.
  42. [2] Jenks, G.F. (1963) Generalisation in statistical mapping, Annals of the Association of American Geographers, 53, pp.15-26.
  43. [3] Grimmeau, J.P. (1977) Cartographie par plages et discontinuités spatiales, Paris, Espace géographique, VI, pp.49-58.