Abstract
AGGLOMERATIVE hierarchical methods of computer classification all begin by calculating distance-measures between elements. The hierarchy is then generated by subjecting these measures to a sorting-strategy, which depends essentially on the definition of a distance-measure between groups of elements. In nearest-neighbour sorting, this is defined as the distance between the closest pair of elements, one in each group. Macnaughton-Smith has pointed out that much more intense clustering can be produced by taking the most remote pair of elements (furthest-neighbour sorting). In group-average sorting1 the distance is defined as the mean of all between-group inter-element distances; in centroid sorting it is the distance between group centroids, defined by a conventional Euclidean model. In median2 sorting the distance of a third group from two which have just fused depends on the previous three inter-group distances in the manner of Apollonius's theorem. Although the earlier of these strategies have received some comparative assessment1,3–5 no attempt seems to have been made to generalize them into a single system. As a result, quite different computer strategies have commonly been used, necessitating a separate computer program for each.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 51 print issues and online access
$199.00 per year
only $3.90 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Sokal, R. R., and Michener, C. D., Univ. Kansas Sci. Bull., 38, 1409 (1958).
Gower, J. C., Biometrics (in the press).
Sokal, R. R., and Sneath, P. H. A., Principles of Numerical Taxonomy (Freeman, San Francisco and London, 1963).
Williams, W. T., and Dale, M. B., Adv. Bot. Res., 2, 35 (1965).
Williams, W. T., Lambert, J. M., and Lance, G. N., J. Ecol., 54, 427 (1966).
Lance, G. N., and Williams, W. T., Comp. J., 9, 60 (1966).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
LANCE, G., WILLIAMS, W. A Generalized Sorting Strategy for Computer Classifications. Nature 212, 218 (1966). https://doi.org/10.1038/212218a0
Issue date:
DOI: https://doi.org/10.1038/212218a0
This article is cited by
-
Natural and electro-flocculation of Cr, Cd, Co, and Ni during estuarine mixing
International Journal of Environmental Science and Technology (2023)
-
Clustering: an R library to facilitate the analysis and comparison of cluster algorithms
Progress in Artificial Intelligence (2023)
-
Using Projection-Based Clustering to Find Distance- and Density-Based Clusters in High-Dimensional Data
Journal of Classification (2021)