Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 May 1;75(Pt 3):593-599.
doi: 10.1107/S2053273319002729. Epub 2019 Apr 30.

A space for lattice representation and clustering

Affiliations

A space for lattice representation and clustering

Lawrence C Andrews et al. Acta Crystallogr A Found Adv. .

Abstract

Algorithms for quantifying the differences between two lattices are used for Bravais lattice determination, database lookup for unit cells to select candidates for molecular replacement, and recently for clustering to group together images from serial crystallography. It is particularly desirable for the differences between lattices to be computed as a perturbation-stable metric, i.e. as distances that satisfy the triangle inequality, so that standard tree-based nearest-neighbor algorithms can be used, and for which small changes in the lattices involved produce small changes in the distances computed. A perturbation-stable metric space related to the reduction algorithm of Selling and to the Bravais lattice determination methods of Delone is described. Two ways of representing the space, as six-dimensional real vectors or equivalently as three-dimensional complex vectors, are presented and applications of these metrics are discussed. (Note: in his later publications, Boris Delaunay used the Russian version of his surname, Delone.).

Keywords: Delaunay; Delone; Niggli; Selling; clustering; unit-cell reduction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example of a one-boundary virtual Cartesian point distance calculation. Only the VCP operation is shown, no reflections. Both points formula image and formula image are Selling reduced. The image is the three-dimensional, all-negative octant of the three formula image axes, formula image, formula image and formula image; the reduction is done along the formula image axis, and formula image and formula image are the two scalars that will be interchanged. The points are shown above or below the formula image/formula image plane, with their projections onto that plane marked with a +. To compute the minimal distance between formula image and formula image, begin by computing the Euclidean distance between the two. The formula image reduction transforms formula image into formula image, but the metric changes when going from negative formula image to positive formula image, so the simple Euclidean distance may not be minimal. To generate formula image to which the distance may be shorter, project formula image onto the formula image/formula image plane, transform that projected point, and subtract formula image from that point. The distance from formula image to formula image can now be used to decide whether it is shorter than the formula imageformula image Euclidean distance. The best distance for this case is the shorter of the distances between formula image and formula image as opposed to the distance between formula image and formula image.
Figure 2
Figure 2
In this figure, M represents the application of all 24 reflection matrices. V represents the generation of six virtual Cartesian points from an input point.
Figure 3
Figure 3
This an example of a one-boundary tunneled mirrored boundary distance calculation. As with Fig. 1 ▸ the 24 reflections are not shown. Both points formula image and formula image are Selling reduced. The image is the three-dimensional, all-negative octant of the three formula image axes, formula image, formula image, and formula image; the reduction is done along the formula image axis, and formula image and formula image are the two scalars that will be interchanged. The points are shown above the formula image/formula image plane, with their projections onto that plane marked with a circled ‘X’. The Euclidean distance from formula image to formula image is shown as a dotted line. Let mp be the mirror point on the boundary going from formula image to formula image via the boundary. Then the shortest distance from formula image to mp to formula image is also shown as a dotted line. The transformed image of mp is mpx. The distance between formula image and mp is the same as the distance between a transformed formula image and mpx. There is a no-cost tunnel from mp to mpx. So the total alternative distance for this case is the distance between formula image and mp plus the distance from mpx to formula image (shown as a dashed line).
Figure 4
Figure 4
Distance between points using the Follower algorithm. To verify the distance algorithms, the ‘Follower’ algorithm has been developed. Follower chooses two points and determines the distance between one of them and all of the points on a line between the two original points. Here, one unreduced point is chosen and the second point is the reduced point of that point. So the distance between the original point and the final point is zero. Distances are shown for the formula image metric (Andrews & Bernstein, 2014 ▸), the formula image metric (Andrews et al., 1980 ▸), the formula image metric (Andrews et al., 2019 ▸), and the two implementations in formula image. Timing in ms: formula image (NCDist) 4542, formula image 676, formula image 7, S6Dist 394, CS6Dist 14.

References

    1. Andrews, L. C. & Bernstein, H. J. (1988). Acta Cryst. A44, 1009–1018.
    1. Andrews, L. C. & Bernstein, H. J. (1995). Acta Cryst. A51, 413–416.
    1. Andrews, L. C. & Bernstein, H. J. (2014). J. Appl. Cryst. 47, 346–359. - PMC - PubMed
    1. Andrews, L. C., Bernstein, H. J. & Pelletier, G. A. (1980). Acta Cryst. A36, 248–252.
    1. Andrews, L. C., Bernstein, H. J. & Sauter, N. K. (2019). Acta Cryst. A75, 115–120. - PMC - PubMed