Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 1;77(Pt 1):1-10.
doi: 10.1107/S2059798320014746. Epub 2021 Jan 1.

Phasertng: directed acyclic graphs for crystallographic phasing

Affiliations

Phasertng: directed acyclic graphs for crystallographic phasing

Airlie J McCoy et al. Acta Crystallogr D Struct Biol. .

Abstract

Crystallographic phasing strategies increasingly require the exploration and ranking of many hypotheses about the number, types and positions of atoms, molecules and/or molecular fragments in the unit cell, each with only a small chance of being correct. Accelerating this move has been improvements in phasing methods, which are now able to extract phase information from the placement of very small fragments of structure, from weak experimental phasing signal or from combinations of molecular replacement and experimental phasing information. Describing phasing in terms of a directed acyclic graph allows graph-management software to track and manage the path to structure solution. The crystallographic software supporting the graph data structure must be strictly modular so that nodes in the graph are efficiently generated by the encapsulated functionality. To this end, the development of new software, Phasertng, which uses directed acyclic graphs natively for input/output, has been initiated. In Phasertng, the codebase of Phaser has been rebuilt, with an emphasis on modularity, on scripting, on speed and on continuing algorithm development. As a first application of phasertng, its advantages are demonstrated in the context of phasertng.xtricorder, a tool to analyse and triage merged data in preparation for molecular replacement or experimental phasing. The description of the phasing strategy with directed acyclic graphs is a generalization that extends beyond the functionality of Phasertng, as it can incorporate results from bioinformatics and other crystallographic tools, and will facilitate multifaceted search strategies, dynamic ranking of alternative search pathways and the exploitation of machine learning to further improve phasing strategies.

Keywords: Phaser; Phasertng; SAD phasing; directed acyclic graphs; machine learning; maximum likelihood; molecular replacement.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(a) Path, (b) tree, (c) directed acyclic graph. The root node is shown in red, leaf nodes are shown in green and intermediate are shown in blue, except that all nodes with two parents are shown in purple.
Figure 2
Figure 2
(a) Schematic for the DAG resulting from user input and phasertng.xtricorder with a colouring scheme as in Fig. 1 ▸. Nodes outlined in orange are those generated by phasertng.xtricorder. Multiple arrows indicate where the DAG may branch. (b) Schematic of DAG for structure solution of PDB entry 4n3e (Sliwiak et al., 2014 ▸). The crystal was obtained by co-crystallization of Hyp-1 with an eightfold molar excess of the ligand 8-anilino-1-naphthalene sulfonate (ANS). Strong fluorescence under UV illumination confirmed the presence of ANS in the crystals. Data were collected from a single crystal at a wavelength of 1.00 Å. The data extended to 2.4 Å resolution. There were no systematic absences of reflections along the axes and the highest symmetry in which the data merged was space group P422. Significant anisotropy was present. TNCS of order 7 was suspected as a result of analysis of the Patterson map, with the absence of TNCS maintained as a hypothesis. Twinning was detected in the intensity statistics after TNCS corrections for order 7. Since overmerging is possible in the presence of twinning, data were expanded to all subgroups of P422. Twinning was not detected in the absence of TNCS. At the conclusion of phasertng.xtricorder data analysis there are eight hypotheses for the contents of the asymmetric unit; solutions can be obtained for two of these hypotheses.
Figure 3
Figure 3
Number of reflections in each test case for the database of 30 test cases used for the speed tests in Figs. 4 ▸ and 5 ▸. PDB identifiers are shown for data with no TNCS (orange), for data with TNCS of order 2 (light blue) and data with TNCS of order greater than 2 (dark blue).
Figure 4
Figure 4
Comparison of the fold speedup of wall time for Phaser and Phasertng with and without threading over five threads. (a) Data with no TNCS, (b) data with TNCS of order 2 and (c) data with TNCS of order greater than 2. Four times are shown for each PDB identifier: Phaser without threading (blue), Phaser threaded on five cores (red), Phasertng without threading (green) and Phasertng threaded on five cores (purple). The longest runtime for each PDB identifier is shown in seconds above each column group, which is for Phaser without threading in every case.
Figure 5
Figure 5
Comparison of the fold speedup of wall time for Phaser (red) and Phasertng (purple) with threading for between one and five threads. The elapsed wall time is shown above each column in seconds for (a) data with no TNCS (PDB entry 1za7), (b) data with TNCS of order 2 (PDB entry 1hto) and (c) data with TNCS of order greater than 2 (PDB entry 4n3e).

References

    1. Bernstein, B. E., Michels, P. A. M. & Hol, W. G. J. (1997). Nature, 385, 275–278. - PubMed
    1. Bibby, J., Keegan, R. M., Mayans, O., Winn, M. D. & Rigden, D. J. (2012). Acta Cryst. D68, 1622–1631. - PubMed
    1. Blow, D. M. & Crick, F. H. C. (1959). Acta Cryst. 12, 794–802.
    1. Bricogne, G. (1992). Proceedings of the CCP4 Study Weekend. Molecular Replacement, edited by W. Wolf, E. J. Dodson & S. Gover, pp. 62–75. Warrington: Daresbury Laboratory.
    1. Bricogne, G. (1997). Methods Enzymol. 276, 361–423. - PubMed