Abstract
Neutron crystallography offers enormous potential to complement structures from X-ray crystallography by clarifying the positions of low-Z elements, namely hydrogen. Macromolecular neutron crystallography, however, remains limited, in part owing to the challenge of integrating peak shapes from pulsed-source experiments. To advance existing software, this article demonstrates the use of machine learning to refine peak locations, predict peak shapes and yield more accurate integrated intensities when applied to whole data sets from a protein crystal. The artificial neural network, based on the U-Net architecture commonly used for image segmentation, is trained using about 100 000 simulated training peaks derived from strong peaks. After 100 training epochs (a round of training over the whole data set broken into smaller batches), training converges and achieves a Dice coefficient of around 65%, in contrast to just 15% for negative control data sets. Integrating whole peak sets using the neural network yields improved intensity statistics compared with other integration methods, including k-nearest neighbours. These results demonstrate, for the first time, that neural networks can learn peak shapes and be used to integrate Bragg peaks. It is expected that integration using neural networks can be further developed to increase the quality of neutron, electron and X-ray crystallography data.
Original language | English |
---|---|
Pages (from-to) | 854-863 |
Number of pages | 10 |
Journal | Journal of Applied Crystallography |
Volume | 52 |
Issue number | 4 |
DOIs | |
State | Published - Aug 2019 |
Funding
This work was funded through grant R01-GM071939 from the National Institutes of Health. The neutron scattering measurements were carried out on the MaNDi instrument at the Spallation Neutron Source, which is sponsored by the Division of Scientific User Facilities, Office of Basic Energy Sciences, US Department of Energy, under contract No. DEAC05- 00OR22725 with UT-Battelle, LLC. This work used samples grown at Oak Ridge National Laboratory's Center for Structural and Molecular Biology (CSMB) which is funded by the Office of Biological Environment Research in the Department of Energy's Office of Science. This work was funded through grant R01-GM071939 from the National Institutes of Health. The neutron scattering measurements were carried out on the MaNDi instrument at the Spallation Neutron Source, which is sponsored by the Division of Scientific User Facilities, Office of Basic Energy Sciences, US Department of Energy, under contract No. DE-AC05-00OR22725 with UT-Battelle, LLC. This work used samples grown at Oak Ridge National Laboratory’s Center for Structural and Molecular Biology (CSMB) which is funded by the Office of Biological Environment Research in the Department of Energy’s Office of Science.
Funders | Funder number |
---|---|
Division of Scientific User Facilities | |
Oak Ridge National Laboratory | |
Oak Ridge National Laboratory | |
Office of Basic Energy Sciences | |
Office of Biological Environment Research | |
US Department of Energy | DEAC05- 00OR22725 |
National Institutes of Health | |
U.S. Department of Energy | |
Office of Science | |
Canadian Society for Molecular Biosciences |
Keywords
- Computational modelling
- Integration
- Machine learning
- Neural networks
- Neutron crystallography