Review

. 2022 Apr 12:5:787421.

doi: 10.3389/fdata.2022.787421. eCollection 2022.

Applications and Techniques for Fast Machine Learning in Science

Allison McCarn Deiana¹, Nhan Tran^{2

3}, Joshua Agar⁴, Michaela Blott⁵, Giuseppe Di Guglielmo⁶, Javier Duarte⁷, Philip Harris⁸, Scott Hauck⁹, Mia Liu¹⁰, Mark S Neubauer¹¹, Jennifer Ngadiuba², Seda Ogrenci-Memik³, Maurizio Pierini¹², Thea Aarrestad¹², Steffen Bähr¹³, Jürgen Becker¹³, Anne-Sophie Berthold¹⁴, Richard J Bonventre¹⁵, Tomás E Müller Bravo¹⁶, Markus Diefenthaler¹⁷, Zhen Dong¹⁸, Nick Fritzsche¹⁹, Amir Gholami¹⁸, Ekaterina Govorkova¹², Dongning Guo³, Kyle J Hazelwood², Christian Herwig², Babar Khan²⁰, Sehoon Kim¹⁸, Thomas Klijnsma², Yaling Liu²¹, Kin Ho Lo²², Tri Nguyen⁸, Gianantonio Pezzullo²³, Seyedramin Rasoulinezhad²⁴, Ryan A Rivera², Kate Scholberg²⁵, Justin Selig¹⁴, Sougata Sen²⁶, Dmitri Strukov²⁷, William Tang²⁸, Savannah Thais²⁸, Kai Lukas Unger¹³, Ricardo Vilalta²⁹, Belina von Krosigk^{13

30}, Shen Wang²¹, Thomas K Warburton³¹

Affiliations

¹ Department of Physics, Southern Methodist University, Dallas, TX, United States.
² Fermi National Accelerator Laboratory, Batavia, IL, United States.
³ Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, United States.
⁴ Department of Materials Science and Engineering, Lehigh University, Bethlehem, PA, United States.
⁵ Xilinx Research, Dublin, Ireland.
⁶ Department of Computer Science, Columbia University, New York, NY, United States.
⁷ Department of Physics, University of California, San Diego, San Diego, CA, United States.
⁸ Massachusetts Institute of Technology, Cambridge, MA, United States.
⁹ Department of Electrical and Computer Engineering, University of Washington, Seattle, WA, United States.
¹⁰ Department of Physics and Astronomy, Purdue University, West Lafayette, IN, United States.
¹¹ Department of Physics, University of Illinois Urbana-Champaign, Champaign, IL, United States.
¹² European Organization for Nuclear Research (CERN), Meyrin, Switzerland.
¹³ Karlsruhe Institute of Technology, Karlsruhe, Germany.
¹⁴ Cerebras Systems, Sunnyvale, CA, United States.
¹⁵ Lawrence Berkeley National Laboratory, Berkeley, CA, United States.
¹⁶ Department of Physics and Astronomy, University of Southampton, Southampton, United Kingdom.
¹⁷ Thomas Jefferson National Accelerator Facility, Newport News, VA, United States.
¹⁸ Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, United States.
¹⁹ Institute of Nuclear and Particle Physics, Technische Universität Dresden, Dresden, Germany.
²⁰ Department of Computer Science, Technical University Darmstadt, Darmstadt, Germany.
²¹ Department of Bioengineering, Lehigh University, Bethlehem, PA, United States.
²² Department of Physics, University of Florida, Gainesville, FL, United States.
²³ Department of Physics, Yale University, New Haven, CT, United States.
²⁴ Department of Engineering and IT, University of Sydney, Camperdown, NSW, Australia.
²⁵ Department of Physics, Duke University, Durham, NC, United States.
²⁶ Birla Institute of Technology and Science, Pilani, India.
²⁷ Department of Electrical and Computer Engineering, University of California, Santa Barbara, Santa Barbara, CA, United States.
²⁸ Department of Physics, Princeton University, Princeton, NJ, United States.
²⁹ Department of Computer Science, University of Houston, Houston, TX, United States.
³⁰ Department of Physics, Universität Hamburg, Hamburg, Germany.
³¹ Department of Physics and Astronomy, Iowa State University, Ames, IA, United States.

PMID: 35496379
PMCID: PMC9041419
DOI: 10.3389/fdata.2022.787421

Review

Applications and Techniques for Fast Machine Learning in Science

Allison McCarn Deiana et al. Front Big Data. 2022.

. 2022 Apr 12:5:787421.

doi: 10.3389/fdata.2022.787421. eCollection 2022.

Authors

Affiliations

¹ Department of Physics, Southern Methodist University, Dallas, TX, United States.
² Fermi National Accelerator Laboratory, Batavia, IL, United States.
³ Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, United States.
⁴ Department of Materials Science and Engineering, Lehigh University, Bethlehem, PA, United States.
⁵ Xilinx Research, Dublin, Ireland.
⁶ Department of Computer Science, Columbia University, New York, NY, United States.
⁷ Department of Physics, University of California, San Diego, San Diego, CA, United States.
⁸ Massachusetts Institute of Technology, Cambridge, MA, United States.
⁹ Department of Electrical and Computer Engineering, University of Washington, Seattle, WA, United States.
¹⁰ Department of Physics and Astronomy, Purdue University, West Lafayette, IN, United States.
¹¹ Department of Physics, University of Illinois Urbana-Champaign, Champaign, IL, United States.
¹² European Organization for Nuclear Research (CERN), Meyrin, Switzerland.
¹³ Karlsruhe Institute of Technology, Karlsruhe, Germany.
¹⁴ Cerebras Systems, Sunnyvale, CA, United States.
¹⁵ Lawrence Berkeley National Laboratory, Berkeley, CA, United States.
¹⁶ Department of Physics and Astronomy, University of Southampton, Southampton, United Kingdom.
¹⁷ Thomas Jefferson National Accelerator Facility, Newport News, VA, United States.
¹⁸ Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, United States.
¹⁹ Institute of Nuclear and Particle Physics, Technische Universität Dresden, Dresden, Germany.
²⁰ Department of Computer Science, Technical University Darmstadt, Darmstadt, Germany.
²¹ Department of Bioengineering, Lehigh University, Bethlehem, PA, United States.
²² Department of Physics, University of Florida, Gainesville, FL, United States.
²³ Department of Physics, Yale University, New Haven, CT, United States.
²⁴ Department of Engineering and IT, University of Sydney, Camperdown, NSW, Australia.
²⁵ Department of Physics, Duke University, Durham, NC, United States.
²⁶ Birla Institute of Technology and Science, Pilani, India.
²⁷ Department of Electrical and Computer Engineering, University of California, Santa Barbara, Santa Barbara, CA, United States.
²⁸ Department of Physics, Princeton University, Princeton, NJ, United States.
²⁹ Department of Computer Science, University of Houston, Houston, TX, United States.
³⁰ Department of Physics, Universität Hamburg, Hamburg, Germany.
³¹ Department of Physics and Astronomy, Iowa State University, Ames, IA, United States.

PMID: 35496379
PMCID: PMC9041419
DOI: 10.3389/fdata.2022.787421

Erratum in

Corrigendum: Applications and techniques for fast machine learning in science.
Deiana AM, Tran N, Agar J, Blott M, Di Guglielmo G, Duarte J, Harris P, Hauck S, Liu M, Neubauer MS, Ngadiuba J, Ogrenci-Memik S, Pierini M, Aarrestad T, Bähr S, Becker J, Berthold AS, Bonventre RJ, Müller Bravo TE, Diefenthaler M, Dong Z, Fritzsche N, Gholami A, Govorkova E, Guo D, Hazelwood KJ, Herwig C, Khan B, Kim S, Klijnsma T, Liu Y, Lo KH, Nguyen T, Pezzullo G, Rasoulinezhad S, Rivera RA, Scholberg K, Selig J, Sen S, Strukov D, Tang W, Thais S, Unger KL, Vilalta R, von Krosigk B, Wang S, Warburton TK. Deiana AM, et al. Front Big Data. 2023 Oct 16;6:1301942. doi: 10.3389/fdata.2023.1301942. eCollection 2023. Front Big Data. 2023. PMID: 37908454 Free PMC article.

Abstract

In this community review report, we discuss applications and techniques for fast machine learning (ML) in science-the concept of integrating powerful ML methods into the real-time experimental data processing loop to accelerate scientific discovery. The material for the report builds on two workshops held by the Fast ML for Science community and covers three main areas: applications for fast ML across a number of scientific domains; techniques for training and implementing performant and resource-efficient ML algorithms; and computing architectures, platforms, and technologies for deploying these algorithms. We also present overlapping challenges across the multiple scientific domains where common solutions can be found. This community report is intended to give plenty of examples and inspiration for scientific discovery through integrated and accelerated ML solutions. This is followed by a high-level overview and organization of technical advances, including an abundance of pointers to source material, which can enable these breakthroughs.

Keywords: big data; codesign; coprocessors; fast machine learning; heterogeneous computing; machine learning for science; particle physics.

Copyright © 2022 Deiana, Tran, Agar, Blott, Di Guglielmo, Duarte, Harris, Hauck, Liu, Neubauer, Ngadiuba, Ogrenci-Memik, Pierini, Aarrestad, Bähr, Becker, Berthold, Bonventre, Müller Bravo, Diefenthaler, Dong, Fritzsche, Gholami, Govorkova, Guo, Hazelwood, Herwig, Khan, Kim, Klijnsma, Liu, Lo, Nguyen, Pezzullo, Rasoulinezhad, Rivera, Scholberg, Selig, Sen, Strukov, Tang, Thais, Unger, Vilalta, von Krosigk, Wang and Warburton.

PubMed Disclaimer

Conflict of interest statement

MB was employed by the company Xilinx Inc. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling editor EC is currently organizing a Research Topic with the authors JD, ML, and JN.

Figures

**Figure 1**
The concept behind this review paper is to find the confluence of domain-specific challenges, machine learning, and experiment and computer system architectures to accelerate science discovery.

**Figure 2**
High-level overview of the stages in a GNN-based tracking pipeline. Only a subset of the typical edge weights are shown for illustration purposes. **(A)** Graph construction, **(B)** edge classification, and **(C)** track construction.

**Figure 3**
Simulated type Ia supernova light-curve and classification. Top: calibrated flux evolution in different DES band-passes as a function of normalized time (the first photometric measurement is set to time equals zero). Bottom: Baseline RNN classification probability evolution with respect of time, no host-galaxy redshift information was provided. At each photometric measurement, classification probability is obtained. The maximum light of the simulated supernova is shown in a gray dashed line and the simulated redshift of the supernovae is shown on the top z = 0.466. We highlight that redshift is not used for this classification but can improve results. Our baseline RNN classifies this light-curve as type Ia SN with great accuracy before maximum light, it only requires a handful of photometric epochs. (Möller and de Boissiére, 2019).

**Figure 4**
A 6GeV/c electron event in the ProtoDUNE detector. The x-axis shows the wire number. The y-axis shows the time tick in the unit of 0.5μs. The color scale represents the charge deposition.

**Figure 5**
Experimental 4D-STEM measurement of a dichalcogenide 2D material. Atomic map is inferred from the data, each diffraction pattern represents an average of 7 × 7 experimental images, green STEM probes are labeled for regions of the sample with one layer, vacuum, and two layers (Ophus, 2019).

**Figure 6**
The illustration of hardware-aware quantization and pruning. A given NN model can be compressed by using low precision quantization instead of single precision. The extreme case is to use 0-bit quantization which is equivalent to removing/pruning the corresponding neurons. The goal of compression is to find the best bit-precision setting for quantization/pruning to reduce model footprint/latency on a target hardware with minimal generalization loss.

**Figure 7**
Taxonomy of compute architectures, differentiating CPUs, GPUs and DPUs.

**Figure 8**
DPU architectures: Matrix of Processing Engines (MPE) on the left, and spatial architecture on the right.

**Figure 10**
Analog vector-by-matrix multiplication (VMM) in a crossbar circuit with adjustable crosspoint devices. For clarity, the output signal is shown for just one column of the array, while sense amplifier circuitry is not shown. Note that other VMM designs, e.g., utilizing duration of applied voltage pulses, rather than their amplitudes, for encoding inputs/outputs, are now being actively explored see, e.g., their brief review in Bavandpour et al. (2018).

See this image and copyright information in PMC

References

1. Aad G. (2008). The ATLAS experiment at the CERN large hadron collider. JINST 3:S08003. 10.1088/1748-0221/3/08/S08003 - DOI
1. Aad G., Abajyan T., Abbott B., Abdallah J., Abdel Khalek S., Abdelalim A. A., et al. (2012). Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC. Phys. Lett. B 716, 1–29. 10.1016/j.physletb.2012.08.020 - DOI
1. Aarrestad T., Loncar V., Ghielmetti N., Pierini M., Summers S., Ngadiuba J., et al. (2021). Fast convolutional neural networks on FPGAs with hls4ml. Mach. Learn. Sci. Tech., 2, 045015. 10.1088/2632-2153/ac0ea1 - DOI
1. Aartsen M., The IceCube, Fermi-LAT, MAGIC, AGILE, ASAS-SN, HAWC et al. (2018). Multimessenger observations of a flaring blazar coincident with high-energy neutrino IceCube-170922A. Science 361, eaat1378. 10.1126/science.aat1378 - DOI - PubMed
1. Abbott B., Abbott R., Abbott T., Abernathy M., Acernese F., Ackley K., et al. (2016a). Properties of the binary black hole merger gw150914. Phys. Rev. Lett. 116, 241102. 10.1103/PhysRevLett.116.241102 - DOI - PubMed

Publication types

Actions

Grants and funding

R01 HL131750/HL/NHLBI NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Applications and Techniques for Fast Machine Learning in Science

Affiliations

Applications and Techniques for Fast Machine Learning in Science

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources