Paper 13479-8
Bullet proofing Bayesian deep learning with particle flow
14 April 2025 • 11:00 AM - 11:20 AM EDT
Abstract
We derive a new theory of Bayesian deep learning with particle flow that bullet proofs the algorithm against stiffness. Bayesian tools, such as STAN, which uses Hamiltonian Monte Carlo, are plagued by extremely stiff flows. Moreover, if you look under the hood of the famous Adam algorithm for deep learning, you will see that Adam was carefully designed to mitigate stiffness. Nevertheless, it is common to use extremely small learning rates for deep learning (e.g., 0.00001) despite the use of Adam!!! Our new theory avoids such a waste of precious GPU resources. As a result, our new theory speeds up Bayesian deep learning by many orders of magnitude. Furthermore, the new theory allows us to avoid fancy stiff ODE solvers in STAN that require a large amount of computer run time, and which are not parallelizable. Our Bayesian particle flow is embarrassingly parallelizable. Our paper generalizes the recent work in Dai & Daum, IEEE AESS Transactions June 2023.
Presenter
Raytheon (United States)
In May 2024 Fred Daum was awarded the IEEE Gold Medal for research and development of phased array radars. This prize comes with a real gold medal as well as $20,000 (before taxes). Fred has published over 100 technical papers, and he has given invited talks at MIT, Harvard, Caltech, Yale, Technion, and many other schools. His h-index is 31, and his Erdös number is 5. He designed, analyzed and tested essentially all of the real time algorithms for long range phased array radars built by the USA in the last 5 decades. Fred was using cognitive radar in many real world operational systems many decades before Professor Simon Haykin published his famous paper claiming that cognitive radar was a new idea. Fred invented several exact finite dimensional nonlinear filters that generalize the Kalman and Benes filters. Fred's Bayesian particle flow filter is many orders of magnitude faster than standard particle filters.