Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning

Meet P Vadera; Adam D Cobb; Brian Jalaian; Benjamin M Marlin

doi:10.48550/arxiv.2202.03770

Back

Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning

Preprint

Open access

Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning

Meet P Vadera, Adam D Cobb, Brian Jalaian and Benjamin M Marlin

arXiv

02/08/2022

DOI: https://doi.org/10.48550/arxiv.2202.03770

Metrics

1 File views/ downloads

6 Record Views

Abstract

Bayesian methods hold significant promise for improving the uncertainty quantification ability and robustness of deep neural network models. Recent research has seen the investigation of a number of approximate Bayesian inference methods for deep neural networks, building on both the variational Bayesian and Markov chain Monte Carlo (MCMC) frameworks. A fundamental issue with MCMC methods is that the improvements they enable are obtained at the expense of increased computation time and model storage costs. In this paper, we investigate the potential of sparse network structures to flexibly trade-off model storage costs and inference run time against predictive performance and uncertainty quantification ability. We use stochastic gradient MCMC methods as the core Bayesian inference method and consider a variety of approaches for selecting sparse network structures. Surprisingly, our results show that certain classes of randomly selected substructures can perform as well as substructures derived from state-of-the-art iterative pruning methods while drastically reducing model training times.

Files and links (2)

pdf

Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning658.33 kBDownload View

Preprint Article pdfCC BY V4.0, Open Access

url

Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep LearningView

Preprint link to articleCC BY V4.0, Open

Details

Title: Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning
Resource Type: Preprint
Publisher: arXiv; https://arxiv.org/
Format: pdf and link
Number of pages: 19
Identifiers: 99381512041706600
Academic Unit: Intelligent Systems and Robotics; Hal Marcus College of Science and Engineering
Language: English

Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning

Metrics

Abstract

Files and links (2)

Details

University of West Florida Social media