Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks

Meet P Vadera; Brian Jalaian; Benjamin M Marlin

doi:10.48550/arxiv.2005.08110

Back

Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks

Preprint

Open access

Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks

Meet P Vadera, Brian Jalaian and Benjamin M Marlin

arXiv, version 1

05/16/2020

DOI: https://doi.org/10.48550/arxiv.2005.08110

Metrics

1 File views/ downloads

2 Record Views

Abstract

In this paper, we present a general framework for distilling expectations with respect to the Bayesian posterior distribution of a deep neural network classifier, extending prior work on the Bayesian Dark Knowledge framework. The proposed framework takes as input "teacher" and student model architectures and a general posterior expectation of interest. The distillation method performs an online compression of the selected posterior expectation using iteratively generated Monte Carlo samples. We focus on the posterior predictive distribution and expected entropy as distillation targets. We investigate several aspects of this framework including the impact of uncertainty and the choice of student model architecture. We study methods for student model architecture search from a speed-storage-accuracy perspective and evaluate down-stream tasks leveraging entropy distillation including uncertainty ranking and out-of-distribution detection.

Files and links (2)

pdf

Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks1.16 MBDownload View

Preprint Preprint pdfCC BY V4.0, Open Access

url

Generalized Bayesian Posterior Expectation Distillation for Deep Neural NetworksView

Preprint link to preprintCC BY V4.0, Open

Details

Title: Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks
Edition: version 1
Resource Type: Preprint
Publisher: arXiv
Format: pdf and link
Number of pages: 23
Identifiers: 99381512043606600
Academic Unit: Intelligent Systems and Robotics; Hal Marcus College of Science and Engineering
Language: English

Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks

Metrics

Abstract

Files and links (2)

Details

University of West Florida Social media