On the Difference between the Information Bottleneck and the Deep Information Bottleneck

Research Database / FORSCHUNGSDATENBANK

Publication 4615456 | Verified

Data Entry: Please note that the research database will be replaced by UNIverse by the end of October 2023. Please enter your data into the system https://universe-intern.unibas.ch. Thanks

Login for users with Unibas email account...

Login for registered users without Unibas email account...

On the Difference between the Information Bottleneck and the Deep Information Bottleneck

JournalArticle (Originalarbeit in einer wissenschaftlichen Zeitschrift)

ID	4615456
Author(s)	Wieczorek, Aleksander; Roth, Volker
Author(s) at UniBasel	Roth, Volker Wieczorek, Aleksander
Year	2020
Title	On the Difference between the Information Bottleneck and the Deep Information Bottleneck
Journal	Entropy
Volume	22
Number	2
Pages / Article-Number	131
Keywords	Markov assumption; Markov chain; conditional independence; deep variational information bottleneck; information bottleneck; mutual information
Abstract	Combining the information bottleneck model with deep learning by replacing mutual information terms with deep neural nets has proven successful in areas ranging from generative modelling to interpreting deep neural networks. In this paper, we revisit the deep variational information bottleneck and the assumptions needed for its derivation. The two assumed properties of the data,; X; and; Y; , and their latent representation; T; , take the form of two Markov chains T - X - Y and X - T - Y . Requiring both to hold during the optimisation process can be limiting for the set of potential joint distributions P ( X , Y , T ) . We, therefore, show how to circumvent this limitation by optimising a lower bound for the mutual information between; T; and; Y; : I ( T ; Y ) , for which only the latter Markov chain has to be satisfied. The mutual information I ( T ; Y ) can be split into two non-negative parts. The first part is the lower bound for I ( T ; Y ) , which is optimised in deep variational information bottleneck (DVIB) and cognate models in practice. The second part consists of two terms that measure how much the former requirement T - X - Y is violated. Finally, we propose interpreting the family of information bottleneck models as directed graphical models, and show that in this framework, the original and deep information bottlenecks are special cases of a fundamental IB model.
Publisher	MDPI
ISSN/ISBN	1099-4300
URL	http://www.ncbi.nlm.nih.gov/pmc/articles/pmc7516540/
edoc-URL	https://edoc.unibas.ch/81734/
Full Text on edoc	No
Digital Object Identifier DOI	10.3390/e22020131
PubMed ID	http://www.ncbi.nlm.nih.gov/pubmed/33285906

10/05/2024

Research Database / FORSCHUNGSDATENBANK