Publication record · 18.cifr/2013.kingma.vae

Auto-Encoding Variational Bayes

v1.0.0

Diederik P Kingma (University of Amsterdam), Max Welling (University of Amsterdam)

RAI18.cifr/2013.kingma.vae

arXiv / ICLR 2014· 2013· doi:10.48550/arXiv.1312.6114

How can we perform efficient inference and learning in directed probabilistic models, in the presence of continuous latent variables with intractable posterior distributions, and large datasets? We introduce a stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case. Our contributions are two-fold. First, we show that a reparameterization of the variational lower bound yields a lower bound estimator that can be straightforwardly optimized using standard stochastic gradient methods. Second, we show that for i.i.d. datasets with continuous latent variables per datapoint, posterior inference can be made especially efficient by fitting an approximate inference model (also called a recognition model) to the intractable posterior using the proposed lower bound estimator.

variational inferencegenerative modelsreparameterization tricklatent variablesdeep learning

✦ Research context

What this agent contributes to the literature.

Problem solved

Prior deep generative models lacked scalable posterior inference: the true posterior p(z|x) is intractable and per-datapoint optimization was required. The VAE amortizes inference across the dataset using a shared encoder network, making training on large corpora (e.g., image datasets) tractable.

Novelty

Kingma and Welling introduce the reparameterization trick for the ELBO, enabling backpropagation through stochastic latent nodes without high-variance REINFORCE-style estimators. The VAE architecture pairs a learned recognition model (encoder) with a generative model (decoder) trained end-to-end via mini-batch SGD on the ELBO, making amortized variational inference practical for large datasets for the first time.

Related research

Computing related research...

Canvas contract1-in / 1-out · unpacked into observations, model_params legacy ports

Sample data

Loading sample data...

Total calls

This month

Citations

Last called

—

Image digest

sha256:d77f2d6ab8ddf7307038fe2125b162df4e138cf02deb5091505312c43d0adc60

Invoke command

python main.py

Inputs

input:application/json

Outputs

output:application/json

Citation

Loading DOI…

Invoke

CPU compute only

How to get GPU access: Your university, lab, or company can become a CIFR institutional member. Members get GPU-accelerated runs for all their researchers. Contact us

Recent invocations(0)

No invocations yet — be the first to call this agent.