Autoencoder - Reducing the Dimensionality of Data with Neural Network

Last update: Nov 17, 2022

Related tags

Overview

autoencoder

Implementation of the Reducing the Dimensionality of Data with Neural Network – G. E. Hinton and R. R. Salakhutdinov paper.

Notes

Aim to minimise the squared reconstruction error using backpropagation.

If the neuron activation functions are linear, it will learn a compact encoding in the hidden units that is a linear function of the data. Exactly the same result as PCA (but probably less efficient). The M hidden units will span the same space as the first M components of PCA, but they may not be orthogonal.

If the activation functions are non-linear, it's able to represent data on a non-linear manifold – more powerful than PCA.

Deep Autoencoders

Learning time is linear (or better) in the number of training cases.

MNIST Digits

Network

Encoding: 784 (pixels) -> 1000 -> 500 -> 250 -> 30 linear units [central code layer]

Decoding: 30 linear units -> 250 -> 500 -> 1000 -> 784 pixel [reconstruction]

First trained by stacking RBMs to get the 30 hidden units. Transpose of those weights used for decoding. Then fine-tune with backprop with cross-entropy error.

MNIST digits are continuous valued [0,1] and very non-Gaussian. RBM units pre-trained as follows:

First layer (784 <-> 1000 units):

hidden units are binary (bernoulli with activation probability given by logistic).
visible units are linear with Gaussian noise

Second layer (1000 <-> 500 units) and third layer (500 <-> 250):

hidden units are binary (bernoulli using probability of 1 given by logistic).
visible units are continuous (activation probabilities of the hidden units in the preceding layer.

Top layer (250 <-> 30):

hidden units have stochastic real-values states drawn form a unit variance Gaussian whose mean was is determined by the input from the that RBM's logistic visible units.

For fine-tuning, the model is "unfolded" and stochastic activities are are replaced by deterministic real-valued probabilties. Top layer is a linear, every other layer is logistic.

Autoencoder - Reducing the Dimensionality of Data with Neural Network

Related tags

Overview

autoencoder

Notes

Deep Autoencoders

MNIST Digits

Network

Resources

Owner

Jordan Burgess

Joint Learning of 3D Shape Retrieval and Deformation, CVPR 2021

Repo for the paper "DiLBERT: Cheap Embeddings for Disease Related Medical NLP"

Official PyTorch implemention of our paper "Learning to Rectify for Robust Learning with Noisy Labels".

Image based Human Fall Detection

Official PyTorch implementation of MAAD: A Model and Dataset for Attended Awareness

Implementation of Geometric Vector Perceptron, a simple circuit for 3d rotation equivariance for learning over large biomolecules, in Pytorch. Idea proposed and accepted at ICLR 2021

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"

a grammar based feedback fuzzer

Python Jupyter kernel using Poetry for reproducible notebooks

A PyTorch-based R-YOLOv4 implementation which combines YOLOv4 model and loss function from R3Det for arbitrary oriented object detection.

Python package for visualizing the loss landscape of parameterized quantum algorithms.

DANet for Tabular data classification/ regression.

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

Arquitetura e Desenho de Software.

Spectrum is an AI that uses machine learning to generate Rap song lyrics

Code base of object detection

Face2webtoon - Despite its importance, there are few previous works applying I2I translation to webtoon.

A large-image collection explorer and fast classification tool

Repository for training material for the 2022 SDSC HPC/CI User Training Course

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)