PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

Last update: Jan 05, 2023

Related tags

Overview

pytorch-maml

This is a PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML): https://arxiv.org/abs/1703.03400

Important: You will need the latest version of PyTorch, v.0.2.0 to run this code (otherwise you will get errors about double backwards not being supported).

Currently, only the Omniglot experiments have been replicated here. The hyper-parameters are the same as those used in the original Tensorflow implementation, except that only 1 random seed is used here.

5-way 1-shot training, best performance 98.9%

20-way 1-shot training, best performance 92%

Note: the 20-way performance is slightly lower than that reported in the paper (they report 95.8%). If you can see why this might be, please let me know. Also in this experiment, we can see evidence of overfitting to the meta-training set.

The 5-way results are achieved by simply meta-testing the network trained on the 1-shot task on the 5-shot task (e.g. for the 5-way 5-shot result, test the 5-way 1-shot trained network with 5-shots). Again the 20-way result is lower here than reported in the paper.

This repo also contains code for running maml experiments on permuted MNIST (tasks are created by shuffling the labels). This is a nice sanity check task.

license

This software is distributed under the MIT license.

to-do

port to pytorch 0.4 from 0.2 and python 3 from 2
investigate performance difference from TF version
add first-order version

PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

Related tags

Overview

pytorch-maml

license

to-do

Owner

Kate Rakelly

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

Implementation of SegNet: A Deep Convolutional Encoder-Decoder Architecture for Semantic Pixel-Wise Labelling

The Ludii general game system, developed as part of the ERC-funded Digital Ludeme Project.

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

This is the code used in the paper "Entity Embeddings of Categorical Variables".

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

Code for ICML 2021 paper: How could Neural Networks understand Programs?

Implementation supporting the ICCV 2017 paper "GANs for Biological Image Synthesis"

SFD implement with pytorch

A new version of the CIDACS-RL linkage tool suitable to a cluster computing environment.

Architecture Patterns with Python (TDD, DDD, EDM)

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

[AAAI 2021] EMLight: Lighting Estimation via Spherical Distribution Approximation and [ICCV 2021] Sparse Needlets for Lighting Estimation with Spherical Transport Loss

House-GAN++: Generative Adversarial Layout Refinement Network towards Intelligent Computational Agent for Professional Architects

A PyTorch implementation of Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021

Process JSON files for neural recording sessions using Medtronic's BrainSense Percept PC neurostimulator

Official implementation of "A Unified Objective for Novel Class Discovery", ICCV2021 (Oral)

QuALITY: Question Answering with Long Input Texts, Yes!