A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model

Jun 22, 2022 1 min read

EMA – Pytorch

import torch
from ema_pytorch import EMA

# your neural network as a pytorch module

network = torch.nn.Linear(512, 512)

# wrap your neural network, specify the decay (beta)

ema = EMA(
    network,
    beta = 0.9999,              # exponential moving average factor
    update_after_step = 100,    # only after this number of .update() calls will it start updating
    update_every = 10,          # how often to actually update, to save on compute (updates every 10th .update() call)
)

# mutate your network, with SGD or otherwise

with torch.no_grad():
    net.weight.copy_(torch.randn_like(net.weight))
    net.bias.copy_(torch.randn_like(net.bias))

# you will call the update function on your moving average wrapper

ema.update()

# then, later on, you can invoke the EMA model the same way as your network

data = torch.randn(1, 512)

output     = net(data)
ema_output = ema(data)

# if you want to save your ema model, it is recommended you save the entire wrapper
# as it contains the number of steps taken (there is a warmup logic in there, recommended by @crowsonkb, validated for a number of projects now)
# however, if you wish to access the copy of your model with EMA, then it will live at ema.ema_model

Todo

address the issue of annealing EMA to 1 near the end of training for BYOL lucidrains/byol-pytorch#82

GitHub

View Github

PyTorch

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

PyTorch

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning

TorchRL Disclaimer This library is not officially released yet and is subject to change. The features are available before an official release so that users and collaborators can get early access and provide

17 January 2023

PyTorch

A PyTorch Implementation of i-MAE: Linearly Separable Representation in MAE

17 January 2023

PyTorch

Official ECCV 2022 repository for SUPR: A Sparse Unified Part-Based Human Representation

SUPR: A Sparse Unified Part-Based Human Representation (ECCV 2022) TLDR: We release a full suite of state of the art models ( 18 models ): A body model, hand model, head model and a foot

16 January 2023

PyTorch

Yet another PyTorch implementation of Stable Diffusion

stable-diffusion-pytorch Yet another PyTorch implementation of Stable Diffusion. I tried my best to make the codebase minimal, self-contained, consistent, hackable, and easy to read. Features are pruned if not needed in Stable Diffusion

16 January 2023

PyTorch

A parallel ODE solver for PyTorch

A Parallel ODE Solver for PyTorch torchode is a suite of single-step ODE solvers such as dopri5 or tsit5 that are compatible with PyTorch’s JIT compiler and parallelized across a batch.

16 January 2023

PyTorch

PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models

16 January 2023

PyTorch

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

TruSThresh Official PyTorch implementation of “Reliable Deicision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild“ (WSDM’23). Installation pip install -e ./ Quick Example import numpy as np

16 January 2023

PyTorch

The official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP

13 January 2023

A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model

EMA – Pytorch

Todo

GitHub

John

RAW (RISC-V Archbuild Wrapper)

Local Transformer With Spatial Partition Restore for Hyperspectral Image Classification

EMA – Pytorch

Todo

GitHub

RAW (RISC-V Archbuild Wrapper)

Local Transformer With Spatial Partition Restore for Hyperspectral Image Classification

You might also like...