Pytorch implementation of LOLA using DiCE

Jul 23, 2021 1 min read

LOLA_DiCE

Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)

Quick results:

Results on IPD using DiCE

[lr_in=0.3, lr_out=0.2, lr_v=0.1, batch_size=128, len_rollout=150, use_baseline=True]
ipd_dice

Results on IPD using DiCE and opponent modelling

[lr_in=0.3, lr_out=0.2, lr_v=0.1, batch_size=128, len_rollout=150, use_baseline=True]
ipd_dice_om
(It seems that 2 lookaheads is the most stable model with this set of hyperparameters)

Results on IPD using exact gradients

[lr_in=0.3, lr_out=0.2, batch_size=128, len_rollout=150]
ipd_exact

Results on IPD using exact gradients and opponent modelling

[lr_in=0.3, lr_out=0.2, batch_size=128, len_rollout=150]
ipd_exact_om

Authors version:

The authors of the paper have their own version (Tensorflow) available here: https://github.com/alshedivat/lola

GitHub

https://github.com/alexis-jacq/LOLA_DiCE

PyTorch

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

PyTorch

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning

TorchRL Disclaimer This library is not officially released yet and is subject to change. The features are available before an official release so that users and collaborators can get early access and provide

17 January 2023

PyTorch

A PyTorch Implementation of i-MAE: Linearly Separable Representation in MAE

17 January 2023

PyTorch

Official ECCV 2022 repository for SUPR: A Sparse Unified Part-Based Human Representation

SUPR: A Sparse Unified Part-Based Human Representation (ECCV 2022) TLDR: We release a full suite of state of the art models ( 18 models ): A body model, hand model, head model and a foot

16 January 2023

PyTorch

Yet another PyTorch implementation of Stable Diffusion

stable-diffusion-pytorch Yet another PyTorch implementation of Stable Diffusion. I tried my best to make the codebase minimal, self-contained, consistent, hackable, and easy to read. Features are pruned if not needed in Stable Diffusion

16 January 2023

PyTorch

A parallel ODE solver for PyTorch

A Parallel ODE Solver for PyTorch torchode is a suite of single-step ODE solvers such as dopri5 or tsit5 that are compatible with PyTorch’s JIT compiler and parallelized across a batch.

16 January 2023

PyTorch

PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models

16 January 2023

PyTorch

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

TruSThresh Official PyTorch implementation of “Reliable Deicision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild“ (WSDM’23). Installation pip install -e ./ Quick Example import numpy as np

16 January 2023

PyTorch

The official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP

13 January 2023

Pytorch implementation of LOLA using DiCE

LOLA_DiCE

Quick results:

Results on IPD using DiCE

Results on IPD using DiCE and opponent modelling

Results on IPD using exact gradients

Results on IPD using exact gradients and opponent modelling

Authors version:

GitHub

John

Password spraying and bruteforcing tool for Active Directory Domain Services

Advanced Deep Learning with TensorFlow 2 and Keras

LOLA_DiCE

Quick results:

Results on IPD using DiCE

Results on IPD using DiCE and opponent modelling

Results on IPD using exact gradients

Results on IPD using exact gradients and opponent modelling

Authors version:

GitHub

Password spraying and bruteforcing tool for Active Directory Domain Services

Advanced Deep Learning with TensorFlow 2 and Keras

You might also like...