A PyTorch implementation of Dueling DQN with action branching architectures

Mar 17, 2022 1 min read

This is a PyTorch implementation of Dueling DQN with action branching architectures for Gym environment with Discrete and Box action space.

Action Branching Architectures

Reference paper: https://arxiv.org/abs/1711.08946

The branching architecture is summarized as following.

For action of Box class, certain number of sub-actions need to be sampled from each continuous action dimension by setting parameter sub_act_dim, which could either be an integer or a list/tuple in the size of action dimension indication number of sub-actions for each action dimension. Given sub_act_dim, actions including lowest and highest possible value are equally sample from each action dimension as following.

Train

Requirements

python=3.8.11

torch=1.9.0

gym=0.19.0

tensorboard=2.8.0

Start Training

Simply run python dqn.py.

Results

CartPole-v1 (discrete action) and MountainCarContinuous-v0 (continuous action) of Gym environment are tested, episode return are show in the following respectively. After around 1.6k and 120 episodes for each case, the agent start to gain steady rewards.

GitHub

View Github

PyTorch

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

PyTorch

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning

TorchRL Disclaimer This library is not officially released yet and is subject to change. The features are available before an official release so that users and collaborators can get early access and provide

17 January 2023

PyTorch

A PyTorch Implementation of i-MAE: Linearly Separable Representation in MAE

17 January 2023

PyTorch

Official ECCV 2022 repository for SUPR: A Sparse Unified Part-Based Human Representation

SUPR: A Sparse Unified Part-Based Human Representation (ECCV 2022) TLDR: We release a full suite of state of the art models ( 18 models ): A body model, hand model, head model and a foot

16 January 2023

PyTorch

Yet another PyTorch implementation of Stable Diffusion

stable-diffusion-pytorch Yet another PyTorch implementation of Stable Diffusion. I tried my best to make the codebase minimal, self-contained, consistent, hackable, and easy to read. Features are pruned if not needed in Stable Diffusion

16 January 2023

PyTorch

A parallel ODE solver for PyTorch

A Parallel ODE Solver for PyTorch torchode is a suite of single-step ODE solvers such as dopri5 or tsit5 that are compatible with PyTorch’s JIT compiler and parallelized across a batch.

16 January 2023

PyTorch

PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models

16 January 2023

PyTorch

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

TruSThresh Official PyTorch implementation of “Reliable Deicision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild“ (WSDM’23). Installation pip install -e ./ Quick Example import numpy as np

16 January 2023

PyTorch

The official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP

13 January 2023

A PyTorch implementation of Dueling DQN with action branching architectures

Action Branching Architectures

Train

Requirements

Start Training

Results

GitHub

John

An IoT Integrated Fully Automatic WIreless PHIshing System

A Telegram Bin Checker Bot made with python for check Bin valid or Invalid

Action Branching Architectures

Train

Requirements

Start Training

Results

GitHub

An IoT Integrated Fully Automatic WIreless PHIshing System

A Telegram Bin Checker Bot made with python for check Bin valid or Invalid

You might also like...