Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Mar 31, 2022 1 min read

Make-A-Scene – PyTorch

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors (https://arxiv.org/pdf/2203.13131.pdf)

Figure 1. from paper

Note: this is work in progress. Everyone is happily invited to contribute

Paper Description:

Make-A-Scene modifies the VQGAN framework. It makes heavy use of using semantic segmentation maps for extra conditioning. This enables more influence on the generation process. Morever, it also conditions on text. The main improvements are the following:

Segmentation condition: separate VQVAE is trained (VQ-SEG) + loss modified to a weighted binary cross entropy. (3.4)
VQGAN training (VQ-IMG) is extended by Face-Loss & Object-Loss (3.3 & 3.5)
Classifier Guidance for the autoregressive transformer (3.7)

Training Pipeline

Figure 6. from paper

What needs to be done?

Refer to the different folders to see details.

Citation

@misc{https://doi.org/10.48550/arxiv.2203.13131,
  doi = {10.48550/ARXIV.2203.13131},
  url = {https://arxiv.org/abs/2203.13131},
  author = {Gafni, Oran and Polyak, Adam and Ashual, Oron and Sheynin, Shelly and Parikh, Devi and Taigman, Yaniv},
  title = {Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors},
  publisher = {arXiv},
  year = {2022},
  copyright = {arXiv.org perpetual, non-exclusive license}
}

GitHub

View Github

PyTorch Text-to-Image

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Text-to-Image

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

07 August 2022

Text-to-Image

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer in Pytorch

30 November 2021

PyTorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

07 November 2021

PyTorch

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning

TorchRL Disclaimer This library is not officially released yet and is subject to change. The features are available before an official release so that users and collaborators can get early access and provide

17 January 2023

PyTorch

A PyTorch Implementation of i-MAE: Linearly Separable Representation in MAE

17 January 2023

PyTorch

Official ECCV 2022 repository for SUPR: A Sparse Unified Part-Based Human Representation

SUPR: A Sparse Unified Part-Based Human Representation (ECCV 2022) TLDR: We release a full suite of state of the art models ( 18 models ): A body model, hand model, head model and a foot

16 January 2023

PyTorch

Yet another PyTorch implementation of Stable Diffusion

stable-diffusion-pytorch Yet another PyTorch implementation of Stable Diffusion. I tried my best to make the codebase minimal, self-contained, consistent, hackable, and easy to read. Features are pruned if not needed in Stable Diffusion

16 January 2023

PyTorch

A parallel ODE solver for PyTorch

A Parallel ODE Solver for PyTorch torchode is a suite of single-step ODE solvers such as dopri5 or tsit5 that are compatible with PyTorch’s JIT compiler and parallelized across a batch.

16 January 2023

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Make-A-Scene – PyTorch

Note: this is work in progress. Everyone is happily invited to contribute

Paper Description:

Training Pipeline

What needs to be done?

Citation

GitHub

John

A reproduced repo of Point2Seq for 3D object detection

Reinforcement Learning for Classical Planning

Make-A-Scene – PyTorch

Note: this is work in progress. Everyone is happily invited to contribute

Paper Description:

Training Pipeline

What needs to be done?

Citation

GitHub

A reproduced repo of Point2Seq for 3D object detection

Reinforcement Learning for Classical Planning

You might also like...