A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets

Sep 29, 2021 1 min read

multitask-learning-transformers

A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.

Colab Notebook

Trained Huggingface Model

HF Model

Install depedencies

pip install -r requirements.txt

Run training

python3 main.py \
        --model_name_or_path='roberta-base' \
        --per_device_train_batch_size=8 \
        --output_dir=output --num_train_epochs=1

Single Encoder Multiple Output Heads

A multi-task model in the age of BERT works by having a shared BERT-style encoder transformer, and different task heads for each task.

Shared Encoder

Separate models for each task, but we make them share the same encoder.

References: Multi-task Training with Transformers+NLP

GitHub

https://github.com/shahrukhx01/multitask-learning-transformers

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Transformer

Minimal implementaion of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

14 February 2022

PyTorch

PyTorch implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

07 December 2021

Task

Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks

28 November 2021

Transformer

Flower classification model that classifies flowers in 10 classes made using transfer learning

20 November 2021

Transformer

Codebase for training transformers on systematic generalization datasets

The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We significantly improve the systematic generalization of transformer models on a variety of datasets using simple tricks and careful considerations.

02 September 2021

Transformer

A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets

multitask-learning-transformers

Colab Notebook

Trained Huggingface Model

Install depedencies

Run training

Single Encoder Multiple Output Heads

Shared Encoder

GitHub

John

A web app to scan crypto markets based on candlestick pattern recognition from

Python tool for dumping flash via uboot reliably

multitask-learning-transformers

Colab Notebook

Trained Huggingface Model

Install depedencies

Run training

Single Encoder Multiple Output Heads

Shared Encoder

GitHub

A web app to scan crypto markets based on candlestick pattern recognition from

Python tool for dumping flash via uboot reliably

You might also like...