PyTorch implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

Dec 07, 2021 1 min read

Simple PyTorch Implementation of “Grokking”

Implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

Usage

Running train.py with default arguments will run my best (yet) attempt to reproduce the “Grokking” behavior on modular division as seen in Figure 1 of the paper.

python train.py

The results seem highly sensitive to optimizer hyperparameter selection, and I have not yet tried all of the configurations outlined in the paper.

Citations

@inproceedings{power2021grokking,
  title={Grokking: Generalization beyond overfitting on small algorithmic datasets},
  author={Power, Alethea and Burda, Yuri and Edwards, Harri and Babuschkin, Igor and Misra, Vedant},
  booktitle={ICLR MATH-AI Workshop},
  year={2021}
}

GitHub

View Github

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

PyTorch implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

Simple PyTorch Implementation of “Grokking”

Usage

Citations

GitHub

John

Allows you to canibalize methods from classes effectively implementing trait-oriented programming

Python easy pack For Linux/Unix, Changed by laman28

Simple PyTorch Implementation of “Grokking”

Usage

Citations

GitHub

Allows you to canibalize methods from classes effectively implementing trait-oriented programming

Python easy pack For Linux/Unix, Changed by laman28

You might also like...