arc-pytorch

PyTorch implementation of Attentive Recurrent Comparators by Shyam et al.

A blog explaining Attentive Recurrent Comparators

Visualizing Attention

On Same characters

On Different Characters

How to run?

Download data

python download_data.py

A one-time 52MB download. Shouldn’t take more than a few minutes.

Train

python train.py --cuda

Let it train until the accuracy rises to at least 80%. Early stopping is not implemented yet. You will have to manually kill the process.

Visualize

python viz.py --cuda --load 0.13591022789478302 --same

Run with exactly the same parameters as train.py and specify the model to load. Specify “–same” if you want to generate a sample with same characters in both images. The script dumps images to a directory in visualization. The name of directory is taken from –name parameter if specified, else name is a function of the parameters of network.

GitHub

https://github.com/sanyam5/arc-pytorch