Transformer-OCR

Implementation of Transformer OCR as described at Scene Text Recognition via Transformer.

model architecture:

art

Results across a number of methods and datasets:

result

Heat map of the source attention (encoder memory) score of the first layer of decoder:

heatmap

GitHub