Implementation of Transformer OCR as described at Scene Text Recognition via Transformer.
Results across a number of methods and datasets:
Heat map of the source attention (encoder memory) score of the first layer of decoder:
Subscribe to Python Awesome
Get the latest posts delivered right to your inbox