OCR A turn-key OCR system optimized for historical and non-Latin script material kraken is a turn-key OCR system optimized for historical and non-Latin script material. 28 August 2021
OCR Some bits of javascript to transcribe scanned pages using PageXML Some bits of javascript to transcribe scanned pages using PageXML. Both ltr and rtl languages are supported. 28 August 2021
OCR Visual attention-based OCR model for image recognition with additional tools A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine. 28 August 2021
Django A Vietnamese personal card OCR website built with Django A Vietnamese personal card OCR website built with Django. 16 August 2021
OCR Multi-Aspect Non-local Network for Scene Text Recognition PyTorch reimplementation of "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021). 15 August 2021
OCR A Python wrapper for the tesseract-ocr API A simple, Pillow-friendly, wrapper around the tesseract-ocr API for Optical Character Recognition (OCR). 09 August 2021
Documentation Python-based tools for document analysis and OCR OCRopus is a collection of document analysis programs, not a turn-key OCR system. 06 August 2021
OCR Recognizes the number of people using OCR and kills the zoom Recognizes the number of people using OCR and kills the zoom 02 August 2021
PDF Adds an OCR text layer to scanned PDF files, allowing them to be searched OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted. 06 July 2021
PDF Detect text blocks and OCR poorly scanned PDFs in bulk Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip. 17 June 2021