Doc Transformers

Document processing using transformers. This is still in developmental phase, currently supports only extraction of form data i.e (key – value pairs)

pip install -q doc-transformers

Pre-requisites

Please install the following seperately

sudo apt install tesseract-ocr
pip install -q detectron2 -f https://dl.fbaipublicfiles.com/detectron2/wheels/cu101/torch1.8/index.html

Implementation

loads the image

image = form_parser.load_image(input_path_image)

gets the bounding boxes, predictions and image processed

bbox, preds, image = form_parser.process_image(image)

returns image as the output

im = form_parser.visualize_image(bbox, preds, image)
">

# loads the pretrained dataset also 
from doc_transformers import form_parser

# loads the image
image = form_parser.load_image(input_path_image)

# gets the bounding boxes, predictions and image processed
bbox, preds, image = form_parser.process_image(image)

# returns image as the output
im = form_parser.visualize_image(bbox, preds, image)

Results

Input

input image

Output

output image
  • Please note that this is still in development phase and will be improved in the near future

GitHub

https://github.com/Vishnunkumar/doc_transformers