Skip to content

Its an plugin for ocrmypdf to use google vision as ocr engine

Notifications You must be signed in to change notification settings

kkrell2016/ocrmypdf_plugin_GoogleVision

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ocrmypdf_plugin_GoogleVision

A very minimalistic approach for an ocrmypdf plugin to use Google Vision as OCR engine. Tesseract is currently used for rotation, so if Tesseract is not able to determine the correct rotation, there are some problems. It doesn't matter for my particular use case, but it might for yours.

A cloudkey is also needed and must be in the same directory as cloudkey.json. https://cloud.google.com/vision/docs/before-you-begin

Also borrowed some code from https://github.com/dinosauria123/gcv2hocr. Thanks a lot for your work.

  1. copy all files to a directory
  2. get the cloudkey ans safe it as cloudkey.json
  3. pip3 install google-cloud-vision
  4. call ocrmypdf from the currect diretory with --plugin gvision.py

About

Its an plugin for ocrmypdf to use google vision as ocr engine

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages