ocrmypdf_plugin_GoogleVision

A very minimalistic approach for an ocrmypdf plugin to use Google Vision as OCR engine. Tesseract is currently used for rotation, so if Tesseract is not able to determine the correct rotation, there are some problems. It doesn't matter for my particular use case, but it might for yours.

A cloudkey is also needed and must be in the same directory as cloudkey.json. https://cloud.google.com/vision/docs/before-you-begin

Also borrowed some code from https://github.com/dinosauria123/gcv2hocr. Thanks a lot for your work.

copy all files to a directory
get the cloudkey ans safe it as cloudkey.json
pip3 install google-cloud-vision
call ocrmypdf from the currect diretory with --plugin gvision.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
HocrConverter.py		HocrConverter.py
README.md		README.md
gcv2hocr2.py		gcv2hocr2.py
gvision.py		gvision.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ocrmypdf_plugin_GoogleVision

About

Releases

Packages

Languages

kkrell2016/ocrmypdf_plugin_GoogleVision

Folders and files

Latest commit

History

Repository files navigation

ocrmypdf_plugin_GoogleVision

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages