Skip to content

Webservice for extracting text from pdfs on the web

Notifications You must be signed in to change notification settings

FriedrichSal/text-from-pdf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text-from-pdf

This projects if for fetching text from a url. Content type can be html or pdf.

Getting started

Build the docker image by

docker build -t url2text:latest .

Then start the service by

docker run -p 8080:80 url2text:latest

Now test the service by querying

http://localhost:8080/textforurl?url=<your url to html or pdf content>

for example

http://localhost:8080/textforurl?url=https://arxiv.org/pdf/2003.03384.pdf```

About

Webservice for extracting text from pdfs on the web

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published