-
Notifications
You must be signed in to change notification settings - Fork 449
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
createTRAINING batch command #1149
Comments
Hello ! Normally it means that the PDF is image only (Grobid does not include an OCR, it has to be provided as pre-processing). Other possible explanations: encrypted PDF or corrupted PDF. Finally it's also possible that no header is detected by the segmentation model which is applied first. In the last case, it means the corrected segmentation training file has to be put first in the segmentation training and the segmentation model updated. |
when running this command I noticed that corresponding to a certain PDF present in the 'directory of input files' files for the header model are not generated ?
Why so and generally is there a criteria for generation of output files model wise corresponding to an input pdf?
The text was updated successfully, but these errors were encountered: