Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

refactor(parse_core): improve image and table block handling #785

Merged
merged 2 commits into from
Oct 25, 2024

Commits on Oct 24, 2024

  1. refactor(magic_pdf): adjust confidence threshold for DocLayout_YOLO m…

    …odel
    
    - Changed the confidence threshold from0.15 to 0.25 in the DocLayout_YOLO model prediction
    - This adjustment aims to improve the accuracy of layout detection by filtering out low-confidence predictions
    myhloli committed Oct 24, 2024
    Configuration menu
    Copy the full SHA
    ce72cf0 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2024

  1. refactor(ocr): improve image and table block handling

    - Split image and table blocks into separate categories
    - Add group_id to image and table blocks- Update block processing logic to handle new categories
    - Modify layout splitting and span filling to accommodate new block types
    - Adjust block indexing and sorting to consider new structures
    myhloli committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    c34c9d2 View commit details
    Browse the repository at this point in the history