Skip to content

Table and structured fields not detected — large table region skipped entirely #4124

@ramrohit3545

Description

@ramrohit3545

🔎 Search before asking | 提交之前请先搜索

  • I have searched the MinerU Readme and found no similar bug report.
  • I have searched the MinerU Issues and found no similar bug report.
  • I have searched the MinerU Discussions and found no similar bug report.

🤖 Consult the online AI assistant for assistance | 在线 AI 助手咨询

Description of the bug | 错误描述

MinerU is failing to detect a large table block and multiple structured fields in the PDF.
The entire highlighted region in the PDF (routing, flight info, handling info, and goods details) is not extracted, and the JSON output only contains text from other sections.

Image

How to reproduce the bug | 如何复现

not extract all.miss some

Operating System Mode | 操作系统类型

Linux

Operating System Version| 操作系统版本

Ubuntu 22.04

Python version | Python 版本

3.12

Software version | 软件版本 (mineru --version)

>=2.5

Backend name | 解析后端

vlm

Device mode | 设备模式

cuda

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions