Skip to content
Change the repository type filter

All

    Repositories list

    • MinerU

      Public
      Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
      Python
      4.2k50k1483Updated Dec 15, 2025Dec 15, 2025
    • A Python package for interacting with the MinerU Vision-Language Model.
      Python
      178422Updated Dec 15, 2025Dec 15, 2025
    • OmniDocBench

      Public
      [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
      Python
      1181.3k944Updated Dec 15, 2025Dec 15, 2025
    • MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.
      HTML
      1613560Updated Dec 12, 2025Dec 12, 2025
    • opendatalab.github.io

      Public
      HTML
      0000Updated Dec 11, 2025Dec 11, 2025
    • OHR-Bench

      Public
      (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
      Python
      149400Updated Dec 3, 2025Dec 3, 2025
    • TRivia

      Public
      TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition
      Python
      01110Updated Dec 3, 2025Dec 3, 2025
    • WebMainBench

      Public
      WebMainBench is a specialized benchmark tool for end-to-end evaluation of web main content extraction quality.
      Python
      7800Updated Nov 26, 2025Nov 26, 2025
    • Python
      108320Updated Nov 22, 2025Nov 22, 2025
    • labelU-Kit

      Public
      Data annotation component library --provided as NPM packages
      TypeScript
      4614121Updated Nov 19, 2025Nov 19, 2025
    • 🕶️ A curated list of awesome things related to MinerU
      Python
      1200Updated Nov 14, 2025Nov 14, 2025
    • Vis3

      Public
      Data browser based on s3. 一个基于 S3 的数据(json / jsonl / parquet / html / md等)可视化工具。👇 Try online.
      TypeScript
      128100Updated Nov 11, 2025Nov 11, 2025
    • LEGION

      Public
      [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"
      Python
      67270Updated Oct 22, 2025Oct 22, 2025
    • skydiffusion

      Public
      [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
      Python
      58170Updated Oct 17, 2025Oct 17, 2025
    • labelU

      Public
      Data annotation toolbox supports image, audio and video data.
      Python
      1591.4k371Updated Oct 1, 2025Oct 1, 2025
    • UniMERNet

      Public
      UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
      Python
      36438362Updated Sep 28, 2025Sep 28, 2025
    • FakeVLM

      Public
      [NeurIPS 2025 🔥] FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis
      Python
      79770Updated Sep 24, 2025Sep 24, 2025
    • [ACL 2025 Best Theme Paper] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models"
      Python
      1418500Updated Aug 29, 2025Aug 29, 2025
    • Python
      0910Updated Aug 20, 2025Aug 20, 2025
    • opendatalab-python-sdk

      Public
      SDK of OpenDataLab - https://opendatalab.org.cn
      Python
      55822Updated Jul 31, 2025Jul 31, 2025
    • awesome-markdown-ebooks

      Public
      31310Updated Jul 17, 2025Jul 17, 2025
    • REST

      Public
      Python
      23210Updated Jul 15, 2025Jul 15, 2025
    • MLS-BRN

      Public
      [CVPR 2024] 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
      Python
      68490Updated Jul 15, 2025Jul 15, 2025
    • datasets resource
      1312730Updated Jul 1, 2025Jul 1, 2025
    • .github

      Public
      2100Updated Jul 1, 2025Jul 1, 2025
    • ProverGen

      Public
      [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation"
      Python
      43820Updated Jun 11, 2025Jun 11, 2025
    • PM4Bench

      Public
      Python
      01400Updated May 16, 2025May 16, 2025
    • GRAIT

      Public
      [NAACL25 findings] Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation
      Python
      0310Updated Apr 28, 2025Apr 28, 2025
    • DocLayout-YOLO

      Public
      DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
      Python
      1441.9k444Updated Apr 14, 2025Apr 14, 2025
    • UrBench

      Public
      [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”
      Python
      63510Updated Apr 10, 2025Apr 10, 2025