Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
-
Updated
Mar 25, 2026 - Python
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
MinerU免安装部署一键启动整合包
🎨 Display system theme colors and their references easily with this Lua Plugin for GrandMA3, simplifying your color selection process.
🛠 Enhance conditioning for FLUX.2 Klein 9B in ComfyUI, optimizing active text regions for improved prompt adherence and editing behavior.
PDF table extraction for RAG — convert to clean HTML. Fast, local, no GPU.
A small web app that finds relevant documents and produces query-focused summaries using Gemini. Supports PDF upload with one-time multimodal preprocessing into per-page Markdown + metadata.
🔄 Optimize model loading in ComfyUI with flexible node connections and controlled sequences for better performance and memory management.
Extract tables precisely from PDFs and convert them to clean HTML for RAG pipelines, running fast on CPU without external dependencies.
🎨 Enhance video generation by syncing audio to visuals with ComfyUI-PainterAI2V. Create precise lip-syncing and seamless transitions using dual model workflows.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
🖼️ Segment characters in images with ComfyUI using a Vision LLM agent, enhancing your projects with detailed and high-quality masks.
Parse JSON quickly using a fast, recursive-descent parser designed for lightweight integration in C++ projects.
🎶 Generate multilingual AI music with lyrics in English, Chinese, Japanese, Korean, and Spanish using ComfyUI's HeartMuLa model.
💡 Control 3D lighting angles effortlessly with ComfyUI - adjust direction, intensity, and color while relighting images in real-time.
📝 Manage your projects and notes locally with Ironpad, a file-based system that keeps your data safe in Markdown format without cloud reliance.
🎨 Build interactive Blazor applications with A2UI, a secure and portable protocol for rich UI rendered natively across platforms without code execution risks.
🤖 Process SCAIL-pose data with ComfyUI nodes, utilizing VitPose for accurate face and hand detection in an efficient, streamlined setup.
Implements Unreal Engine 5 network protocol in Python to connect, authenticate, and replicate actors with UE5 Lyra Starter Game servers.
An interactive command-line tool designed to quickly navigate directories and perform various file operations efficiently. Its simple syntax and intuitive commands make it a favorite among developers for streamlining workflow tasks.
Add a description, image, and links to the pdf-extractor-rag topic page so that developers can more easily learn about it.
To associate your repository with the pdf-extractor-rag topic, visit your repo's landing page and select "manage topics."