Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
-
Updated
Mar 25, 2026 - Python
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
A small web app that finds relevant documents and produces query-focused summaries using Gemini. Supports PDF upload with one-time multimodal preprocessing into per-page Markdown + metadata.
🔄 Optimize model loading in ComfyUI with flexible node connections and controlled sequences for better performance and memory management.
🎨 Enhance video generation by syncing audio to visuals with ComfyUI-PainterAI2V. Create precise lip-syncing and seamless transitions using dual model workflows.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Add a description, image, and links to the pdf-extractor-rag topic page so that developers can more easily learn about it.
To associate your repository with the pdf-extractor-rag topic, visit your repo's landing page and select "manage topics."