Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
-
Updated
Mar 19, 2026 - Python
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
天枢 - 企业级 AI 一站式数据预处理平台 | PDF/Office转Markdown | 支持MCP协议AI助手集成 | Vue3+FastAPI全栈方案 | 文档解析 | 多模态信息提取
Native Swift + MLX port of Paddle0CR-VL, a 0.9B doc parsing VLM for OCR, tables, formulas, and charts on Apple Silicon
Minimal script to run PaddleOCR-VL on a PDF and output a MD file with images
Advanced PDF processing, OCR, and AI vision analysis nodes for ComfyUI. Extract images from PDFs, perform multilingual OCR with Surya, detect objects with Florence-2, and analyze document layouts.
Local web UI for PaddleOCR‑VL via Docker (PDF/Image OCR, preview, logs, downloads)
Bob 终极 OCR:支持本地隐私模型 + 大模型API云端 OCR 高精度,一键切换。快、免费、隐私好。识别能力更强,可选更多模型
Python data pipeline for arXiv metadata: SQLAlchemy + Alembic schema, PostgreSQL storage, PDF download tracking, and optional PaddleOCR processing.
Истории автомобилей
Turn your Android device into a low-latency Bluetooth HID gamepad recognized as a native controller by Windows and other systems without extra software.
🔍 Explore top 50 starred Flutter repositories on GitHub with offline support and intuitive sorting in a user-friendly app.
Add a description, image, and links to the paddleocr-vl topic page so that developers can more easily learn about it.
To associate your repository with the paddleocr-vl topic, visit your repo's landing page and select "manage topics."