LiteParse pairs fast text parsing with a two-stage agent pattern, falling back to multimodal models when tables or charts ...
This project provides a lightweight, containerized API for extracting and cleaning text from PDF files using PyMuPDF and serving it with FastAPI. We provide a docker ...
A powerful Model Context Protocol (MCP) server that empowers AI assistants like Claude and GitHub Copilot to intelligently interact with PDF documents. Extract text, metadata, search content, and ...
Abstract: Creating presentation slides from complex or poorly structured PDFs remains a time-consuming process. Existing systems that attempt to automate this process are typically limited, relying on ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results