#document
3 APIs with this tag
PDF to Text API
Extract text from PDF documents: the complete document text, the text of every page separately, the document metadata (page count, title, author, subject, keywords, creator, producer, creation and modification dates, PDF version, encryption and outline flags), and a full in-document search that returns the matching pages with hit counts and context snippets. Supply a PDF by public URL, base64 or a raw binary request body; only public http/https URLs are accepted and private or internal hosts are blocked. Pure server-side computation (pdf.js, no third-party upstream, no per-page cloud cost). Ideal for document indexing and search, data extraction and ETL, invoice and contract processing, archiving and content pipelines.
api.oanor.com/pdftext-api
OCR API
Extract text from images with optical character recognition in eight languages (English, German, French, Spanish, Italian, Portuguese, Dutch and Turkish). Get the full recognised text with an average confidence score and word and line counts, or word-level results where every word carries its own confidence and bounding box for layout-aware processing. Supply an image by public URL, base64 or a raw binary request body; large images are downscaled automatically before recognition to keep responses fast, and a warm engine keeps latency low after start-up. Pure server-side computation (Tesseract, no third-party upstream, no per-call cloud cost). Ideal for digitising documents and receipts, reading text from photos and screenshots, accessibility, and data-entry automation.
api.oanor.com/ocr-api
PDF API
Generate PDF documents from text on the fly — with an optional bold title, page size (A4, Letter or Legal) and font size. Returns a base64 PDF or a raw downloadable file. Great for invoices, reports, receipts and tickets.
api.oanor.com/pdf-api