PDF
pdfcpu is great Go lib to parse PDFs. React PDF REPL is interesting.
Notes
Links
- pdf-diff - PDF comparison utility in Python.
- Caradoc - Parser and validator of PDF files written in OCaml.
- pdfsandwich - Tool to make "sandwich" OCR pdf files. (HN)
- Markdown to PDF - Will take a markdown file as input and then create a PDF file with the markdown formatting.
- PDF Reader in Go
- lopdf - Rust library for PDF document manipulation. (Breakdown Of How Lopdf Reads PDFs)
- pdf_render - Experimental PDF viewer.
- pdf-rs - Read, alter and write PDF files.
- PEP - Open Source & Free PDF Editor for Mac. (Code)
- Make any PDF look like scanned (Code) (CLI)
- TPPDF - Fast PDF builder for iOS & macOS using simple commands to create advanced documents.
- PDF-Lib - Create and modify PDF documents in any JavaScript environment. (Docs)
- PDF to CSV converter
- pikepdf - Python library for reading and writing PDF files.
- pdf2svg - Simple PDF to SVG converter using the Poppler and Cairo libraries.
- CamlPDF - OCaml library for reading, writing and modifying PDF files.
- cpdf-binaries - PDF Command Line Tools binaries.
- pdfannots - Extracts and formats text annotations from a PDF file.
- Processing PDFs with Cloud Functions (2020)
- pystitcher - Stitches your PDF files together, generating nice customizable bookmarks for you using a declarative input in the form of a markdown file. (HN)
- QPDF - Command-line tool and C++ library that performs content-preserving transformations on PDF files.
- qpdf-rs - Rust bindings for QPDF C++ library.
- pdfc - PDF compiler for your source code.
- labelmake - Declarative style PDF generation library for Node and the browser.
- Reducing the size of large PDFs (2022) (HN)
- Purdy - Experimental PDF renderer built on top of WebGPU.
- pdftotext - Simple PDF text extraction.
- iLovePDF - Online PDF tools for PDF lovers.
- PDFRip - Fast PDF password cracking utility equipped with commonly encountered password format builders and dictionary attacks.
- ZotFile - Advanced PDF management for Zotero.
- pdfmake - PDF document generation library for server-side and client-side in pure JavaScript.
- x-ray - Python library for finding bad redactions in PDF documents.
- svg2pdf.js - JavaScript-only SVG to PDF conversion utility that runs in the browser leveraging jsPDF.
- PDFMiner.six - Tool for extracting information from PDF documents.
- Keypoints - Annotate PDFs in Markdown.
- Excalibur - PDF Table Extraction for Humans. (Code)
- Tabula - Extract Tables from PDFs. (Code) (HN)
- SlidePilot - PDF Presentation Tool for macOS. (Code)
- giopdf - PDF viewer library for Gio.
- PDFME - TypeScript based PDF generator library, made with React. (Docs)
- PSPDFKit API - Generate, convert, and modify PDF documents. (HN)
- react-pdf-highlighter - React library that provides annotation experience for PDF documents on web.
- WASM-PDF - Generate PDF files with JavaScript and WASM.
- PDFIO - PDF Reader Library for Native Julia.
- gopdf - Simple library for generating PDF written in Go.
- Local PDF Tools - Merge, Optimize, Extract PDFs in your Browser. (Code)
- pdfboxing - Clojure PDF manipulation library & wrapper for PDFBox.
- PDFKit - JavaScript PDF generation library for Node and the browser.
- Express PDF Generator Service
- PDF::Reader - Implements a PDF parser conforming as much as possible to the PDF specification from Adobe.
- PyPDF2 - Pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files.
- Look Scanned - Make your PDFs look scanned. (Code)
- Ask HN: Why is the PDF format so inaccessible? (2022)
- pdfreader - Read text and parse tables from PDF files.
- pdfsandwich - OCR your PDF fast. (Reddit)
- Converts PDFs to dark mode (Code)
- Scholar Reader - User interface, API, and data processing scripts for an augmented PDF reader application.
- Pdfmake-wrapper - Generate PDF documents in an easy and readable way.
- Let's write a PDF file (2015)
- wkhtmltopdf - Convert HTML to PDF using Webkit (QtWebKit).
- img2pdf - Losslessly convert raster images to PDF.
- Boxes and Glue - PDF typesetting library/backend in the spirit of TeX's algorithms.
- TinyWow - Free PDF, Video, Image & Other Online Tools.
- s3-ocr: Extract text from PDF files stored in an S3 bucket (2022)
- Search PDFs with Transformers and Python Notebook (HN)
- PDF-Diff - Tool for visualizing differences between two PDF files. (HN)
- Generate an Invoice PDF using Cloudflare Workers (Code)
- Ask HN: How do you organize or rename PDF files (books, papers, etc)? (2022)
- Extract structured data from PDF invoices
- PDF Grep - Command line utility to search text in PDF files. (HN) (Code)
- WeasyPrint - Turns simple HTML pages into gorgeous statistical reports, invoices, tickets as PDFs.
- lazypress - Convert HTML pages to PDFs looking just like they would render in the browser.
- PDF processing and analysis with open-source tools (2021) (HN)
- How To Create a PDF in Go: A Step-By-Step Tutorial (Reddit)
- go-audio - Offline solution to convert PDFs into audiobooks.
- bagme - PDF rendering library for Go using boxes and glue.
- PDFSyntax - Python PDF parsing library and tool built on top to browse the internal structure of a PDF file. (HN)
- PDF storage with global search
- Hammer PDF - Smart Scientific Reader. (Code)
- Open PDF Sign - Digitally sign PDF files from your command line. (HN)
- pdfcpu - PDF processor written in Go. (HN)
- pdfium-render - Idiomatic high-level Rust interface to Pdfium. Used by Google Chromium.
- PDF.js extract - Extracts text from PDF files.
- Simple PDF Embed - Add a powerful PDF editor directly into your website or React App.
- PDF to Image - Converts PDFs to images in Node with no native dependencies.
- SVG 2 PDF - Converts SVG files to PDF.
- PDF Writer - Step-by-step PDF writer.
- React PDF REPL (Code)
- jendeley - JSON-based PDF paper organizing software.
- PDF Extract - Rust library for extracting content from PDFs.
- pdf-extract - Rust library to extract content from PDF files.
- GPT-4 & LangChain - Create a ChatGPT Chatbot for Your PDF Docs
- Ask Your PDF - Upload, chat and interact with any PDF document. (HN)
- PDF 2.0 specification (Reddit)
- Maroto - Maroto way to create PDFs.
- ChatPDF - Chat with any PDF.
- pdfGPT - Chat with the contents of your PDF file by using GPT capabilities.
- vortex - Tool to extract images from PDF files.
- PDF Annotation Fixer - Fixes macOS Preview garbled annotations.
- PDFEasy - JavaScript Client/Server Side PDF-Generator based in PDFKit.
- ScholarTurbo - Use ChatGPT to chat with PDFs (supports GPT-4). (HN)
- Book Builder - Turns markdown into PDF.
- MultiPDF Chat App - Langchain app that allows you to chat with multiple PDFs.
- PDF GPT Indexer - Build Personal ChatGPT Using Your Data. (HN)
- peepdf - Python tool to analyze PDF documents.
- PDF Sign - Tool to sign PDF files.
- unpdf - Utilities to work with PDFs, like extracting text.
- PDF Tool - Modify PDFs in the browser without uploading. (HN)
- PDF.js Serverless - Serverless build of PDF.js for Deno, workers, and more.