HomeOffice & ProductivityPapermerge DMS

Papermerge DMS Product Information

Papermerge DMS is an open-source Document Management System designed for digital archives. It stores, organizes, and indexes scanned documents (PDF, JPEG, TIFF) and enables fast retrieval using full-text search, tags, and metadata. The platform emphasizes a modern, user-friendly web interface, OCR-powered search, and robust document versioning, making it suitable for personal use and small to medium-sized organizations.


How to Use Papermerge DMS

  1. Store documents by uploading scanned files (PDF/JPEG/TIFF) into the repository.
  2. Index and search using full-text search, metadata, and custom tags to quickly locate documents.
  3. Manage versions: each operation creates a new document version while preserving the original, enabling easy tracking of updates (e.g., an OCRed version is stored as a separate version).
  4. Annotate with metadata: create custom fields (metadata) and assign them to categories like Receipts, Invoices, or Contracts.
  5. Organize with categories: define document types (Categories) to structure your library (e.g., Receipt, Invoice, Contract).
  6. Fix page order and quality: use page management to reorder, rotate, or extract pages within documents, a boon for bulk scans with mixed-up pages.

Disclaimer: This description reflects the features and capabilities as documented for Papermerge DMS.

Core Capabilities

  • Open-source, Apache 2.0 license; source code available on GitHub
  • Web-based, modern, intuitive user interface
  • Supports PDF, JPEG, and TIFF formats; optimized for digital archives
  • Full-text search and metadata-based indexing for rapid information retrieval
  • OCR powered by the open-source Tesseract engine, supporting 100+ languages
  • Versioning: each operation creates a new document version while preserving originals
  • Custom Fields (document metadata) to attach attributes like dates, prices, IDs
  • Categories (document types) to classify documents (e.g., Receipt, Invoice, Contract)
  • Page Management: reorder, rotate, and extract pages within documents
  • Easy-to-use tools to visualize and manage documents within a web UI

How It Works

  • Upload documents (PDF/JPEG/TIFF) to Papermerge DMS.
  • OCR is applied to extract and index text for searchable content.
  • Documents and their metadata are stored with version history and category assignments.
  • Users can search using full-text queries or filter by custom fields and categories.

Safety and Legal Considerations

  • Ensure you have the rights to store and process documents and respect privacy and data protection regulations relevant to your jurisdiction.

Core Features

  • Open-source DMS with permissive Apache 2.0 license
  • Web-based, modern, and intuitive user interface
  • Wide document format support: PDF, JPEG, TIFF
  • Full-text search and metadata tagging for fast retrieval
  • OCR via Tesseract supporting 100+ languages
  • Document Versioning: original retained; new versions created for updates (e.g., OCRed version)
  • Custom Fields (metadata) for flexible data attributes
  • Categories to classify document types (Receipt, Invoice, Contract, etc.)
  • Page Management: reorder, rotate, extract pages within documents
  • User-friendly workflow for indexing, tagging, and organizing archival content