Papermerge DMS is an open-source Document Management System designed for digital archives. It stores, organizes, and indexes scanned documents (PDF, JPEG, TIFF) and enables fast retrieval using full-text search, tags, and metadata. The platform emphasizes a modern, user-friendly web interface, OCR-powered search, and robust document versioning, making it suitable for personal use and small to medium-sized organizations.
How to Use Papermerge DMS
- Store documents by uploading scanned files (PDF/JPEG/TIFF) into the repository.
- Index and search using full-text search, metadata, and custom tags to quickly locate documents.
- Manage versions: each operation creates a new document version while preserving the original, enabling easy tracking of updates (e.g., an OCRed version is stored as a separate version).
- Annotate with metadata: create custom fields (metadata) and assign them to categories like Receipts, Invoices, or Contracts.
- Organize with categories: define document types (Categories) to structure your library (e.g., Receipt, Invoice, Contract).
- Fix page order and quality: use page management to reorder, rotate, or extract pages within documents, a boon for bulk scans with mixed-up pages.
Disclaimer: This description reflects the features and capabilities as documented for Papermerge DMS.
Core Capabilities
- Open-source, Apache 2.0 license; source code available on GitHub
- Web-based, modern, intuitive user interface
- Supports PDF, JPEG, and TIFF formats; optimized for digital archives
- Full-text search and metadata-based indexing for rapid information retrieval
- OCR powered by the open-source Tesseract engine, supporting 100+ languages
- Versioning: each operation creates a new document version while preserving originals
- Custom Fields (document metadata) to attach attributes like dates, prices, IDs
- Categories (document types) to classify documents (e.g., Receipt, Invoice, Contract)
- Page Management: reorder, rotate, and extract pages within documents
- Easy-to-use tools to visualize and manage documents within a web UI
How It Works
- Upload documents (PDF/JPEG/TIFF) to Papermerge DMS.
- OCR is applied to extract and index text for searchable content.
- Documents and their metadata are stored with version history and category assignments.
- Users can search using full-text queries or filter by custom fields and categories.
Safety and Legal Considerations
- Ensure you have the rights to store and process documents and respect privacy and data protection regulations relevant to your jurisdiction.
Core Features
- Open-source DMS with permissive Apache 2.0 license
- Web-based, modern, and intuitive user interface
- Wide document format support: PDF, JPEG, TIFF
- Full-text search and metadata tagging for fast retrieval
- OCR via Tesseract supporting 100+ languages
- Document Versioning: original retained; new versions created for updates (e.g., OCRed version)
- Custom Fields (metadata) for flexible data attributes
- Categories to classify document types (Receipt, Invoice, Contract, etc.)
- Page Management: reorder, rotate, extract pages within documents
- User-friendly workflow for indexing, tagging, and organizing archival content