Documentation

The PDF page deduplication tool detects repeated pages inside one PDF and creates a new PDF that keeps the first occurrence. Processing runs locally in your browser, making it useful for scanned documents, contracts, course files, invoice bundles, and office document cleanup.

Features

  • Exact matching removes pages with identical rendered fingerprints.
  • Fuzzy matching adds visual feature comparison for slightly different scanned duplicates.
  • Render quality can be adjusted to balance speed and detection stability.
  • Duplicate mapping shows which removed page matched which kept page.

Usage Tips

Use exact matching first for important documents, then try fuzzy matching if scanned duplicates remain. A higher fuzzy threshold matches more aggressively, so use lower values and review the page count when handling contracts, invoices, certificates, or other sensitive documents.

Privacy and Limits

PDF reading, rendering, deduplication, and export are completed locally in the browser. Encrypted PDFs, damaged PDFs, or files that cannot be rendered by the browser may not be processed directly.