Duplicate Document Detection

Why:

To prevent data redundancy and double processing (Inflated costs).

What:

Identify and flag duplicate document uploads based on file hash and extracted data.

  • Supplier
  • Document ID
  • Subtotal
  • No of Line items
  • Gross Total

Add a filter do duplicate docs

Acceptance Criteria:

  • Unique file hash generated for each document.
  • Duplicate alerts are shown to the user during upload.
  • Option to “View Existing Document” when a duplicate is detected.

Edge Cases:

  • Same document with minor edits or re-scans.
  • Files renamed but identical in content.
  • User orders 40 bricks. Vendor delivers the first 20, then delivers the second 20. System shouldn’t mark this as a duplicate as while the DN will be identical, it’s not a duplicate

Please authenticate to join the conversation.

Upvoters
Status

In Progress

Board
💡

Feature Request

Date

3 months ago

Author

Linear

Subscribe to post

Get notified by email when there are changes.