Question 1

How do I find duplicates inside a single CSV file?

Accepted Answer

Drop your file into slot A. To focus only on intra-file duplicates, drop the same file into slot B and run the comparison. The Duplicates tab in the results lists every key that appears more than once inside file A and inside file B separately.

Question 2

What counts as a duplicate row?

Accepted Answer

By default, two rows are duplicates if every cell matches after the cleaning rules you enabled. In key-column mode, rows are duplicates when their key column repeats. Even if other columns differ.

Question 3

Can I ignore casing or trailing spaces when finding duplicates?

Accepted Answer

Yes. Turn on trim whitespace, ignore casing, ignore accents, normalize emails or normalize phone numbers in the cleaning rules. 'José García', 'jose garcia' and 'JOSE GARCIA ' will be grouped as one record.

Question 4

Can I export the deduped list?

Accepted Answer

Yes. Each result section. Including duplicates and the canonical 'in both' rows. Has a CSV and XLSX export button. Download the deduped set or just the duplicates, depending on what you need.

Question 5

Are my CSV files uploaded to find the duplicates?

Accepted Answer

No. Parsing and duplicate detection happen in your browser via a Web Worker. The file contents are processed inside your browser via a Web Worker and are not transmitted to our servers.

Question 6

How large a CSV can I dedupe?

Accepted Answer

Anonymous users can run the tool on files up to about 2,000 rows for free. Larger files use the pay-as-you-go tiers (from $3). See the pricing page for the full caps.

Question 7

What about near-duplicates. Values that look the same but are not byte-identical?

Accepted Answer

Use the fuzzy matching mode. It finds approximate duplicates ('Acme Corp' vs 'Acme Corporation' vs 'ACME corp') via Jaro-Winkler similarity. They are listed in the Almost matches tab with the reason.

Find duplicates in a CSV file online.

When do you need to find duplicates in a CSV?

How the duplicate detection works

Ignore the formatting that fakes duplicates

Approximate duplicates: when keys do not match exactly

Export the deduped CSV

Browser-first by design

Related tools

Frequently asked questions