Question 1

How do you calculate the deduplication ratio?

Accepted Answer

Add up the records returned by every database and other source to get the total records identified. Subtract the number of records that remain after you remove duplicates. That difference is the number of duplicates removed. Divide the duplicates removed by the total records identified and multiply by 100 to express the deduplication rate as a percentage.

Question 2

What is the dedup rate?

Accepted Answer

The deduplication rate is the share of your identified records that turned out to be duplicates, written as a percentage. Searches across three or more large databases such as MEDLINE, Embase, and Cochrane CENTRAL commonly land somewhere around 20 to 40 percent because the databases index many of the same journals. The rate is descriptive, not a quality threshold, so report your own figure rather than aiming for a target.

Question 3

What does deduplication mean?

Accepted Answer

Deduplication is the step of merging records that describe the same article so each one is counted and screened only once. When you search several databases the same paper is returned multiple times, and every appearance counts as a separate record at identification. Deduplication resolves that overlap before title and abstract screening begins.

Question 4

Does Covidence automatically remove duplicates?

Accepted Answer

Yes. Covidence runs an automatic deduplication pass as you import references and reports the number it removed, and Rayyan offers a similar detect-and-merge feature. Automated matching catches the clear repeats, but you should still review the borderline pairs the tool is unsure about, since variant titles, accents, or abbreviated journal names can split one article into two records.

Question 5

How do you remove duplicates in EBSCOhost?

Accepted Answer

EBSCOhost has no built-in deduplication across separate database exports, so export your results with full bibliographic fields and deduplicate in a reference manager or screening platform instead. Import every export into one library, run the automated match on title, author, year, and identifiers, then manually adjudicate the near matches before recording the final duplicate count.

Deduplication Rate Calculator for Systematic Reviews

Your search results

Your deduplication figures

What the deduplication rate tells you

How to calculate your deduplication rate

A worked example

Common mistakes when reporting duplicates

Frequently asked questions

How do you calculate the deduplication ratio?

What is the dedup rate?

What does deduplication mean?

Does Covidence automatically remove duplicates?

How do you remove duplicates in EBSCOhost?

Create Your PRISMA 2020 Flow Diagram Free