Question 1

What is Cohen's kappa used for?

Accepted Answer

Cohen's kappa measures how well two reviewers agree when they classify the same items, correcting for the agreement you would expect by chance alone. In a systematic review it is most often used to report inter-rater reliability at the title and abstract screening stage, where two reviewers each decide whether each record should be included or excluded.

Question 2

How do you calculate Cohen's kappa?

Accepted Answer

Build a table of how the two reviewers classified each record, then compute the observed agreement (the proportion of records they classified the same way) and the expected agreement (the proportion they would match by chance, from the marginal totals). Kappa is the observed agreement minus the expected agreement, divided by one minus the expected agreement. The calculator above does this from your four cell counts.

Question 3

What is an acceptable kappa value?

Accepted Answer

Using the common Landis and Koch benchmarks, a kappa of 0.41 to 0.60 is moderate, 0.61 to 0.80 is substantial, and above 0.81 is almost perfect. Many review teams aim for substantial agreement or better before screening continues, and resolve a low kappa by clarifying the eligibility criteria and re-piloting on a fresh sample rather than pressing on.

Question 4

Can you calculate Cohen's kappa in Excel?

Accepted Answer

Yes, Excel can compute kappa from a two-by-two agreement table using the observed and expected agreement formulas, though it has no single built-in kappa function so you build it from cell references. The calculator on this page saves that setup: enter the four counts and it returns kappa, the observed agreement, and the chance agreement directly.

Question 5

What is the difference between ICC and Cohen's kappa?

Accepted Answer

Cohen's kappa is for categorical decisions by two raters, such as include or exclude, while the intraclass correlation coefficient (ICC) is for continuous or ordinal ratings, such as a quality score out of ten. For systematic review screening, where the decision is a yes or no inclusion call, Cohen's kappa is the appropriate statistic.

Cohen's Kappa Calculator for Reviewer Agreement

Agreement table

Your agreement

Why kappa, and not just percent agreement

How to read your result

Where agreement fits in the review

Frequently asked questions

What is Cohen's kappa used for?

How do you calculate Cohen's kappa?

What is an acceptable kappa value?

Can you calculate Cohen's kappa in Excel?

What is the difference between ICC and Cohen's kappa?

Create Your PRISMA 2020 Flow Diagram Free