Module 4 — The 2PL Model — Discrimination
The Rasch model places every item's ICC on the same template: same slope, same shape, shifted only left or right by difficulty \( b \). But real test items are not all equally good at telling students apart. A brilliantly written question might sharply separate students who truly understand the concept from those who don't, while a poorly worded item might be nearly random for everyone. The two-parameter logistic model (2PL) captures this difference with one additional number: the discrimination parameter \( a \).
Adding the slope parameter
The 2PL formula simply multiplies \( (\theta - b) \) by \( a \) inside the exponent:
When \( a = 1 \) this reduces to the Rasch model. When \( a \) is large the exponent changes faster — the logistic function switches from near-0 to near-1 over a narrower range of \( \theta \) — producing a steeper S-curve. When \( a \) is small the switch happens slowly, producing a flatter curve.
Formally, \( a \) is proportional to the slope of the ICC at its inflection point \( \theta = b \). The exact slope there is \( a/4 \) (in the logistic metric). So doubling \( a \) doubles the steepness of the curve at the difficulty point.
What does "discriminating" mean in practice?
Imagine two students: Student A has ability \( \theta = -0.5 \) (slightly below average) and Student B has ability \( \theta = +0.5 \) (slightly above average). Both are taking an item with difficulty \( b = 0 \).
- On a high-discrimination item (\( a = 2.0 \)) the probabilities are roughly 27% for A and 73% for B — a gap of about 46 percentage points. The item clearly distinguishes them.
- On a low-discrimination item (\( a = 0.5 \)) the probabilities are roughly 44% for A and 56% for B — a gap of only 12 percentage points. The item barely tells the two students apart.
This probability gap is exactly the measure of discrimination. High \( a \) means the item is doing its job: separating students with different abilities. Low \( a \) means the item is adding noise with little signal.
The 2PL does not change the meaning of b
A common misconception: does adding \( a \) change where the ICC crosses \( P = 0.5 \)? It does not. No matter what \( a \) is, when \( \theta = b \) the exponent is zero and \( P = 0.5 \). The difficulty \( b \) is still the midpoint of the curve. What \( a \) changes is how quickly the curve climbs through that midpoint, not where the midpoint is.
This separation of roles is what makes the 2PL interpretable: \( b \) says "how hard" and \( a \) says "how sharp." They are orthogonal concepts.
Typical values of a
In most well-calibrated item banks, \( a \) ranges from about 0.5 to 2.5. Items below 0.5 are considered poor discriminators and are typically revised or dropped. Items above 2.5 are rare and sometimes suspect — an extremely steep curve may indicate that the item is measuring something other than the target construct, such as familiarity with a specific quirk of the wording. A healthy item typically has \( a \) between 0.8 and 2.0.
This activity needs JavaScript. Adjust the discrimination slider for item A and compare it to a fixed low-discrimination item B to see how the probability gap between two students changes.
Discrimination and item information
There is a deep connection between \( a \) and how much information an item contributes to the ability estimate. Higher discrimination = more information near \( b \). In fact, for a 2PL item, the peak information at \( \theta = b \) equals \( a^2 / 4 \). Doubling \( a \) quadruples the peak information. This is why adaptive testing algorithms strongly prefer high-\( a \) items when precision matters — they give more bang per question. Module 7 unpacks item information fully.
Sort: which item discriminates better?
For each pair, choose whether item A or item B discriminates better near its difficulty point.
This activity needs JavaScript.