Module 7 — Information & Adaptive Testing
Module 6 showed you how to estimate a student's ability \( \theta \) from a set of responses. But here is the obvious follow-up: which items should you ask? A 200-item item bank contains many questions that are nearly useless for a particular student — questions so easy that everyone at that level gets them right, or so hard that everyone gets them wrong. Those items teach you almost nothing new. IRT makes "usefulness" precise through the concept of information.
The information function of one item
The Fisher information of a single item at ability level \( \theta \) measures how sharply that item's likelihood contribution peaks near \( \theta \). For the 3PL model with parameters \( a, b, c \), the information function is:
There is a lot packed into this formula, so let's unpack each piece:
- \( a^2 \) out front. Higher discrimination means the item sorts students more sharply, so it contributes more information. Doubling \( a \) quadruples the information — discrimination is the single most powerful lever for test efficiency.
- The ratio \( (1 - P)/P \). This is largest when \( P \approx 0.5 \) — the 50/50 zone. When \( P \) is near 1 (item is trivially easy) or near 0 (item is impossibly hard), the item gives almost no information. The intuition: an item only reveals something new when the outcome is uncertain.
- The guessing correction \( ((P - c)/(1 - c))^2 \). The lower asymptote \( c \) reduces information — correct answers from guessing do not reflect ability. When \( c = 0 \) (the 2PL case), this term equals \( P^2 \) and simplifies the formula.
The key intuition: an item peaks in information right around its difficulty \( b \), because that is where the student has roughly 50/50 odds. Move far above or below \( b \) and the information falls toward zero.
The standard error of the ability estimate
Fisher information connects directly to estimation precision. The Cramér–Rao lower bound from statistics says that no unbiased estimator can achieve a standard error smaller than \( 1/\sqrt{I(\theta)} \). For IRT, the MLE \( \hat{\theta} \) achieves this bound asymptotically, so:
This formula is elegant and powerful. To halve the standard error you need to quadruple the information. That means asking highly discriminating items near the student's ability level is far more efficient than piling on many mediocre items spread across the ability range.
Activity: explore the information function
Below are three items with different difficulty values \( b \). Use the slider to shift the middle item's difficulty and watch its information peak slide accordingly. Notice how each peak sits directly above its item's \( b \), and how higher \( a \) produces a taller, narrower peak.
This activity needs JavaScript. It plots the information function I(θ) for several items.
Computer-adaptive testing: the big idea
A computer-adaptive test (CAT) uses item information to make testing maximally efficient. The algorithm is a loop that repeats after every response:
- Estimate \( \theta \). Given all responses so far, compute the current ability estimate \( \hat{\theta} \) (e.g., by MLE on a grid, as in Module 6) and the associated standard error \( \text{SE} = 1/\sqrt{I_{\text{total}}(\hat{\theta})} \).
- Select the most informative unused item. Scan every item in the bank that has not yet been administered. For each, compute \( I_i(\hat{\theta}) \). Pick the item with the highest information at the current estimate — this is the item that, if answered, will most tighten the uncertainty around \( \hat{\theta} \).
- Administer and update. The student answers. The response is added to the record, the likelihood is updated, and the cycle repeats.
- Stop when done. The test ends when the standard error falls below a threshold (e.g., 0.3 on the \( \theta \) scale), when a fixed number of items has been asked, or when a pass/fail decision can be made with sufficient confidence.
The result is remarkable efficiency. A well-designed CAT can achieve the same measurement precision as a traditional 100-item fixed-form test using only 20–30 items. Different students see almost entirely different questions, yet the estimates are on the same \( \theta \) scale and directly comparable — the fairness property you learned in Module 1.
Activity: simulate a CAT session
Set a true ability level (hidden from the "test"), then click Administer next item to run the CAT loop. Watch the estimate converge and the standard error shrink as items are added. The algorithm selects items maximizing \( I_i(\hat{\theta}) \) at each step — exactly as a real CAT engine would.
This activity needs JavaScript. It simulates a CAT session with live ability estimation.
Sort: information and adaptive testing
For each statement, decide whether it is True or False about information functions and CAT.
This activity needs JavaScript.