Research Tool

Item Difficulty Estimator

Estimate assessment item difficulty using the AIED 2026 approach. Supports 9 item types and 28 interactive math widgets across K-8.

Try an example

Item Stem

Grade

Item Type

Widget

Skill Name (optional)

Answer Options

Select an example or enter your own item

Supports 9 item types and 28 interactive widgets

9 Item Types

Assessment formats

Multiple Choice

4 options, 1 correct, misconception-mapped distractors

K-8

Multiple Select

Select all that apply from a set of options

2-8

True / False

Binary judgment on a mathematical statement

K-8

Short Answer

Free-text response, pattern-matched scoring

1-8

Numeric Entry

Exact numeric answer with tolerance range

K-8

Fill in the Blank

Cloze-style with inline blanks in context

1-8

Matching

Connect items across two columns

2-8

Ordering

Drag items into correct sequence

2-8

Essay / Explanation

Extended response, LLM-evaluated with rubric

3-8

28 Interactive Widgets

Math manipulatives

Counting

Ten FrameK-2

2x5 grid for addition/subtraction to 20

Counting SceneK-2

60+ sprite types in configurable arrangements

Place Value

Base Ten Blocks1-3

Hundreds, tens, and ones blocks

Place Value Chart1-4

Interactive column chart

Operations

Number LineK-5

Jumps, highlights, position markers

Area Model3-5

Rectangular grid for multiplication/division

Array2-4

Dot array with row/column highlights

Integer Number Line6-8

Negative to positive with regions

Fractions

Fraction Bar3-5

Bar divided into equal parts

Fraction Circle3-5

Pie-chart style sectors

Fraction Comparison3-5

Side-by-side bars or circles

Chocolate Bar3-5

Grid fraction with eaten parts

Measurement

Clock1-3

12-hour face with draggable hands

Measuring Cup3-5

Liquid fractions

Data

Dot Plot3-6

Stacked dots on number line

Histogram6-8

Grouped frequency bars

Box Plot6-8

Quartiles and whiskers

Scatter Plot8

Correlation visualization

Tape Diagram3-6

Bar segments for ratios

Geometry

Coordinate Plane5-8

Points, lines, regions

Shape BuilderK-3

Drag-and-drop geometric shapes

Right Triangle7-8

Labeled sides with Pythagorean theorem

Volume Builder5-6

3D cube structures (WebGL)

How it works

1. Chain-of-thought analysis

The LLM analyzes 7 difficulty factors before estimating: cognitive steps, prerequisites, misconceptions, transfer distance, working memory, distractor quality, and reading load.

2. Anchored rubric

Instead of asking “how hard is this?” we provide grade-level calibrated examples at each difficulty level, grounding estimates in concrete comparisons.

3. Bias correction

LLMs exhibit variance collapse and systematic overconfidence. We apply corrections from our AIED 2026 research across 200 experimental conditions.

Open Items →AIED 2026 Paper →