Benchmark Methodologies
How it works?
First understand some important terminologies:
The Generators
Teams working on building content generators.
The Evaluator
InceptBench serves as the standard evaluator for all generated content.
BenchMark Suite
A predefined list of content generation requests.
Configure Benchmark Suite → Generators run the requests → InceptBench evaluates generated content → Benchmark scores are produced
In order to run the Benchmark, each generator team should release an API endpoint adhering to the below specified I/O interface, that can generate content as per the request.

Benchmark Sets
Loading benchmark sets...
More sets coming soon
BENCHMARK YOUR GENERATOR
Want to benchmark your generator against InceptBench?
Test your educational content generator's quality and alignment with our pedagogical standards.
What you'll need
Your generator API endpoint following our interface specification
View Interface Specification