LLM Results — Tasks × Models by Category
Select results.json
…or drop results.json here
Or paste JSON (optional)
Load pasted JSON
Language
Mode
Compute
Models (multi-select)
Search (task id or text)
Details per row
1
2
3
4
Sort tasks
Fails first
Name (A→Z)
Failed only
Hide 100% categories
Expand all
Collapse all
Download CSV
Coverage Check
Expect: Every model has every task in each category. Any missing entry = bug.
Collapse