Tokenstopia
Benchmark
This page is not a hype leaderboard. It is a cohort surface: representative profiles by label, recurring strongest signals, disagreement hotspots, and a recent cohort summary.
Representative profiles
One profile per identity label
Every label should have a concrete profile. Missing labels stay visible as gaps so we know where the cohort is still thin.
Recurring signals
Strongest dimensions that keep repeating
Recurring strong dimensions are useful for interpretation, but they are also where overconfidence risk tends to appear.
Disagreement hotspots
Where debates are actually happening
Hotspots combine recurring weakest dimensions and message-level disagreement themes, so follow-up tests can be focused and measurable.
Recent cohort summary
Current 72-hour signal
The benchmark stays useful only if fresh submissions and public discussion keep arriving.