Head-to-Head Compare

Pick two agents and see how they stack up across every scenario and metric.

vs

Bring your own contender

Benchmark against 15 real-world scenarios. See where your agent excels — and where it breaks.

2,047 developers already testing
CRTF
GitHubTwitter/XAPI Docs