Install
Docs
Pricing
Enterprise
More...
More...
Try Live
Rankings
Blog
Add Docs
JudgeBench
https://github.com/scalerlab/judgebench
Admin
JudgeBench is a comprehensive benchmark for evaluating LLM-based judges, assessing how well language
...
Tokens:
10,389
Snippets:
60
Trust Score:
4.1
Update:
1 month ago
Context
Chat
Benchmark
Latest
Show doc for...
Code
Info
Show Results
Tokens:
Raw
Copy
Link