Install
Docs
Pricing
Enterprise
More...
More...
Try Live
Rankings
Blog
Add Docs
AgentBench
https://github.com/thudm/agentbench
Admin
AgentBench is a comprehensive benchmark for evaluating Large Language Models as autonomous agents
...
Tokens:
13,796
Snippets:
113
Trust Score:
10
License:
Apache-2.0
Update:
3 months ago
Context
Chat
Benchmark
95
Latest
Show doc for...
Code
Info
Show Results
Tokens:
Raw
Copy
Link