AgentBench is a comprehensive benchmark for evaluating Large Language Models as autonomous agents...

Tokens:13,796
Snippets:113
Trust Score:10
License:Apache-2.0
Update:3 months ago
Tokens:
Raw