AssistantBench is a benchmark dataset and evaluation suite for assessing AI agents' ability to solve...

Tokens:4,689
Snippets:42
Trust Score:5.9
Update:2 months ago
Tokens:
Raw