Arena-Hard-Auto is an automatic evaluation tool for instruction-tuned LLMs, designed to correlate...

Tokens:8,188
Snippets:22
Trust Score:7.8
License:Apache-2.0
Update:2 weeks ago
Tokens:
Raw