ORQA is a new benchmark designed to assess the reasoning capabilities of Large Language Models...

Tokens:476
Snippets:6
Trust Score:6.1
Update:2 weeks ago
Tokens:
Raw