BigO(Bench) is a benchmark of ~300 code problems to be solved in Python, that evaluates whether LLMs...

Tokens:30,932
Snippets:305
Trust Score:9.5
Update:6 days ago
Tokens:
Raw