CRUXEval is a benchmark of 800 Python functions and input-output pairs for evaluating code...

Tokens:3,024
Snippets:49
Trust Score:9.5
License:MIT
Update:1 month ago
Tokens:
Raw