Yet Another Applied LLM Benchmark (carlini/yet-another-applied-llm-benchmark) | Context7

InstallDocsPricing

More...

Yet Another Applied LLM Benchmark

https://github.com/carlini/yet-another-applied-llm-benchmark

This benchmark evaluates how well language models perform on real-world tasks encountered by the...

Tokens:4,759

Snippets:52

Trust Score:9.6

Update:2 weeks ago

Show doc for...

Tokens: