ConvBench is a multi-turn conversation evaluation benchmark with hierarchical ablation capability...

Tokens:6,760
Snippets:57
Trust Score:3.7
Update:2 months ago
Tokens:
Raw