Install
Docs
Pricing
Enterprise
More...
More...
Try Live
Rankings
Blog
Add Docs
AirLLM
https://github.com/lyogavin/airllm
Admin
AirLLM optimizes inference memory usage to run large language models like 70B on a single 4GB GPU
...
Tokens:
136,690
Snippets:
1,336
Trust Score:
9.6
License:
Apache-2.0
Update:
2 weeks ago
Context
Chat
Benchmark
89.33
Latest
Show doc for...
Code
Info
Show Results
Tokens:
Raw
Copy
Link