Text-to-text Generation Models (LLMs, Llama, GPT, ...)
Collection
5163 items
•
Updated
•
13
Efficient machine learning for any model and hardware: pruning, quantization, compilation, and more.
pruna
(https://docs.pruna.ai/en/latest/).