Jędrzej Grabala
jgitsolutions
AI & ML interests
Local Drive Human Overseered System of Agents, LLMs, Langchains & other useful stuff on mid-to-low-end of commercial hardware.
Recent Activity
Reacted to
yongchanghao's
post
with 👀
6 days ago
We just released a paper (NeuZip) that compresses VRAM in a lossless manner to run larger models. This should be particularly useful when VRAM is insufficient during training/inference. Specifically, we look inside each floating number and find that the exponents are highly compressible (as shown in the figure below).
Read more about the work at https://huggingface.co/papers/2410.20650
Reacted to
yongchanghao's
post
with 🔥
6 days ago
We just released a paper (NeuZip) that compresses VRAM in a lossless manner to run larger models. This should be particularly useful when VRAM is insufficient during training/inference. Specifically, we look inside each floating number and find that the exponents are highly compressible (as shown in the figure below).
Read more about the work at https://huggingface.co/papers/2410.20650
Organizations
Collections
5
models
None public yet
datasets
None public yet