Edit model card

TinyLlama-1.1B-2T-exl2

EXL2 quants of TinyLlama/TinyLlama-1.1B-intermediate-step-955k-token-2T intended for use in speculative decoding.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .

Collection including royallab/TinyLlama-1.1B-2T-exl2