llamafile
English
GGUF

Commit History

Add TinyLlama-1.1B-Chat-v1.0.Q4_0.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
c2ab9a9

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_S.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
19ec640

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_M.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
ae088eb

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_L.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
de943b0

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q2_K.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
4324a07

jartine commited on

Add README for quantized weights
da72cd3

jartine commited on

Add README for quantized weights
1cc7774

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
18e69e7

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
25ae9a9

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
1757add

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
a09f5ff

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
623bf7e

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
734d0b9

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
1ca064a

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
4bf145d

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
32b626c

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
bc8f9a6

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
4df03a3

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
bb45d9d

jartine commited on

initial commit
1abeb5b

jartine commited on