llamafile
English
GGUF

Commit History

Quantize TinyLlama-1.1B-Chat-v1.0 with llamafile-0.7.3 F32
a26b598
verified

jartine commited on

Quantize TinyLlama-1.1B-Chat-v1.0 with llamafile-0.7 BF16
679b4d6
verified

jartine commited on

Quantize TinyLlama-1.1B-Chat-v1.0 with llamafile-0.7 Q5_K
9b89ed8
verified

jartine commited on

Quantize TinyLlama-1.1B-Chat-v1.0 with llamafile-0.7 Q4_K
9fa0931
verified

jartine commited on

Quantize TinyLlama-1.1B-Chat-v1.0 with llamafile-0.7 Q3_K
7a6f2d0
verified

jartine commited on

Quantize TinyLlama-1.1B-Chat-v1.0 with llamafile-0.7 Q5_1
ecadf61
verified

jartine commited on

Quantize TinyLlama-1.1B-Chat-v1.0 with llamafile-0.7 Q4_1
9ab6669
verified

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q5_0.gguf to repo using llamafile a6d041a3b59582d2a43c5837cf170cccaa511180
783ab7e

jartine commited on

Upload TinyLlama-1.1B-Chat-v1.0.f16.gguf with huggingface_hub
271bc1e

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.f16.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
3cb317f

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q8_0.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
e7b8a51

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q6_K.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
d8f2ce6

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q5_K_S.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
0b0ec4f

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q5_K_M.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
4264cca

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q4_K_S.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
a3803f0

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q4_K_M.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
88ac6ed

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q4_0.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
c2ab9a9

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_S.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
19ec640

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_M.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
ae088eb

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_L.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
de943b0

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q2_K.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
4324a07

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
18e69e7

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
25ae9a9

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
1757add

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
a09f5ff

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
623bf7e

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
734d0b9

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
1ca064a

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
4bf145d

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
32b626c

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
bc8f9a6

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
4df03a3

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
bb45d9d

jartine commited on

initial commit
1abeb5b

jartine commited on