Transformers
GGUF
English
Inference Endpoints
imatrix
File size: 242 Bytes
18af707
 
 
 
 
 
1
2
3
4
5
6
7
<!-- ### quantize_version: 2 -->
<!-- ### output_tensor_quantised: 1 -->
<!-- ### convert_type: hf -->
<!-- ### vocab_type:  -->
<!-- ### tags: nicoboss -->
weighted/imatrix quants of https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.5