mridul3301
/

mistral-7b-finetuned-gguf

Inference Endpoints

Model card Files Files and versions Community

Edit model card

GGUF models of the following model : https://huggingface.co/mridul3301/BioMistral-7B-finetuned

3 format of quantization:

fp8
fp16
fp32

Converted the safetensors to GGUF for inference in CPU using llama_cpp

Downloads last month: 11

GGUF

Model size

7.24B params

Architecture

llama

Inference API

Unable to determine this model's library. Check the docs .