Do I require config.json to run in OLLAMA with FP16.gguf? If so, where can I find it?

by NeevrajKB - opened Mar 24, 2024

Mar 24, 2024

•

edited Mar 24, 2024

Do I require config.json to run in OLLAMA with FP16.gguf? If so, where can I find it?
BTW, how much performance-accuracy loss in fp16 compared to original non-gguf model of same specs?
Also any idea about which is the best time:accuracy:weight model for performance on modest hardware?
I am trying with OLLAMA due to errors with Huggingface and unsloth. If i have errors in ollama, can i ask you guys about it here?
Also guys, how can i rag with this in python?
Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment