Full Text Search - Hugging Face

Full-text search

models datasets spaces

+ 1,000 results

gguf-org / flux-dev-gguf

README.md

model

11 matches

tags: gguf, gguf-node, text-to-image, en, base_model:Comfy-Org/flux1-dev, base_model:quantized:Comfy-Org/flux1-dev, license:other, region:us

# **gguf quantized version of flux-dev (incl. gguf encoder and gguf vae)**

- drag **flux-dev** to > `./ComfyUI/models/diffusion_models`

- drag **clip-l, t5xxl** to > `./ComfyUI/models/text_encoders`

- drag **vae** to > `./ComfyUI/models/vae`

- drag demo picture (below) to > your browser for workflow

AetherArchitectural / GGUF-Quantization-Script

README.md

model

16 matches

tags: gguf, quantized, text-generation-inference, text-generation, license:cc-by-nc-4.0, region:us

ural/GGUF-Quantization-Script**](https://huggingface.co/AetherArchitectural/GGUF-Quantization-Script).

> **Credits:** <br>

> Made with love by [**@Aetherarchio**](https://huggingface.co/aetherarchio)/[**@FantasiaFoundry**](https://huggingface.co/FantasiaFoundry)/[**@Lewdiculous**](https://huggingface.co/Lewdiculous) with the generous contributions by [**@SolidSnacke**](https://huggingface.co/SolidSnacke) and [**@Virt-io**](https://huggingface.co/Virt-io). <br>

> If this proves useful for you, feel free to credit and share the repository and authors.

Cebtenzzre / gguf-misc

model

2 matches

tags: gguf, endpoints_compatible, region:us

npvinHnivqn / GGUF-openchat

README.md

model

3 matches

tags: gguf, license:mit, endpoints_compatible, region:us

ivqn/GGUF-openchat",

model_file="openchat.gguf",

model_type="llama", gpu_layers=0,

context_length=768)

model('''AI will ''', temperature=0.1)

npvinHnivqn / GGUF-metamath-llemma

README.md

model

3 matches

tags: gguf, license:mit, endpoints_compatible, region:us

ivqn/GGUF-metamath-llemma",

model_file="metamath-llemma.gguf",

model_type="llama", gpu_layers=0,

context_length=768)

model('''AI will ''', temperature=0.1)

huodon / gguf-models

model

2 matches

tags: gguf, endpoints_compatible, region:us, conversational

Felladrin / gguf-TinyMistral-248M-SFT-v4

README.md

model

3 matches

tags: gguf, base_model:Felladrin/TinyMistral-248M-Chat-v2, base_model:quantized:Felladrin/TinyMistral-248M-Chat-v2, license:apache-2.0, endpoints_compatible, region:us

GGUF version of [Felladrin/TinyMistral-248M-SFT-v4](https://huggingface.co/Felladrin/TinyMistral-248M-SFT-v4).

Felladrin / gguf-llama-160m

README.md

model

3 matches

tags: gguf, text-generation, en, dataset:wikipedia, base_model:JackFram/llama-160m, base_model:quantized:JackFram/llama-160m, license:apache-2.0, endpoints_compatible, region:us

ized GGUF version of [JackFram/llama-160m](https://huggingface.co/JackFram/llama-160m).

Felladrin / gguf-Llama-160M-Chat-v1

README.md

model

3 matches

tags: gguf, base_model:Felladrin/Llama-160M-Chat-v1, base_model:quantized:Felladrin/Llama-160M-Chat-v1, endpoints_compatible, region:us, conversational

GGUF version of [Felladrin/Llama-160M-Chat-v1](https://huggingface.co/Felladrin/Llama-160M-Chat-v1).

hesha / gguf-collection

model

2 matches

tags: gguf, endpoints_compatible, region:us

IndiaBuild / GGUF_Navarna_v0_1_OpenHermes_Hindi

README.md

model

3 matches

tags: gguf, llama.cpp, hindi, endpoints_compatible, region:us, conversational

a 7B GGUF VERSION here is the orignal [TokenBender/Navarna_v0_1_OpenHermes_Hindi](https://huggingface.co/TokenBender/Navarna_v0_1_OpenHermes_Hindi)

calcuis / gguf

model

2 matches

tags: gguf, endpoints_compatible, region:us

gguf / AlphaMonarch-7B-GGUF

model

2 matches

tags: gguf, endpoints_compatible, region:us, conversational

gguf / Mistral-7B-Instruct-v0.2-GGUF

model

2 matches

tags: gguf, endpoints_compatible, region:us, conversational

gguf / BioMistral-7B-GGUF

model

2 matches

tags: gguf, endpoints_compatible, region:us, conversational

gguf / cosmo-1b-GGUF

model

2 matches

tags: gguf, endpoints_compatible, region:us

gguf / Nous-Hermes-2-Mistral-7B-DPO-GGUF

model

2 matches

tags: gguf, endpoints_compatible, region:us, conversational

gguf / Smaug-34B-v0.1-GGUF

model

2 matches

tags: gguf, endpoints_compatible, region:us, conversational

Felladrin / gguf-Minueza-32M-Base

README.md

model

5 matches

tags: gguf, base_model:Felladrin/Minueza-32M-Base, base_model:quantized:Felladrin/Minueza-32M-Base, license:apache-2.0, endpoints_compatible, region:us, conversational

GGUF version of [Felladrin/Minueza-32M-Base](https://huggingface.co/Felladrin/Minueza-32M-Base).

It was not possible to quantize the model, so only the F16 and F32 GGUF files are available.

## Try it with [llama.cpp](https://github.com/ggerganov/llama.cpp)

Felladrin / gguf-Minueza-32M-Chat

README.md

model

4 matches

tags: gguf, base_model:Felladrin/Minueza-32M-Chat, base_model:quantized:Felladrin/Minueza-32M-Chat, license:apache-2.0, endpoints_compatible, region:us, conversational

GGUF version of [Felladrin/Minueza-32M-Chat](https://huggingface.co/Felladrin/Minueza-32M-Chat).

It was not possible to quantize the model after converting it to F16/F32 GGUF, so only those versions are available, being F32 the recommended one for having better precision.

## Recommended Inference Parameters