pantheon-ui-lfm25-emoji-merged (GGUF)

GGUF quantized versions of shreyask/pantheon-ui-lfm25-emoji-merged for use with llama.cpp, Ollama, and LM Studio.

Model Details

  • Architecture: Lfm2ForCausalLM (Liquid Foundation Model 2)
  • Base Model: LiquidAI/LFM2.5-1.2B-Thinking
  • Parameters: ~1.2B
  • Context Length: 128,000 tokens
  • Vocab Size: 65,536

Available Quantizations

File Quant Size BPW Description
pantheon-ui-lfm25-emoji-merged-Q4_K_M.gguf Q4_K_M ~697 MB 4.98 Recommended โ€” best balance of size and quality
pantheon-ui-lfm25-emoji-merged-Q8_0.gguf Q8_0 ~1.2 GB 8.50 Near-lossless, best fidelity
pantheon-ui-lfm25-emoji-merged.f16.gguf F16 ~2.2 GB 16.00 Full precision, for re-quantization

Usage

Ollama

A Modelfile with thinking support is included in this repo. Download it alongside the GGUF, then:

ollama create pantheon-emoji -f Modelfile
ollama run pantheon-emoji "your prompt"

To see the model's chain-of-thought reasoning, use the --think flag:

ollama run pantheon-emoji --think "your prompt"

llama.cpp

llama-cli -m pantheon-ui-lfm25-emoji-merged-Q4_K_M.gguf -p "Hello" -cnv

LM Studio

Download the GGUF file and load it directly in LM Studio.

Conversion Details

Converted from the original safetensors weights using llama.cpp's convert_hf_to_gguf.py.

Downloads last month
32
GGUF
Model size
1B params
Architecture
lfm2
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for shreyask/pantheon-ui-lfm25-emoji-merged-GGUF

Quantized
(1)
this model