DAN™

New activity in dranger003/bagel-34b-v0.4-iMat.GGUF 6 days ago

Update README.md with license information

#1 opened 6 days ago by

New activity in dranger003/bagel-dpo-34b-v0.5-iMat.GGUF 6 days ago

Update README.md with license information

#1 opened 6 days ago by

New activity in dranger003/Smaug-34B-v0.1-iMat.GGUF 8 days ago

Update README.md with license information

#2 opened 8 days ago by

New activity in microsoft/Phi-3-vision-128k-instruct about 1 month ago

How to enable streaming for phi 3 vision model ?

6

#15 opened about 1 month ago by

bhimrazy

New activity in dranger003/c4ai-command-r-plus-iMat.GGUF about 1 month ago

I'm generating a imatrix using `groups_merged.txt` if you want me to run any tests?

19

#15 opened 3 months ago by

jukofyork

New activity in dranger003/c4ai-command-r-v01-iMat.GGUF about 1 month ago

Is the KV cache of these models unusually high?

#6 opened about 1 month ago by

Hugsanir

New activity in dranger003/c4ai-command-r-plus-iMat.GGUF about 2 months ago

How about a quantized version that fits in 16 GB of memory like wizardlm?

#19 opened about 2 months ago by

Zibri

New activity in dranger003/c4ai-command-r-v01-iMat.GGUF about 2 months ago

Update chat templates

#5 opened 3 months ago by

CISCai

New activity in dranger003/c4ai-command-r-plus-iMat.GGUF 2 months ago

Will you redo quants after your bpe pr gets merged?

#18 opened 2 months ago by

ggnoy

New activity in dranger003/dbrx-instruct-iMat.GGUF 2 months ago

can't use llama load gguf model

#6 opened 2 months ago by

Tianyi000

New activity in dranger003/CausalLM-34b-beta-iMat.GGUF 2 months ago

35B-beta is realeased

4

#3 opened 2 months ago by

tastypear

New activity in dranger003/c4ai-command-r-plus-iMat.GGUF 3 months ago

Update chat templates

6

#17 opened 3 months ago by

CISCai

New activity in dranger003/dbrx-instruct-iMat.GGUF 3 months ago

ollama failed to create model

#3 opened 3 months ago by

edisonzf2020

New activity in dranger003/c4ai-command-r-plus-iMat.GGUF 3 months ago

Can't merge files with gguf

7

#16 opened 3 months ago by

zedmango

New activity in dranger003/SFR-Embedding-Mistral-GGUF 3 months ago

is it possible to use this model with LM Studio ??

#1 opened 3 months ago by

michabbb

New activity in dranger003/c4ai-command-r-plus-iMat.GGUF 3 months ago

Can we get a Q4 without the IMat?

#14 opened 3 months ago by

yehiaserag

New activity in dranger003/dbrx-instruct-iMat.GGUF 3 months ago

Reuse your `ggml-dbrx-instruct-16x12b-q8_0-imatrix.dat` file?

20

#1 opened 3 months ago by

jukofyork

New activity in dranger003/c4ai-command-r-v01-iMat.GGUF 3 months ago

prompt eval too slow

#4 opened 3 months ago by

lfjmgs

New activity in dranger003/dbrx-instruct-iMat.GGUF 3 months ago

Very sensitve to any repetition penalty!

#2 opened 3 months ago by

jukofyork

New activity in dranger003/c4ai-command-r-v01-iMat.GGUF 3 months ago

can you guys share the size & perlexity tables thanks

#3 opened 3 months ago by

habout632

New activity in dranger003/c4ai-command-r-plus-iMat.GGUF 3 months ago

Garbled output in llama.cpp

#13 opened 3 months ago by

spanielrassler

fail on 104b-iq2_xxs.gguf with llama.cpp

4

#12 opened 3 months ago by

telehan

New activity in dranger003/GritLM-7B-GGUF 3 months ago

PR #5796 is merged

#1 opened 3 months ago by

Joseph717171

New activity in dranger003/c4ai-command-r-plus-iMat.GGUF 3 months ago

Invalid split files?

#11 opened 3 months ago by

SabinStargem

Unable to load in ollama built from PR branch

#10 opened 3 months ago by

gigq

What does iMat mean?

15

#2 opened 3 months ago by

AS1200

Is IQ1_S broken? If so why list it here?

#9 opened 3 months ago by

stduhpf

Fast work by the people on the llama.cpp team

#8 opened 3 months ago by

qaraleza

Add model sizes

#5 opened 3 months ago by

nanoflooder

For a context of at least 32K tokens which version on a 2x16GB Gpu Config?

#3 opened 3 months ago by

Kalemnor

iq3_xxs request

#1 opened 3 months ago by

yamikumods

New activity in dranger003/starcoder2-15b-GGUF 3 months ago

support by llama-cpp-python?

7

#2 opened 3 months ago by

madhucharan

5 quants?

#1 opened 4 months ago by

Orenguteng

New activity in dranger003/dolphincoder-starcoder2-15b-iMat.GGUF 3 months ago

Bigger quants

#1 opened 3 months ago by

WeirdObs

New activity in dranger003/CausalLM-34b-beta-iMat.GGUF 3 months ago

Thanks for your quants!

9

#2 opened 3 months ago by

Cran-May

New activity in dranger003/c4ai-command-r-v01-iMat.GGUF 3 months ago

About q4_k and q5_k

#2 opened 3 months ago by

stduhpf

New activity in dranger003/WhiteRabbitNeo-33B-v1.5-iMat.GGUF 3 months ago

How did you convert it?

#2 opened 3 months ago by

froggeric

New activity in dranger003/Qwen1.5-72B-Chat-iMat.GGUF 3 months ago

Can't download via text gen web ui

#2 opened 3 months ago by

AS1200

New activity in dranger003/Qwen1.5-72B-Chat-iMat.GGUF 4 months ago

May the 2-bit compression still face some performance limitations.

#1 opened 4 months ago by

DesperateZero

New activity in dranger003/c4ai-command-r-v01-iMat.GGUF 4 months ago

Cannot load model due to invalid format

#1 opened 4 months ago by

ABX-AI

New activity in dranger003/Smaug-72B-v0.1-iMat.GGUF 4 months ago

More quant types

#2 opened 4 months ago by

Wubbbi

New activity in dranger003/deepseek-coder-33b-instruct-iMat.GGUF 4 months ago

Add quants for Q5

#2 opened 4 months ago by

dzupin

New and improved Q1_S quants

#1 opened 4 months ago by

LapinMalin

New activity in dranger003/CausalLM-34b-beta-iMat.GGUF 4 months ago

imatrix problem

#1 opened 4 months ago by

DataSoul

New activity in dranger003/airoboros-34b-3.2-iMat.GGUF 4 months ago

corrupt download or bad file?

#1 opened 4 months ago by

Terminus-26

New activity in LoneStriker/zephyr-7b-gemma-v0.1-GGUF 4 months ago

Tokens overrides (added_tokens_decoder)

#1 opened 4 months ago by

dranger003

New activity in dranger003/Smaug-34B-v0.1-iMat.GGUF 4 months ago

What is going on with this model?

#1 opened 4 months ago by

MrVolk

New activity in dranger003/Senku-70B-iMat.GGUF 4 months ago

Tokenizer issues?

#3 opened 4 months ago by

xhyi

New activity in abideen/AlphaMonarch-laser 4 months ago

Could you please provide GGUF Files? :)

#1 opened 4 months ago by

Venkman42

New activity in dranger003/gemma-7b-it-iMat.GGUF 4 months ago

How did you make these quants?

#1 opened 4 months ago by

rombodawg

New activity in dranger003/alpaca-dragon-72b-v1-iMat.GGUF 4 months ago

Q4_k_s version please

#2 opened 5 months ago by

Hoioi

New activity in dranger003/WhiteRabbitNeo-33B-v1.5-iMat.GGUF 4 months ago

Hello

#1 opened 4 months ago by

userbox

New activity in dranger003/CodeLlama-70b-Instruct-iMat.GGUF 4 months ago

a few interesting models

#1 opened 4 months ago by

KnutJaegersberg

New activity in dranger003/Smaug-Mixtral-v0.1-iMat.GGUF 4 months ago

Quantisation parameters + Q5_K_M version?

#1 opened 4 months ago by

smcleod

New activity in abacusai/Smaug-Mixtral-v0.1 4 months ago

Any chance of providing an iMatrix?

#2 opened 4 months ago by

smcleod

New activity in dranger003/Senku-70B-iMat.GGUF 5 months ago

Slow prompt processing

#2 opened 5 months ago by

OrangeApples

New activity in Nexesenex/Senku-70b-iMat.GGUF 5 months ago

A request for quantization.

#1 opened 5 months ago by

Kotokin

New activity in wolfram/miquliz-120b-v2.0-GGUF 5 months ago

iMatrix, IQ2_XS & IQ2_XXS

13

#2 opened 5 months ago by

Nexesenex

New activity in dranger003/Senku-70B-iMat.GGUF 5 months ago

A request for quantization.