Full name
TobDeBer
AI & ML interests
Diffusion, Causality, LLM, LMM (Large Music Model), Quantization, AI Context Databases
Recent Activity
updated
a model
about 6 hours ago
TobDeBer/SmartQuant
published
a model
2 days ago
TobDeBer/SmartQuant
new activity
3 days ago
microsoft/bitnet-b1.58-2B-4T-gguf:TQ1 quant version
Organizations
None yet
TobDeBer's activity
TQ1 quant version
3
#7 opened 3 days ago
by
TobDeBer
Performance
11
#1 opened 28 days ago
by
robb-0

Which quantized version can run on a Mac computer with 32GB of memory?
4
#2 opened 14 days ago
by
jimpunk
DOA
1
15
#1 opened 19 days ago
by
MrDevolver

Is the 2.51bit model using imatrix?
7
#3 opened about 1 month ago
by
daweiba12
Dynamic bnb-4bit
1
2
#1 opened about 2 months ago
by
iqdddd
RTX 5090 with 600GB of RAM what models?
4
#40 opened about 2 months ago
by
frank-mx
Accuracy of the dynamic quants compared to usual quants?
19
#21 opened 3 months ago
by
inputout

Saving to q5_k_m GGUF
4
#1 opened 2 months ago
by
sasha1234567
8bits quantization
5
#20 opened 3 months ago
by
ramkumarkoppu
Is there a model removing non-shared MoE experts?
4
#17 opened 3 months ago
by
ghostplant
Over 2 tok/sec agg backed by NVMe SSD on 96GB RAM + 24GB VRAM AM5 rig with llama.cpp
4
9
#13 opened 3 months ago
by
ubergarm
Quantizer Tool
2
#14 opened 3 months ago
by
TobDeBer
Where did the BF16 come from?
8
#10 opened 3 months ago
by
gshpychka
vram+ram
4
#7 opened 6 months ago
by
sdyy
11b instruct gguf?
2
3
#1 opened 7 months ago
by
celsowm

torch and llama.cpp integration
3
#1 opened 7 months ago
by
TobDeBer
Fine control for Turbo and Lightning models
1
#1 opened 8 months ago
by
TobDeBer
RuntimeError: cutlassF: no kernel found to launch!
12
#11 opened about 1 year ago
by
mayonaisu