8 1 115

Neo Dim

NeoDim

AI & ML interests

None yet

Recent Activity

liked a model 13 days ago

bartowski/mistralai_Mistral-Small-3.1-24B-Instruct-2503-GGUF

liked a model 19 days ago

NousResearch/DeepHermes-3-Mistral-24B-Preview

liked a model 19 days ago

bartowski/NousResearch_DeepHermes-3-Mistral-24B-Preview-GGUF

View all activity

Organizations

None yet

NeoDim's activity

New activity in bartowski/starchat2-15b-v0.1-GGUF about 1 year ago

What is the prompt format?

#1 opened about 1 year ago by

siddhesh22

New activity in NeoDim/starcoder-GGML almost 2 years ago

how did you convert `transformers.PreTrainedTokenizer` to ggml format?

#2 opened almost 2 years ago by

keunwoochoi

New activity in NeoDim/starchat-alpha-GGML almost 2 years ago

demo space

#4 opened almost 2 years ago by

matthoffner

Looks like the starchat-alpha-ggml-q4_1.bin is broken

#3 opened almost 2 years ago by

xhyi

New activity in NeoDim/starcoderbase-GGML almost 2 years ago

missing tok_embeddings.weight error when trying to run with llama.cpp

#1 opened almost 2 years ago by

ultra2mh

New activity in NeoDim/starcoder-GGML almost 2 years ago

Cannot run on llama.cpp and koboldcpp

#1 opened almost 2 years ago by

FenixInDarkSolo

New activity in NeoDim/starchat-alpha-GGML almost 2 years ago

Which inference repo is this quantized for?

#2 opened almost 2 years ago by

xhyi

Can the quantized model be loaded in gpu to have faster inference ?

#1 opened almost 2 years ago by

MohamedRashad

Can the quantized model be loaded in gpu to have faster inference ?

#1 opened almost 2 years ago by

MohamedRashad

New activity in NeoDim/starcoder-GGML almost 2 years ago

Cannot run on llama.cpp and koboldcpp

#1 opened almost 2 years ago by

FenixInDarkSolo

New activity in NeoDim/starchat-alpha-GGML almost 2 years ago

Can the quantized model be loaded in gpu to have faster inference ?

#1 opened almost 2 years ago by

MohamedRashad