Ben Li's picture

Ben Li

bash99

·

bash99

AI & ML interests

AIGC, stable diffusion, chatgpt

Recent Activity

upvoted a collection 9 days ago

liked a model 17 days ago

ModelCloud/QwQ-32B-gptqmodel-4bit-vortex-v1

liked a model 24 days ago

ds4sd/SmolDocling-256M-preview

View all activity

Organizations

None yet

bash99's activity

New activity in HuggingFaceH4/ultrachat_200k 24 days ago

Does it contains languages other than English?

#8 opened 24 days ago by

New activity in ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2 30 days ago

Will you convert DeepSeek-R1-Distill-Qwen-32B?

#2 opened 30 days ago by

New activity in allenai/olmOCR-7B-0225-preview about 2 months ago

Some edge case is not good

#2 opened about 2 months ago by

New activity in rhymes-ai/Aria 6 months ago

llama.cpp support

#1 opened 6 months ago by

New activity in Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4 7 months ago

Any one can use VLLM or any other engine support dynamic batch to run this with more than 1 GPU?

#1 opened 7 months ago by

New activity in Alibaba-NLP/gte-multilingual-base 8 months ago

How do I got token_weights from onnx inference?

#9 opened 8 months ago by

New activity in Alibaba-NLP/gte-multilingual-reranker-base 8 months ago

反问句的重排似乎效果不佳

#5 opened 8 months ago by

New activity in Alibaba-NLP/gte-multilingual-base 8 months ago

某些特殊情况匹配排序会有错）

#5 opened 8 months ago by

New activity in OpenBuddy/openbuddy-llama2-13b-v8.1-fp16 over 1 year ago

最好能给出Instruction模板和示例，另外请问底层是llama2-base还是llama2-chat？

#3 opened over 1 year ago by

New activity in jondurbin/airoboros-13b-gpt4-1.4 almost 2 years ago

4 bit GPTQ

#1 opened almost 2 years ago by

New activity in IDEA-CCNL/Ziya-LLaMA-13B-v1 almost 2 years ago

能给出转换后的3个pytorch bin文件的sha256sum吗？

#30 opened almost 2 years ago by

New activity in THUDM/chatglm2-6b-int4 almost 2 years ago

如果这个是用Bitsandsbyte的NF4量化的，能否直接在这个基础上用qlora继续训练？

#1 opened almost 2 years ago by

New activity in coyude/Nous-Hermes-13b-Chinese-plus-GPTQ almost 2 years ago

请问这个带Plus的版本和不带的有什么区别？

#1 opened almost 2 years ago by

New activity in TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ almost 2 years ago

Gibberish on 'latest', with recent qwopqwop GPTQ/triton and ooba?

#2 opened almost 2 years ago by

New activity in thatname/Ziya-LLaMA-13B-v1-ggml almost 2 years ago

convert ziya to ggml shell

#1 opened almost 2 years ago by

New activity in anon8231489123/vicuna-13b-GPTQ-4bit-128g about 2 years ago

Vram usage

#3 opened about 2 years ago by