Zeze Nene's picture

Zeze Nene

Neman

·

AI & ML interests

LLM, evolutionary programming, AI

Recent Activity

liked a model 3 days ago

ai-forever/MoVQGAN

liked a model 5 days ago

QuantFactory/orpheus-3b-0.1-ft-GGUF

liked a model 6 days ago

canopylabs/orpheus-3b-0.1-pretrained

View all activity

Organizations

None yet

Neman's activity

New activity in EuroBERT/EuroBERT-2.1B 27 days ago

Only 9 European languages?

#4 opened 27 days ago by

New activity in google/siglip2-large-patch16-512 about 1 month ago

Problem with demo code using pipeline

#2 opened about 1 month ago by

New activity in unsloth/DeepSeek-R1-Distill-Qwen-14B-GGUF 2 months ago

unknown pre-tokenizer type: 'deepseek-r1-qwen'

#1 opened 3 months ago by

New activity in unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF 2 months ago

unknown pre-tokenizer type: 'deepseek-r1-qwen'

#1 opened 3 months ago by

New activity in srinivasbilla/llasa-3b 2 months ago

safetensors size

#1 opened 3 months ago by

New activity in iiiorg/piiranha-v1-detect-personal-information 7 months ago

Phone number format

#4 opened 7 months ago by

New activity in google/gemma-2-9b-it 7 months ago

Update?

#44 opened 7 months ago by

New activity in OpenGVLab/Mini-InternVL-Chat-4B-V1-5 10 months ago

Flash Attention

#3 opened 10 months ago by

New activity in OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B 10 months ago

What ViT?

#2 opened 11 months ago by

New activity in deepseek-ai/deepseek-vl-7b-chat about 1 year ago

4-bit quant?

#3 opened about 1 year ago by

New activity in YaTharThShaRma999/DeepSeek-vl-4bit-7b about 1 year ago

Base or Chat?

#1 opened about 1 year ago by

New activity in ISTA-DASLab/Mixtral-8x7b-AQLM-2Bit-1x16-hf about 1 year ago

NameError: name 'flash_attn_func' is not defined

#4 opened about 1 year ago by

New activity in MMInstruction/Silkie over 1 year ago

'QWenTokenizer' object has no attribute 'IMAGE_ST'

#1 opened over 1 year ago by

New activity in TheBloke/Qwen-14B-Chat-GPTQ over 1 year ago

Will it come?

#2 opened over 1 year ago by

New activity in Qwen/Qwen-VL-Chat-Int4 over 1 year ago

Update of checkpoints?

#1 opened over 1 year ago by

New activity in facebook/hf-seamless-m4t-large over 1 year ago

ImportError: cannot import name 'SeamlessM4TModel' from 'transformers'

#13 opened over 1 year ago by

New activity in adept/fuyu-8b over 1 year ago

Question What are the results for image captioning for fuyu-8b in comparison to other models?

#8 opened over 1 year ago by

What are the memory requirements for running the model?

#6 opened over 1 year ago by

New activity in llm-agents/tora-code-7b-v1.0 over 1 year ago

gguf variant?

#1 opened over 1 year ago by

New activity in TheBloke/Qwen-14B-Chat-GPTQ over 1 year ago

Will it come?

#2 opened over 1 year ago by