hai's picture

hai

cloudyu

·

yu-hai-52a1702a

AI & ML interests

Personal contributor

Recent Activity

new activity about 2 months ago

Qwen/QwQ-32B:It's challenging for QwQ to generate long codes...

updated a model 3 months ago

cloudyu/S1-Llama-3.2-3Bx4-MoE

published a model 3 months ago

cloudyu/S1-Llama-3.2-3Bx4-MoE

View all activity

Organizations

cloudyu's activity

New activity in Qwen/QwQ-32B about 2 months ago

It's challenging for QwQ to generate long codes...

#38 opened about 2 months ago by

New activity in unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF 3 months ago

error when to try this gguf

#3 opened 3 months ago by

New activity in unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF 3 months ago

unknown pre-tokenizer type: 'deepseek-r1-qwen'

#1 opened 3 months ago by

New activity in cloudyu/Mixtral_11Bx2_MoE_19B 6 months ago

Adding Evaluation Results

#3 opened about 1 year ago by

leaderboard-pr-bot

New activity in nvidia/Llama-3.1-Nemotron-70B-Instruct-HF 6 months ago

Templete Prompt

#20 opened 6 months ago by

there are 3 "r"s in the playful "strawrberry"?

#6 opened 6 months ago by

New activity in cloudyu/Mixtral_34Bx2_MoE_60B 7 months ago

Adding Evaluation Results

#16 opened 7 months ago by

leaderboard-pr-bot

New activity in mistralai/Mistral-Large-Instruct-2407 7 months ago

不知道下载哪些内容

#18 opened 9 months ago by

New activity in cloudyu/Mixtral-8x7B-Instruct-v0.1-DPO 8 months ago

Adding Evaluation Results

#1 opened about 1 year ago by

leaderboard-pr-bot

New activity in cloudyu/Mixtral_7Bx2_MoE 8 months ago

Adding Evaluation Results

#7 opened about 1 year ago by

leaderboard-pr-bot

New activity in cloudyu/Mixtral_7Bx4_MOE_24B 8 months ago

Adding Evaluation Results

#3 opened about 1 year ago by

leaderboard-pr-bot

New activity in cloudyu/Mixtral_11Bx2_MoE_19B 8 months ago

Adding Evaluation Results

#4 opened about 1 year ago by

New activity in cloudyu/Yi-34Bx2-MoE-60B-DPO 9 months ago

Adding Evaluation Results

#5 opened 9 months ago by

leaderboard-pr-bot

New activity in cloudyu/Llama-3-70Bx2-MOE 9 months ago

Adding Evaluation Results

#1 opened 9 months ago by

leaderboard-pr-bot

New activity in mistralai/Mistral-Nemo-Instruct-2407 9 months ago

mistral-chat doesn't work

#12 opened 9 months ago by

New activity in Kwai-Kolors/Kolors 10 months ago

加油老铁们

#2 opened 10 months ago by

New activity in mlx-community/gemma-2-27b-it-8bit 10 months ago

example code doesn't work at all

#2 opened 10 months ago by

New activity in bartowski/gemma-2-27b-it-GGUF 10 months ago

llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'gemma2'

#1 opened 10 months ago by

New activity in cloudyu/Yi-34Bx2-MoE-60B-DPO 10 months ago

Update README.md with license information

#4 opened 10 months ago by

New activity in cloudyu/Mixtral_34Bx2_MoE_60B 10 months ago

Update README.md with license information

#15 opened 10 months ago by