hai

cloudyu

AI & ML interests

Personal contributor m2 ultra 192G QQ 206 887 187

Organizations

cloudyu's activity

New activity in cloudyu/Meta-Llama-3-70B-Instruct-DPO 13 days ago

good model

1
#1 opened 13 days ago by gopi87
New activity in mlx-community/c4ai-command-r-plus-4bit about 1 month ago

output is not correct.

1
#7 opened about 1 month ago by flymonk
New activity in mlx-community/c4ai-command-r-plus-4bit about 2 months ago
New activity in CohereForAI/c4ai-command-r-plus about 2 months ago

MMLU is only 25.64, anything wrong?

5
#8 opened about 2 months ago by cloudyu
New activity in wolfram/miquliz-120b-v2.0 about 2 months ago

VRAM Estimates

5
#3 opened 3 months ago by ernestr
New activity in cloudyu/Mixtral_34Bx2_MoE_60B about 2 months ago
New activity in cloudyu/Mixtral_11Bx2_MoE_19B 2 months ago

Hardware requirement

2
#5 opened 2 months ago by Dtree07

Adding Evaluation Results

1
#4 opened 2 months ago by ac-automata
New activity in cloudyu/Yi-34Bx2-MoE-60B 2 months ago

4x version

1
#15 opened 2 months ago by ehartford
New activity in cloudyu/mistral_pretrain_demo 3 months ago

Very interesting

1
#1 opened 3 months ago by ehartford
New activity in LayerDiffusion/layerdiffusion-v1 3 months ago

how to run this model?

3
#1 opened 3 months ago by cloudyu
New activity in yunconglong/MoE_13B_DPO 3 months ago
New activity in cloudyu/Mixtral_7Bx4_MOE_DPO 3 months ago

Train after merging?

2
#1 opened 3 months ago by adi-kmt

how to run this model

#2 opened 3 months ago by cloudyu

Upload tokenizer.model

1
#1 opened 4 months ago by Nexesenex

Upload tokenizer.model

#2 opened 4 months ago by Nexesenex

fp16

4
#1 opened 4 months ago by Nexesenex
New activity in 152334H/miqu-1-70b-sf 4 months ago
New activity in yunconglong/Mixtral_7Bx2_MoE_13B_DPO 4 months ago

Update README.md

#2 opened 4 months ago by cloudyu

Update README.md

#1 opened 4 months ago by cloudyu
New activity in mlabonne/phixtral-4x2_8 4 months ago
New activity in cloudyu/Mixtral_7Bx4_MOE_24B 4 months ago
New activity in jondurbin/truthy-dpo-v0.1 4 months ago

this is really great dataset

1
#2 opened 4 months ago by cloudyu
New activity in cloudyu/Pluto_13B_DPO 4 months ago
New activity in cloudyu/Yi-34Bx2-MoE-60B 4 months ago

vllm

2
#10 opened 4 months ago by regzhang
New activity in moreh/MoMo-72B-lora-1.8.6-DPO 4 months ago

congrat!new SOTA!

4
#1 opened 4 months ago by cloudyu
New activity in cloudyu/Yi-34Bx2-MoE-60B 4 months ago
New activity in cloudyu/Yi-34Bx2-MoE-60B 4 months ago

Multi-langua?

1
#7 opened 4 months ago by oFDz
New activity in cloudyu/Mixtral_7Bx2_MoE 4 months ago
New activity in cloudyu/Yi-34Bx2-MoE-60B 4 months ago
New activity in cloudyu/Mixtral_34Bx2_MoE_60B 4 months ago

Vram

2
#7 opened 4 months ago by DKRacingFan
New activity in cloudyu/Mixtral_7Bx2_MoE 5 months ago

Problem with tokenizer

1
#4 opened 5 months ago by ipechman

Can You Share Your Config

3
#2 opened 5 months ago by Weyaxi

How to merge models into moe?

2
#1 opened 5 months ago by Yhyu13
New activity in breadlicker45/museRWKV-test 9 months ago

how to run this model?

1
#2 opened 9 months ago by cloudyu