Wenbo Zhang's picture

Wenbo Zhang

Wenboz

https://onepounchman.github.io/

AI & ML interests

Causal Inference, Out-of-distribuition Robustness, NLP

Recent Activity

updated a dataset 8 days ago

Wenboz/llama3-instruct-reward-logps-ultrafeedback

updated a dataset 8 days ago

Wenboz/mistral-instruct-reward-logps-ultrafeedback

updated a dataset 8 days ago

Wenboz/llama3-base-reward-logps-ultrafeedback

View all activity

Organizations

None yet

models 16

Wenboz/mistral-7b-base-p3o

Updated 15 days ago

Wenboz/zephyr-7b-dpo-full

Text Generation • Updated 20 days ago • 227

Wenboz/zephyr-7b-dpo-lora

Updated Oct 20, 2024

Wenboz/llama3-wpo-lora

Updated Sep 22, 2024

Wenboz/llama3-dpo-lora

Updated Sep 20, 2024

Wenboz/zephyr-7b-wpo-lora

Updated Sep 18, 2024

Wenboz/llama3-dpo-full

Updated Sep 10, 2024

Wenboz/FsfairX-LLaMA3-RM-clone

Updated Sep 2, 2024 • 3

Wenboz/aromarm_clone

Updated Sep 1, 2024 • 6

Wenboz/phi3-offline-dpo-lora-noise-0.0-5e-7-thre-1.5-42

Updated Jul 9, 2024

datasets 10

Wenboz/llama3-instruct-reward-logps-ultrafeedback

Viewer • Updated 8 days ago • 61.8k • 11

Wenboz/mistral-instruct-reward-logps-ultrafeedback

Viewer • Updated 8 days ago • 62.7k • 15

Wenboz/llama3-base-reward-logps-ultrafeedback

Viewer • Updated 8 days ago • 63.1k • 29

Wenboz/mistral-base-reward-logps-ultrafeedback

Viewer • Updated 8 days ago • 63.1k • 11

Wenboz/mistral-base-proxy-reward-ultrafeedback

Viewer • Updated 19 days ago • 63.1k • 41

Wenboz/hh_clean_test_messages

Updated Jul 13, 2024 • 2

Wenboz/SELM-Phi-3-mini-4k-instruct-dataset

Viewer • Updated Jul 1, 2024 • 6 • 30

Wenboz/hh_sft_messages

Viewer • Updated Jun 20, 2024 • 48.4k • 30

Wenboz/hh_clean

Viewer • Updated Jun 19, 2024 • 48.4k • 31

Wenboz/hh_sft

Viewer • Updated Jun 18, 2024 • 65.5k • 32