Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Wenbo Zhang
Wenboz
Follow
https://onepounchman.github.io/
AI & ML interests
Causal Inference, Out-of-distribuition Robustness, NLP
Organizations
None yet
models
14
Sort: Recently updated
Wenboz/llama3-wpo-lora
Updated
4 days ago
Wenboz/llama3-dpo-lora
Updated
7 days ago
•
1
Wenboz/zephyr-7b-wpo-lora
Updated
9 days ago
•
2
Wenboz/zephyr-7b-dpo-lora
Updated
17 days ago
•
2
Wenboz/llama3-dpo-full
Updated
17 days ago
Wenboz/FsfairX-LLaMA3-RM-clone
Updated
25 days ago
•
159
Wenboz/aromarm_clone
Updated
26 days ago
•
52
Wenboz/phi3-offline-dpo-lora-noise-0.0-5e-7-thre-1.5-42
Updated
Jul 9
Wenboz/phi3-offline-dpo-lora-noise-0.0-5e-7-42
Updated
Jul 9
•
1
Wenboz/phi3-offline-dpo-lora-noise-0.0-5e-6-42
Updated
Jul 9
Expand 14 models
datasets
5
Sort: Recently updated
Wenboz/hh_clean_test_messages
Updated
Jul 13
•
1
Wenboz/SELM-Phi-3-mini-4k-instruct-dataset
Viewer
•
Updated
Jul 1
•
6
•
6
Wenboz/hh_sft_messages
Viewer
•
Updated
Jun 20
•
48.4k
•
2
Wenboz/hh_clean
Viewer
•
Updated
Jun 19
•
48.4k
•
2
Wenboz/hh_sft
Viewer
•
Updated
Jun 18
•
65.5k
•
2