arxiv:2412.13670
Mingzhe Du
Elfsong
AI & ML interests
Code Generation / Preference Alignment / Bias Mitigation
Recent Activity
updated
a model
about 3 hours ago
Elfsong/Phi-4-14B-Instruct-sft
published
a model
about 3 hours ago
Elfsong/Phi-4-14B-Instruct-sft
updated
a model
about 3 hours ago
Elfsong/Llama-3.1-8B-Instruct-sft
Organizations
Papers
2
spaces
5
models
17
Elfsong/Phi-4-14B-Instruct-sft
Text Generation
•
Updated
Elfsong/Llama-3.1-8B-Instruct-sft
Text Generation
•
Updated
•
230
Elfsong/Phi-3.5-4B-instruct-sft
Text Generation
•
Updated
Elfsong/Llama-3.3-70B-Instruct-dpo
Text Generation
•
Updated
•
15
Elfsong/Llama-3.3-70B-Instruct-stf
Text Generation
•
Updated
•
67
Elfsong/Llama-3.1-8B-Instruct-dpo
Text Generation
•
Updated
•
38
Elfsong/mouadsfilter
Text2Text Generation
•
Updated
•
2
Elfsong/dpo
Updated
Elfsong/debias_model
Updated
Elfsong/my_awesome_model
Updated
datasets
66
Elfsong/Venus_KTO
Updated
Elfsong/Venus_SFT
Updated
Elfsong/Venus_DPO
Updated
Elfsong/Venus_t
Viewer
•
Updated
•
1.53k
•
368
Elfsong/Llama-3.3-70B-Instruct-sft-response
Viewer
•
Updated
•
256
•
19
Elfsong/Llama-3.3-70B-Instruct-dpo-response
Viewer
•
Updated
•
256
•
17
Elfsong/Llama-3.3-70B-Instruct-response
Viewer
•
Updated
•
256
•
20
Elfsong/Llama-3.1-8B-Instruct-dpo-response
Viewer
•
Updated
•
256
•
18
Elfsong/Llama-3.1-8B-Instruct-response
Viewer
•
Updated
•
256
•
20
Elfsong/gpt-4o-response
Viewer
•
Updated
•
256
•
20