IQWiki-XFACT

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

nlee-208 updated a dataset about 21 hours ago

iqwiki-kor/Q2.5-7B-dist-op-pref-iter2-v2

nlee-208 updated a dataset 1 day ago

iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.01-Iter1-v2-Self-seed192-w-score

nlee-208 updated a dataset 2 days ago

iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.01-Iter1-v2-Self-seed42

View all activity

iqwiki-kor's activity

nlee-208

updated a dataset about 21 hours ago

iqwiki-kor/Q2.5-7B-dist-op-pref-iter2-v2

Updated about 21 hours ago • 1

nlee-208

updated a dataset 1 day ago

iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.01-Iter1-v2-Self-seed192-w-score

Updated 1 day ago • 3

nlee-208

updated 2 datasets 2 days ago

iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.01-Iter1-v2-Self-seed42

Viewer • Updated 2 days ago • 60.9k • 4

iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.01-Iter1-v2-Self-seed192

Updated 2 days ago • 1

JW17

authored 2 papers about 2 months ago

Stable Language Model Pre-training by Reducing Embedding Variability

Paper • 2409.07787 • Published Sep 12, 2024

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Paper • 2410.18027 • Published Oct 23, 2024

nlee-208

updated a dataset about 2 months ago

iqwiki-kor/wDPO-it-final1

Viewer • Updated Nov 25, 2024 • 10k • 32

nlee-208

authored a paper 2 months ago

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Paper • 2410.18027 • Published Oct 23, 2024

nlee-208

authored 2 papers 4 months ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 12

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9, 2024 • 2

JW17

authored a paper 7 months ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 12

JW17

authored a paper 10 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 64

nlee-208

authored 2 papers 10 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 64

Can Large Language Models Infer and Disagree Like Humans?

Paper • 2305.13788 • Published May 23, 2023

AI & ML interests

Recent Activity

Team members 5

iqwiki-kor's activity