Bram Vanroy's picture

Bram Vanroy PRO

BramVanroy

·

https://bramvanroy.github.io/

AI & ML interests

Artificial intelligence, natural language processing, computational linguistics

Organizations

Posts 11

Post

866

The InstructGPT paper mentions that they insert 10% pretraining data during SFT, which they find improves the effect of PPO (IIUC). Has anyone else done later ablations on this? I've only seen the inverse suggested, mixing in SFT data during pretraining.

Post

1593

All my models seem to be plagued by infinite lists. When you ask a question that requires it to write a list, it most often keeps adding bullet points or enumeration. I am wondering whether this is a result of using chatty GPT-4 as DPO preferences. Any thoughts?

Collections 7

Papers 1

arxiv:2312.12852

spaces 7

Fietje

An efficient, open LMM for Dutch

GEITje 7B ultra

Text To AMR

Dutch Simplification

Open Dutch LLM Leaderboard

Steps Calculator

models 38

BramVanroy/fietje-2-chat

Text Generation • Updated 13 days ago • 1.45k • 1

BramVanroy/fietje-2-instruct

Text Generation • Updated 13 days ago • 1.33k • 2

BramVanroy/fietje-2

Text Generation • Updated 13 days ago • 1.36k • 4

BramVanroy/fietje-2-chat-gguf

Updated 23 days ago • 318 • 3

BramVanroy/fietje-2-instruct-gguf

Updated 23 days ago • 193 • 2

BramVanroy/fietje-2-gguf

Updated 23 days ago • 237 • 1

BramVanroy/tweety-7b-dutch-v24a-GGUF

Updated May 9 • 125 • 1

BramVanroy/fietje-3-mini-4k-instruct-GGUF

Updated May 5 • 160 • 2

BramVanroy/GEITje-7B-ultra-GGUF

Updated May 1 • 355 • 4

BramVanroy/GEITje-7B-ultra

Text Generation • Updated Apr 26 • 2.16k • 31

datasets 23

BramVanroy/stack_md_lid

Viewer • Updated 11 days ago • 38 • 4

BramVanroy/fietje-2-data

Viewer • Updated 13 days ago

BramVanroy/occiglot-fineweb-v0.5-nl

Viewer • Updated 14 days ago • 9 • 1

BramVanroy/no_robots_dutch

Viewer • Updated 16 days ago • 59 • 2

BramVanroy/ultra_feedback_dutch_cleaned

Viewer • Updated May 13 • 177 • 3

BramVanroy/WildChat-1M-filtered-gpt-4

Viewer • Updated May 4 • 3

BramVanroy/orca_dpo_pairs_dutch_cleaned

Viewer • Updated Apr 24 • 1

BramVanroy/orca_dpo_pairs_dutch

Viewer • Updated Apr 24 • 17 • 5

BramVanroy/ultrachat_200k_dutch

Viewer • Updated Apr 18 • 258 • 6

BramVanroy/ultra_feedback_dutch

Viewer • Updated Apr 18 • 137 • 2