seongyun_lee's picture

seongyun_lee

Seongyun

·

AI & ML interests

None yet

Recent Activity

updated a dataset 11 days ago

Seongyun/human_eval_1

published a dataset 11 days ago

Seongyun/human_eval_1

published a model 20 days ago

Seongyun/exaone_deep_2.4b_non_math_only_mcqa_format

View all activity

Organizations

Seongyun's activity

upvoted a paper 3 months ago

How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?

Paper • 2410.07571 • Published Oct 10, 2024 • 2

upvoted an article 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k

upvoted a collection 3 months ago

Reasoning Datasets

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 24

upvoted a paper 3 months ago

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published Jan 10 • 72

upvoted a collection 4 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 151

upvoted 2 papers 4 months ago

OpenAI o1 System Card

Paper • 2412.16720 • Published Dec 21, 2024 • 34

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 13

upvoted a paper 8 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126

upvoted 3 papers 10 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 42

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 104

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Paper • 2306.16527 • Published Jun 21, 2023 • 46

upvoted an article 10 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 177

upvoted 2 collections 11 months ago

System Message Generalization

11 items • Updated Jun 7, 2024 • 4

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12, 2024 • 68

upvoted 2 papers 11 months ago

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Paper • 2406.04770 • Published Jun 7, 2024 • 31

Aligning to Thousands of Preferences via System Message Generalization

Paper • 2405.17977 • Published May 28, 2024 • 7

upvoted 3 papers about 1 year ago

PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models

Paper • 2402.01118 • Published Feb 2, 2024 • 32

LangBridge: Multilingual Reasoning Without Multilingual Supervision

Paper • 2401.10695 • Published Jan 19, 2024 • 5

CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

Paper • 2401.12208 • Published Jan 22, 2024 • 23