9 48 226

Yohan Na PRO

nayohan

nayohan

AI & ML interests

NLP, Dialogue systems

Recent Activity

upvoted a paper 4 days ago

START: Self-taught Reasoner with Tools

liked a dataset 5 days ago

KRX-Data/Won-Instruct

updated a Space 5 days ago

nayohan/sample-chat

View all activity

Organizations

nayohan's activity

upvoted a paper 4 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 108

upvoted a collection 15 days ago

TxGemma Release

Collection

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 13 days ago • 45

upvoted an article about 1 month ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 387

upvoted a paper about 1 month ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

upvoted 3 papers about 2 months ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 25

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Paper • 2502.02481 • Published Feb 4 • 13

upvoted an article about 2 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21, 2024

• 33

upvoted a paper 2 months ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published Feb 4 • 18

upvoted 10 papers 3 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 58

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 91

upvoted a collection 3 months ago

Deepseek Papers

Collection

Deepseek papers collection • 19 items • Updated 12 days ago • 190