1 33 26

InHo Won

kotmul

AI & ML interests

None yet

Recent Activity

liked a dataset 9 days ago

simplescaling/s1K-claude-3-7-sonnet

liked a dataset 9 days ago

simplescaling/s1K-1.1

liked a dataset 9 days ago

simplescaling/data_ablation_full59K

View all activity

Organizations

kotmul's activity

upvoted a paper about 1 month ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 111

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

upvoted a paper 3 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

upvoted an article 3 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

•

Sep 27, 2024

• 40

upvoted a paper 10 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 68

upvoted 5 papers 11 months ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25, 2024 • 54

Extending Llama-3's Context Ten-Fold Overnight

Paper • 2404.19553 • Published Apr 30, 2024 • 34

Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean

Paper • 2403.10882 • Published Mar 16, 2024 • 6

X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment

Paper • 2403.11399 • Published Mar 18, 2024 • 6

BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining

Paper • 2401.06443 • Published Jan 12, 2024 • 2

upvoted a collection 11 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 719

upvoted 9 papers about 1 year ago

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 23

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 71

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Paper • 2402.15506 • Published Feb 23, 2024 • 16