Seungwoo Ryu's picture

Seungwoo Ryu PRO

tryumanshow

·

AI & ML interests

LLM, Agent

Recent Activity

updated a dataset 1 day ago

tryumanshow/glaive-function-calling-v2-ko-refined

published a dataset 1 day ago

tryumanshow/glaive-function-calling-v2-ko-refined

liked a model 3 days ago

Qwen/QwQ-32B

View all activity

Organizations

tryumanshow's activity

upvoted a paper 23 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 46

upvoted an article about 2 months ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 51

upvoted 2 collections 4 months ago

Korean Instruction Dataset

5 items • Updated Jan 24 • 7

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 111

upvoted a collection 5 months ago

Korean Reward Modeling

Korean Datasets, Reward Models for RLHF • 16 items • Updated Nov 19, 2024 • 3

upvoted a paper 5 months ago

DiaSynth -- Synthetic Dialogue Generation Framework

Paper • 2409.19020 • Published Sep 25, 2024 • 21

upvoted an article 6 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 87

upvoted a collection 7 months ago

LLMs

415 items • Updated about 3 hours ago • 30

upvoted a paper 8 months ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 48

upvoted a paper 9 months ago

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Paper • 2406.14563 • Published Jun 20, 2024 • 30

upvoted a collection 9 months ago

Function Calling v3

Models fine-tuned for function-calling • 14 items • Updated Apr 27, 2024 • 21

upvoted 3 collections 10 months ago

Agents

Collection of resources related to Agents. • 73 items • Updated Jan 28 • 6

Miqu-based Models

A collection of creative writing models based on the 'miqu-1-70b ' model. • 9 items • Updated Dec 3, 2024 • 2

Agents

63 items • Updated Jan 10 • 5

upvoted a paper 10 months ago

Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3

Paper • 2405.00664 • Published May 1, 2024 • 20

upvoted an article 10 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 438

upvoted a collection 11 months ago

Long context

94 items • Updated Sep 29, 2024 • 31

upvoted an article 11 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 234

upvoted a collection 11 months ago

Handbook v0.1 models and datasets

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24