11 19 97

Jisoo Kim PRO

kuotient

AI & ML interests

NLP

Recent Activity

liked a dataset 10 days ago

Intelligent-Internet/II-Thought-RL-v0

liked a dataset 11 days ago

werty1248/Korea-Related-Reddit-posts

View all activity

Organizations

kuotient's activity

upvoted a paper 28 days ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published about 1 month ago • 25

upvoted a paper about 1 month ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 364

upvoted 3 papers 7 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 139

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 77

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 41

upvoted an article 9 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 118

upvoted a paper 9 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 162

upvoted an article 10 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 85

upvoted a paper 10 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 92

upvoted a collection 10 months ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 361

upvoted a collection 11 months ago

Alpha Llama-3 collection

Collection

5 items • Updated Jan 15 • 2

upvoted a paper about 1 year ago

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27, 2024 • 25