2 18 6

DeyangKong

DeyangKong

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 2 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

upvoted a paper 6 days ago

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

upvoted a paper 7 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

View all activity

Organizations

None yet

DeyangKong's activity

upvoted a paper 2 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 3 days ago • 25

upvoted a paper 6 days ago

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published 7 days ago • 61

upvoted a paper 7 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 9 days ago • 107

upvoted a paper 23 days ago

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Paper • 2503.01506 • Published 24 days ago • 9

upvoted a paper 27 days ago

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 37

upvoted a paper 7 months ago

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 43

upvoted a collection 7 months ago

Code Evaluation

Collection

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15

upvoted 2 papers 8 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 114

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 162

upvoted a paper 9 months ago

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Paper • 2406.07522 • Published Jun 11, 2024 • 39

upvoted 4 papers 10 months ago

The Power of Scale for Parameter-Efficient Prompt Tuning

Paper • 2104.08691 • Published Apr 18, 2021 • 10

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 67

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 65

upvoted 2 papers 11 months ago

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 35

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29, 2024 • 70

upvoted an article 11 months ago

Article

Introducing the Open Leaderboard for Hebrew LLMs!

May 5, 2024

• 39

upvoted a paper about 1 year ago

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 71