32 10 22

Wonho Song

wonhosong

SONG-WONHO

AI & ML interests

Data, Evaluation, Large Language Model

Recent Activity

upvoted a paper about 1 month ago

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

upvoted a paper about 1 month ago

System Message Generation for User Preferences using Open-Source Models

liked a model about 1 month ago

mistralai/Mistral-Small-24B-Base-2501

View all activity

Organizations

wonhosong's activity

upvoted 2 papers about 1 month ago

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published Feb 20 • 26

System Message Generation for User Preferences using Open-Source Models

Paper • 2502.11330 • Published Feb 17 • 15

liked a model about 1 month ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated Jan 30 • 19.6k • 240

upvoted a paper 3 months ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 57

liked a model 4 months ago

LGAI-EXAONE/EXAONE-3.5-32B-Instruct

Text Generation • Updated Dec 11, 2024 • 21.9k • 115

liked a dataset 5 months ago

upstage/dp-bench

Updated Oct 24, 2024 • 1.3k • 69

liked 2 models 7 months ago

upstage/solar-pro-preview-pretrained

Text Generation • Updated Sep 9, 2024 • 59

upstage/solar-pro-preview-instruct

Text Generation • Updated Sep 20, 2024 • 7.87k • 446

upvoted 3 papers 10 months ago

An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Paper • 2406.09415 • Published Jun 13, 2024 • 51

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12, 2024 • 41

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Paper • 2406.02900 • Published Jun 5, 2024 • 14

liked a dataset 10 months ago

lawcompany/KLAID

Viewer • Updated Nov 17, 2022 • 161k • 176 • 12

liked a model about 1 year ago

davidkim205/nox-solar-10.7b-v4

Text Generation • Updated Apr 19, 2024 • 1.69k • 10

liked a dataset about 1 year ago

math-ai/AutoMathText

Viewer • Updated Feb 19 • 7.89M • 64.1k • 169

upvoted 2 papers about 1 year ago

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Paper • 2401.01325 • Published Jan 2, 2024 • 27

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18, 2024 • 36

upvoted a paper over 1 year ago

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 59

liked 2 models over 1 year ago

upstage/SOLAR-10.7B-Instruct-v1.0

Text Generation • Updated Sep 10, 2024 • 55.4k • 623

upstage/SOLAR-10.7B-v1.0

Text Generation • Updated Sep 10, 2024 • 10.9k • 302