3 13 18

Kaiqiang Song

kqsong

http://i2u.world

KaiQiangSong

AI & ML interests

Summarization and Text Generation

Recent Activity

liked a model 29 days ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

liked a model 29 days ago

Nexusflow/Athene-RM-70B

upvoted a paper about 1 month ago

Qwen2.5 Technical Report

View all activity

Organizations

None yet

kqsong's activity

liked 2 models 29 days ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

Updated 12 days ago • 1.13k • 84

Nexusflow/Athene-RM-70B

Text Classification • Updated Nov 15, 2024 • 23 • 7

upvoted a paper about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 365

upvoted a paper about 2 months ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25 • 57

upvoted 2 papers 2 months ago

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6 • 24

MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion

Paper • 2502.04235 • Published Feb 6 • 22

liked a dataset 3 months ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 2.75k • 725

liked a model 4 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 29 days ago • 8.03k • 1.63k

liked a dataset 4 months ago

nvidia/HelpSteer2

Viewer • Updated Dec 18, 2024 • 21.4k • 4.38k • 411

liked 2 datasets 5 months ago

allenai/llama-3.1-tulu-3-70b-preference-mixture

Viewer • Updated Feb 4 • 337k • 3.58k • 18

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 4.97k • 138

upvoted a collection 5 months ago

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 1 day ago • 162

liked a model 5 months ago

mistralai/Pixtral-Large-Instruct-2411

Image-Text-to-Text • Updated Mar 16 • 4 • 409

liked a dataset 7 months ago

huuuyeah/SportsGen

Viewer • Updated Oct 3, 2024 • 70k • 200 • 5

upvoted a paper 7 months ago

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 69

New activity in kqsong/InFoBench 8 months ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

New activity in microsoft/Phi-3.5-MoE-instruct 8 months ago

The provided example doesn't work

#32 opened 8 months ago by

kqsong

upvoted a paper 9 months ago

WPO: Enhancing RLHF with Weighted Preference Optimization

Paper • 2406.11827 • Published Jun 17, 2024 • 15

authored a paper 10 months ago

WPO: Enhancing RLHF with Weighted Preference Optimization

Paper • 2406.11827 • Published Jun 17, 2024 • 15

authored a paper about 1 year ago

InFoBench: Evaluating Instruction Following Ability in Large Language Models

Paper • 2401.03601 • Published Jan 7, 2024 • 7