1 22 132

peng

superpeng

AI & ML interests

None yet

Recent Activity

liked a dataset 7 days ago

BAAI/Infinity-Instruct

liked a dataset 10 days ago

zd21/ReST-MCTS-Llama3-8b-Instruct-Policy-1st

liked a dataset 10 days ago

CCCCCC/SPaR

View all activity

Organizations

None yet

superpeng's activity

upvoted a paper 25 days ago

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published 27 days ago • 21

upvoted an article about 2 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 76

upvoted a collection 3 months ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12, 2024 • 13

upvoted 2 papers 6 months ago

HelpSteer2: Open-source dataset for training top-performing reward models

Paper • 2406.08673 • Published Jun 12, 2024 • 17

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Paper • 2405.20335 • Published May 30, 2024 • 18

upvoted a collection 6 months ago

Biomedical NLP papers

Collection

Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) • 181 items • Updated 1 day ago • 36

upvoted 2 papers 6 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

Inference Performance Optimization for Large Language Models on CPUs

Paper • 2407.07304 • Published Jul 10, 2024 • 52

upvoted a collection 7 months ago

Tulu 2 Llama 3 Update

Collection

Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5). • 12 items • Updated Aug 15, 2024 • 2

upvoted 2 papers 8 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 67

upvoted a paper 9 months ago

Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts

Paper • 2309.07430 • Published Sep 14, 2023 • 27

upvoted 2 articles 9 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19, 2024

• 129

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

• 72

upvoted 2 papers 10 months ago

MathScale: Scaling Instruction Tuning for Mathematical Reasoning

Paper • 2403.02884 • Published Mar 5, 2024 • 17

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 184

upvoted a paper 11 months ago

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26, 2024 • 27

upvoted a collection 11 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated Dec 13, 2024 • 329

upvoted 2 papers 11 months ago

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

Paper • 2402.10524 • Published Feb 16, 2024 • 23

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77