6 1 4

Keming Lu

keminglu

Lukeming-tsinghua

AI & ML interests

Information Extraction, Large Language Model, Knowledge Graph

Recent Activity

authored a paper 3 months ago

Qwen2.5 Technical Report

authored a paper 5 months ago

Aligning Large Language Models via Self-Steering Optimization

authored a paper 5 months ago

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

View all activity

Organizations

keminglu's activity

authored a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 352

authored 2 papers 5 months ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 23

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Paper • 2410.13841 • Published Oct 17, 2024 • 17

authored a paper 6 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

upvoted a paper 7 months ago

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model

Paper • 2408.10764 • Published Aug 20, 2024 • 9

commented a paper 7 months ago

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model

Paper • 2408.10764 • Published Aug 20, 2024 • 9 •

updated a model 7 months ago

Qwen/Qwen2-Math-72B

Text Generation • Updated Aug 8, 2024 • 43 • 30

authored a paper 8 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

authored a paper 9 months ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 16

New activity in nvidia/Nemotron-4-340B-Reward 9 months ago

Missing model_weights/model.rm_head._extra_state

#1 opened 9 months ago by

keminglu

authored 5 papers 10 months ago

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

Paper • 2308.07074 • Published Aug 14, 2023

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

Paper • 2310.05492 • Published Oct 9, 2023 • 2

Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization

Paper • 2310.05506 • Published Oct 9, 2023 • 1

Speculative Contrastive Decoding

Paper • 2311.08981 • Published Nov 15, 2023 • 2

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

Paper • 2405.17931 • Published May 28, 2024

updated 2 models 10 months ago

OFA-Sys/MuggleMath_13B

Text Generation • Updated May 24, 2024 • 12

OFA-Sys/MuggleMath_7B

Text Generation • Updated May 23, 2024 • 12

liked a Space about 1 year ago

386

Qwen1.5 72B Chat

🚀

Generate chat responses from user input

authored a paper about 1 year ago

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

Paper • 2401.12474 • Published Jan 23, 2024 • 36

New activity in moreh/MoMo-72B-lora-1.8.6-DPO about 1 year ago

Qwen compabilitiy

#6 opened about 1 year ago by

keminglu