Kaifeng Lyu's picture

1

Kaifeng Lyu

vfleaking

·

https://kaifeng.ac/

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

authored a paper 10 months ago

Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing

authored a paper 10 months ago

DistillSpec: Improving Speculative Decoding via Knowledge Distillation

View all activity

Organizations

None yet

vfleaking's activity

authored a paper about 1 month ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published Mar 25 • 34

authored 5 papers 10 months ago

Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing

Paper • 2301.11500 • Published Jan 27, 2023

DistillSpec: Improving Speculative Decoding via Knowledge Distillation

Paper • 2310.08461 • Published Oct 12, 2023 • 1

On the SDEs and Scaling Rules for Adaptive Gradient Algorithms

Paper • 2205.10287 • Published May 20, 2022

Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking

Paper • 2311.18817 • Published Nov 30, 2023

RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval

Paper • 2402.18510 • Published Feb 28, 2024

authored 2 papers 12 months ago

A Quadratic Synchronization Rule for Distributed Deep Learning

Paper • 2310.14423 • Published Oct 22, 2023

The Marginal Value of Momentum for Small Learning Rate SGD

Paper • 2307.15196 • Published Jul 27, 2023

updated 3 datasets about 1 year ago

vfleaking/hh-redteam-instruction33K

Viewer • Updated Apr 5, 2024 • 33.4k • 32

vfleaking/GSM-Danger

Viewer • Updated Mar 1, 2024 • 100 • 18

vfleaking/DirectHarm4

Viewer • Updated Mar 1, 2024 • 400 • 21 • 6

liked a model about 2 years ago

THUDM/chatglm-6b

Updated Aug 4, 2024 • 4.15k • 2.85k