Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

peng's picture

peng

superpeng

·

AI & ML interests

None yet

Organizations

None yet

Collections 5

How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15 • 38
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 75
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182
MathScale: Scaling Instruction Tuning for Mathematical Reasoning

Paper • 2403.02884 • Published Mar 5 • 15

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15 • 17
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26 • 26
LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Paper • 2405.17428 • Published May 27 • 16

models

None public yet

datasets

None public yet

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs