Yaowei Zheng's picture

Yaowei Zheng

hiyouga

·

https://github.com/hiyouga

llamafactory_ai

hiyouga

AI & ML interests

LLM Knowledge Management

Articles

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Organizations

hiyouga's activity

upvoted a paper 11 days ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published 12 days ago • 33

upvoted a collection about 1 month ago

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 16 items • Updated 15 days ago • 181

upvoted an article about 1 month ago

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Mar 20

• 20

upvoted a paper about 1 month ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 80

upvoted 2 papers about 2 months ago

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 70

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 74

upvoted a paper 2 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 57

upvoted a paper 3 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 176

upvoted a collection 4 months ago

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 19 days ago • 183