Yaowei Zheng's picture

Yaowei Zheng

hiyouga

·

https://github.com/hiyouga

llamafactory_ai

hiyouga

AI & ML interests

LLM Knowledge Management

Articles

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Organizations

hiyouga's activity

upvoted a paper 3 days ago

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

Paper • 2405.19026 • Published 10 days ago • 6

upvoted a paper 18 days ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published 20 days ago • 33

upvoted a collection about 1 month ago

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated 2 days ago • 197

upvoted an article about 1 month ago

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Mar 20

• 21

upvoted 2 papers about 2 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 80

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 70

upvoted a paper 2 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 74

upvoted 2 papers 3 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 57

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 176

upvoted a collection 4 months ago

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 2 days ago • 189