Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
312.6
TFLOPS
56
9
88
Yaowei Zheng
hiyouga
Follow
bkz11's profile picture
PeepDaSlan9's profile picture
brucevanfdm's profile picture
109 followers
·
12 following
https://github.com/hiyouga
llamafactory_ai
hiyouga
AI & ML interests
LLM Knowledge Management
Articles
GaLore: Advancing Large Model Training on Consumer-grade Hardware
Mar 20
•
20
Organizations
hiyouga
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
BUAADreamer/Yi-VL-6B-hf
17 days ago
Update config.json
#3 opened 17 days ago by
hiyouga
Update config.json
#2 opened 17 days ago by
hiyouga
Update config.json
#1 opened 17 days ago by
hiyouga
New activity in
shenzhi-wang/Llama3-70B-Chinese-Chat
22 days ago
Update tokenizer_config.json
#2 opened 22 days ago by
hiyouga
Update config.json
#1 opened 22 days ago by
hiyouga
New activity in
shenzhi-wang/Llama3-8B-Chinese-Chat
about 1 month ago
Update README.md
#20 opened about 1 month ago by
hiyouga
Update README.md
#19 opened about 1 month ago by
hiyouga
Delete trainer_log.jsonl
#18 opened about 1 month ago by
hiyouga
Delete all_results.json
#17 opened about 1 month ago by
hiyouga
BFloat16 is not supported on MPS
5
#13 opened about 1 month ago by
RDY97
New activity in
hiyouga/LLaMA-Board
about 1 month ago
llama3 available on the local demo but is unavailable on the Spaces
1
#9 opened about 1 month ago by
ysharma
New activity in
hiyouga/DPO-En-Zh-20k
about 1 month ago
[bot] Conversion to Parquet
#1 opened about 1 month ago by
parquet-converter
New activity in
shenzhi-wang/Llama3-8B-Chinese-Chat
about 1 month ago
🚀Fix metadata dict bug
#10 opened about 1 month ago by
hiyouga
Delete training_args.bin
#9 opened about 1 month ago by
hiyouga
Update generation_config.json
#6 opened about 1 month ago by
hiyouga
Update generation_config.json
#7 opened about 1 month ago by
hiyouga
Update config.json
#5 opened about 1 month ago by
hiyouga
Update model.safetensors.index.json
#4 opened about 1 month ago by
hiyouga
🚀 Fix the bug of checkpoint files
#3 opened about 1 month ago by
hiyouga
add Usage
#2 opened about 1 month ago by
hiyouga
Update README.md
#1 opened about 1 month ago by
hiyouga
New activity in
hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling
about 2 months ago
just for curiosity
9
#1 opened 2 months ago by
prudant
New activity in
llamafactory/adgen_tiny
about 2 months ago
[bot] Conversion to Parquet
#1 opened about 2 months ago by
parquet-converter
New activity in
hiyouga/LLaMA-Board
2 months ago
Add link to paper so it's automatically linked from Arxiv and paper page
#8 opened 2 months ago by
osanseviero
New activity in
hiyouga/LLaMA-Board
3 months ago
Update data/dataset_info.json
1
#3 opened 3 months ago by
tonymds
Upload dev.csv
#4 opened 3 months ago by
zongyang
Upload jd.json
#5 opened 3 months ago by
zongyang
Create a
#6 opened 3 months ago by
zongyang
Create a.json
#7 opened 3 months ago by
zongyang
New activity in
hiyouga/Qwen-14B-Chat-LLaMAfied
3 months ago
Adding Evaluation Results
#2 opened 3 months ago by
leaderboard-pr-bot
New activity in
google/gemma-7b-it
3 months ago
how to extract model response from the output of tokenizer
3
#54 opened 3 months ago by
mans-0987
New activity in
baichuan-inc/Baichuan2-13B-Chat
3 months ago
Missing module: torch.utils.checkpoint
#13 opened 9 months ago by
hiyouga
New activity in
google/gemma-2b-it
3 months ago
Update readme to match chat template
1
#22 opened 3 months ago by
hiyouga
Update chat template
2
#21 opened 3 months ago by
pcuenq
New activity in
google/gemma-7b-it
3 months ago
Fix chat template does not compatible with ConversationalPipeline
5
#42 opened 3 months ago by
hiyouga
New activity in
mistralai/Mistral-7B-v0.1
3 months ago
How to finetune this model mistralai/Mistral-7B-v0.1 and also merge the weights
5
#126 opened 4 months ago by
yeniceriSGK
New activity in
mistralai/Mixtral-8x7B-v0.1
4 months ago
How to fine tune mixtral 8x7B?
3
#30 opened 5 months ago by
tzivi
New activity in
mistralai/Mixtral-8x7B-v0.1
5 months ago
Fine-tuning toolkit for Mixtral 8x7B MoE model
18
#10 opened 6 months ago by
hiyouga
New activity in
THUDM/chatglm3-6b
5 months ago
fix can't set attribute 'eos_token' when loading the saved tokenizer
#27 opened 5 months ago by
hiyouga
New activity in
hiyouga/Qwen-14B-Chat-LLaMAfied
5 months ago
eval error with LLaMA-Factory
4
#1 opened 5 months ago by
charry2000
New activity in
hiyouga/Baichuan2-7B-Chat-LLaMAfied
7 months ago
Adding Evaluation Results
#1 opened 7 months ago by
leaderboard-pr-bot
New activity in
hiyouga/Baichuan2-7B-Base-LLaMAfied
7 months ago
Adding Evaluation Results
#1 opened 7 months ago by
leaderboard-pr-bot
New activity in
hiyouga/LLaMA-Board
7 months ago
Apply for community grant: Personal project (storage)
1
#2 opened 7 months ago by
hiyouga
added duplicate button, title and description
1
#1 opened 7 months ago by
ysharma
New activity in
openchat/openchat_3.5
7 months ago
How to setup system message
13
#5 opened 7 months ago by
fernandofernandes
New activity in
hf-accelerate/model-memory-usage
7 months ago
Determining Minimum GPU Memory and Input Text Length Calculation in Model Training
2
#19 opened 8 months ago by
kobe8-24
New activity in
microsoft/phi-1_5
8 months ago
Adding _set_gradient_checkpointing for compatibility
6
#22 opened 9 months ago by
vriveras
New activity in
hiyouga/Llama-2-Chinese-13b-chat
10 months ago
how to quantize?
1
#1 opened 10 months ago by
miraclezst
New activity in
Qwen/Qwen-7B
10 months ago
The current implementation of tokenizer cannot adopt left-padding
2
#2 opened 10 months ago by
hiyouga
New activity in
baichuan-inc/Baichuan-13B-Base
11 months ago
Take input attention masks to support left-padded sequences
2
#1 opened 11 months ago by
hiyouga
New activity in
hiyouga/Baichuan-7B-sft
11 months ago
模型合并
2
#5 opened 11 months ago by
TJSL1111
为什么回答这么长
4
#4 opened 11 months ago by
sunnima
Not Working. 'BaiChuanForCausalLM' object has no attribute 'generation_config
2
#1 opened 12 months ago by
jackaduma
How to merge lora model to base model
1
#2 opened 12 months ago by
angel113110
New activity in
baichuan-inc/Baichuan-7B
12 months ago
这个模型应该怎么才能正常对话啊?测试一下感觉不对劲啊!
7
#6 opened 12 months ago by
Zomens
New activity in
THUDM/chatglm-6b
about 1 year ago
About the unusual attention_mask of ChatGLM
1
#40 opened about 1 year ago by
hiyouga