Yaowei Zheng

hiyouga

AI & ML interests

LLM Knowledge Management

Articles

Organizations

hiyouga's activity

New activity in BUAADreamer/Yi-VL-6B-hf 17 days ago

Update config.json

#3 opened 17 days ago by hiyouga

Update config.json

#2 opened 17 days ago by hiyouga

Update config.json

#1 opened 17 days ago by hiyouga
New activity in shenzhi-wang/Llama3-70B-Chinese-Chat 22 days ago

Update tokenizer_config.json

#2 opened 22 days ago by hiyouga

Update config.json

#1 opened 22 days ago by hiyouga
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat about 1 month ago

Update README.md

#20 opened about 1 month ago by hiyouga

Update README.md

#19 opened about 1 month ago by hiyouga

Delete trainer_log.jsonl

#18 opened about 1 month ago by hiyouga

Delete all_results.json

#17 opened about 1 month ago by hiyouga

BFloat16 is not supported on MPS

5
#13 opened about 1 month ago by RDY97
New activity in hiyouga/LLaMA-Board about 1 month ago
New activity in hiyouga/DPO-En-Zh-20k about 1 month ago

[bot] Conversion to Parquet

#1 opened about 1 month ago by parquet-converter
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat about 1 month ago

🚀Fix metadata dict bug

#10 opened about 1 month ago by hiyouga

Delete training_args.bin

#9 opened about 1 month ago by hiyouga

Update generation_config.json

#6 opened about 1 month ago by hiyouga

Update generation_config.json

#7 opened about 1 month ago by hiyouga

Update config.json

#5 opened about 1 month ago by hiyouga

Update model.safetensors.index.json

#4 opened about 1 month ago by hiyouga

🚀 Fix the bug of checkpoint files

#3 opened about 1 month ago by hiyouga

add Usage

#2 opened about 1 month ago by hiyouga

Update README.md

#1 opened about 1 month ago by hiyouga
New activity in hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling about 2 months ago

just for curiosity

9
#1 opened 2 months ago by prudant
New activity in llamafactory/adgen_tiny about 2 months ago

[bot] Conversion to Parquet

#1 opened about 2 months ago by parquet-converter
New activity in hiyouga/LLaMA-Board 3 months ago

Update data/dataset_info.json

1
#3 opened 3 months ago by tonymds

Upload dev.csv

#4 opened 3 months ago by zongyang

Upload jd.json

#5 opened 3 months ago by zongyang

Create a

#6 opened 3 months ago by zongyang

Create a.json

#7 opened 3 months ago by zongyang
New activity in hiyouga/Qwen-14B-Chat-LLaMAfied 3 months ago
New activity in baichuan-inc/Baichuan2-13B-Chat 3 months ago
New activity in google/gemma-2b-it 3 months ago

Update chat template

2
#21 opened 3 months ago by pcuenq
New activity in mistralai/Mixtral-8x7B-v0.1 4 months ago

How to fine tune mixtral 8x7B?

3
#30 opened 5 months ago by tzivi
New activity in mistralai/Mixtral-8x7B-v0.1 5 months ago
New activity in hiyouga/Qwen-14B-Chat-LLaMAfied 5 months ago

eval error with LLaMA-Factory

4
#1 opened 5 months ago by charry2000
New activity in hiyouga/Baichuan2-7B-Chat-LLaMAfied 7 months ago
New activity in hiyouga/Baichuan2-7B-Base-LLaMAfied 7 months ago
New activity in openchat/openchat_3.5 7 months ago
New activity in microsoft/phi-1_5 8 months ago
New activity in hiyouga/Llama-2-Chinese-13b-chat 10 months ago

how to quantize?

1
#1 opened 10 months ago by miraclezst
New activity in Qwen/Qwen-7B 10 months ago
New activity in THUDM/chatglm-6b about 1 year ago