Shenzhi Wang

shenzhi-wang

AI & ML interests

Large Language Model, Reinforcement Learning, and AI Agents

Organizations

None yet

shenzhi-wang's activity

New activity in shenzhi-wang/Gemma-2-9B-Chinese-Chat 11 days ago

Better formatting for CAUTION

2
#1 opened 13 days ago by mishig
New activity in shenzhi-wang/Gemma-2-27B-Chinese-Chat 12 days ago

Default to eager attention

2
#1 opened 12 days ago by lysandre
New activity in shenzhi-wang/Llama3-70B-Chinese-Chat-GGUF-4bit about 1 month ago

中文理解有点差

1
#2 opened about 2 months ago by chaochaoli
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat about 2 months ago

error

#42 opened about 2 months ago by LuffyDreams
New activity in shenzhi-wang/Llama3-70B-Chinese-Chat about 2 months ago

Request: DOI

4
#9 opened about 2 months ago by luxen1234

没有在线体验的demo吗?

4
#8 opened about 2 months ago by jansen-liu
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 2 months ago

长上下文版本计划

3
#34 opened 2 months ago by rzzhangtao
New activity in shenzhi-wang/Llama3-70B-Chinese-Chat 2 months ago
New activity in shenzhi-wang/Llama3-70B-Chinese-Chat 2 months ago

perfect!!!

2
#3 opened 2 months ago by bluestarry
New activity in shenzhi-wang/Llama3-70B-Chinese-Chat 2 months ago

Update tokenizer_config.json

#2 opened 2 months ago by hiyouga

Update config.json

#1 opened 2 months ago by hiyouga

中文的效果感觉不是很好

4
#5 opened 2 months ago by daisr

报错了

3
#4 opened 2 months ago by ytcheng
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 2 months ago

微调参数

1
#30 opened 2 months ago by rzzhangtao

上下文长度只有512?

3
#3 opened 2 months ago by YUCYU
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 3 months ago

Training environment

4
#15 opened 3 months ago by Leeli1

For fine-tuning

1
#16 opened 3 months ago by svippixel

BFloat16 is not supported on MPS

5
#13 opened 3 months ago by RDY97

Update README.md

#20 opened 3 months ago by hiyouga

Update README.md

#19 opened 3 months ago by hiyouga

Delete all_results.json

#17 opened 3 months ago by hiyouga

Delete trainer_log.jsonl

#18 opened 3 months ago by hiyouga
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 3 months ago

how to reproduce in colab

1
#14 opened 3 months ago by chenshake
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 3 months ago

GGUF file

4
#8 opened 3 months ago by BB8-dev

GGUF version

2
#11 opened 3 months ago by zhouzr

🚀Fix metadata dict bug

#10 opened 3 months ago by hiyouga

Update generation_config.json

#7 opened 3 months ago by hiyouga

Delete training_args.bin

#9 opened 3 months ago by hiyouga

Update config.json

#5 opened 3 months ago by hiyouga

add Usage

#2 opened 3 months ago by hiyouga

Update README.md

#1 opened 3 months ago by hiyouga