64 14 33

Shenzhi Wang

shenzhi-wang

https://shenzhi-wang.netlify.app/

ShenzhiWang_THU

AI & ML interests

Large Language Model, Reinforcement Learning, and AI Agents

Recent Activity

updated a model 4 days ago

xwen-team/Xwen-0.5B-Chat

published a model 4 days ago

xwen-team/Xwen-0.5B-Chat

upvoted a collection 11 days ago

Xwen-Chat

View all activity

Organizations

shenzhi-wang's activity

commented a paper 4 months ago

LLM-based Optimization of Compound AI Systems: A Survey

Paper • 2410.16392 • Published Oct 21, 2024 • 15 •

New activity in shenzhi-wang/Llama3.1-70B-Chinese-Chat 6 months ago

部署了一下试用，非常感谢这样的工作

#2 opened 7 months ago by

WEI21321

New activity in shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit 6 months ago

中文模型很弱智

#10 opened 6 months ago by

Jerry-SDUA

New activity in shenzhi-wang/Llama3.1-8B-Chinese-Chat 6 months ago

根本没法用，它认为 ”你好吗“有5个汉字

#15 opened 6 months ago by

Jerry-SDUA

这个是训练的词库特地指定的？

#13 opened 6 months ago by

roamerxv

New activity in shenzhi-wang/Llama3.1-8B-Chinese-Chat 7 months ago

坐等70b chinese

#1 opened 7 months ago by

iwaitu

遇到了无穷回复问题

#4 opened 7 months ago by

Orion-zhen

训练数据模板是什么

#6 opened 7 months ago by

Libraone

感觉效果不如之前orpo的llama3

#8 opened 7 months ago by

ztyl-tech

大佬。啥时候出3.1的4bit版本啊

#9 opened 7 months ago by

shenbushou

config.json中的rope_scaling字段为什么没有包含type和factor？

#11 opened 7 months ago by

Alexcccn

Train data?

#5 opened 7 months ago by

yyq90

New activity in shenzhi-wang/Llama3.1-70B-Chinese-Chat 7 months ago

遇到了和8b版本一样的无限输出问题

#5 opened 7 months ago by

Orion-zhen

希望有一个30G左右的量化版本

#1 opened 7 months ago by

yxh0774

New activity in shenzhi-wang/Llama3-70B-Chinese-Chat 7 months ago

请问加载这个模型要多少GPU？我24000+的提示out of memory

#10 opened 8 months ago by

zyc1128

[AUTOMATED] Model Memory Requirements

#12 opened 7 months ago by

model-sizer-bot

New activity in shenzhi-wang/Gemma-2-9B-Chinese-Chat 8 months ago

Better formatting for CAUTION

#1 opened 8 months ago by

mishig

New activity in shenzhi-wang/Gemma-2-27B-Chinese-Chat 8 months ago

Default to eager attention

#1 opened 8 months ago by

lysandre

New activity in shenzhi-wang/Llama3-70B-Chinese-Chat-GGUF-4bit 8 months ago

中文理解有点差

#2 opened 9 months ago by

chaochaoli

New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 9 months ago

error

#42 opened 9 months ago by

LuffyDreams