Shenzhi Wang
shenzhi-wang
AI & ML interests
Large Language Model, Reinforcement Learning, and AI Agents
Organizations
None yet
shenzhi-wang's activity
遇到了无穷回复问题
3
#4 opened 1 day ago
by
Orion-zhen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63e2453cf0740bec2bfd3b23/sxRDL7jMKSKOZV3iFpxGs.jpeg)
坐等70b chinese
1
#1 opened 2 days ago
by
iwaitu
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642b37187f152f6e72b5baab/O7Qkun1MRQzm4mY9wA0i4.jpeg)
希望有一个30G左右的量化版本
2
#1 opened 2 days ago
by
yxh0774
请问加载这个模型要多少GPU?我24000+的提示out of memory
1
#10 opened about 1 month ago
by
zyc1128
[AUTOMATED] Model Memory Requirements
#12 opened 16 days ago
by
model-sizer-bot
Better formatting for CAUTION
2
#1 opened 25 days ago
by
mishig
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60a551a34ecc5d054c8ad93e/dhcBFtwNLcKqqASxniyVw.jpeg)
Default to eager attention
2
#1 opened 24 days ago
by
lysandre
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618450692745-5e3aec01f55e2b62848a5217.jpeg)
中文理解有点差
1
#2 opened 2 months ago
by
chaochaoli
error
#42 opened about 2 months ago
by
LuffyDreams
可以提供function calling 更多代码示例吗?
1
#1 opened 2 months ago
by
lbjfish
Request: DOI
4
#9 opened 2 months ago
by
luxen1234
没有在线体验的demo吗?
4
#8 opened 2 months ago
by
jansen-liu
there is no tokenizer.model file
5
#35 opened 2 months ago
by
zhaowei0315
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64c0e5dde8e1818a3687a9ad/ZqeEMTfW2TzU1KwqOqTQt.jpeg)
长上下文版本计划
3
#34 opened 2 months ago
by
rzzhangtao
请问能提供GPTQ-Int8版本吗?
4
#5 opened 2 months ago
by
worldggg
The model does not exist in the repository.
3
#1 opened 3 months ago
by
tiangou0123456
perfect!!!
2
#3 opened 3 months ago
by
bluestarry
70B-Chinese in the future?
5
#27 opened 3 months ago
by
woshimark666
What the template is formatted with for function calls
2
#32 opened 3 months ago
by
Charles99
Adding Evaluation Results
#33 opened 3 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Update tokenizer_config.json
#2 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Update config.json
#1 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
how to do batch inference for this model?
2
#31 opened 3 months ago
by
Alan42
我想自己拿这个模型部署个聊天的,该怎么整啊
1
#28 opened 3 months ago
by
sunwenzhe
中文的效果感觉不是很好
4
#5 opened 3 months ago
by
daisr
微调参数
1
#30 opened 3 months ago
by
rzzhangtao
上下文长度只有512?
3
#3 opened 3 months ago
by
YUCYU
TypeError: BFloat16 is not supported on MPS
2
#29 opened 3 months ago
by
sunwenzhe
shenzhi-wang/Llama3-8B-Chinese-Chat生成乱码怎么解决
9
#25 opened 3 months ago
by
Terence8Tao
ollama上的q8版本是v1还是v2呀?
1
#24 opened 3 months ago
by
coolcoolcloud
提问"荆轲刺秦王",模型返回与史实相去甚远
3
#23 opened 3 months ago
by
freedenS
Training environment
4
#15 opened 3 months ago
by
Leeli1
For fine-tuning
1
#16 opened 3 months ago
by
svippixel
BFloat16 is not supported on MPS
5
#13 opened 3 months ago
by
RDY97
![](https://cdn-avatars.huggingface.co/v1/production/uploads/660d1e6903a46d81aebd5f65/1W_fDaCx87oj3GTbzQ5Al.jpeg)
Update README.md
#20 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Update README.md
#19 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Delete all_results.json
#17 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Delete trainer_log.jsonl
#18 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
为什么这个包导入ollama用Ollama运行就乱讲一通?
10
#2 opened 3 months ago
by
Kollcn
how to reproduce in colab
1
#14 opened 3 months ago
by
chenshake
我在ollama上下载的这个Q8模型,那个上面不能评论,特地来这里给你点个赞
4
#1 opened 3 months ago
by
SerEzio
GGUF file
4
#8 opened 3 months ago
by
BB8-dev
GGUF version
2
#11 opened 3 months ago
by
zhouzr
Run Infer the fine-tuned model, then display error
1
#12 opened 3 months ago
by
hongbaoai
🚀Fix metadata dict bug
#10 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Update generation_config.json
#7 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Delete training_args.bin
#9 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Update config.json
#5 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Update model.safetensors.index.json
#4 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
🚀 Fix the bug of checkpoint files
#3 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)
Update README.md
#1 opened 3 months ago
by
hiyouga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642fef28a043f0ac7defa8a9/RwOEkuj3fOnOA54tGR7Ea.png)