fp8转bf16的脚本在A100上无法执行
2
#32 opened 29 minutes ago
by
duanyu
v3的自我认知怎么还不如之前
2
#31 opened about 1 hour ago
by
yyren7
No monthly active user limitation on commercial use?
#30 opened about 4 hours ago
by
DrNicefellow
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
#27 opened about 8 hours ago
by
phil111
Converted bf16 Model on Hugging Face
#26 opened about 16 hours ago
by
OpenSourceRonin
Create README.md
#24 opened about 21 hours ago
by
xiaoshuai1234
应该把字节、阿里、百度的钱和显卡都分给deepseek,不然浪费资源啊
6
#23 opened about 22 hours ago
by
eatcosmos
开源了也下不了。。哈哈哈
#21 opened about 23 hours ago
by
hzxx0921
SHU到此一游QAQ
#20 opened 1 day ago
by
gtTeri
Excited to this Open Source LLM!
#19 opened 1 day ago
by
adrisinaga
Create README.md
#17 opened 1 day ago
by
amiramiramirdeh
某些自研代码助手又有饭吃了
2
#16 opened 1 day ago
by
zh20233
I can't wait to see your work
#15 opened 1 day ago
by
jiangchengchengNLP
vllm/sglang deploy script?
1
#14 opened 1 day ago
by
Meteonis
Create README.md
1
#13 opened 1 day ago
by
Jaggz333
When using the web version of DeepSeek v3, it keeps repeating responses without stopping.
#12 opened 1 day ago
by
Nydaym
No model card rn
#11 opened 1 day ago
by
BlackBeenie
where?
#10 opened 1 day ago
by
MrZhanggggg
Create README.md
#9 opened 1 day ago
by
semenionut
Create README.md
#8 opened 1 day ago
by
gavinzhu
Create README.md
#6 opened 1 day ago
by
RIOGOAT
Missing Model Card
#5 opened 1 day ago
by
p3nGu1nZz
Add `library_name` tag for better metadata
#4 opened 1 day ago
by
reach-vb
License
4
#2 opened 1 day ago
by
mrfakename
我嘞个dou,这么大
9
#1 opened 1 day ago
by
mrwkd123