Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
deepseek-ai
/
DeepSeek-V2-Chat-0628
like
172
Follow
DeepSeek
1,381
Text Generation
Transformers
Safetensors
deepseek_v2
conversational
custom_code
text-generation-inference
Inference Endpoints
arxiv:
2405.04434
License:
deepseek
Model card
Files
Files and versions
Community
5
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
MLA的实现代码
#5 opened 3 months ago by
yuyijiong
What is the FSDP value for `fsdp_transformer_layer_cls_to_wrap`?
#4 opened 4 months ago by
migtissera
模型启动依赖问题
#3 opened 4 months ago by
malowking
different between DeepSeek-V2-Chat-0628 and Deepseek-v2-API-0628
1
#2 opened 4 months ago by
xxllp