Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Pricing

  • Log In
  • Sign Up
deepseek-ai 's Collections
DeepSeek-Prover
DeepSeek-V2
DeepSeekCoder-V2
DeepSeek-Math
ESFT
DeepSeek-VL
DeepSeek-Coder
DeepSeek-LLM
DeepSeek-MoE
DeepSeek-V2.5

DeepSeek-MoE

updated Aug 16

DeepSeek MoE series

Upvote
7

  • deepseek-ai/deepseek-moe-16b-base

    Text Generation • Updated Jan 12 • 26k • 79

  • deepseek-ai/deepseek-moe-16b-chat

    Text Generation • Updated Feb 5 • 1.28k • 112

  • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

    Paper • 2401.06066 • Published Jan 11 • 42
Upvote
7
  • Collection guide
  • Browse collections
Company
© Hugging Face
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs