Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
prithivMLmods
/
Segue-Qwen3_DeepScaleR-Preview
like
0
Text Generation
Transformers
Safetensors
agentica-org/DeepScaleR-Preview-Dataset
English
qwen3
Mixture of Experts
text-generation-inference
code
deepscale
math
conversational
arxiv:
2412.15115
arxiv:
2309.00071
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
prithivMLmods
commited on
May 12
Commit
b28f4cc
·
verified
·
1 Parent(s):
dd2512c
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+2
-0
README.md
CHANGED
Viewed
@@ -2,4 +2,6 @@
2
license: apache-2.0
3
datasets:
4
- agentica-org/DeepScaleR-Preview-Dataset
5
---
2
license: apache-2.0
3
datasets:
4
- agentica-org/DeepScaleR-Preview-Dataset
5
+
base_model:
6
+
- Qwen/Qwen3-1.7B
7
---