Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
arxiv:
2408.15237
AutoTrain Compatible
Inference Endpoints
text-generation-inference
Misc with no match
Eval Results
Merge
4-bit precision
custom_code
text-embeddings-inference
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
21
Full-text search
Edit filters
Sort: Trending
Active filters:
2408.15237
Clear all
JunxiongWang/mamba_0_5_dpo_ep1
Text Generation
•
Updated
Sep 2
•
15
JunxiongWang/mamba_0_5_dpo_ep3
Text Generation
•
Updated
Sep 2
•
13
JunxiongWang/mamba_0_875_dpo_ep3
Text Generation
•
Updated
Sep 2
•
13
•
1
JunxiongWang/mamba_0_875_dpo_ep1
Text Generation
•
Updated
Sep 2
•
12
JunxiongWang/mamba_0_75_dpo_ep3
Text Generation
•
Updated
Sep 2
•
17
JunxiongWang/mamba_0_75_dpo_ep1
Text Generation
•
Updated
Sep 2
•
9
JunxiongWang/MambaInLlama_0_50
Updated
Sep 2
•
59
JunxiongWang/Mamba2InLlama_0_50
Updated
Sep 2
•
89
JunxiongWang/MambaInLlama_0_75
Updated
Sep 2
•
9
JunxiongWang/Mamba2InLlama_0_75
Updated
Sep 2
•
8
JunxiongWang/Mamba2InLlama_0_875
Updated
Sep 2
•
6
JunxiongWang/MambaInLlama_0_875
Updated
Sep 2
•
8
JunxiongWang/Mamba2InLlama_1
Updated
Sep 2
•
3
•
1
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
Nov 17
•
523
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
Nov 17
•
15
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
Nov 17
•
36
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
Nov 17
•
59
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
Nov 17
•
26
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
Nov 17
•
7
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
Nov 17
•
7
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
Nov 17
•
31