microsoft
/

MAI-DS-R1

Text Generation

text-generation-inference

Model card Files Files and versions

bsnelling commited on 17 days ago

Commit

35733cb

·

verified ·

1 Parent(s): 9c8129a

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -152,4 +152,6 @@ The following factors can influence MAI-DS-R1's behavior and performance:
 - **Model Name**: MAI-DS-R1
 - **Architecture**: Based on DeepSeek-R1, a transformer-based autoregressive language model utilizing multi-head self-attention and Mixture-of-Experts (MoE) for scalable and efficient inference.
 - **Objective**: Post-trained to reduce CCP-aligned restrictions and enhance harm protection, while preserving the original model’s strong chain-of-thought reasoning and general-purpose language understanding capabilities.
-- **Pre-trained Model Base**: DeepSeek-R1 (671B)

 - **Model Name**: MAI-DS-R1
 - **Architecture**: Based on DeepSeek-R1, a transformer-based autoregressive language model utilizing multi-head self-attention and Mixture-of-Experts (MoE) for scalable and efficient inference.
 - **Objective**: Post-trained to reduce CCP-aligned restrictions and enhance harm protection, while preserving the original model’s strong chain-of-thought reasoning and general-purpose language understanding capabilities.
+- **Pre-trained Model Base**: DeepSeek-R1 (671B)
+### Data Summary
+https://huggingface.co/microsoft/MAI-DS-R1/blob/main/data_summary_card.md