Text Generation
Transformers
Safetensors
English
mistral
conversational
Eval Results
Inference Endpoints
text-generation-inference
Felladrin commited on
Commit
a43e296
1 Parent(s): 32a710d

Add links for GGUF and ONNX versions

Browse files
Files changed (1) hide show
  1. README.md +21 -2
README.md CHANGED
@@ -25,6 +25,16 @@ datasets:
25
  - Felladrin/ChatML-oasst2_curated
26
  - CohereForAI/aya_dataset
27
  - Felladrin/ChatML-aya_dataset
 
 
 
 
 
 
 
 
 
 
28
  widget:
29
  - text: |-
30
  <|im_start|>system
@@ -70,10 +80,10 @@ inference:
70
  repetition_penalty: 1.176
71
  ---
72
 
73
- # Minueza-32M-Chat: A Chat Model of 32M Parameters
74
 
75
  - Base model: [Felladrin/Minueza-32M-Base](https://huggingface.co/Felladrin/Minueza-32M-Base)
76
- - Datasets:
77
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-databricks-dolly-15k)] [databricks/databricks-dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k)
78
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-reddit-instruct-curated)] [euclaise/reddit-instruct-curated](https://huggingface.co/datasets/euclaise/reddit-instruct-curated)
79
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-WebGLM-QA)] [THUDM/webglm-qa](https://huggingface.co/datasets/THUDM/webglm-qa)
@@ -84,6 +94,15 @@ inference:
84
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-HelpSteer)] [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer)
85
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-oasst2_curated)] [sablo/oasst2_curated](https://huggingface.co/datasets/sablo/oasst2_curated)
86
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-aya_dataset)] [CohereForAI/aya_dataset](https://huggingface.co/datasets/CohereForAI/aya_dataset)
 
 
 
 
 
 
 
 
 
87
 
88
  ## Recommended Prompt Format
89
 
 
25
  - Felladrin/ChatML-oasst2_curated
26
  - CohereForAI/aya_dataset
27
  - Felladrin/ChatML-aya_dataset
28
+ - argilla/distilabel-capybara-dpo-7k-binarized
29
+ - Felladrin/ChatML-distilabel-capybara-dpo-7k-binarized
30
+ - argilla/distilabel-intel-orca-dpo-pairs
31
+ - Felladrin/ChatML-distilabel-intel-orca-dpo-pairs
32
+ - argilla/ultrafeedback-binarized-preferences
33
+ - Felladrin/ChatML-ultrafeedback-binarized-preferences
34
+ - sablo/oasst2_dpo_pairs_en
35
+ - Felladrin/ChatML-oasst2_dpo_pairs_en
36
+ - NeuralNovel/Neural-DPO
37
+ - Felladrin/ChatML-Neural-DPO
38
  widget:
39
  - text: |-
40
  <|im_start|>system
 
80
  repetition_penalty: 1.176
81
  ---
82
 
83
+ # Minueza-32M-Chat: A chat model with 32 million parameters
84
 
85
  - Base model: [Felladrin/Minueza-32M-Base](https://huggingface.co/Felladrin/Minueza-32M-Base)
86
+ - Datasets used during SFT:
87
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-databricks-dolly-15k)] [databricks/databricks-dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k)
88
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-reddit-instruct-curated)] [euclaise/reddit-instruct-curated](https://huggingface.co/datasets/euclaise/reddit-instruct-curated)
89
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-WebGLM-QA)] [THUDM/webglm-qa](https://huggingface.co/datasets/THUDM/webglm-qa)
 
94
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-HelpSteer)] [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer)
95
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-oasst2_curated)] [sablo/oasst2_curated](https://huggingface.co/datasets/sablo/oasst2_curated)
96
  - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-aya_dataset)] [CohereForAI/aya_dataset](https://huggingface.co/datasets/CohereForAI/aya_dataset)
97
+ - Datasets used during DPO:
98
+ - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-distilabel-capybara-dpo-7k-binarized)] [argilla/distilabel-capybara-dpo-7k-binarized](https://huggingface.co/datasets/argilla/distilabel-capybara-dpo-7k-binarized)
99
+ - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-distilabel-intel-orca-dpo-pairs)] [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs)
100
+ - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-ultrafeedback-binarized-preferences)] [argilla/ultrafeedback-binarized-preferences](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences)
101
+ - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-oasst2_dpo_pairs_en)] [sablo/oasst2_dpo_pairs_en](https://huggingface.co/datasets/sablo/oasst2_dpo_pairs_en)
102
+ - [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-Neural-DPO)] [NeuralNovel/Neural-DPO](https://huggingface.co/datasets/NeuralNovel/Neural-DPO)
103
+ - Availability in other ML formats:
104
+ - GGUF: [Felladrin/gguf-Minueza-32M-Chat](https://huggingface.co/Felladrin/gguf-Minueza-32M-Chat)
105
+ - ONNX: [Felladrin/onnx-Minueza-32M-Chat](https://huggingface.co/Felladrin/onnx-Minueza-32M-Chat)
106
 
107
  ## Recommended Prompt Format
108