Prashasst
/

Medical-Reasoning-DeepSeek-8B

Text Generation

text-generation-inference

Model card Files Files and versions Community

Prashasst commited on 15 days ago

Commit

b2d3b2c

·

verified ·

1 Parent(s): ae5f28c

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -6,9 +6,14 @@ tags:
 - unsloth
 - llama
 - trl
 license: apache-2.0
 language:
 - en
 ---
 # Uploaded  model
@@ -17,6 +22,5 @@ language:
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 - unsloth
 - llama
 - trl
+- prashasst
 license: apache-2.0
 language:
 - en
+datasets:
+- FreedomIntelligence/medical-o1-reasoning-SFT
+pipeline_tag: text-generation
+library_name: peft
 ---
 # Uploaded  model
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit