File size: 1,353 Bytes
2cd6106 349f3a0 2cd6106 349f3a0 2cd6106 d3c2b91 2cd6106 349f3a0 2cd6106 349f3a0 2cd6106 483acbe 349f3a0 2cd6106 349f3a0 2cd6106 349f3a0 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 |
---
base_model: Qwen/Qwen2.5-14B-Instruct
tags:
- fluently-lm
- fluently-sets
- demo
- reasoning
- text-generation-inference
- transformers
- unsloth
- qwen2
- trl
- sft
license: apache-2.0
language:
- en
datasets:
- fluently-sets/reasoning-1-1k
pipeline_tag: text-generation
---
# Reasoning-1 1K Demo (Finetune of Qwen2.5-14B-IT on Reasoning-1-1k dataset)
***Q4_K_M GGUF-quant available [here](https://huggingface.co/fluently-sets/reasoning-1-1k-demo-Q4_K_M-GGUF)***
This is SFT-finetune Qwen2.5-14B-IT on Reasoning-1-1K dataset. This is far from a perfect model, its main purpose is to show an example of using the dataset.
- **Base model**: [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
- **Model type**: [Qwen2ForCausalLM](https://huggingface.co/models?other=qwen2)
- **Number of parameters**: 14.8B
- **Precision**: FP16
- **Training method**: SFT
- **Training dataset**: [fluently-sets/reasoning-1-1k](https://huggingface.co/datasets/fluently-sets/reasoning-1-1k)
- **Languages**: English (mostly)
*Trained by Fluently Team ([@ehristoforu](https://huggingface.co/ehristoforu)) with [Unsloth AI](https://github.com/unslothai/unsloth) with love🥰*
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth) |