metadata
base_model: eyad-silx/Quasar-3.0-Max
datasets: eyad-silx/Quasar-Max-3.3
library_name: transformers
model_name: Quasar-3.0-Max
tags:
- rl
- silx
- trl
- sft
licence: license
Quasar Series of Models
Introducing Quasar-3.3-Max
This model is provided by SILX INC. It has been supervised fine-tuned using the open-r1 repository. The training data includes sequences of varying lengths (32k, 16k, and 8k) to enhance the model's knowledge and adaptability.
Quasar-3.3-Max represents the first step in the Quasar project before Reinforcement Learning (RL). At this stage, the model's reasoning steps are capped at a maximum length of 8129 tokens to optimize processing efficiency and contextual understanding.
Stay tuned for further updates as we advance the Quasar project with RL enhancements!
Resources
Founders
- Eyad Gomaa
- Gomaa Salah