metadata

base_model: eyad-silx/Quasar-3.0-Max
datasets: eyad-silx/Quasar-Max-3.3
library_name: transformers
model_name: Quasar-3.0-Max
tags:
  - rl
  - silx
  - trl
  - sft
licence: license

Quasar Series of Models

Quasar Model Image

Introducing Quasar-3.3-Max

This model is provided by SILX INC. It has been supervised fine-tuned using the open-r1 repository. The training data includes sequences of varying lengths (32k, 16k, and 8k) to enhance the model's knowledge and adaptability.

Quasar-3.3-Max represents the first step in the Quasar project before Reinforcement Learning (RL). At this stage, the model's reasoning steps are capped at a maximum length of 8129 tokens to optimize processing efficiency and contextual understanding.

Stay tuned for further updates as we advance the Quasar project with RL enhancements!

Resources

Research Paper
Website

Founders

Eyad Gomaa
Gomaa Salah