Descripton:
This is ruadapt version of upstage/SOLAR-10.7B-v1.0 model with tokenizer replacement. Additionally to previous work, the model was adapted in two stages: 1) vocabulary optimization, and 2) additional attention fine-tuning using LoRa.
How to cite:
Tikhomirov M., Chernyshev D. Impact of Tokenization on LLaMa Russian Adaptation //arXiv preprint arXiv:2312.02598. – 2023.
- Downloads last month
- 44
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.