Undi95
/

Borealis-10.7B

Text Generation

Not-For-All-Audiences

nsfw

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

So this isn't SOLAR?

#2

by modster - opened Jan 20

modster

Jan 20

Borealis-10.7B is a 10.7B model made of 48 Mistral 7B layers, finetuned for +70h on 2xA6000 on a big RP and Conversational dataset with llama2 configuration of Axolotl, like SOLAR.

Shouldn't it be better to just finetune base SOLAR? I mean it's not just a normal Mistral frankenmerge, it's also further pretrained with 3T tokens. While this one have the same concept but with much fewer training data?

Jan 20

Borealis-10.7B is a 10.7B model made of 48 Mistral 7B layers, finetuned for +70h on 2xA6000 on a big RP and Conversational dataset with llama2 configuration of Axolotl, like SOLAR.

Shouldn't it be better to just finetune base SOLAR? I mean it's not just a normal Mistral frankenmerge, it's also further pretrained with 3T tokens. While this one have the same concept but with much fewer training data?

Yeah no, it's not solar. Undi is doing experiments here, wanted to see what the model would be like primarily trained on RP data.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment