Is it 30B? In the description it says 13B

by rafa9 - opened May 30, 2023

May 30, 2023

This is wizard-vicuna-13b trained with a subset of the dataset - responses that contained alignment / moralizing were removed. ....

ehartford

Cognitive Computations org May 30, 2023

Yeah it's 30b

The original wizard-vicuna model was never published in 30b

Saiyan

May 31, 2023

Hey @ehartford
Appreciate your work here 🙏
Can you please elaborate? is it a README typo? if Vicuna was never released with 30B, what is the base model used to train Wizard-Vicuna-30B?
And how it relates to Vicuna (only by dataset but not by weights)?
Thank you in advance

ehartford

Cognitive Computations org May 31, 2023

The readme is not a typo.
Vicuna's code was used to finetune.
Wizard-Vicuna's dataset (with refusals and bias removed) was used as the training dataset.
llama-30b was used as the base model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment