Differences between 1.0 and 1.1?

by brucethemoose - opened Nov 22, 2023

Nov 22, 2023

•

edited Nov 22, 2023

Any details about the finetuning? What was changed that makes 1.1 better than Tess-M 1.0?

brucethemoose changed discussion title from Differences between 1.0? to Differences between 1.0 and 1.1? Nov 22, 2023

dillfrescott

Nov 22, 2023

I would also like to know!

Yhyu13

Nov 22, 2023

•

edited Nov 22, 2023

Mentioned by https://huggingface.co/migtissera/Tess-M-Creative-v1.0 that V1.1 deprecated v1.0 models, possibly achieving better in both creative and STEM the same time?

rkzed

Nov 22, 2023

@brucethemoose do you have plan to merge this model with Capybara?
Capybara-Tess is my favorite model so far!

brucethemoose

Nov 22, 2023

•

edited Nov 22, 2023

@brucethemoose do you have plan to merge this model with Capybara?
Capybara-Tess is my favorite model so far!

That's good! I havent really gotten much feedback about the merge.

Yeah, probably soon once I can test Tess M 1.1? I am hoping another good 200K finetune comes out to add in.

migtissera

Owner Nov 22, 2023

Hey guys,
Tess-v1.1 has a lot more data. I'm still creating the complete dataset for Tess, but wanted to see how things work out at different levels. Tess-v1.0 was trained according to LIMA paper (~4000 samples), but the mistake was that it was only trained for 1-epoch. Hence why the issues with instruction following. Tess-v1.1 has a lot more data and more training epochs. I'm still doing R&D so things will keep improving as I release newer versions.

migtissera

Owner Nov 22, 2023

Sorry guys, v1.2 will be coming out soon as well -- Probably tomorrow. You have to put up with frequent releases for the next little while. :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment