About vocab extension

by alielfilali01 - opened Feb 13, 2024

Feb 13, 2024

Have you extend the tokenizer's vocabulary or you used just the original tokenizer ? And also about the training, was it like full model training or you went with adding a LoRA adapter on top then merge it ?
Your answer will be much appreciated and tnx for your contribution to the open science 🤗

jmprcp

Unbabel org Feb 13, 2024

Hi, thank you for your interest in Tower.
We did full model training and used the original tokenizer.
We are going to publish a paper on Tower soon, so stay tuned!

jmprcp changed discussion status to closed Feb 13, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment