Edit model card

In-progess long-context Japanese-English translation model based on tinyllama. Input should be 500-1000 tokens long. Make sure to set 'do_sample = False' if using HF transformers for inference, or otherwise set temperature to 0 for deterministic outputs.

Prompt format

"""Translate this from Japanese to English:\n### JAPANESE: {source text} \n### ENGLISH: """

Downloads last month
13
Safetensors
Model size
1.1B params
Tensor type
BF16
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Finetuned from

Dataset used to train NilanE/tinyllama-en_ja-translation-v2