README.md · iocuydi/llama-2-amharic-3784m at afde89c489171f24551d3fc45593e4d6f227f506

metadata

license: llama2

Llama-2-Amharic

Uses Llama-2-7b as base. Does not use chat variant.

You can use the inference demo script provided to run inference. Note that it expects to be executed in context of llama recipes (https://github.com/facebookresearch/llama-recipes/tree/main/src/llama_recipes/inference)

Ensure that you replace the Llama-2 tokenizer with the Llama-2-Amharic tokenizer.

To avoid hallucinations use low top-k.

Use format "$system\nHuman: $prompt\nAssistant [Amharic] : "

Specify Amharic or English in the prompt to control the response language. Prompt can be in either language.

Example:

"Below is an interaction between a human and an AI fluent in English and Amharic, providing reliable and informative answers. The AI is supposed to answer test questions from the human with short responses saying just the answer and nothing else.

Human: አማርኛ መናገር የሚችል ማሽን መማሪያ ሞዴልን ለማስተዋወቅ ረጅም ብሎግ ጻፍ። ቢያንስ 3 አንቀጾች መሆን አለባቸው።

Assistant [Amharic] : "

Cite:

@misc{andersland2024amharic,
      title={Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource Languages}, 
      author={Michael Andersland},
      year={2024},
      eprint={2403.06354},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}