kobkrit's picture
Update README.md
e52107d
|
raw
history blame
3.87 kB
metadata
license: apache-2.0
datasets:
  - kobkrit/rd-taxqa
  - iapp_wiki_qa_squad
  - Thaweewat/alpaca-cleaned-52k-th
  - Thaweewat/instruction-wild-52k-th
  - Thaweewat/databricks-dolly-15k-th
  - Thaweewat/hc3-24k-th
  - Thaweewat/gpteacher-20k-th
  - Thaweewat/onet-m6-social
  - Thaweewat/alpaca-finance-43k-th
language:
  - th
  - en
library_name: transformers
pipeline_tag: text-generation
tags:
  - openthaigpt
  - llama

๐Ÿ‡น๐Ÿ‡ญ OpenThaiGPT 1.0.0-beta

๐Ÿ‡น๐Ÿ‡ญ OpenThaiGPT Version 1.0.0-beta is a Thai language 7B-parameter LLaMA v2 Chat model finetuned to follow Thai translated instructions and extend more than 24,500 most popular Thai words vocabularies into LLM's dictionary for turbo speed.

Upgrade from OpenThaiGPT 1.0.0-alpha

  • Add more than 24,500 most popular Thai words vocabularies into LLM's dictionary and re-pretrain embedding layers which make it generate Thai text 10 times faster than previous version.

Support

License

Source Code: License Apache Software License 2.0.
Weight: Research and Commercial uses.

Code and Weight

Colab Demo: https://colab.research.google.com/drive/1kDQidCtY9lDpk49i7P3JjLAcJM04lawu?usp=sharing
Finetune Code: https://github.com/OpenThaiGPT/openthaigpt-finetune-010beta
Inference Code: https://github.com/OpenThaiGPT/openthaigpt
Weight (Huggingface Checkpoint): https://huggingface.co/openthaigpt/openthaigpt-1.0.0-beta-7b-chat-ckpt-hf

Sponsors

Pantip.com, ThaiSC

Powered by

OpenThaiGPT Volunteers, Artificial Intelligence Entrepreneur Association of Thailand (AIEAT), and Artificial Intelligence Association of Thailand (AIAT)

Authors

Disclaimer: Provided responses are not guaranteed.