LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation Paper • 2402.11485 • Published Feb 18 • 1
Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training Paper • 2404.10555 • Published Apr 16 • 2
Pretraining and Updating Language- and Domain-specific Large Language Model: A Case Study in Japanese Business Domain Paper • 2404.08262 • Published Apr 12 • 1
Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities Paper • 2404.17790 • Published 24 days ago • 2
JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuning Paper • 2310.10083 • Published Oct 16, 2023 • 2
JaColBERT and Hard Negatives, Towards Better Japanese-First Embeddings for Retrieval: Early Technical Report Paper • 2312.16144 • Published Dec 26, 2023 • 2
Karasu Collection The models trained under our Karasu and Qarasu project • 9 items • Updated Jan 24 • 1
NTQ AI LM Collection A collection of finely tuned Language Models (LLMs) across diverse datasets. • 3 items • Updated 27 days ago • 1
ELYZA-japanese-CodeLlama-7b Collection CodeLlama models augmented for Japanese usage • 3 items • Updated Dec 27, 2023 • 2
ELYZA-japanese-Llama-2-13b Collection 13b Llama-2 models augmented for Japanese usage • 5 items • Updated Dec 27, 2023 • 5
ELYZA-japanese-Llama-2-7b Collection 7b Llama-2 models augmented for Japanese usage • 6 items • Updated Dec 27, 2023 • 4
nekomata Collection The nekomata model series are based on the Qwen series and have been continually pre-trained on Japanese-specific corpora. • 8 items • Updated Apr 4 • 5
Japanese Multimodal Models Collection Suite of multimodal models focusing on Japan/Japanese-related usage • 4 items • Updated Apr 8 • 4
Japanese Stable LM Collection Suite of LLMs focusing on Japanese usage • 15 items • Updated 14 days ago • 13
youri Collection The youri model series are based on the llama2 series and have been continually pre-trained on Japanese-specific corpora. • 6 items • Updated 21 days ago • 1
bilingual-gpt-neox-4b Collection The bilingual-gpt-neox-4b series are pre-trained from scratch on a mixture of Japanese and English corpora. • 5 items • Updated Apr 3 • 1
japanese-gpt-neox-3.6b Collection The japanese-gpt-neox-3.6b series are pre-trained from scratch on Japanese corpora. • 5 items • Updated Apr 3 • 2