Traditional Chinese **cleaned** corpus collection for LLM pre-training.
Oscar, Li
liswei
AI & ML interests
Multimodal Deep Learning, Natural Language Processing, Parameter-Efficient Fine-Tuning
Organizations
None yet
Collections
2
models
5
liswei/OpenELM-270M-Llama-2-Chinese-7b-hf-30K-sft
Text Generation
•
Updated
•
3
liswei/OpenELM-270M-Llama-2-Chinese-7b-hf-30K-task-vector-sft
Text Generation
•
Updated
•
8
liswei/OpenELM-Chinese-zhtw-270M
Text Generation
•
Updated
•
17
liswei/EmojiLMSeq2SeqLoRAAccumulation
Updated
liswei/EmojiLMSeq2SeqLoRA
Text2Text Generation
•
Updated
•
8
datasets
9
liswei/zhtw-news-and-articles-2B
Viewer
•
Updated
•
1
liswei/wikinews-zhtw-dedup
Viewer
•
Updated
liswei/wikipedia-zhtw-dedup
Viewer
•
Updated
liswei/news-collection-zhtw
Viewer
•
Updated
liswei/common-crawl-zhtw
Viewer
•
Updated
liswei/coct-en-zhtw-dedup
Viewer
•
Updated
liswei/c4-zhtw
Viewer
•
Updated
liswei/rm-static-zhTW
Updated
•
27
liswei/NTU-Tree
Viewer
•
Updated
•
2