收集繁體中文在語言模型上存在多國語言翻譯的資料集,例如:中轉英、中轉越南等。繁體中文與東亞、東南亞關係密切,需考量未來延展性
Heng-Shiou Sheu | 許恆修
Heng666
AI & ML interests
Graph Neural Learning
Recent Activity
liked
a dataset
13 days ago
google/smol
liked
a model
3 months ago
feabries/TaiwanWordTranslator-v0.1
liked
a model
4 months ago
1-800-BAD-CODE/xlm-roberta_punctuation_fullstop_truecase
Organizations
Collections
6
spaces
16
models
31

Heng666/gemma-2b-GGUF
Updated
•
8

Heng666/paligemma_construction_safety
Updated
•
1

Heng666/my_awesome_billsum_model
Updated

Heng666/madlad400-10b-mt-ct2-int8
Updated
•
2

Heng666/madlad400-7b-bt-mt-ct2-int8
Updated
•
3

Heng666/madlad400-7b-mt-ct2-int8
Translation
•
Updated
•
3
•
3

Heng666/madlad400-3b-mt-ct2
Translation
•
Updated
•
3

Heng666/madlad400-3b-mt-ct2-int8
Translation
•
Updated
•
40

Heng666/NeuralPipe-7B-slerp
Text Generation
•
Updated
•
3

Heng666/phi-2-GGUF
Updated
•
9
datasets
11
Heng666/dot_embedding
Viewer
•
Updated
•
152
•
29
Heng666/Taiwan-patent-corpus
Viewer
•
Updated
•
28
•
30
•
1
Heng666/Taiwan-patent-qa
Viewer
•
Updated
•
1.22k
•
132
•
5
Heng666/Taiwan-patent-qa-eval
Viewer
•
Updated
•
192
•
75
•
2
Heng666/OpenSubtitles-TW-Corpus
Viewer
•
Updated
•
7.22M
•
14
•
3
Heng666/Traditional_Chinese-aya_evaluation_suite
Viewer
•
Updated
•
650
•
49
•
3
Heng666/Traditional_Chinese-aya_dataset
Viewer
•
Updated
•
4.91k
•
98
•
3
Heng666/Traditional_Chinese-aya_collection
Viewer
•
Updated
•
2.02M
•
2.98k
•
8
Heng666/MultiCCAligned-TW-Corpus
Viewer
•
Updated
•
3.13M
•
96
•
5
Heng666/Taoyuan-Airport-MRT-MT-Challenge
Viewer
•
Updated
•
1.14k
•
146