メタデータラボ様からの計算資源のご提供により構築したモデルおよびデータセットhttps://prtimes.jp/main/html/rd/p/000000008.000056944.html
kaeru39
ryota39
AI & ML interests
language model
Organizations
Collections
7
models
18
ryota39/Tora-12B
Text Generation
•
Updated
•
35
•
1
ryota39/Tora-7B-v0.1
Text Generation
•
Updated
•
17
•
2
ryota39/mluke-large-lite-reward
Text Classification
•
Updated
•
17
ryota39/bge-m3-preference-classifier
Text Classification
•
Updated
•
20
ryota39/retriva-bert-preference-classifier
Text Classification
•
Updated
•
16
ryota39/Tora-7B-v0.2
Text Generation
•
Updated
•
16
•
1
ryota39/llm-jp-1b-sft-100k-LoRA-dpo-12k
Text Generation
•
Updated
•
19
ryota39/Phi-3-mini-4k-instruct-dpo
Text Generation
•
Updated
•
21
•
3
ryota39/llm-jp-1b-sft-15k
Text Generation
•
Updated
•
19
ryota39/llm-jp-1b-sft-100k-LoRA
Text Generation
•
Updated
•
19
datasets
26
ryota39/hh-rlhf
Viewer
•
Updated
•
169k
•
55
ryota39/preference-en-ja-100k
Viewer
•
Updated
•
101k
•
95
•
1
ryota39/preference_test
Viewer
•
Updated
•
29.6k
•
36
ryota39/preference_test_annotated
Viewer
•
Updated
•
5
•
35
ryota39/open_preference_v0.4
Viewer
•
Updated
•
202k
•
42
•
1
ryota39/webgpt_comparisons-ja
Viewer
•
Updated
•
17.4k
•
54
•
1
ryota39/synthetic-instruct-gptj-pairwise-ja
Viewer
•
Updated
•
33.1k
•
54
•
1
ryota39/self-rewarding_instruct_AIFT_M3_scored
Viewer
•
Updated
•
7.11k
•
34
ryota39/self-rewarding_instruct_AIFT_M2_scored
Viewer
•
Updated
•
7k
•
36
ryota39/self-rewarding_instruct_AIFT_M1_scored
Viewer
•
Updated
•
4k
•
39