-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 14 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 8 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 13 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 96
daje kang
daje
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
daje/kotext-to-sql-v1-hard
published
a dataset
2 days ago
daje/kotext-to-sql-v1-hard
updated
a model
11 days ago
daje/Meta-Llama-3.1-8B-Instruct-de-identification
Organizations
None yet
Collections
1
models
39
daje/Meta-Llama-3.1-8B-Instruct-de-identification
Updated
•
1
daje/Qwen2.5-14B-Instruct-tools
Text Generation
•
Updated
daje/model_0.0002_alpha-32_r-64
Updated
•
9
daje/model_0.0002_alpha-8_r-16
Updated
•
7
daje/model_5e-05_alpha-128_r-256
Updated
•
8
daje/model_2e-4_alpha-8_r-16
Updated
•
3
daje/model_Lora
Updated
•
4
daje/model_2e-4
Updated
•
5
daje/model
Updated
•
3
daje/Qwen2-7B-Instruct-harmful_detector_2000-H100_1
Updated
•
4
datasets
14
daje/kotext-to-sql-v1-hard
Viewer
•
Updated
•
2k
•
6
daje/de-identify-chat-ko
Viewer
•
Updated
•
9.92k
•
102
daje/ko-hatefulmemes_train_8500
Viewer
•
Updated
•
8.2k
•
36
daje/ko-hatefulmemes_train_8500_kmhas
Viewer
•
Updated
•
95.3k
•
39
daje/ko-hatefulmemes_train_2000
Viewer
•
Updated
•
1.91k
•
24
daje/Ko-SciecneQA
Viewer
•
Updated
•
12.7k
•
44
daje/keyword_summary
Viewer
•
Updated
•
1k
•
79
daje/kotext-to-sql-v1
Viewer
•
Updated
•
262k
•
68
•
2
daje/mistral_tokenized_en_wiki
Viewer
•
Updated
•
16.1M
•
219
daje/mistral_tokenized_ko_wiki
Viewer
•
Updated
•
1.7M
•
34