arxiv:2309.00071
Jeffrey Quesnelle PRO
emozilla
AI & ML interests
None yet
Organizations
Papers
1
models
57
emozilla/8B_128K_bs_8M_rope_512K_step_1000_lr_2e-5
Text Generation
•
Updated
•
1
emozilla/llama-1.1b-init
Text Generation
•
Updated
•
105
emozilla/LWM-Text-1M-mpe64k
Text Generation
•
Updated
•
7
emozilla/LWM-Text-1M-mpe32k
Text Generation
•
Updated
•
58
emozilla/LWM-Text-1M-mpe4k
Text Generation
•
Updated
•
8
emozilla/LWM-Text-1M-GGUF
Updated
•
199
emozilla/bt3
Text Generation
•
Updated
•
7
•
1
emozilla/tl-c1_2
Text Generation
•
Updated
•
6
emozilla/oh25
Text Generation
•
Updated
•
5
emozilla/tl-c6
Text Generation
•
Updated
•
6
•
1
datasets
33
emozilla/c4-validation.00000-of-00008
Viewer
•
Updated
•
16
emozilla/hermes2-tokenized-llama-alpaca
Viewer
•
Updated
emozilla/yarn-train-tokenized-8k-mistral
Viewer
•
Updated
•
2
emozilla/story-summary-training-mistral-9k-1_4_24
Viewer
•
Updated
•
1
emozilla/yarn-train-tokenized-8k-llama
Viewer
•
Updated
•
469
emozilla/yarn-train-tokenized-32k-mistral
Viewer
•
Updated
•
1
emozilla/yarn-train-tokenized-16k-mistral
Viewer
•
Updated
•
121
•
13
emozilla/pg19
Viewer
•
Updated
•
758
•
9
emozilla/Long-Data-Collections-Fine-Tune
Viewer
•
Updated
•
2
emozilla/Long-Data-Collections-Pretrain-Without-Books
Viewer
•
Updated
•
1