arxiv:2309.00071
Jeffrey Quesnelle PRO
emozilla
AI & ML interests
None yet
Organizations
Papers
1
models
60
emozilla/llama2-1.2b-init
Text Generation
•
Updated
•
148
emozilla/llama3-1.6b-init
Text Generation
•
Updated
•
309
emozilla/llama3-1.3b-gptneox-init
Text Generation
•
Updated
•
424
emozilla/8B_128K_bs_8M_rope_512K_step_1000_lr_2e-5
Text Generation
•
Updated
emozilla/llama-1.1b-init
Text Generation
•
Updated
•
52
emozilla/LWM-Text-1M-mpe64k
Text Generation
•
Updated
•
2
emozilla/LWM-Text-1M-mpe32k
Text Generation
•
Updated
•
55
emozilla/LWM-Text-1M-mpe4k
Text Generation
•
Updated
•
2
emozilla/LWM-Text-1M-GGUF
Updated
•
206
emozilla/bt3
Text Generation
•
Updated
•
3
•
1
datasets
35
emozilla/proofpile-test-tokenized-llama3
Viewer
•
Updated
•
114
emozilla/PaulGrahamEssays
Viewer
•
Updated
•
111
emozilla/c4-validation.00000-of-00008
Viewer
•
Updated
•
11
emozilla/hermes2-tokenized-llama-alpaca
Viewer
•
Updated
emozilla/yarn-train-tokenized-8k-mistral
Viewer
•
Updated
•
2
emozilla/story-summary-training-mistral-9k-1_4_24
Viewer
•
Updated
•
2
emozilla/yarn-train-tokenized-8k-llama
Viewer
•
Updated
•
249
emozilla/yarn-train-tokenized-32k-mistral
Viewer
•
Updated
•
1
emozilla/yarn-train-tokenized-16k-mistral
Viewer
•
Updated
•
70
•
13
emozilla/pg19
Viewer
•
Updated
•
185
•
9