Model assets for the first Mixture-of-Lora technique applied to Llama. https://bit.ly/48bqshl
Maxine
crumb
AI & ML interests
im 19, ive had a special interest in artificial intelligence since middle school, i like innovating in compute-restricted environments
Organizations
Collections
5
models
129
crumb/model-a-48.5m
Text Generation
•
Updated
•
2.59k
•
2
crumb/nano-mistral
Text Generation
•
Updated
•
2.51k
•
2
crumb/apricot-wildflower-20
Text Generation
•
Updated
•
3.13k
•
2
crumb/testing-cot-nothing-to-see-here-7b
Updated
crumb/minipile-111m
Text Generation
•
Updated
•
8
crumb/askmistral-2-15-111m
Text Generation
•
Updated
•
11
•
1
crumb/askmistral-2-15-tophalf-111m
Text Generation
•
Updated
•
11
crumb/Llama-p-small
Text Generation
•
Updated
•
3
crumb/GLORT2
Text Generation
•
Updated
•
2
crumb/ParaLlama-p-micro
Text Generation
•
Updated
•
3
datasets
44
crumb/dummy-cot-sampling-dataset-clean-preview
Viewer
•
Updated
crumb/askmistral-pile-2-15
Viewer
•
Updated
•
14
•
5
crumb/dummy-cot-sampling-dataset
Viewer
•
Updated
crumb/deduped-pile-askmistral-shard1-top1-in-4
Viewer
•
Updated
crumb/askmistral-pile-011-filtered
Viewer
•
Updated
crumb/js-free-sites
Viewer
•
Updated
•
13
crumb/tiny-slimpajama-k8-00001
Viewer
•
Updated
crumb/c4-benchfilter-nano
Updated
•
8
•
3
crumb/c4-subset-for-mmlu-approx
Viewer
•
Updated
•
7
crumb/c4-subset-for-hellaswag-approx
Viewer
•
Updated
•
6
•
1