Model assets for the first Mixture-of-Lora technique applied to Llama. https://bit.ly/48bqshl
Maxine
crumb
AI & ML interests
im 19, ive had a special interest in artificial intelligence since middle school, i like innovating in compute-restricted environments
Organizations
Collections
5
models
130
crumb/scd-pythia-70m
Updated
crumb/model-a-48.5m
Text Generation
•
Updated
•
2.68k
•
2
crumb/nano-mistral
Text Generation
•
Updated
•
2.71k
•
3
crumb/apricot-wildflower-20
Text Generation
•
Updated
•
4.28k
•
2
crumb/testing-cot-nothing-to-see-here-7b
Updated
crumb/minipile-111m
Text Generation
•
Updated
•
3
crumb/askmistral-2-15-111m
Text Generation
•
Updated
•
4
•
1
crumb/askmistral-2-15-tophalf-111m
Text Generation
•
Updated
•
3
crumb/Llama-p-small
Text Generation
•
Updated
crumb/GLORT2
Text Generation
•
Updated
•
2
datasets
49
crumb/semantic-corruption-t5-v1_1-small
Viewer
•
Updated
•
5
crumb/semantic-corruption-t5-v1_1-base
Viewer
•
Updated
•
5
crumb/semantic-corruption-t5-v1_1-large
Viewer
•
Updated
•
5
crumb/reup-test
Viewer
•
Updated
•
5
crumb/redbeam-c4-rated-vit-l14-laion
Viewer
•
Updated
•
9
crumb/dummy-cot-sampling-dataset-clean-preview
Viewer
•
Updated
crumb/askmistral-pile-2-15
Viewer
•
Updated
•
51
•
6
crumb/dummy-cot-sampling-dataset
Viewer
•
Updated
crumb/deduped-pile-askmistral-shard1-top1-in-4
Viewer
•
Updated
•
25
crumb/askmistral-pile-011-filtered
Viewer
•
Updated