Model assets for the first Mixture-of-Lora technique applied to Llama. https://bit.ly/48bqshl
cephaloform
crumb
AI & ML interests
im 19, ive had a special interest in artificial intelligence since middle school, i like innovating in compute-restricted environments
Organizations
Collections
5
models
138
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/L3.1-tokenizer
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/utf8-gelu-dec-8.5M-10KB-ctx-3GB
Text Generation
•
Updated
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/utf8-relu-dec-8.5M-10KB-ctx-3GB
Text Generation
•
Updated
•
56
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/92d52f-ame-full-7B
Text Generation
•
Updated
•
28
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/a3843d-augmented-mappings-medium-experimental
Text Generation
•
Updated
•
28
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/13f189-augmented-mappings-medium-control
Text Generation
•
Updated
•
13
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/3e8e6c-augmented-mappings-small-experimental
Text Generation
•
Updated
•
12
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/536137-augmented-mappings-small-control
Text Generation
•
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/gpt2-medium-eb49cc
Text Generation
•
Updated
•
8
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6079949388160e14e4e2e499/oqHKgCTLJJcUmA6NtQg7D.png)
crumb/model-a-48.5m
Text Generation
•
Updated
•
300
•
2
datasets
50
crumb/songbird
Viewer
•
Updated
•
12M
•
12
crumb/semantic-corruption-t5-v1_1-small
Viewer
•
Updated
•
2.28k
crumb/semantic-corruption-t5-v1_1-base
Updated
crumb/semantic-corruption-t5-v1_1-large
Updated
crumb/reup-test
Viewer
•
Updated
•
2k
crumb/redbeam-c4-rated-vit-l14-laion
Updated
crumb/dummy-cot-sampling-dataset-clean-preview
Updated
crumb/askmistral-pile-2-15
Viewer
•
Updated
•
2.34M
•
56
•
6
crumb/dummy-cot-sampling-dataset
Updated
crumb/deduped-pile-askmistral-shard1-top1-in-4
Viewer
•
Updated
•
1M