Models I pre-trained initialising SMoE models using dense model weights and the upcycling process used for Qwen1.5-MoE2.7BA (or something similar)
Gabriel Martín Blázquez
gabrielmbmb
AI & ML interests
ML Engineer
Organizations
Collections
4
spaces
1
models
8
gabrielmbmb/Upcycled-Qwen1.5-MoE2.7B-LoRA-merged-32-2000-steps-adapter
Updated
•
1
gabrielmbmb/Upcycled-Qwen1.5-MoE2.7B-LoRA-merged-32-2000-steps
Text Generation
•
Updated
gabrielmbmb/Upcycled-Qwen1.5-MoE2.7B-LoRA-merged-32
Text Generation
•
Updated
•
1
gabrielmbmb/Upcycled-Qwen1.5-MoE2.7B
Text Generation
•
Updated
•
2
•
1
gabrielmbmb/Upcycled-Qwen1.5-MoE2.7B-LoRA-merged
Text Generation
•
Updated
•
1
gabrielmbmb/Upcycled-Qwen1.5-MoE2.7B-LoRA
Updated
•
1
gabrielmbmb/Genstruct-7B-AWQ
Text Generation
•
Updated
•
15
gabrielmbmb/finbert
Text Classification
•
Updated
•
42
datasets
17
gabrielmbmb/vllm-structured-generation
Viewer
•
Updated
gabrielmbmb/testing-vllm
Viewer
•
Updated
gabrielmbmb/test
Viewer
•
Updated
gabrielmbmb/alpaca-garbage-collected
Viewer
•
Updated
gabrielmbmb/wikipedia_genstruct_dpo
Viewer
•
Updated
gabrielmbmb/deitaset
Viewer
•
Updated
gabrielmbmb/test-complexity-scorer
Viewer
•
Updated
gabrielmbmb/test-distilabel
Viewer
•
Updated
gabrielmbmb/wikipedia_es_genstruct_v2_iter_1
Viewer
•
Updated
gabrielmbmb/wikipedia_es_genstruct_v2
Viewer
•
Updated