š§"raw" pretrained smol_llama checkpoints - WIP š§
-
BEE-spoke-data/smol_llama-101M-GQA
Text Generation ⢠Updated ⢠520 ⢠28 -
BEE-spoke-data/smol_llama-81M-tied
Text Generation ⢠Updated ⢠15 ⢠6 -
BEE-spoke-data/smol_llama-220M-GQA
Text Generation ⢠Updated ⢠159 ⢠12 -
BEE-spoke-data/verysmol_llama-v11-KIx2
Text Generation ⢠Updated ⢠11 ⢠4