meta-llama/Llama-3.3-70B-Instruct Text Generation ā¢ Updated about 11 hours ago ā¢ 47.4k ā¢ ā¢ 821
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper ā¢ 2408.15518 ā¢ Published Aug 28 ā¢ 42
The Mamba in the Llama: Distilling and Accelerating Hybrid Models Paper ā¢ 2408.15237 ā¢ Published Aug 27 ā¢ 37 ā¢ 4