Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

Sanskrit Sahitya Embeddings

Embeddings of translated Sanskrit Texts

Mercity/ramayana-embeddings

Viewer • Updated 3 days ago • 18.8k • 34
Mercity/mahabharat-embeddings

Viewer • Updated 3 days ago • 73.8k • 38
Mercity/bhagavad_gita-embeddings

Viewer • Updated 2 days ago • 657 • 32

Kyro-n1.1 is an improved model in the Kyro family with better reasoning than n1. This model outperforms Kyro-n1 in all areas such as STEM: Open-Neo

open-neo/Kyro-n1.1-3B

Text Generation • Updated 1 day ago • 33 • 2
open-neo/Kyro-n1.1-7B

Text Generation • Updated 1 day ago • 2

TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity!

akhauriyash/DeepSeek-R1-Distill-Llama-8B-Butler

Text Generation • Updated about 21 hours ago • 32
akhauriyash/Llama-3.1-8B-Butler

Text Generation • Updated about 21 hours ago • 27
akhauriyash/Llama-2-7b-hf-Butler

Text Generation • Updated about 21 hours ago • 30
akhauriyash/Llama-3.2-3B-Butler

Text Generation • Updated about 21 hours ago • 18

CardProjector-v2

AlexBefest/CardProjector-14B-v2

Updated 4 days ago • 20 • 6
AlexBefest/CardProjector-7B-v2

Updated 4 days ago • 13 • 4
AlexBefest/CardProjector-14B-v2-GGUF

Updated 4 days ago • 644 • 4
AlexBefest/CardProjector-7B-v2-GGUF

Updated 4 days ago • 390

🤓Small-Thoughts

Distill thinking dataset more compactly and accurately!

SmallDoge/SmallThoughts

Viewer • Updated about 5 hours ago • 51k • 1.51k • 28

Official models for DiffCLIP: Differential Attention Meets CLIP

hammh0a/ViTB16_CC3M

Updated 5 days ago
hammh0a/ViTB16_CC12M

Updated 5 days ago
hammh0a/DiffCLIP_ViTB16_CC3M

Updated 5 days ago
hammh0a/DiffCLIP_ViTB16_CC12M

Updated 5 days ago

Llama-3.3-Swallow

tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4

Text Generation • Updated 5 days ago • 578 • 1
tokyotech-llm/Llama-3.3-Swallow-70B-v0.4

Text Generation • Updated 5 days ago • 59 • 2
tokyotech-llm/edu-classifier

Text Classification • Updated Jan 30 • 1.68k • 10

Romansetu is a collection of models address the challenge of extending Large Language Models (LLMs) to non-English languages using non-Latin scripts

ai4bharat/romansetu-cpt-roman-100m

Updated 7 days ago • 22
ai4bharat/romansetu-cpt-roman-200m

Updated 7 days ago • 28
ai4bharat/romansetu-cpt-native-300m

Updated 7 days ago • 14
ai4bharat/romansetu-cpt-native-400m

Updated 7 days ago • 16

Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs.

amd/Instella-3B-Stage1

Text Generation • Updated 8 days ago • 166 • 12
amd/Instella-3B

Text Generation • Updated 8 days ago • 729 • 31
amd/Instella-3B-SFT

Text Generation • Updated 8 days ago • 167 • 8
amd/Instella-3B-Instruct

Text Generation • Updated 8 days ago • 1.24k • 34

Dataset Collection of Skywork-R1V

about 8 hours ago

Previous
1
...
6
7
8
9
10
...
10,237
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs