Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG).

nvidia/Llama3-ChatQA-1.5-8B

Text Generation • Updated about 7 hours ago • 21.3k • 436
nvidia/Llama3-ChatQA-1.5-70B

Text Generation • Updated about 7 hours ago • 3.6k • 249
nvidia/ChatRAG-Bench

Updated 1 day ago • 2.93k • 56
nvidia/ChatQA-Training-Data

Viewer • Updated 1 day ago • 2.01k • 106

4bit Instruct Models

unsloth/llama-3-8b-Instruct-bnb-4bit

Text Generation • Updated 1 day ago • 218k • 69
unsloth/mistral-7b-instruct-v0.2-bnb-4bit

Text Generation • Updated Mar 22 • 142k • 26
unsloth/llama-3-70b-Instruct-bnb-4bit

Text Generation • Updated 1 day ago • 16.8k • 28
unsloth/gemma-7b-it-bnb-4bit

Text Generation • Updated 29 days ago • 6.82k • 12

LLaVA-NeXT-Video

Some powerful video models.

lmms-lab/LLaVA-NeXT-Video-7B

Text Generation • Updated 24 days ago • 495 • 15
lmms-lab/LLaVA-NeXT-Video-34B

Text Generation • Updated 24 days ago • 173 • 13
lmms-lab/LLaVA-NeXT-Video-7B-DPO

Text Generation • Updated 8 days ago • 922 • 7
lmms-lab/LLaVA-NeXT-Video-34B-DPO

Text Generation • Updated 13 days ago • 238 • 3

MoEs papers reading list

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

Paper • 1701.06538 • Published Jan 23, 2017 • 4
Sparse Networks from Scratch: Faster Training without Losing Performance

Paper • 1907.04840 • Published Jul 10, 2019 • 3
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

Paper • 1910.02054 • Published Oct 4, 2019 • 3
A Mixture of h-1 Heads is Better than h Heads

Paper • 2005.06537 • Published May 13, 2020 • 2

Nous' Flagship LLM Series

NousResearch/Hermes-2-Theta-Llama-3-8B

Text Generation • Updated 3 days ago • 915 • 61
NousResearch/Hermes-2-Pro-Llama-3-8B

Text Generation • Updated 5 days ago • 22.7k • 324
NousResearch/Hermes-2-Pro-Mistral-7B

Text Generation • Updated 17 days ago • 37.2k • 458
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF

Updated Mar 28 • 36.6k • 208

Scaling up instruction data from the web for to build better LLMs

TIGER-Lab/MAmmoTH2-7B

Text Generation • Updated 29 minutes ago • 32.3k
TIGER-Lab/MAmmoTH2-7B-Plus

Text Generation • Updated 7 days ago • 456 • 1
TIGER-Lab/MAmmoTH2-8B

Text Generation • Updated 7 days ago • 6 • 1
TIGER-Lab/MAmmoTH2-8B-Plus

Text Generation • Updated 7 days ago • 211 • 11

yentinglin/Llama-3-Taiwan-70B-Instruct-rc2

Text Generation • Updated 3 days ago • 6
yentinglin/Llama-3-Taiwan-70B-Instruct-rc1

Text Generation • Updated 2 days ago • 26 • 2
yentinglin/Llama-3-Taiwan-8B-Instruct-rc1

Text Generation • Updated 4 days ago • 11 • 4
Measuring Taiwanese Mandarin Language Understanding

Paper • 2403.20180 • Published Mar 29 • 3

Collection of models trained on the CommonCatalogue datasets

Running on Zero

24

🌖

CommonCanvas Demo

SD arch trained from scratch on Creative Commons dataset
common-canvas/CommonCanvas-XL-C

Text-to-Image • Updated about 11 hours ago • 120 • 9
common-canvas/CommonCanvas-XL-NC

Text-to-Image • Updated 1 day ago • 108 • 6
common-canvas/CommonCanvas-S-C

Text-to-Image • Updated 1 day ago • 167 • 5

My experiments with Llama-3 models

MaziyarPanahi/Llama-3-70B-Instruct-DPO-v0.1

Text Generation • Updated 9 days ago • 1.49k • 8
MaziyarPanahi/Llama-3-70B-Instruct-DPO-v0.2

Text Generation • Updated 9 days ago • 987 • 2
MaziyarPanahi/Llama-3-70B-Instruct-DPO-v0.3

Text Generation • Updated 9 days ago • 1.38k • 2
MaziyarPanahi/Llama-3-70B-Instruct-DPO-v0.4

Text Generation • Updated 9 days ago • 1.41k • 9

CodeGemma Release

google/codegemma-7b

Text Generation • Updated Apr 16 • 7.11k • 119
google/codegemma-7b-it-GGUF

Text Generation • Updated Apr 9 • 434 • 34
google/codegemma-2b-GGUF

Text Generation • Updated Apr 9 • 171 • 15
google/codegemma-7b-GGUF

Text Generation • Updated Apr 9 • 141 • 14

Previous
1
2
3
4
5
...
3,939
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs