Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2407.11062

The official prequantized EfficientQAT models.

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128

Text Generation • Updated about 10 hours ago • 8
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64

Text Generation • Updated about 10 hours ago • 4
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128

Text Generation • Updated about 11 hours ago • 3
ChenMnZ/Llama-3-8b-EfficientQAT-w4g128

Text Generation • Updated about 11 hours ago • 4

Papers - Quantization - AQLM

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published 12 days ago • 3

Papers - Quantization - EfficientQAT

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published 12 days ago • 3

Papers - Quantization

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 44
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published 12 days ago • 3
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Paper • 2407.12327 • Published 5 days ago • 62

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published Apr 24 • 25
MoDE: CLIP Data Experts via Clustering

Paper • 2404.16030 • Published Apr 24 • 11
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Paper • 2405.12130 • Published May 20 • 44
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Paper • 2405.12981 • Published May 21 • 26

Foundation Models and Tools

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 75
bigcode/starcoder2-15b

Text Generation • Updated Jun 5 • 22.2k • 539
Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 120
mixedbread-ai/mxbai-rerank-large-v1

Text Classification • Updated about 1 hour ago • 40.7k • 83

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs