Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
13.5
TFLOPS
37
36
65
Marc Sun
marcsun13
Follow
mdouglas's profile picture
cataluna84's profile picture
kirch's profile picture
91 followers
·
129 following
_marcsun
SunMarc
AI & ML interests
LLM, Quantization, Training, Inference
Articles
Fine-tuning LLMs to 1.58bit: extreme quantization made easy
Sep 18
•
172
Accelerate 1.0.0
Sep 13
•
48
Llama 3.1 - 405B, 70B & 8B with multilinguality and long context
Jul 23
•
205
quanto: a pytorch quantization toolkit
Mar 18
•
28
Overview of natively supported quantization schemes in 🤗 Transformers
Sep 12, 2023
•
10
Making LLMs lighter with AutoGPTQ and transformers
Aug 23, 2023
•
30
Organizations
models
14
Sort:Â Recently updated
marcsun13/Meta-Llama-3-8B-torchao-int8_weight_only
Updated
4 days ago
•
4
marcsun13/sft_openassistant-guanaco
Text Generation
•
Updated
Jul 5
•
5
marcsun13/gemma-2-27b-it-bnb-colab
Text Generation
•
Updated
Jul 4
•
23
marcsun13/gemma-2-9b-it-GPTQ
Text Generation
•
Updated
Jul 3
•
369
•
2
marcsun13/test_push_checkpoint
Fill-Mask
•
Updated
Jun 28
•
28
marcsun13/Mixtral-8x7B-v0.1-GPTQ
Text Generation
•
Updated
Dec 11, 2023
•
11
marcsun13/Mixtral-tiny-GPTQ
Text Generation
•
Updated
Dec 11, 2023
•
64
marcsun13/Llama-2-13B-AWQ
Text Generation
•
Updated
Nov 6, 2023
•
9
marcsun13/opt-125m-awq
Text Generation
•
Updated
Oct 30, 2023
•
31
marcsun13/opt-350m-gptq-4bit
Text Generation
•
Updated
Jul 31, 2023
•
472
Expand 14 models
datasets
None public yet