Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1211
11
Tom Jobbins
PRO
TheBloke
Follow
mariopcorreia's profile picture
simranjeet97's profile picture
BUTTER-BEAR's profile picture
22488 followers
·
16 following
TheBlokeAI
TheBloke
AI & ML interests
LLM: quantisation, fine tuning
Articles
Making LLMs lighter with AutoGPTQ and transformers
Aug 23, 2023
•
37
Organizations
TheBloke
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
TheBloke/Llama-2-70B-GGUF
11 months ago
fix for join commands
1
#2 opened 11 months ago by
orbiter
New activity in
TheBloke/CodeLlama-70B-hf-AWQ
11 months ago
not codelama
3
#1 opened 11 months ago by
luckiskind
New activity in
TheBloke/MegaDolphin-120b-GPTQ
12 months ago
Missing files
2
#1 opened 12 months ago by
Danne980
New activity in
TheBloke/Fennec-Mixtral-8x7B-GGUF
12 months ago
Is this the same model with orangetin/OpenHermes-Mixtral-8x7B ?
2
#2 opened 12 months ago by
bingw5
New activity in
Yhyu13/LMCocktail-10.7B-v1
about 1 year ago
Quant pls
4
#1 opened about 1 year ago by
Yhyu13
New activity in
VAGOsolutions/SauerkrautLM-SOLAR-Instruct
about 1 year ago
Quants uploading now
1
#4 opened about 1 year ago by
TheBloke
New activity in
ddh0/OrcaMaidXL-17B-32k
about 1 year ago
Add YaRN modeling code
#1 opened about 1 year ago by
TheBloke
New activity in
TheBloke/openchat-3.5-1210-AWQ
about 1 year ago
Update special_tokens_map.json
#1 opened about 1 year ago by
alpayariyak
New activity in
TheBloke/openchat-3.5-1210-GPTQ
about 1 year ago
Update special_tokens_map.json
#1 opened about 1 year ago by
alpayariyak
New activity in
TheBloke/Llama-2-13B-Chat-Dutch-AWQ
about 1 year ago
Update to new format and include chat template with default system message
#1 opened about 1 year ago by
BramVanroy
New activity in
TheBloke/Llama-2-13B-Chat-Dutch-GPTQ
about 1 year ago
Update to new format and include chat template with default system message
#1 opened about 1 year ago by
BramVanroy
New activity in
ddh0/Norocetacean-20b-10k
about 1 year ago
Update configuration_llama.py, required to get model to Load as `rope_scaling` needs to be None, or else a dictionary
#1 opened about 1 year ago by
TheBloke
New activity in
TheBloke/Rogue-Rose-103b-v0.2-GPTQ
about 1 year ago
Missing Model
1
#1 opened about 1 year ago by
Razzor9000
New activity in
Undi95/Mixtral-8x7B-MoE-RP-Story
about 1 year ago
Model config.json has Mistral params instead of Mixtral, breaking ExLlama quants and maybe affecting others too
#3 opened about 1 year ago by
TheBloke
New activity in
TheBloke/SOLAR-10.7B-Instruct-v1.0-GGUF
about 1 year ago
vocabulary maybe wrong
3
#1 opened about 1 year ago by
limoncc
New activity in
TheBloke/openchat-3.5-1210-GGUF
about 1 year ago
Sorta Broken
6
#1 opened about 1 year ago by
dillfrescott
New activity in
TheBloke/Mixtral-8x7B-v0.1-GPTQ
about 1 year ago
RuntimeError: shape '[32, 8]' is invalid for input of size 0
7
#5 opened about 1 year ago by
woldeM
New activity in
mattshumer/mistral-8x7b-chat
about 1 year ago
Quant pls
1
#5 opened about 1 year ago by
Yhyu13
New activity in
TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ
about 1 year ago
Did anyone get it to run?
11
#1 opened about 1 year ago by
dimaischenko
New activity in
TheBlokeAI/Mixtral-tiny-GPTQ
about 1 year ago
Seems like the GPTQ versions are broken
4
#2 opened about 1 year ago by
NePe
Load more