Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
74.8
TFLOPS
32
31
AItomek ;P
altomek
Follow
Aspik101's profile picture
Nexesenex's profile picture
guidevops's profile picture
18 followers
·
113 following
altomek
AI & ML interests
MLOps, Polish LLMs
Recent Activity
replied
to
bartowski
's
post
11 days ago
Looks like Q4_0_N_M file types are going away Before you panic, there's a new "preferred" method which is online (I prefer the term on-the-fly) repacking, so if you download Q4_0 and your setup can benefit from repacking the weights into interleaved rows (what Q4_0_4_4 was doing), it will do that automatically and give you similar performance (minor losses I think due to using intrinsics instead of assembly, but intrinsics are more maintainable) You can see the reference PR here: https://github.com/ggerganov/llama.cpp/pull/10446 So if you update your llama.cpp past that point, you won't be able to run Q4_0_4_4 (unless they add backwards compatibility back), but Q4_0 should be the same speeds (though it may currently be bugged on some platforms) As such, I'll stop making those newer model formats soon, probably end of this week unless something changes, but you should be safe to download and Q4_0 quants and use those ! Also IQ4_NL supports repacking though not in as many shapes yet, but should get a respectable speed up on ARM chips, PR for that can be found here: https://github.com/ggerganov/llama.cpp/pull/10541 Remember, these are not meant for Apple silicon since those use the GPU and don't benefit from the repacking of weights
replied
to
bartowski
's
post
11 days ago
Looks like Q4_0_N_M file types are going away Before you panic, there's a new "preferred" method which is online (I prefer the term on-the-fly) repacking, so if you download Q4_0 and your setup can benefit from repacking the weights into interleaved rows (what Q4_0_4_4 was doing), it will do that automatically and give you similar performance (minor losses I think due to using intrinsics instead of assembly, but intrinsics are more maintainable) You can see the reference PR here: https://github.com/ggerganov/llama.cpp/pull/10446 So if you update your llama.cpp past that point, you won't be able to run Q4_0_4_4 (unless they add backwards compatibility back), but Q4_0 should be the same speeds (though it may currently be bugged on some platforms) As such, I'll stop making those newer model formats soon, probably end of this week unless something changes, but you should be safe to download and Q4_0 quants and use those ! Also IQ4_NL supports repacking though not in as many shapes yet, but should get a respectable speed up on ARM chips, PR for that can be found here: https://github.com/ggerganov/llama.cpp/pull/10541 Remember, these are not meant for Apple silicon since those use the GPU and don't benefit from the repacking of weights
liked
a model
16 days ago
TeeZee/DarkForest-20B-v3.0
View all activity
Organizations
altomek
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
16 days ago
TeeZee/DarkForest-20B-v3.0
Text Generation
•
Updated
Jun 15
•
29
•
8
liked
a model
17 days ago
cognitivecomputations/WizardLM-33B-V1.0-Uncensored
Text Generation
•
Updated
Mar 4
•
1.13k
•
60
liked
a dataset
2 months ago
LLM360/TxT360
Preview
•
Updated
Nov 8
•
113k
•
216
liked
a model
4 months ago
FourOhFour/Deedlit_4B
Text Generation
•
Updated
Sep 2
•
25
•
3
liked
a dataset
5 months ago
Alignment-Lab-AI/EmotionDialogue
Viewer
•
Updated
Jul 17
•
23.1k
•
11
•
3
liked
3 models
6 months ago
01-ai/Yi-1.5-34B-Chat
Text Generation
•
Updated
Aug 27
•
9.95k
•
•
258
mistralai/Mistral-7B-Instruct-v0.3
Text Generation
•
Updated
Aug 21
•
3.52M
•
•
1.2k
Sao10K/L3-8B-Stheno-v3.2
Text Generation
•
Updated
Jun 7
•
7.54k
•
253
liked
a model
7 months ago
01-ai/Yi-1.5-34B
Text Generation
•
Updated
Jun 26
•
3.78k
•
47
liked
a model
8 months ago
speakleash/Bielik-7B-Instruct-v0.1
Text Generation
•
Updated
Oct 26
•
5.06k
•
57
liked
a Space
8 months ago
Restarting
on
CPU Upgrade
52
🏆🇵🇱
Open PL LLM Leaderboard
liked
3 models
9 months ago
codellama/CodeLlama-70b-Python-hf
Text Generation
•
Updated
Apr 12
•
301
•
107
w4r10ck/SOLAR-10.7B-Instruct-v1.0-uncensored
Text Generation
•
Updated
Oct 19
•
81
•
30
fblgit/UNA-SOLAR-10.7B-Instruct-v1.0
Text Generation
•
Updated
Dec 22, 2023
•
81
•
16
liked
a Space
9 months ago
Running
391
🏆🏋️
LLM-Perf Leaderboard
liked
5 models
10 months ago
eryk-mazus/polka-1.1b-chat
Text Generation
•
Updated
Sep 20
•
2.6k
•
18
Aspik101/vicuna-13b-v1.5-PL-lora_unload
Text Generation
•
Updated
Aug 3, 2023
•
777
•
2
piotr-ai/polanka-7b-v0.1
Text Generation
•
Updated
Nov 14, 2023
•
1.91k
•
8
upstage/SOLAR-10.7B-Instruct-v1.0
Text Generation
•
Updated
Sep 10
•
61.1k
•
619
oobabooga/CodeBooga-34B-v0.1
Text Generation
•
Updated
Jan 2
•
3.44k
•
144
Load more