Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
6
ALex
Nessit
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
new
activity
27 days ago
google/gemma-3-4b-it:
VRAM not freed during long generations (Gemma, max_new_tokens=3000)
new
activity
about 2 months ago
RefalMachine/RuadaptQwen2.5-14B-Instruct:
потеря внимания
liked
a model
about 2 months ago
RefalMachine/RuadaptQwen2.5-14B-Instruct
View all activity
Organizations
None yet
Nessit
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
google/gemma-3-4b-it
27 days ago
VRAM not freed during long generations (Gemma, max_new_tokens=3000)
3
#29 opened 28 days ago by
Nessit
New activity in
RefalMachine/RuadaptQwen2.5-14B-Instruct
about 2 months ago
потеря внимания
4
#1 opened about 2 months ago by
Nessit
liked
a model
about 2 months ago
RefalMachine/RuadaptQwen2.5-14B-Instruct
Text Generation
•
Updated
Feb 3
•
11.2k
•
4
New activity in
yandex/YandexGPT-5-Lite-8B-pretrain
about 2 months ago
chat_template будет?
4
#7 opened about 2 months ago by
Nessit
liked
a model
2 months ago
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation
•
Updated
Jan 12
•
390k
•
•
1.79k
liked
4 models
3 months ago
Qwen/Qwen2.5-Coder-14B-Instruct
Text Generation
•
Updated
Jan 12
•
57.8k
•
105
Qwen/Qwen2.5-14B-Instruct
Text Generation
•
Updated
Sep 25, 2024
•
846k
•
227
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation
•
Updated
Feb 24
•
914k
•
•
499
microsoft/phi-4
Text Generation
•
Updated
Feb 24
•
422k
•
•
2k