Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
20
Burning ray
adarksky
Follow
0 followers
ยท
5 following
AI & ML interests
None yet
Recent Activity
liked
a model
15 days ago
deepseek-ai/Janus-1.3B
reacted
to
merve
's
post
with ๐ฅ
about 1 month ago
small but mighty ๐ฅ you can fine-tune SmolVLM on an L4 with batch size of 4 and it will only take 16.4 GB VRAM ๐ซฐ๐ป also with gradient accumulation simulated batch size is 16 โจ I made a notebook that includes all the goodies: QLoRA, gradient accumulation, gradient checkpointing with explanations on how they work ๐ https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
liked
a model
about 2 months ago
Qwen/Qwen2.5-Coder-32B-Instruct
View all activity
Organizations
spaces
2
Sort:ย Recently updated
Sleeping
๐ฌ
Summer24 Fine Tuning
Sleeping
๐
What Panda
models
4
Sort:ย Recently updated
adarksky/pokemon-DDPM
Unconditional Image Generation
โข
Updated
Nov 11, 2024
โข
5
adarksky/bart-base-rel-therapy
Text2Text Generation
โข
Updated
Nov 11, 2024
โข
10
adarksky/president-gpt2
Text Generation
โข
Updated
Jul 4, 2024
โข
25
adarksky/biden-gpt2
Text Generation
โข
Updated
Jul 3, 2024
โข
20
datasets
None public yet