Muche

Truc95
ยท

AI & ML interests

None yet

Recent Activity

Organizations

None yet

Truc95's activity

reacted to merve's post with ๐Ÿ˜Ž about 18 hours ago
view post
Post
1471
small but mighty ๐Ÿ”ฅ
you can fine-tune SmolVLM on an L4 with batch size of 4 and it will only take 16.4 GB VRAM ๐Ÿซฐ๐Ÿป also with gradient accumulation simulated batch size is 16 โœจ
I made a notebook that includes all the goodies: QLoRA, gradient accumulation, gradient checkpointing with explanations on how they work ๐Ÿ’ https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
New activity in HuggingFaceTB/SmolVLM-Instruct 5 days ago

Best option for DocQVA->JSON

1
#11 opened 5 days ago by Truc95
New activity in microsoft/Florence-2-large 5 months ago

How to increase max token output ?

1
#40 opened 5 months ago by Truc95