Antonio Esparza

Tonioesparza
ยท

AI & ML interests

Creative Technologies

Recent Activity

Organizations

Stable Diffusion concepts library's profile picture CreativesCombined's profile picture

Tonioesparza's activity

reacted to csabakecskemeti's post with ๐Ÿ‘ about 1 month ago
view post
Post
1955
-UPDATED-
4bit inference is working! The blogpost is updated with code snippet and requirements.txt
https://devquasar.com/uncategorized/all-about-amd-and-rocm/
-UPDATED-
I've played around with an MI100 and ROCm and collected my experience in a blogpost:
https://devquasar.com/uncategorized/all-about-amd-and-rocm/
Unfortunately I've could not make inference or training work with model loaded in 8bit or use BnB, but did everything else and documented my findings.
  • 4 replies
ยท
updated a Space 5 months ago
updated a Space 6 months ago
reacted to christopher's post with โค๏ธ 7 months ago
view post
Post
1331
4 million chess puzzles
reacted to soldni's post with โค๏ธ about 1 year ago
view post
Post
release day release day! OLMo 1b + 7b out today ๐Ÿฅณ and 65b coming soon...

With OLMo, we are really focused on advancing the study of LLMs. We release **everything**, from toolkit to create its training dataset (dolma) to training & inference code:

- OLMo paper: https://allenai.org/olmo/olmo-paper.pdf
- OLMo train code: https://github.com/allenai/OLMo
- OLMo eval code: https://github.com/allenai/OLMo-Eval
- OLMo 7b: allenai/OLMo-7B
- OLMo 1b: allenai/OLMo-1B
- Dolma paper: https://allenai.org/olmo/dolma-paper.pdf
- Dolma dataset v1.6: allenai/dolma
- Dolma toolkit v1.0: https://github.com/allenai/dolma
  • 2 replies
ยท
New activity in diffusers/sd-to-diffusers over 1 year ago

Gets stuck in 10% progress

1
#15 opened over 1 year ago by
Tonioesparza