Rodolfo Amorim

Dolfini

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago

reacted to reach-vb's post with 🚀 4 months ago

Smol models ftw! AMD released AMD OLMo 1B - beats OpenELM, tiny llama on MT Bench, Alpaca Eval - Apache 2.0 licensed 🔥 > Trained with 1.3 trillion (dolma 1.7) tokens on 16 nodes, each with 4 MI250 GPUs > Three checkpoints: - AMD OLMo 1B: Pre-trained model - AMD OLMo 1B SFT: Supervised fine-tuned on Tulu V2, OpenHermes-2.5, WebInstructSub, and Code-Feedback datasets - AMD OLMo 1B SFT DPO: Aligned with human preferences using Direct Preference Optimization (DPO) on UltraFeedback dataset Key Insights: > Pre-trained with less than half the tokens of OLMo-1B > Post-training steps include two-phase SFT and DPO alignment > Data for SFT: - Phase 1: Tulu V2 - Phase 2: OpenHermes-2.5, WebInstructSub, and Code-Feedback > Model checkpoints on the Hub & Integrated with Transformers ⚡️ Congratulations & kudos to AMD on a brilliant smol model release! 🤗 https://huggingface.co/collections/amd/amd-olmo-6723e7d04a49116d8ec95070

liked a model 4 months ago

allenai/tulu-2-dpo-70b

View all activity

Organizations

None yet

Dolfini's activity

liked a model 18 days ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 8 days ago • 673k • • 574

reacted to reach-vb's post with 🚀 4 months ago

Post

3065

Smol models ftw! AMD released AMD OLMo 1B - beats OpenELM, tiny llama on MT Bench, Alpaca Eval - Apache 2.0 licensed 🔥

> Trained with 1.3 trillion (dolma 1.7) tokens on 16 nodes, each with 4 MI250 GPUs

> Three checkpoints:

- AMD OLMo 1B: Pre-trained model
- AMD OLMo 1B SFT: Supervised fine-tuned on Tulu V2, OpenHermes-2.5, WebInstructSub, and Code-Feedback datasets
- AMD OLMo 1B SFT DPO: Aligned with human preferences using Direct Preference Optimization (DPO) on UltraFeedback dataset

Key Insights:
> Pre-trained with less than half the tokens of OLMo-1B
> Post-training steps include two-phase SFT and DPO alignment
> Data for SFT:
- Phase 1: Tulu V2
- Phase 2: OpenHermes-2.5, WebInstructSub, and Code-Feedback

> Model checkpoints on the Hub & Integrated with Transformers ⚡️

Congratulations & kudos to AMD on a brilliant smol model release! 🤗

amd/amd-olmo-6723e7d04a49116d8ec95070

liked a model 4 months ago

allenai/tulu-2-dpo-70b

Text Generation • Updated Jan 31, 2024 • 3.79k • 156