alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

posted an update about 8 hours ago

Llama 4 is out...

upvoted an article about 8 hours ago

Welcome Llama 4 Maverick & Scout on Hugging Face!

reacted to BestWishYsh's post with 👀 1 day ago

🚨 Hot Take: GPT-4o might NOT be a purely autoregressive model! 🚨 There’s a high chance it has a diffusion head. 🤯 If true, this could be a game-changer for AI architecture. What do you think? 🤔👇 Code: https://github.com/PicoTrex/GPT-ImgEval Paper: https://huggingface.co/papers/2504.02782

View all activity

Organizations

AtAndDev's activity

upvoted an article about 8 hours ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

2 days ago

• 86

upvoted a paper 8 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 123

upvoted a collection 9 days ago

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated 1 day ago • 60

upvoted a paper 17 days ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published 21 days ago • 29

upvoted a collection 17 days ago

Gemma 2 Release

15 items • Updated 4 days ago • 217

upvoted a paper 18 days ago

Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation

Paper • 2503.13070 • Published 21 days ago • 9

upvoted a collection 22 days ago

Gemma 3

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated 1 day ago • 48

upvoted a paper 22 days ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published Feb 6 • 30

upvoted 2 articles 22 days ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

27 days ago

• 73

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

26 days ago

• 373

upvoted a paper 23 days ago

Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models

Paper • 2503.09669 • Published 25 days ago • 35

upvoted a collection 25 days ago

Gemma 3 Release

17 items • Updated 4 days ago • 311

upvoted 2 collections 2 months ago

DeepSeek-R1

8 items • Updated Jan 21 • 598

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 79

upvoted a paper 2 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 104