15 84 371

alkinun

AtAndDev

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

reacted to BestWishYsh's post with 👀 about 4 hours ago

🚨 Hot Take: GPT-4o might NOT be a purely autoregressive model! 🚨 There’s a high chance it has a diffusion head. 🤯 If true, this could be a game-changer for AI architecture. What do you think? 🤔👇 Code: https://github.com/PicoTrex/GPT-ImgEval Paper: https://huggingface.co/papers/2504.02782

reacted to BestWishYsh's post with 🔥 about 4 hours ago

reacted to clem's post with 🔥 about 4 hours ago

Llama models (arguably the most successful open AI models of all times) just represented 3% of total model downloads on Hugging Face in March. People and media like stories of winner takes all & one model/company to rule them all but the reality is much more nuanced than this! Kudos to all the small AI builders out there!

View all activity

Organizations

AtAndDev's activity

reacted to BestWishYsh's post with 👀🔥 about 4 hours ago

Post

1687

🚨 Hot Take: GPT-4o might NOT be a purely autoregressive model! 🚨

There’s a high chance it has a diffusion head. 🤯 If true, this could be a game-changer for AI architecture. What do you think? 🤔👇

Code: https://github.com/PicoTrex/GPT-ImgEval
Paper: GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation (2504.02782)

reacted to clem's post with 🔥 about 4 hours ago

Post

1524

Llama models (arguably the most successful open AI models of all times) just represented 3% of total model downloads on Hugging Face in March.

People and media like stories of winner takes all & one model/company to rule them all but the reality is much more nuanced than this!

Kudos to all the small AI builders out there!

2 replies

updated a model about 5 hours ago

AtAndDev/lora_model

Updated about 5 hours ago

reacted to Jaward's post with 🔥 about 5 hours ago

Post

1715

Amazing work👏
Introduces Dream 7B - a discrete diffusion reasoning model, fully opensourced with weights on 🤗
- it outperforms existing non-autoregressive models and matches or beats frontier autoregressive of similar size on reasoning tasks.
Models:
- base: Dream-org/Dream-v0-Base-7B
- SFT: Dream-org/Dream-v0-Instruct-7B
Code: https://github.com/HKUNLP/Dream
Project: https://hkunlp.github.io/blog/2025/dream/

1 reply

published a model about 5 hours ago

AtAndDev/lora_model

Updated about 5 hours ago

liked 3 models 3 days ago

liked 2 datasets 3 days ago

MMInstruction/Clevr_CoGenT_TrainA_R1

Viewer • Updated Feb 13 • 37.8k • 1.27k • 43

MMInstruction/Clevr_CoGenT_TrainA_70K_Complex

Viewer • Updated Feb 5 • 70k • 761 • 4

reacted to openfree's post with ❤️👀🚀🔥 3 days ago

Post

4918

🔥 'Open Meme Studio': Your Creative Meme Factory 🎭✨

Hello everyone! Today I'm introducing 'Open Meme Studio', an amazing space where you can easily create and transform fun and original meme images. 🚀

VIDraft/Open-Meme-Studio

🎯 Taking Meme Creation to the Next Level!
This application leverages the powerful Kolors model and IP-Adapter-Plus to upgrade your meme-making abilities. Go beyond simple image editing and experience a completely new meme world powered by AI!

🛠️ Features You'll Love

📸 Transform and reinterpret existing meme templates
🎭 Freely change expressions and poses
👓 Add props (sunglasses, hats, etc.)
🏞️ Change backgrounds and composite characters
🎨 Apply various artistic styles

💪 Why 'Open Meme Studio' is So Effective

Fast Meme Generation: High-quality memes completed in seconds
Unlimited Creativity: Completely different results just by changing prompts
User-Friendly Interface: Simple prompt input and image upload is all you need
Fine-tuned Control: Adjust how much of the original image characteristics to preserve
Advanced User Options: Freely set seed values, resolution, number of steps, and more

🚀 Streamlined Meme Creation Process
Tasks that previously required complex tools like Photoshop can now be accomplished with just a few simple prompts. Experience intuitive image manipulation through text commands.

🌈 Effective Prompt Examples

😎 "sunglass" - Add cool sunglasses to your character
🏔️ "background alps" - Change the background to Alpine mountains
💃 "dancing" - Transform your character into a dancing pose
😁 "smile" - Change to a smiling expression
🎮 "with Pikachu" - Create a scene with Pikachu
🎨 "3d style" - Convert to 3D style

🔗 Join Our Community
For more meme creation tips and interaction with other users, join our Discord!
https://discord.gg/openfreeai

Start creating unique memes that will shake up social media with 'Open Meme Studio' right now! 🚀💯 It's time for your meme

1 reply

reacted to burtenshaw's post with 🚀❤️🤗 6 days ago

Post

2523

NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO.

🔗

reasoning-course

This unit is super useful if you’re tuning models with reinforcement learning. It will help with:

- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions

This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.

📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.

1 reply

reacted to onekq's post with 👀 6 days ago

Post

2237

Open source models are immutable, this is a big pain.

When you open source a piece of software, users leave their feedbacks via issues or PRs. You can merge their feedbacks in semi real time, this creates a positive cycle. Then you have a community.

LLMs don't have these nice micro steps. There are no hot fixes. Even a minor version bump is an endeavor. I'm quite confident my model is being used by teams somewhere. But until next launch, it's awfully quiet.

I don't know the solution. Just a regular lament before weekend. 🤗

3 replies

reacted to Aurelien-Morgan's post with 🔥 7 days ago

Post

1932

Almost there !
https://test.pypi.org/project/test-010-retrain-pipelines/