132 12 334

Djuunaa

djuna

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

rubenroy/Gilgamesh-72B

liked a model 1 day ago

RLHFlow/Decision-Tree-Reward-Llama-3.1-8B

liked a model 2 days ago

RLHFlow/Decision-Tree-Reward-Gemma-2-27B

View all activity

Organizations

djuna's activity

liked 2 models 1 day ago

rubenroy/Gilgamesh-72B

Text Generation • Updated 1 day ago • 20 • 4

RLHFlow/Decision-Tree-Reward-Llama-3.1-8B

Text Classification • Updated 12 days ago • 292 • 1

liked 2 models 2 days ago

RLHFlow/Decision-Tree-Reward-Gemma-2-27B

Text Classification • Updated 12 days ago • 58 • 2

nicolinho/QRM-Gemma-2-27B

Updated 24 days ago • 1.09k • 3

liked 3 models 3 days ago

liked a model 5 days ago

nbeerbower/Dumpling-Qwen2.5-1.5B

Text Generation • Updated 5 days ago • 34 • 1

reacted to Jaward's post with 🔥 5 days ago

Post

1420

The beauty in GRPO is the fact that it doesn’t care if the rewards are rule-based or learned, the hack: let the data self-normalize— trajectories in a batch compete against their mean, no value model, no extra params, just clean, efficient RL that cuts memory usage by 50%, while maintaining SOTA performance. btw it was introduced 9months prior to R1: arxiv.org/pdf/2402.03300

1 reply

liked 2 models 5 days ago

arcee-ai/Virtuoso-Small-v2

Text Generation • Updated 6 days ago • 512 • 21

BlossomsAI/Qwen2.5-Coder-14B-Instruct-Uncensored

Text Generation • Updated 6 days ago • 25 • 2

New activity in tugstugi/Qwen2.5-Coder-0.5B-QwQ-draft 5 days ago

Tokenizer Details

#2 opened 14 days ago by

qingy2024

liked a Space 5 days ago

DeepseekJanusPro

🚀

Deepseek AI's Janus-Pro-7B: Generate image from text

reacted to Bils's post with 🔥 5 days ago

Post

2006

🚀 We're excited to share major improvements to our Janus-Pro-7B Text-to-Image Generation Space!
🎨What's New:
1-Critical Bug Fixes
2-Enhanced Features
3-UI Improvements
4-Performance Boost
Try It Now:
Bils/DeepseekJanusPro-Image

liked a dataset 6 days ago

cognitivecomputations/dolphin-r1

Viewer • Updated 6 days ago • 814k • 1.47k • 176

reacted to fdaudens's post with 🔥 6 days ago

Post

3159

🎯 Kokoro TTS just hit v1.0! 🚀

Small but mighty: 82M parameters, runs locally, speaks multiple languages. The best part? It's Apache 2.0 licensed!
This could unlock so many possibilities ✨

Check it out: hexgrad/Kokoro-82M

1 reply

liked a model 6 days ago

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated 3 days ago • 24k • • 606

replied to mkurman's post 7 days ago

You should look up on Unsloth project

New activity in djuna/TEST-Q2.5-Lenned-14B 8 days ago

Update config.json

#1 opened 8 days ago by

djuna

updated a model 8 days ago

djuna/TEST-Q2.5-Lenned-14B

Text Generation • Updated 8 days ago • 19 • 1