Derry Pratama
ibndias
AI & ML interests
None yet
Recent Activity
published
a model
about 17 hours ago
ibndias/Qwen2.5-1.5B-Open-R1-GRPO
updated
a model
about 19 hours ago
ibndias/Qwen2.5-1.5B-Open-R1-Distill
published
a model
about 23 hours ago
ibndias/Qwen2.5-1.5B-Open-R1-Distill
Organizations
Collections
2
Papers
2
models
11
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/taxi-v3
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/Nous-Hermes-2-MoE-2x34B
Text Generation
•
Updated
•
1.25k
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/NeuralHermes-MoE-2x7B
Text Generation
•
Updated
•
1.37k
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/mistral-7b-gtfobins-lora
Text Generation
•
Updated
•
10
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/llama2-gtfobins-lora-3ep
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/mistral-gtfobins-lora-3ep
Updated
•
1