Arham
arhamk
·
AI & ML interests
Machine Learning, LLM
Recent Activity
liked
a dataset
about 1 month ago
qingy2024/FineQwQ-142k
liked
a Space
8 months ago
open-llm-leaderboard/open_llm_leaderboard
updated
a collection
8 months ago
Deep RL Course - Hugging Face
Organizations
Collections
2
spaces
1
models
15
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/ppo-LunarLander-v2-2
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/llama2-qlora-sft
Updated
•
4
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/llama2-finance-sft
Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/q-Taxi-v3
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e8d381a198f8520f2d4f3/ju07_OAOsnuRCDYxavm1O.jpeg)
arhamk/ppo-Pyramids
Reinforcement Learning
•
Updated
•
3