Reinforcement Learning related models
Davide Buoso
lambdavi
AI & ML interests
PhD Student @ VANDAL (Polytechnic University of Turin).
Interested in the intersection of Robotics and Generative AI.
Organizations
None yet
Collections
2
Papers
1
models
18

lambdavi/span-marker-luke-legal
Token Classification
•
Updated
•
45
•
3

lambdavi/legal-luke-base-ner
Token Classification
•
Updated
•
150
•
1

lambdavi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated

lambdavi/ppo-Pyramids
Reinforcement Learning
•
Updated
•
19

lambdavi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
•
1

lambdavi/ddpg-PandaReach-v3
Reinforcement Learning
•
Updated

lambdavi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
110

lambdavi/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated

lambdavi/span-marker-luke-base-conll2003
Token Classification
•
Updated
•
12
•
2

lambdavi/luke-base_finetuned_conll2003
Token Classification
•
Updated
•
133
datasets
None public yet