Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
272
5
6
Edward Beeching
edbeeching
Follow
lunarflu's profile picture
BrigitteTousi's profile picture
dariussingh's profile picture
63 followers
·
0 following
https://edbeeching.github.io/
edbeeching
AI & ML interests
None yet
Articles
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
3 days ago
•
42
Vision Language Models Explained
14 days ago
•
48
Constitutional AI with Open LLMs
Feb 1
•
2
Preference Tuning LLMs with Direct Preference Optimization Methods
Jan 18
•
9
Can foundation models label data like humans?
Jun 12, 2023
Creating a Coding Assistant with StarCoder
May 9, 2023
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
Mar 9, 2023
•
6
Train your first Decision Transformer
Sep 8, 2022
Introducing Decision Transformers on Hugging Face 🤗
Mar 28, 2022
Organizations
Papers
3
arxiv:
2402.09844
arxiv:
2310.16944
arxiv:
2112.03636
spaces
3
Sort: Recently updated
Runtime error
1
📚
ShapeNetViz
Runtime error
1
🌍
Scienceworld Demo
Runtime error
1
🐢
Atari_live_model
models
350
Sort: Recently updated
edbeeching/vsft-llava_builder_Meta-Llama-3-8B
Updated
1 day ago
edbeeching/vsft-llava_builder-meta-Llama-3-8B
Updated
1 day ago
edbeeching/vsft-llava_builder_zephyr-7b-beta
Updated
4 days ago
edbeeching/vsft-llava_builder
Updated
6 days ago
edbeeching/atari_2B_atari_stargunner_2222
Reinforcement Learning
•
Updated
8 days ago
•
1
edbeeching/atari_2B_atari_stargunner_1111
Reinforcement Learning
•
Updated
8 days ago
•
1
edbeeching/atari_2B_atari_spaceinvaders_2222
Reinforcement Learning
•
Updated
8 days ago
•
1
edbeeching/atari_2B_atari_spaceinvaders_1111
Reinforcement Learning
•
Updated
8 days ago
•
1
edbeeching/atari_2B_atari_solaris_2222
Reinforcement Learning
•
Updated
8 days ago
•
1
edbeeching/atari_2B_atari_solaris_1111
Reinforcement Learning
•
Updated
8 days ago
•
1
Expand 350 models
datasets
96
Sort: Recently updated
edbeeching/godot_rl_ZombieGame
Updated
Feb 22
•
1
edbeeching/godot_rl_VirtualCamera
Viewer
•
Updated
Feb 22
•
2
•
2
edbeeching/godot_rl_Ships
Updated
Feb 22
•
1
edbeeching/godot_rl_RobotVolleyball
Updated
Feb 22
•
1
edbeeching/godot_rl_Racer
Updated
Feb 22
•
2
•
1
edbeeching/godot_rl_MultiLevelRobot
Updated
Feb 22
•
1
edbeeching/godot_rl_JumperHard
Viewer
•
Updated
Feb 22
•
2
•
1
edbeeching/godot_rl_ItemSortingCart
Updated
Feb 22
•
1
edbeeching/godot_rl_HovercraftRacing
Viewer
•
Updated
Feb 22
•
1
edbeeching/godot_rl_FPS
Updated
Feb 22
•
2
•
1
Expand 96 datasets