LeRobot documentation
Getting Started with Real-World Robots
Getting Started with Real-World Robots
This tutorial will explain how to train a neural network to control a real robot autonomously.
You’ll learn:
- How to record and visualize your dataset.
- How to train a policy using your data and prepare it for evaluation.
- How to evaluate your policy and visualize the results.
By following these steps, you’ll be able to replicate tasks, such as picking up a Lego block and placing it in a bin with a high success rate, as shown in the video below.
Video: pickup lego block task
This tutorial isn’t tied to a specific robot: we walk you through the commands and API snippets you can adapt for any supported platform.
During data collection, you’ll use a “teloperation” device, such as a leader arm or keyboard to teleoperate the robot and record its motion trajectories.
Once you’ve gathered enough trajectories, you’ll train a neural network to imitate these trajectories and deploy the trained model so your robot can perform the task autonomously.
If you run into any issues at any point, jump into our Discord community for support.
Set up and Calibrate
If you haven’t yet set up and calibrated your robot and teleop device, please do so by following the robot-specific tutorial.
Teleoperate
In this example, we’ll demonstrate how to teleoperate the SO101 robot. For each command, we also provide a corresponding API example.
Note that the id
associated with a robot is used to store the calibration file. It’s important to use the same id
when teleoperating, recording, and evaluating when using the same setup.
python -m lerobot.teleoperate \ --robot.type=so101_follower \ --robot.port=/dev/tty.usbmodem58760431541 \ --robot.id=my_awesome_follower_arm \ --teleop.type=so101_leader \ --teleop.port=/dev/tty.usbmodem58760431551 \ --teleop.id=my_awesome_leader_arm
The teleoperate command will automatically:
- Identify any missing calibrations and initiate the calibration procedure.
- Connect the robot and teleop device and start teleoperation.
Cameras
To add cameras to your setup, follow this Guide.
Teleoperate with cameras
With rerun
, you can teleoperate again while simultaneously visualizing the camera feeds and joint positions. In this example, we’re using the Koch arm.
python -m lerobot.teleoperate \
--robot.type=koch_follower \
--robot.port=/dev/tty.usbmodem58760431541 \
--robot.id=my_awesome_follower_arm \
--robot.cameras="{ front: {type: opencv, index_or_path: 0, width: 1920, height: 1080, fps: 30}}" \
--teleop.type=koch_leader \
--teleop.port=/dev/tty.usbmodem58760431551 \
--teleop.id=my_awesome_leader_arm \
--display_data=true
Record a dataset
Once you’re familiar with teleoperation, you can record your first dataset.
We use the Hugging Face hub features for uploading your dataset. If you haven’t previously used the Hub, make sure you can login via the cli using a write-access token, this token can be generated from the Hugging Face settings.
Add your token to the CLI by running this command:
huggingface-cli login --token ${HUGGINGFACE_TOKEN} --add-to-git-credential
Then store your Hugging Face repository name in a variable:
HF_USER=$(huggingface-cli whoami | head -n 1)
echo $HF_USER
Now you can record a dataset. To record 2 episodes and upload your dataset to the hub, execute this command tailored to the SO101.
python -m lerobot.record \
--robot.type=so101_follower \
--robot.port=/dev/tty.usbmodem585A0076841 \
--robot.id=my_awesome_follower_arm \
--robot.cameras="{ front: {type: opencv, index_or_path: 0, width: 1920, height: 1080, fps: 30}}" \
--teleop.type=so101_leader \
--teleop.port=/dev/tty.usbmodem58760431551 \
--teleop.id=my_awesome_leader_arm \
--display_data=true \
--dataset.repo_id=${HF_USER}/record-test \
--dataset.num_episodes=2 \
--dataset.single_task="Grab the black cube"
Dataset upload
Locally, your dataset is stored in this folder: ~/.cache/huggingface/lerobot/{repo-id}
. At the end of data recording, your dataset will be uploaded on your Hugging Face page (e.g. https://huggingface.co/datasets/cadene/so101_test) that you can obtain by running:
echo https://huggingface.co/datasets/${HF_USER}/so101_test
Your dataset will be automatically tagged with LeRobot
for the community to find it easily, and you can also add custom tags (in this case tutorial
for example).
You can look for other LeRobot datasets on the hub by searching for LeRobot
tags.
Record function
The record
function provides a suite of tools for capturing and managing data during robot operation:
1. Data Storage
- Data is stored using the
LeRobotDataset
format and is stored on disk during recording. - By default, the dataset is pushed to your Hugging Face page after recording.
- To disable uploading, use
--dataset.push_to_hub=False
.
- To disable uploading, use
2. Checkpointing and Resuming
- Checkpoints are automatically created during recording.
- If an issue occurs, you can resume by re-running the same command with
--control.resume=true
. - To start recording from scratch, manually delete the dataset directory.
3. Recording Parameters
Set the flow of data recording using command-line arguments:
--dataset.episode_time_s=60
Duration of each data recording episode (default: 60 seconds).--dataset.reset_time_s=60
Duration for resetting the environment after each episode (default: 60 seconds).--dataset.num_episodes=50
Total number of episodes to record (default: 50).
4. Keyboard Controls During Recording
Control the data recording flow using keyboard shortcuts:
- Press Right Arrow (
→
): Early stop the current episode or reset time and move to the next. - Press Left Arrow (
←
): Cancel the current episode and re-record it. - Press Escape (
ESC
): Immediately stop the session, encode videos, and upload the dataset.
Tips for gathering data
Once you’re comfortable with data recording, you can create a larger dataset for training. A good starting task is grasping an object at different locations and placing it in a bin. We suggest recording at least 50 episodes, with 10 episodes per location. Keep the cameras fixed and maintain consistent grasping behavior throughout the recordings. Also make sure the object you are manipulating is visible on the camera’s. A good rule of thumb is you should be able to do the task yourself by only looking at the camera images.
In the following sections, you’ll train your neural network. After achieving reliable grasping performance, you can start introducing more variations during data collection, such as additional grasp locations, different grasping techniques, and altering camera positions.
Avoid adding too much variation too quickly, as it may hinder your results.
If you want to dive deeper into this important topic, you can check out the blog post we wrote on what makes a good dataset.
Troubleshooting:
- On Linux, if the left and right arrow keys and escape key don’t have any effect during data recording, make sure you’ve set the
$DISPLAY
environment variable. See pynput limitations.
Visualize a dataset
If you uploaded your dataset to the hub with --control.push_to_hub=true
, you can visualize your dataset online by copy pasting your repo id given by:
echo ${HF_USER}/so101_test
Replay an episode
A useful feature is the replay
function, which allows you to replay any episode that you’ve recorded or episodes from any dataset out there. This function helps you test the repeatability of your robot’s actions and assess transferability across robots of the same model.
You can replay the first episode on your robot with:
python -m lerobot.replay \
--robot.type=so101_follower \
--robot.port=/dev/tty.usbmodem58760431541 \
--robot.id=my_awesome_follower_arm \
--dataset.repo_id=${HF_USER}/record-test \
--dataset.episode=0 # choose the episode you want to replay
Your robot should replicate movements similar to those you recorded. For example, check out this video where we use replay
on a Aloha robot from Trossen Robotics.
Train a policy
To train a policy to control your robot, use the python lerobot/scripts/train.py
script. A few arguments are required. Here is an example command:
python lerobot/scripts/train.py \
--dataset.repo_id=${HF_USER}/so101_test \
--policy.type=act \
--output_dir=outputs/train/act_so101_test \
--job_name=act_so101_test \
--policy.device=cuda \
--wandb.enable=true
Let’s explain the command:
- We provided the dataset as argument with
--dataset.repo_id=${HF_USER}/so101_test
. - We provided the policy with
policy.type=act
. This loads configurations fromconfiguration_act.py
. Importantly, this policy will automatically adapt to the number of motor states, motor actions and cameras of your robot (e.g.laptop
andphone
) which have been saved in your dataset. - We provided
policy.device=cuda
since we are training on a Nvidia GPU, but you could usepolicy.device=mps
to train on Apple silicon. - We provided
wandb.enable=true
to use Weights and Biases for visualizing training plots. This is optional but if you use it, make sure you are logged in by runningwandb login
.
Training should take several hours. You will find checkpoints in outputs/train/act_so101_test/checkpoints
.
To resume training from a checkpoint, below is an example command to resume from last
checkpoint of the act_so101_test
policy:
python lerobot/scripts/train.py \
--config_path=outputs/train/act_so101_test/checkpoints/last/pretrained_model/train_config.json \
--resume=true
Upload policy checkpoints
Once training is done, upload the latest checkpoint with:
huggingface-cli upload ${HF_USER}/act_so101_test \
outputs/train/act_so101_test/checkpoints/last/pretrained_model
You can also upload intermediate checkpoints with:
CKPT=010000
huggingface-cli upload ${HF_USER}/act_so101_test${CKPT} \
outputs/train/act_so101_test/checkpoints/${CKPT}/pretrained_model
Evaluate your policy
You can use the record
script from lerobot/record.py
but with a policy checkpoint as input. For instance, run this command to record 10 evaluation episodes:
python -m lerobot.record \
--robot.type=so100_follower \
--robot.port=/dev/ttyACM1 \
--robot.cameras="{ up: {type: opencv, index_or_path: /dev/video10, width: 640, height: 480, fps: 30}, side: {type: intelrealsense, serial_number_or_name: 233522074606, width: 640, height: 480, fps: 30}}" \
--robot.id=my_awesome_follower_arm \
--display_data=false \
--dataset.repo_id=$HF_USER/eval_so100 \
--dataset.single_task="Put lego brick into the transparent box" \
--policy.path=${HF_USER}/my_policy
As you can see, it’s almost the same command as previously used to record your training dataset. Two things changed:
- There is an additional
--control.policy.path
argument which indicates the path to your policy checkpoint with (e.g.outputs/train/eval_act_so101_test/checkpoints/last/pretrained_model
). You can also use the model repository if you uploaded a model checkpoint to the hub (e.g.${HF_USER}/act_so101_test
). - The name of dataset begins by
eval
to reflect that you are running inference (e.g.${HF_USER}/eval_act_so101_test
).