lehome/dataset_challenge_merged
Updated • 3.31k • 2
How to use ruali-dev/lehome-smolvla-submit-4types with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("ruali-dev/lehome-smolvla-submit-4types", dtype="auto")How to use ruali-dev/lehome-smolvla-submit-4types with LeRobot:
# See https://github.com/huggingface/lerobot?tab=readme-ov-file#installation for more details git clone https://github.com/huggingface/lerobot.git cd lerobot pip install -e .[smolvla]
# Launch finetuning on your dataset python lerobot/scripts/train.py \ --policy.path=ruali-dev/lehome-smolvla-submit-4types \ --dataset.repo_id=lerobot/svla_so101_pickplace \ --batch_size=64 \ --steps=20000 \ --output_dir=outputs/train/my_smolvla \ --job_name=my_smolvla_training \ --policy.device=cuda \ --wandb.enable=true
# Run the policy using the record function
python -m lerobot.record \
--robot.type=so101_follower \
--robot.port=/dev/ttyACM0 \ # <- Use your port
--robot.id=my_blue_follower_arm \ # <- Use your robot id
--robot.cameras="{ front: {type: opencv, index_or_path: 8, width: 640, height: 480, fps: 30}}" \ # <- Use your cameras
--dataset.single_task="Grasp a lego block and put it in the bin." \ # <- Use the same task description you used in your dataset recording
--dataset.repo_id=HF_USER/dataset_name \ # <- This will be the dataset name on HF Hub
--dataset.episode_time_s=50 \
--dataset.num_episodes=10 \
--policy.path=ruali-dev/lehome-smolvla-submit-4typesWe fine-tune lerobot/smolvla_base on the official LeHome merged four-garment dataset (four_types_merged).
We use a custom evaluation wrapper (custom_smolvla_policy.py) to remap LeHome RGB observation keys:
observation.images.top_rgb -> observation.images.camera1observation.images.left_rgb -> observation.images.camera2observation.images.right_rgb -> observation.images.camera3The policy takes:
and outputs:
pretrained_model/: trained checkpointcustom_smolvla_policy.py: custom evaluation wrapperrollout_results.txt: self-reported resultscustom_smolvla_policy.py under scripts/eval_policy/.scripts/eval_policy/__init__.py.python -m scripts.eval \
--policy_type custom_smolvla \
--policy_path <PATH_TO_PRETRAINED_MODEL> \
--garment_type top_short \
--dataset_root Datasets/example/four_types_merged \
--num_episodes 1 \
--enable_cameras \
--device cpu \
--headless
Replace top_short with:
top_longpant_longpant_shortBase model
lerobot/smolvla_base