Video-Text-to-Text
Transformers
Safetensors
English
Chinese
qwen3_5_moe
image-text-to-text
robotics
vision-language-model
video-understanding
robot-manipulation
Instructions to use xlangai/RoboFine-VLM-397B-A17B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use xlangai/RoboFine-VLM-397B-A17B with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("xlangai/RoboFine-VLM-397B-A17B") model = AutoModelForMultimodalLM.from_pretrained("xlangai/RoboFine-VLM-397B-A17B") - Notebooks
- Google Colab
- Kaggle
Add library_name and paper authors to model card
#1
by nielsr HF Staff - opened
Hi, I'm Niels from the Hugging Face community science team!
This PR improves the model card by:
- Adding
library_name: transformersto the metadata. This helps users identify how to load the model and enables the "Use in Transformers" button on the Hub. - Adding the
datasetsmetadata to link this model to the RoboFine-bench. - Including the full list of authors from the paper for better attribution.
The rest of the documentation is excellent. Thank you for sharing this work!
EricsXt changed pull request status to merged