Miguel Alonso Jr's picture

1 7 5

Miguel Alonso Jr

miguelalonsojr

·

AI & ML interests

ML, RL, Robotics

Recent Activity

upvoted an article 15 days ago

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

liked a dataset 5 months ago

Hypersniper/unity_api_2022_3

updated a model 7 months ago

miguelalonsojr/DatacampLlama-3.1-8B

View all activity

Organizations

miguelalonsojr's activity

upvoted an article 15 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 128

upvoted 4 papers about 1 year ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 65

Nash Learning from Human Feedback

Paper • 2312.00886 • Published Dec 1, 2023 • 17

Aligning Large Multimodal Models with Factually Augmented RLHF

Paper • 2309.14525 • Published Sep 25, 2023 • 30

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 55

upvoted 2 collections about 1 year ago

Zephyr 7B

Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12, 2024 • 148

Papers about model merging

referenced in the mergekit repo: https://github.com/cg123/mergekit • 4 items • Updated Feb 13, 2024 • 14