cola's picture

13 38

cola

alexchauncy

AI & ML interests

None yet

Recent Activity

liked a dataset 9 days ago

OpenFace-CQUPT/FaceCaption-15M

liked a dataset 15 days ago

unitreerobotics/LAFAN1_Retargeting_Dataset

liked a model 16 days ago

PowerInfer/SmallThinker-3B-Preview

View all activity

Organizations

None yet

alexchauncy's activity

upvoted a paper 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted a paper 5 months ago

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Paper • 2401.02117 • Published Jan 4, 2024 • 30

upvoted an article 6 months ago

Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

By

•

Jun 20, 2024

• 26

upvoted a paper 8 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 67

upvoted a paper 10 months ago

Llemma: An Open Language Model For Mathematics

Paper • 2310.10631 • Published Oct 16, 2023 • 51

upvoted 3 papers about 1 year ago

Vision-Language Models as a Source of Rewards

Paper • 2312.09187 • Published Dec 14, 2023 • 11

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Paper • 2310.11954 • Published Oct 18, 2023 • 25

upvoted 5 papers over 1 year ago

The Shaky Foundations of Clinical Foundation Models: A Survey of Large Language Models and Foundation Models for EMRs

Paper • 2303.12961 • Published Mar 22, 2023 • 3

SayTap: Language to Quadrupedal Locomotion

Paper • 2306.07580 • Published Jun 13, 2023 • 7

From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought

Paper • 2306.12672 • Published Jun 22, 2023 • 26

h2oGPT: Democratizing Large Language Models

Paper • 2306.08161 • Published Jun 13, 2023 • 18

Language to Rewards for Robotic Skill Synthesis

Paper • 2306.08647 • Published Jun 14, 2023 • 12