2 1

Michal Valko

misovalko

https://misovalko.github.io/

AI & ML interests

LLM fine-tuning

Recent Activity

authored a paper 2 months ago

The Llama 3 Herd of Models

new activity 7 months ago

paris-ai-running-club/README:next run wen?

authored a paper 8 months ago

Demonstration-Regularized RL

View all activity

Organizations

misovalko's activity

authored a paper 2 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 110

New activity in paris-ai-running-club/README 7 months ago

next run wen?

#3 opened 7 months ago by

julien-c

authored 5 papers 8 months ago

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Paper • 2405.12205 • Published May 20, 2024

New activity in paris-ai-running-club/README 8 months ago

FOMO

#1 opened 8 months ago by

osanseviero

authored 7 papers about 1 year ago

Adapting to game trees in zero-sum imperfect information games

Paper • 2212.12567 • Published Dec 23, 2022

Fast Rates for Maximum Entropy Exploration

Paper • 2303.08059 • Published Mar 14, 2023

Understanding Self-Predictive Learning for Reinforcement Learning

Paper • 2212.03319 • Published Dec 6, 2022

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Paper • 2305.13185 • Published May 22, 2023

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 13

Nash Learning from Human Feedback

Paper • 2312.00886 • Published Dec 1, 2023 • 15

Bootstrap your own latent: A new approach to self-supervised Learning

Paper • 2006.07733 • Published Jun 13, 2020 • 2

liked a Space about 1 year ago

Running on CPU Upgrade

250

📊

Daily Papers

Complete list of past Daily Papers