9 108 198

Pierre Dulac

dulacp

dulacp

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

ds4sd/SmolDocling-256M-preview

liked a model 19 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

liked a model about 1 month ago

mistralai/Mistral-Small-24B-Instruct-2501

View all activity

Organizations

dulacp's activity

upvoted 2 papers about 1 month ago

Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?

Paper • 2502.15657 • Published Feb 21 • 5

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18 • 58

upvoted a paper about 2 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 152

upvoted a collection about 2 months ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 148

upvoted 2 articles about 2 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 956

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 835

upvoted a paper 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 374

upvoted 2 papers 3 months ago

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 84

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 42

upvoted a collection 3 months ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 24 days ago • 68

upvoted 2 papers 3 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 40

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 39

upvoted 7 papers 4 months ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 63

LLMs Do Not Think Step-by-step In Implicit Reasoning

Paper • 2411.15862 • Published Nov 24, 2024 • 10

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 40

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 48