Adrien Bufort's picture

Adrien Bufort

Forbu14

·

AdrienB

AI & ML interests

Deep learning, machine learning, reinforcement learning. @orange

Recent Activity

liked a model 3 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

reacted to burtenshaw's post with 🔥 6 days ago

We’re launching a FREE and CERTIFIED course on Agents! We're thrilled to announce the launch of the Hugging Face Agents course on Learn! This interactive, certified course will guide you through building and deploying your own AI agents. Here's what you'll learn: - Understanding Agents: We'll break down the fundamentals of AI agents, showing you how they use LLMs to perceive their environment (observations), reason about it (thoughts), and take actions. Think of a smart assistant that can book appointments, answer emails, or even write code based on your instructions. - Building with Frameworks: You'll dive into popular agent frameworks like LangChain, LlamaIndex and smolagents. These tools provide the building blocks for creating complex agent behaviors. - Real-World Applications: See how agents are used in practice, from automating SQL queries to generating code and summarizing complex documents. - Certification: Earn a certification by completing the course modules, implementing a use case, and passing a benchmark assessment. This proves your skills in building and deploying AI agents. Audience This course is designed for anyone interested in the future of AI. Whether you're a developer, data scientist, or simply curious about AI, this course will equip you with the knowledge and skills to build your own intelligent agents. Enroll today and start building the next generation of AI agent applications! https://bit.ly/hf-learn-agents

upvoted a paper 9 days ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

View all activity

Organizations

Forbu14's activity

upvoted a paper 9 days ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 10 days ago • 65

upvoted a paper 19 days ago

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published 25 days ago • 17

upvoted a collection 22 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 20 days ago • 75

upvoted a collection 3 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 9 days ago • 152

upvoted a collection 4 months ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 9 days ago • 51

upvoted a paper 10 months ago

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 61

upvoted a paper about 1 year ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 147

upvoted 3 collections about 1 year ago

Models

41 items • Updated Nov 29, 2024 • 1

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12, 2024 • 67

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 128

upvoted 2 papers over 1 year ago

NExT-GPT: Any-to-Any Multimodal LLM

Paper • 2309.05519 • Published Sep 11, 2023 • 78

Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Paper • 2307.05695 • Published Jul 11, 2023 • 23