Frederic Branchaud-Charron

Dref360

AI & ML interests

Bayesian deep learning, uncertainty estimation, and trustworthiness.

Recent Activity

Organizations

Glowstick AI's profile picture Blog-explorers's profile picture

Dref360's activity

reacted to lewtun's post with ❀️πŸ”₯ 17 days ago
view post
Post
6605
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute πŸ”₯

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

πŸ“ˆ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

πŸŽ„ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!
  • 2 replies
Β·
reacted to burtenshaw's post with ❀️ about 1 month ago
view post
Post
2598
For anyone looking to boost their LLM fine-tuning and alignment skills this decemeber. We're running this free and open course called smol course. It’s not big like Li Yin and @mlabonne , it’s just smol.

πŸ‘· It focuses on practical use cases, so if you’re working on something, bring it along.

πŸ‘―β€β™€οΈ It’s peer reviewed and open so you can discuss and get feedback.

🀘 If you’re already a smol pro, feel free to drop a star or issue.

> > Part 1 starts now, and it’s on instruction tuning!

https://github.com/huggingface/smol-course
reacted to andito's post with πŸ”₯❀️ about 1 month ago
view post
Post
3281
Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

- SmolVLM generates tokens 7.5 to 16 times faster than Qwen2-VL! 🀯
- Other models at this size crash a laptop, but SmolVLM comfortably generates 17 tokens/sec on a macbook! πŸš€
- SmolVLM can be fine-tuned on a Google collab! Or process millions of documents with a consumer GPU!
- SmolVLM even outperforms larger models in video benchmarks, despite not even being trained on videos!

Check out more!
Demo: HuggingFaceTB/SmolVLM
Blog: https://huggingface.co/blog/smolvlm
Model: HuggingFaceTB/SmolVLM-Instruct
Fine-tuning script: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
reacted to merve's post with πŸš€πŸ”₯ about 1 month ago
view post
Post
3894
Small yet mighty! πŸ’«

We are releasing SmolVLM: a new 2B small vision language made for on-device use, fine-tunable on consumer GPU, immensely memory efficient 🀠

We release three checkpoints under Apache 2.0: SmolVLM-Instruct, SmolVLM-Synthetic and SmolVLM-Base HuggingFaceTB/smolvlm-6740bd584b2dcbf51ecb1f39

Learn more from our blog here: huggingface.co/blog/smolvlm
This release comes with a demo, fine-tuning code, MLX integration and TRL integration for DPO πŸ’
Try the demo: HuggingFaceTB/SmolVLM
Fine-tuning Recipe: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
Also TRL integration for DPO πŸ’—
posted an update about 1 month ago
view post
Post
1275
New week, new #cv Gradio app for human understanding.( Dref360/human-interaction-demo) πŸ₯³

This demo highlights when a person touches an object. For instance, it is useful to know if someone is touching a wall, a vase or a door. It works for multiple people too!

Still using nielsr/vitpose-base-simple for pose estimation, very excited to see the PR approved!


reacted to jsulz's post with 🧠❀️πŸ”₯ about 1 month ago
view post
Post
2917
When the XetHub crew joined Hugging Face this fall, @erinys and I started brainstorming how to share our work to replace Git LFS on the Hub. Uploading and downloading large models and datasets takes precious time. That’s where our chunk-based approach comes in.

Instead of versioning files (like Git and Git LFS), we version variable-sized chunks of data. For the Hugging Face community, this means:

⏩ Only upload the chunks that changed.
πŸš€ Download just the updates, not the whole file.
🧠 We store your file as deduplicated chunks

In our benchmarks, we found that using CDC to store iterative model and dataset version led to transfer speedups of ~2x, but this isn’t just a performance boost. It’s a rethinking of how we manage models and datasets on the Hub.

We're planning on our new storage backend to the Hub in early 2025 - check out our blog to dive deeper, and let us know: how could this improve your workflows?

https://huggingface.co/blog/from-files-to-chunks
posted an update about 2 months ago
reacted to averoo's post with πŸ‘πŸ”₯πŸ‘€ 2 months ago
view post
Post
3792
Hello, researchers! I've tried to made reading HF Daily Papers easier and made a tool that does reviews with LLMs like Claude 3.5, GPT-4o and sometimes FLUX.

πŸ“š Classification by topics
πŸ“… Sorting by publication date and HF addition date
πŸ”„ Syncing every 2 hours
πŸ’» Hosted on GitHub
🌏 English, Russian, and Chinese
πŸ“ˆ Top by week/month (in progress)

πŸ‘‰ https://hfday.ru

Let me know what do you think of it.
reacted to merve's post with πŸ”₯ 7 months ago
view post
Post
3046
releasing: smol vision 🌼

A repository with notebooks on shrinking, optimizing, speeding-up, customizing large vision models! https://github.com/merveenoyan/smol-vision
  • 1 reply
Β·
reacted to clem's post with πŸš€ 7 months ago
view post
Post
3665
Who said you couldn't build a big business based on open-source AI? Congrats Mistral team: https://huggingface.co/mistralai
reacted to their post with πŸš€ 8 months ago
view post
Post
1092
Baal, our Bayesian Active Learning library is working on a major version and we want to know more about you!

If you use Baal for Active Learning, Uncertainty Estimation or Bayesian Deep Learning, we would **love** to talk to you! 😎

In more detail, we want to understand when our users use our library and how.

You can take a spot in our Calendly: https://calendly.com/baal-org/30min?month=2024-05
posted an update 8 months ago
view post
Post
1092
Baal, our Bayesian Active Learning library is working on a major version and we want to know more about you!

If you use Baal for Active Learning, Uncertainty Estimation or Bayesian Deep Learning, we would **love** to talk to you! 😎

In more detail, we want to understand when our users use our library and how.

You can take a spot in our Calendly: https://calendly.com/baal-org/30min?month=2024-05