HF-Party (Hugging Face Party @ PyTorch Conference)

clem

posted an update about 3 hours ago

Post

86

Llama models (arguably the most successful open AI models of all times) just represented 3% of total model downloads on Hugging Face in March.

People and media like stories of winner takes all & one model/company to rule them all but the reality is much more nuanced than this!

Kudos to all the small AI builders out there!

zamal

posted an update about 9 hours ago

Post

430

🚀 DeepGit Lite is live! 🔍✨

Hey folks!
Just launched DeepGit Lite — a lighter version of DeepGit with fewer components under the hood.
It won’t perform quite like the full powerhouse, but it’s great for a quick peek and first-hand feel! ⚙️👀

Give it a spin and tell us what you think!
👉 Try it here zamal/DeepGit-lite
#opensource #DeepGit #gradio #githubresearch

clem

posted an update 1 day ago

Post

645

Now in Enterprise Hub organizations, you can centralize your billing not only for HF usage but also inference through our inference partners.

Will prevent some headaches for your finance & accounting teams haha (so feel free to share that with them).

bstadt

authored 2 papers 3 days ago

CoRNStack: High-Quality Contrastive Data for Better Code Ranking

Paper • 2412.01007 • Published Dec 1, 2024

Training Sparse Mixture Of Experts Text Embedding Models

Paper • 2502.07972 • Published Feb 11 • 5

clem

posted an update 3 days ago

Post

3751

Before 2020, most of the AI field was open and collaborative. For me, that was the key factor that accelerated scientific progress and made the impossible possible—just look at the “T” in ChatGPT, which comes from the Transformer architecture openly shared by Google.

Then came the myth that AI was too dangerous to share, and companies started optimizing for short-term revenue. That led many major AI labs and researchers to stop sharing and collaborating.

With OAI and sama now saying they're willing to share open weights again, we have a real chance to return to a golden age of AI progress and democratization—powered by openness and collaboration, in the US and around the world.

This is incredibly exciting. Let’s go, open science and open-source AI!

5 replies

·

zamal

posted an update 3 days ago

Post

2364

DeepGit: Your GitHub Gold Digger! 💰🚀
Hey Hugging Face gang! Meet DeepGit—my open-source sidekick that rips through GitHub to snag repos that fit you. Done with dead-end searches? Me too. Built it with LangGraph and some dope tricks:
Embeddings grab the good stuff (HF magic, baby!)

Re-ranking nails the best picks

Snoops docs, code, and buzz in one slick flow

Drops a clean list of hidden gems 💎

Unearth that sneaky ML lib or Python gem—run python app.py or langgraph dev and boom! Peek it at https://github.com/zamalali/DeepGit. Fork it, tweak it, love it—Docker’s in, HF vibes are strong. Drop a 🌟 or a crazy idea—I’m pumped to jam with you all! 🪂

m-ric

posted an update 3 days ago

Post

1690

🚀 DeepSeek R1 moment has come for GUI agents: Rule-based Reinforcement Learning gives better results than SFT with 500x smaller datasets!

Traditionally (by which I mean "in the last few months"), GUI agents have been trained with supervised fine-tuning (SFT). This meant, collecting huge datasets of screen captures from people using computers, and using these to fine-tune your model. 📚

👉 But last week, a new paper introduced UI-R1, applying DeepSeek's R1-style rule-based reinforcement learning (RL) specifically to GUI action prediction tasks.
This is big news: with RL, maybe we could build good agents without the need for huge datasets.

UI-R1 uses a unified reward function that evaluates multiple responses from models, optimizing via policy algorithms like Group Relative Policy Optimization (GRPO).

Specifically, the reward function assesses:
🎯 Action type accuracy: Does the predicted action match the ground truth?
📍 Coordinate accuracy (specifically for clicks): Is the predicted click within the correct bounding box?
📑 Output format: Does the model clearly articulate both its reasoning and final action?

Using just 136 carefully selected mobile tasks—compared to 76,000 tasks for larger models like OS-Atlas—UI-R1 shows significant efficiency and improved performance:
📈 Boosted action prediction accuracy from 76% to 89% on AndroidControl.
🌐 Outperformed larger, SFT-trained models (e.g., OS-Atlas-7B), demonstrating superior results with vastly fewer data points (136 tasks vs. 76K).
🔍 Enhanced adaptability and generalization, excelling even in out-of-domain scenarios.

The paper tests this RL-based method only in low-level GUI tasks. Could it generalize to more complex interactions? 🧐

Read the full paper here 👉 UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning (2503.21620)

Aurelien-Morgan

posted an update 5 days ago

Post

1918

Almost there !
https://test.pypi.org/project/test-010-retrain-pipelines/

clem

posted an update 6 days ago

Post

2330

What's this cool purple banner haha 😶😶😶

4 replies

·

osanseviero

authored a paper 6 days ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published 9 days ago • 40

clem

posted an update 7 days ago

Post

2172

Very interesting security section by @yjernite @lvwerra @reach-vb @dvilasuero & the team replicating R1. Broadly applicable to most open-source models & some to APIs (but APIs have a lot more additional risks because you're not in control of the underlying system):

https://huggingface.co/blog/open-r1/update-4#is-it-safe

1 reply

·

clem

posted an update 8 days ago

Post

1539

A repository is created every ~15 secs on Hugging Face so @kramp added a "Getting Started" to make it easier & a model release checklist: https://huggingface.co/docs/hub/model-release-checklist

What are you uploading today?

1 reply

·

clem

posted an update 14 days ago

Post

3695

Should we assemble affordable open-source robots at Hugging Face for the community. Would you buy them? At what price?

8 replies

·

clem

posted an update 15 days ago

Post

2560

Nice new space to see how fast your personal or organization followers are growing on HF:
julien-c/follow-history

As you can see, I still have more followers than @julien-c even if he's trying to change this by building such cool spaces 😝😝😝

m-ric

posted an update 19 days ago

Post

4731

smolagents now support vLLM! 🥳

As one of the most popular local inference solutions, the community had been asking us to integrate vLLM: after a heavy refactoring of our LLM classes, we've just released smolagents 1.11.0, with a brand new VLLMModel class.

Go try it and tell us what you think!

https://github.com/huggingface/smolagents/blob/45b2c86857b7f7657daaa74e4d17d347e9e2c4a4/src/smolagents/models.py#L497

Skylion007

authored a paper 21 days ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 22 days ago • 67

clem

posted an update 21 days ago

Post

4611

We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

3 replies

·

m-ric

posted an update 24 days ago

Post

995

Our new Agentic leaderboard is now live!💥

If you ever asked which LLM is best for powering agents, we've just made a leaderboard that ranks them all! Built with @albertvillanova , this ranks LLMs powering a smolagents CodeAgent on subsets of various benchmarks. ✅

🏆 GPT-4.5 comes on top, even beating reasoning models like DeepSeek-R1 or o1. And Claude-3.7-Sonnet is a close second!

The leaderboard also allows you to show the scores of vanilla LLMs (without any agentic setup) on the same benchmarks: this shows the huge improvements brought by agentic setups. 💪

(Note that results will be added manually, so the leaderboard might not always have the latest LLMs)

1 reply

·

JingzeShi

posted an update 25 days ago

Post

4691

We distill a more accurate and concise dataset from DeepSeek R1, and also provide a distillation pipeline code repository.🤗

Dataset: SmallDoge/SmallThoughts
Code: https://github.com/SmallDoges/small-thoughts

Hugging Face Party @ PyTorch Conference

AI & ML interests

Recent Activity

HF-Party's activity

CoRNStack: High-Quality Contrastive Data for Better Code Ranking

Training Sparse Mixture Of Experts Text Embedding Models

Gemma 3 Technical Report

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

AI & ML interests

Recent Activity

Team members 184

HF-Party's activity