Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

mitkox 
posted an update about 22 hours ago
view post
Post
2958
I’ve built my blocker for AI-generated content. It’s a local AI running on my laptop with a browser extension that classifies and scrubs synthetic content from my eyeballs. I’m too old for this synthetic noise.

TL;DR I’m going full John Connor on the AI content apocalypse

Think of it as an on device AI ad-blocker, but for:
Em-dash overdose. Seriously, why is everything suddenly revolutionary—disruptive—life-changing?
AI influencers’ auto-generated posts and images, auto-posted, all hands-free.
Fake news, fake images, fake people... puff.

Surprisingly, it works. I suppose it will block some human-generated content. However, I would rather read a 2007 Myspace blog than another “10 Growth Hacks Powered By ChatGPT” post.
  • 2 replies
·
prithivMLmods 
posted an update 3 days ago
view post
Post
6633
Introducing Gliese-OCR-7B-Post1.0, a document content-structure retrieval VLM designed for content extraction(OCRs) and summarization. This is the third model in the Camel Doc OCR VLM series, following Camel-Doc-OCR-062825. The new version fixes formal table reconstruction issues in both En and Zh, achieving optimal performance for long-context inferences. This model also shows significant improvements in LaTeX and Markdown rendering for OCR tasks.

🤗 Gliese-OCR-7B-Post1.0 : prithivMLmods/Gliese-OCR-7B-Post1.0
✨ Demo Space/App : prithivMLmods/Multimodal-VLM-v1.0
📌 Gliese-Post1.0 Collection : prithivMLmods/gliese-post10-68c52c4a6ca4935f5259a6d7
⬅️ Previous Versions : prithivMLmods/Camel-Doc-OCR-062825
🧨 Gliese-OCR-7B-Post1.0 (4-bit) Notebook Demo on T4 : prithivMLmods/Gliese-OCR-7B-Post1.0
📖 GitHub [Gliese-OCR-7B-Post1.0(4-bit)-reportlab] : https://tinyurl.com/ys7zuerc

Other Collections:

➔ Multimodal Implementations : prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
➔ Multimodal VLMs - Aug'25 : prithivMLmods/multimodal-vlms-aug25-68a56aac39fe8084f3c168bd
➔ Multimodal VLMs - July'25 : prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027

.
.
.
To know more about it, visit the app page or the respective model page!!
lysandre 
posted an update about 13 hours ago
view post
Post
1865
We're kick-starting the process of Transformers v5, with @ArthurZ and @cyrilvallez !

v5 should be significant: we're using it as a milestone for performance optimizations, saner defaults, and a much cleaner code base worthy of 2025.

Fun fact: v4.0.0-rc-1 came out on Nov 19, 2020, nearly five years ago!
  • 2 replies
·
Jaward 
posted an update 3 days ago
Kseniase 
posted an update 1 day ago
view post
Post
4553
6 Recent & free sources to master Reinforcement Learning

Almost every week new research and resources on RL come out. Knowledge needs to be constantly refreshed and updated with the latest trends. So today, we’re sharing 6 free sources to help you stay on track with RL:

1. A Survey of Continual Reinforcement Learning → https://arxiv.org/abs/2506.21872
Covers continual RL (CRL): how agents can keep learning and adapt to new tasks without forgetting past ones. It analyses methods, benchmarks, evaluation metrics &challenges

2. The Deep Reinforcement Learning course by Hugging Face → https://huggingface.co/learn/deep-rl-course/unit0/introduction
This is a popular free course, regularly updated. Includes community interaction, exercises, leaderboards, etc.

3. Reinforcement Learning Specialization (Coursera, University of Alberta) → https://www.coursera.org/specializations/reinforcement-learning
A 4-course series introducing foundational RL, implementing different algorithms, culminating in a capstone. It's a great structured path

4. A Technical Survey of Reinforcement Learning Techniques for LLMs → A Technical Survey of Reinforcement Learning Techniques for Large Language Models (2507.04136)
Looks at how RL is being used for/with LLMs for alignment, reasoning, preference signals, etc. Covers methods like RLHF, RLAIF, DPO, PPO, GRPO & applications from code gen to tool use

5. A Survey of Reinforcement Learning for Software Engineering → https://arxiv.org/abs/2507.12483
Good if you're interested in RL-applied domains. Examines how RL is used in software engineering tasks: maintenance, development, evaluation. Covering 115 papers since DRL introduction, it summarizes trends, gaps & challenges

6. A Survey of Reinforcement Learning for LRMs → https://arxiv.org/abs/2509.08827
Tracks the way from LLMs to LRMs via RL. Covers reward design, policy optimization, use cases and future approaches like continual, memory, model-based RL and more

If you liked this, subscribe to The Turing Post https://www.turingpost.com/subscribe
  • 1 reply
·
pagezyhf 
posted an update 4 days ago
view post
Post
3544
🤝 Collaborating with AMD to ensure Hugging Face Transformers runs smoothly on AMD GPUs!

We run daily CI on AMD MI325 to track the health of the most important model architectures and we’ve just made our internal dashboard public.

By making this easily accessible, we hope to spark community contributions and improve support for everyone!
  • 2 replies
·
DualityAI-RebekahBogdanoff 
posted an update 3 days ago
view post
Post
3397
🎉 Big congratulations to the winners of the "Synthetic 2 Real Object Detection Challenge 2", the second Kaggle challenge that Duality AI hosted. This competition was more fierce than the last one, but these users managed to clench the win!

🥇 1st place: @sergio-sanz-rodriguez (see the blog he produced with us outlining how he achieved his results: https://tinyurl.com/mreunr98)

🥈 2nd place: Kaggle user Diana Shilova - https://tinyurl.com/yjjz3szm

🥉 3rd place: Kaggle user 眉间尺 - https://tinyurl.com/ycxskfzv

View the entire leaderboard at - https://tinyurl.com/jm2ery7w

Join our current Multi-Class Object Detection Synthetic2Real Kaggle competition here: https://tinyurl.com/y224rttu

And we will soon be launching a new competition in a new domain! Hint: 🌄 🏠 🌳 ✈️
sergiopaniego 
posted an update 5 days ago
view post
Post
3999
gpt-oss was possible thanks to new engineering efforts in 🤗 transformers. We just dropped a blog covering them:

- Kernels from the Hub
- MXFP4 Quantization
- Tensor & Expert Parallelism
- Dynamic Sliding Window & Cache
- Continuous Batching & Paged Attention

Grab a coffee & dive in! ☕️

https://huggingface.co/blog/faster-transformers
hesamation 
posted an update 11 days ago
view post
Post
5785
a senior engineer at google just dropped a 400-page free book on docs for review: agentic design patterns.

the table of contents looks like everything you need to know about agents + code:
> advanced prompt techniques
> multi-agent patterns
> tool use and MCP
> you name it

read it here: https://docs.google.com/document/d/1rsaK53T3Lg5KoGwvf8ukOUvbELRtH-V0LnOIFDxBryE/edit?tab=t.0#heading=h.pxcur8v2qagu

you can also pre-order on Amazon (published by Springer) and the royalties goes to Save the Children: https://www.amazon.com/Agentic-Design-Patterns-Hands-Intelligent/dp/3032014018/
aposadasn 
posted an update about 7 hours ago
view post
Post
83
My team at arclabmit created a robotic teleoperation and learning software for controlling robots, recording datasets, and training physical AI models, which is compatible with lerobot . This work was part of a paper we published to ICCR Kyoto 2025. Check out or code here: https://github.com/ARCLab-MIT/beavr-bot/tree/main

Our work aims to solve two key problems in the world of robotic manipulation:

1. The lack of a well-developed, open-source, accessible teleoperation system that can work out of the box.

2. No performant end-to-end control, recording, and learning platform for robots that is completely hardware agnostic.

If you are curious to learn more or have any questions please feel free to reach out!

Paper: BEAVR: Bimanual, multi-Embodiment, Accessible, Virtual Reality Teleoperation System for Robots (2508.09606)