@singhsidhukuldeep on Hugging Face: "🗓️ Remember when last April, @Meta released Segment Anything Model (SAM)…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

singhsidhukuldeep

posted an update Aug 6, 2024

Post

2183

🗓️ Remember when last April, @Meta released Segment Anything Model (SAM) paper and it was too good to be true. 🤯

They have now released Segment Anything Model 2 (SAM 2) and it's mind-blowingly great! 🚀

SAM 2 is the first unified model for segmenting objects across images and videos. You can use a click, box, or mask as the input to select an object on any image or frame of video. 🖼️📹

SAM consists of an image encoder to encode images, a prompt encoder to encode prompts, then outputs of these two are given to a mask decoder to generate masks. 🎭

The biggest jump of SAM2 from SAM is using memory to have consistent masking across frames! They call it masklet prediction! 🧠

They have also released the dataset, SA-V
This dataset is truly huge, with 190.9K manual annotations and 451.7K automatic! 📊

📄 Paper: https://ai.meta.com/research/publications/sam-2-segment-anything-in-images-and-videos/

📝 Blog: https://ai.meta.com/sam2/

🔗 Demo: https://sam2.metademolab.com/demo

💾 Model Weights: https://github.com/facebookresearch/segment-anything-2/blob/main/checkpoints/download_ckpts.sh

📁 Dataset: https://ai.meta.com/datasets/segment-anything-video-downloads/

dillfrescott

Aug 8, 2024

Tried it with a basic minecraft video and the tracking was not so good. oof

In this post