Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
5
Shreya Mishra
Feluda
Follow
0 followers
ยท
10 following
AI & ML interests
None yet
Recent Activity
reacted
to
alvarobartt
's
post
with ๐ฅ
15 days ago
๐ฅ Agents can do anything! @microsoft Research just announced the release of Magma 8B! Magma is a new Visual Language Model (VLM) with 8B parameters for multi-modal agents designed to handle complex interactions across virtual and real environments; and it's MIT licensed! Magma comes with exciting new features such as: - Introduces the Set-of-Mark and Trace-of-Mark techniques for fine-tuning - Leverages a large amount of unlabeled video data to learn the spatial-temporal grounding and planning - A strong generalization and ability to be fine-tuned for other agentic tasks - SOTA in different multi-modal benchmarks spanning across UI navigation, robotics manipulation, image / video understanding and spatial understanding and reasoning - Generates goal-driven visual plans and actions for agentic use cases Model: https://huggingface.co/microsoft/Magma-8B Technical Report: https://huggingface.co/papers/2502.13130
upvoted
an
article
25 days ago
Open-source DeepResearch โ Freeing our search agents
updated
a model
2 months ago
Feluda/finetuned_llama3.1_try2_1.5epch
View all activity
Organizations
None yet
Feluda
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
an
article
25 days ago
view article
Article
Open-source DeepResearch โ Freeing our search agents
Feb 4
โข
1.16k
upvoted
a
collection
2 months ago
Meta's Llama 3.1 models & evals
Collection
17 items
โข
Updated
Dec 13, 2024
โข
89