51 70 115

Dmitry Ryumin

DmitryRyumin

https://dmitryryumin.github.io

DmitryRyumin

AI & ML interests

Machine Learning and Applications, Multi-Modal Understanding

Recent Activity

upvoted a paper 2 days ago

TransMamba: Flexibly Switching between Transformer and Mamba

liked a Space 3 days ago

openfree/Llama-4-Scout-17B-Research

upvoted an article 3 days ago

Welcome Llama 4 Maverick & Scout on Hugging Face!

View all activity

Organizations

DmitryRyumin's activity

upvoted a paper 2 days ago

TransMamba: Flexibly Switching between Transformer and Mamba

Paper • 2503.24067 • Published 10 days ago • 14

liked a Space 3 days ago

Llama-4-Scout-17B Research

🏃

Llama-4-Scout-17B + Real Time Deep Research

upvoted an article 3 days ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

5 days ago

• 137

liked a dataset 7 days ago

faridlab/deepspeak_v2

Updated 1 day ago • 168 • 2

upvoted a paper 8 days ago

Multi-Token Attention

Paper • 2504.00927 • Published 9 days ago • 39

reacted to AdinaY's post with 🔥 10 days ago

Post

1914

AReal-Boba 🔥 a fully open RL Frameworks released by AntGroup, an affiliate company of Alibaba.
inclusionAI/areal-boba-67e9f3fa5aeb74b76dcf5f0a
✨ 7B/32B - Apache2.0
✨ Outperform on math reasoning
✨ Replicating QwQ-32B with 200 data under $200
✨ All-in-one: weights, datasets, code & tech report

1 reply

liked a Space 10 days ago

146

BLIP2

🌖

image captioning, VQA

updated a collection 15 days ago

🎭 Avatars

Collection

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 73 items • Updated 15 days ago • 86

commented a paper 15 days ago

FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few Images

Paper • 2503.19207 • Published 16 days ago • 4 •

liked a model 16 days ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated 14 days ago • 183k • • 2.5k

upvoted a collection 16 days ago

MambaVision

Collection

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated about 3 hours ago • 31

liked a model 16 days ago

nvidia/MambaVision-L3-512-21K

Image Classification • Updated 12 days ago • 7.71k • 46

updated a collection 17 days ago

🎭 Avatars

Collection

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 73 items • Updated 15 days ago • 86

upvoted 2 papers 17 days ago

TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting

Paper • 2503.17032 • Published 20 days ago • 23

MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving

Paper • 2503.16905 • Published 20 days ago • 53

liked 2 Spaces 21 days ago

339

TANGO

🐠

Co-Speech Gesture Video Generation

Follow History

🔥

Track history of Follows of organizations and users on HF

liked a Space 22 days ago

Echomimic V2

👁

Generate a video from an image, audio, and pose data

reacted to KaiChen1998's post with 👍 25 days ago

Post

4815

📢 Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)!

🤗 EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller.

✨ EMOVA Highlights
✅ State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously.
✅ Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)!
✅ Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny!

🔥 You are all welcome to try and star!
- Project page: https://emova-ollm.github.io/
- Github: https://github.com/emova-ollm/EMOVA
- Demo: Emova-ollm/EMOVA-demo

liked a model 27 days ago

BadToBest/EchoMimicV2

Updated Jan 6 • 112