48 24 64

Cuiunbo PRO

Cuiunbo

AI & ML interests

Anything

Recent Activity

new activity 24 days ago

lerobot/pi0fast_base:Hi!

liked a model 26 days ago

lerobot/pi0fast_base

liked a model about 2 months ago

lerobot/pi0

View all activity

Organizations

Cuiunbo's activity

New activity in lerobot/pi0fast_base 24 days ago

Hi!

#1 opened 26 days ago by

Cuiunbo

liked a model 26 days ago

lerobot/pi0fast_base

Robotics • Updated 27 days ago • 1.31k • 12

liked a model about 2 months ago

lerobot/pi0

Robotics • Updated Mar 6 • 11.8k • 230

New activity in HKUSTAudio/Llasa-3B 2 months ago

Are There Quantitative Metrics, Such as Simo Compared to Other TTS?

#7 opened 3 months ago by

Cuiunbo

liked a dataset 2 months ago

liboaccn/MIT-10M

Viewer • Updated Mar 13 • 10.9M • 210 • 9

New activity in openbmb/MiniCPM-o-2_6 2 months ago

test

#33 opened 3 months ago by

horpheu

updated a model 2 months ago

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated about 1 month ago • 203k • 1.12k

New activity in openbmb/MiniCPM-o-2_6 2 months ago

`streaming_prefill` can't handle the parameter `omni_input=False`

#31 opened 3 months ago by

yzmyyff

liked a model 3 months ago

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • Updated 19 days ago • 307k • 217

reacted to merve's post with 🤗❤️ 3 months ago

Post

2635

Everything that happened this week in open AI, a recap 🤠 merve/jan-17-releases-678a673a9de4a4675f215bf5

👀 Multimodal
- MiniCPM-o 2.6 is a new sota any-to-any model by OpenBMB
(vision, speech and text!)
- VideoChat-Flash-Qwen2.5-2B is new video multimodal models by OpenGVLab that come in sizes 2B & 7B in resolutions 224 & 448
- ByteDance released larger SA2VA that comes in 26B parameters
- Dataset: VRC-Bench is a new diverse benchmark for multimodal LLM reasoning performance

💬 LLMs
- MiniMax-Text-01 is a new huge language model (456B passive 45.9B active params) by MiniMaxAI with context length of 4M tokens 🤯
- Dataset: Sky-T1-data-17k is a diverse dataset used to train Sky-T1-32B
- kyutai released Helium-1-Preview-2B is a new small multilingual LM
- Wayfarer-12B is a new LLM able to write D&D 🧙🏻‍♂️
- ReaderLM-v2 is a new HTML parsing model by Jina AI

- Dria released, Dria-Agent-a-3B, new agentic coding model (Pythonic function calling) based on Qwen2.5 Coder
- Unsloth released Phi-4, faster and memory efficient Llama 3.3

🖼️ Vision
- MatchAnything is a new foundation model for matching
- FitDit is a high-fidelity VTON model based on DiT architecture

🗣️ Audio
- OuteTTS-0.3-1B is a new multilingual text-to-speech model with voice cloning and emotion control capabilities

📖 Retrieval
- lightblue released a new reranker based on Qwen2.5 LB-reranker-0.5B-v1.0 that can handle 95+ languages
- cde-small-v2 is a new sota small retrieval model by
@jxm

New activity in openbmb/MiniCPM-o-2_6-gguf 3 months ago

add task to the model for better discoverability

#2 opened 3 months ago by

reach-vb

reacted to alibabasglab's post with 🚀 3 months ago

Post

1238

Introducing open-sourced ClearerVoice-Studio. A powerful speech processing AI tool to dramatically improve your speech quality. Checkout demo page: alibabasglab/ClearVoice and https://modelscope.cn/studios/iic/ClearerVoice-Studio. Give us a Star on Github: https://github.com/modelscope/ClearerVoice-Studio!

liked a Space 3 months ago

166

ViTPose Transformers

⚡

Detect and annotate poses in images and videos

replied to mitkox's post 3 months ago

nice! Looking forward to seeing your work!

reacted to mitkox's post with 👀🚀 3 months ago

Post

1432

Training a model to reason in the continuous latent space based on Meta's Coconut.
If it all works will apply it on the MiniCPM-o SVD-LR.
Endgame is a multimodal, adaptive, and efficient foundational on device AI model.

2 replies

New activity in openbmb/MiniCPM-o-2_6 3 months ago

Where are the MiniCPM-o 2.6 int4 weights?

#9 opened 3 months ago by

AiCreatornator

int-4 quantized version is missing [link broken]

#10 opened 3 months ago by

pranay-ar