fireblade2534 (fireblade2534)

liked 3 models about 4 hours ago

liked a Space about 4 hours ago

4

Talk to Llama 4

🦙

Talk to Llama 4 using Groq + Cloudflare

liked a model 3 days ago

Qwen/QwQ-32B

Text Generation • Updated 29 days ago • 828k • • 2.65k

reacted to hexgrad's post with 👀 7 days ago

Post

2972

To Meta AI Research: I would like to fold ylacombe/expresso into the training mix of an Apache TTS model series. Can you relax the Expresso dataset license to CC-BY or more permissive?

Barring that, can I have an individual exception to train on the materials and distribute trained Apache models, without direct redistribution of the original files? Thanks!

CC (Expresso paper authors whose handles I could find on HF) @wnhsu @adavirro @bowenshi @itaigat @TalRemez @JadeCopet @hassid @felixkreuk @adiyoss @edupoux

liked a Space 7 days ago

7

Advance Blur

🥸

Advance Blur anonymizes your images with "Vance Blurring."

liked a model 16 days ago

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • Updated 3 days ago • 314k • 324

liked a model 19 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Image-Text-to-Text • Updated 1 day ago • 125k • 1.09k

liked a model 25 days ago

Emova-ollm/emova-qwen-2-5-7b-hf

Feature Extraction • Updated 28 days ago • 97 • 3

liked a Space 25 days ago

8

EMOVA Online Interactive Demo

🔥

Live Interactive demo for EMOVA with Qwen-2.5 backbone

liked a model 25 days ago

Emova-ollm/emova-qwen-2-5-3b-hf

Feature Extraction • Updated 28 days ago • 61 • 5

reacted to KaiChen1998's post with 👍 25 days ago

Post

4815

📢 Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)!

🤗 EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller.

✨ EMOVA Highlights
✅ State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously.
✅ Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)!
✅ Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny!

🔥 You are all welcome to try and star!
- Project page: https://emova-ollm.github.io/
- Github: https://github.com/emova-ollm/EMOVA
- Demo: Emova-ollm/EMOVA-demo

liked a Space 25 days ago

50

Yoloe

🚀

Identify and segment objects in images using text, visual, or prompt-free prompts

liked a model 25 days ago

hanzla/Falcon3-Mamba-R1-v0

Text Generation • Updated 18 days ago • 1.69k • 9

liked a dataset 30 days ago

duality-robotics/pose_estimation5.1

Viewer • Updated Mar 7 • 20 • 126 • 5

reacted to tomaarsen's post with ❤️ 30 days ago

Post

6639

An assembly of 18 European companies, labs, and universities have banded together to launch 🇪🇺 EuroBERT! It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.

🇪🇺 15 Languages: English, French, German, Spanish, Chinese, Italian, Russian, Polish, Portuguese, Japanese, Vietnamese, Dutch, Arabic, Turkish, Hindi
3️⃣ 3 model sizes: 210M, 610M, and 2.1B parameters - very very useful sizes in my opinion
➡️ Sequence length of 8192 tokens! Nice to see these higher sequence lengths for encoders becoming more common.
⚙️ Architecture based on Llama, but with bi-directional (non-causal) attention to turn it into an encoder. Flash Attention 2 is supported.
🔥 A new Pareto frontier (stronger *and* smaller) for multilingual encoder models
📊 Evaluated against mDeBERTa, mGTE, XLM-RoBERTa for Retrieval, Classification, and Regression (after finetuning for each task separately): EuroBERT punches way above its weight.
📝 Detailed paper with all details, incl. data: FineWeb for English and CulturaX for multilingual data, The Stack v2 and Proof-Pile-2 for code.

Check out the release blogpost here: https://huggingface.co/blog/EuroBERT/release
* EuroBERT/EuroBERT-210m
* EuroBERT/EuroBERT-610m
* EuroBERT/EuroBERT-2.1B

The next step is for researchers to build upon the 3 EuroBERT base models and publish strong retrieval, zero-shot classification, etc. models for all to use. I'm very much looking forward to it!