Zoroa Strella's picture

37 534

Zoroa Strella PRO

ZoroaStrella

·

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

Skywork/SkyReels-A2

liked a Space 6 days ago

VAST-AI/TripoSG

liked a Space 6 days ago

enzostvs/deepsite

View all activity

Organizations

ZoroaStrella's activity

upvoted a collection 20 days ago

YuE

YuE: Open Full-song Generation Foundation Model • 11 items • Updated 17 days ago • 23

upvoted 2 collections 4 months ago

Hymba

A series of Hybrid Small Language Models. • 2 items • Updated about 14 hours ago • 29

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74

upvoted a collection 6 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 22 days ago • 300

upvoted a collection 7 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 226

upvoted 2 collections 8 months ago

Qwen2-Audio

Audio-language model series based on Qwen2 • 4 items • Updated Nov 28, 2024 • 56

Parler-TTS: fully open-source high-quality TTS

If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 8 items • Updated Dec 2, 2024 • 50

upvoted 3 collections 9 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated about 14 hours ago • 60

Chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR. • 2 items • Updated Jul 9, 2024 • 28

xLAM models

xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 20 items • Updated 3 days ago • 48

upvoted an article 9 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 350

upvoted a paper 9 months ago

SEED-Story: Multimodal Long Story Generation with Large Language Model

Paper • 2407.08683 • Published Jul 11, 2024 • 25

upvoted a collection 9 months ago

InternLM2.5

14 items • Updated Feb 11 • 71

upvoted 3 collections 10 months ago

Florence

9 items • Updated Jan 8 • 167

4M Models

Multimodal models from https://4m.epfl.ch/ • 17 items • Updated 28 days ago • 31

GLM-4

GLM-4 Open Models • 14 items • Updated Feb 22 • 117

upvoted a collection 12 months ago

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 91

upvoted a paper 12 months ago

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Paper • 2404.07839 • Published Apr 11, 2024 • 47

upvoted 2 papers about 1 year ago

Video Interpolation with Diffusion Models

Paper • 2404.01203 • Published Apr 1, 2024 • 2

CameraCtrl: Enabling Camera Control for Text-to-Video Generation

Paper • 2404.02101 • Published Apr 2, 2024 • 23