Running on CPU Upgrade 274 274 Serverless ImgGen Hub ♨ Highly hackable hub w/ Flux, SD 3.5, LoRAs, no GPUs required
Ultravox v0.5 Collection Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone. • 3 items • Updated Feb 10 • 13
D_AU - Dark Planet Series (see "source" coll. for FP) Collection A dark bias collection of models for any creative use such as writing, fiction, storytelling, role play and other uses. Example gens at each repo. • 32 items • Updated 4 days ago • 11
Running on L40S 406 406 Stable Virtual Camera ⚡ Generate videos from multiple input images with 3D camera control
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated Mar 21 • 22
Wan2.1 14B 480p I2V LoRAs Collection A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 39 items • Updated 22 days ago • 104