Submit media inputs to generate text and speech responses
Create and quantize models on Hugging Face
Generate images based on prompts and LoRA models