radames (Radamés Ajna)

posted an update about 15 hours ago

Post

799

Thanks to @OzzyGT for pushing the new Anyline preprocessor to https://github.com/huggingface/controlnet_aux. Now you can use the TheMistoAI/MistoLine ControlNet with Diffusers completely.

Here's a demo for you: radames/MistoLine-ControlNet-demo
Super resolution version: radames/Enhance-This-HiDiffusion-SDXL

from controlnet_aux import AnylineDetector

anyline = AnylineDetector.from_pretrained(
    "TheMistoAI/MistoLine", filename="MTEED.pth", subfolder="Anyline"
).to("cuda")

source = Image.open("source.png")
result = anyline(source, detect_resolution=1280)

replied to their post 2 days ago

yes that's the idea, you Sign in with HF and it syncs the db with personal dataset , in case you reboot it I'll look back your state. 👀 into it

posted an update 8 days ago

Post

2126

At Google I/O 2024, we're collaborating with the Google Visual Blocks team (https://visualblocks.withgoogle.com) to release custom Hugging Face nodes. Visual Blocks for ML is a browser-based tool that allows users to create machine learning pipelines using a visual interface. We're launching nodes with Transformers.js, running models on the browser, as well as server-side nodes running Transformers pipeline tasks and LLMs using our hosted inference. With @Xenova @JasonMayes

You can learn more about it here https://huggingface.co/blog/radames/hugging-face-google-visual-blocks

Source-code for the custom nodes:
https://github.com/huggingface/visual-blocks-custom-components

replied to their post 8 days ago

Yes that's a great idea, I'm chatting with folks from Convex, and check if the sqlite db is the only file I need to backup, then I'll set a scheduler to push it to a personal dataset. Folks them would be able to pause and restart from that state!

replied to their post 9 days ago

Thanks! It was a fun challenge to put it all together in a single container. I'm excited to try more Convexdb as a vector db and backend.

posted an update 9 days ago

Post

1880

AI-town now runs on Hugging Face Spaces with our API for LLMs and embeddings, including the open-source Convex backend, all in one container. Easy to duplicate and config on your own

Demo: radames/ai-town
Instructions: https://github.com/radames/ai-town-huggingface

6 replies

·

replied to dhruvabansal's post 13 days ago

Just saying that I really like the ability to quickly test your model against the monster ones! It's amazing how well it performs against Claude. 🤯

replied to Xenova's post 14 days ago

Amazing!! Shall we make a VB node for this?

posted an update 15 days ago

Post

2457

HiDiffusion SDXL now supports Image-to-Image, so I've created an "Enhance This" version using the latest ControlNet Line Art model called MistoLine. It's faster than DemoFusion

Demo: radames/Enhance-This-HiDiffusion-SDXL

Older version based on DemoFusion radames/Enhance-This-DemoFusion-SDXL

New Controlnet SDXL Controls Every Line TheMistoAI/MistoLine

HiDiffusion is compatible with diffusers and support many SD models - https://github.com/megvii-research/HiDiffusion

1 reply

·

replied to Sentdex's post 22 days ago

Thanks for the summary, @Sentdex !
For the curious, there are some examples on their GitHub repo https://github.com/KindXiaoming/pykan

replied to renyuxi's post 23 days ago

Hi @renyuxi , thanks for sharing this update! 8 steps with CFG and negative prompts is amazing!

posted an update 24 days ago

Post

2420

I've built a custom component that integrates Rerun web viewer with Gradio, making it easier to share your demos as Gradio apps.

Basic snippet

# pip install gradio_rerun gradio
import gradio as gr
from gradio_rerun import Rerun

gr.Interface(
    inputs=gr.File(file_count="multiple", type="filepath"),
    outputs=Rerun(height=900),
    fn=lambda file_path: file_path,
).launch()

More details here radames/gradio_rerun
Source https://github.com/radames/gradio-rerun-viewer

Follow Rerun here https://huggingface.co/rerun

replied to oliveryanzuolu's post about 1 month ago

amazing work @oliveryanzuolu 👏 Do you have plans to release the training distillation code?

posted an update about 1 month ago

Post

2439

ByteDance released new distillation technique Hyper-SD ( ByteDance/Hyper-SD) for efficient image generation.
Here a few demos:

Official:
Hyper-SDXL-1Step-T2I ByteDance/Hyper-SDXL-1Step-T2I

Hyper-SD15-Scribble ByteDance/Hyper-SD15-Scribble

Unofficial Demos: InstantStyle + Hyper SD1.5 (not great but super fast) radames/InstantStyle-Hyper-SD

InstantStyle + Hyper SDXL radames/InstantStyle-Hyper-SDXL

posted an update about 1 month ago

Post

2172

InstantStyle works with the 2-step SDXL-Lightning distilled model, reducing generation time from ~20s to ~9s!

In a big related update, as of today, Diffusers@main supports InstantStyle. I'm looking forward to playing with it!

https://github.com/huggingface/diffusers/pull/7668

radames/InstantStyle-SDXL-Lightning
ByteDance/SDXL-Lightning

replied to andrewrreed's post about 1 month ago

Very interesting, @andrewrreed , and completely unaware of this feature! Do you know of any other strategies for grounded generation in models like LLaMA or Mistral?

posted an update about 1 month ago

Post

3834

Here's a utility component for integrating your Gradio app with Hugging Face. This custom component enables you to search for models, spaces, datasets, and users.

pip install gradio_huggingfacehub_search

You can see it in action here. arcee-ai/mergekit-config-generator

And learn how to use it here radames/gradio_huggingfacehub_search

posted an update about 2 months ago

Post

2734

Following up on @vikhyatk 's Moondream2 update and @santiagomed 's implementation on Candle, I quickly put togheter the WASM module so that you could try running the ~1.5GB quantized model in the browser. Perhaps the next step is to rewrite it using https://github.com/huggingface/ratchet and run it even faster with WebGPU, @FL33TW00D-HF .

radames/Candle-Moondream-2

ps: I have a collection of all Candle WASM demos here radames/candle-wasm-examples-650898dee13ff96230ce3e1f

replied to freddyaboulton's post 2 months ago

nice!! can you set the jpeg quality as well?

replied to chansung's post 2 months ago

thanks @chansung this is so helpful! btw you could launch a quantize version here https://huggingface.co/spaces/chansung/gradio_together_tgi/blob/main/entrypoint.sh.template#L11 and even try running this on CPU.

replied to Wauplin's post 2 months ago

@Wauplin is doing impressive work here. 👏

posted an update 2 months ago

Post

3502

Testing new pix2pix-Turbo in real-time, very interesting GAN architecture that leverages SD-Turbo model. Here I'm using edge2image LoRA single-step inference 🤯

It's very interesting how ControlNet Canny quality is comparable, but in a single step. Looking forward to when they release the code: https://github.com/GaParmar/img2img-turbo/issues/1

I've been keeping a list of fast diffusion model pipelines together with this real-time websocket app. Have a look if you want to test it locally, or check out the demo here on Spaces.

radames/real-time-pix2pix-turbo

Github app:
https://github.com/radames/Real-Time-Latent-Consistency-Model/

You can also check the authors img2img sketch model here

gparmar/img2img-turbo-sketch

Refs:
One-Step Image Translation with Text-to-Image Models (2403.12036)

cc @gparmar @junyanz

replied to visheratin's post 3 months ago

hi @visheratin , do you have any guides on how to train similar model? Phi-2 + SigLIP vision encoder?

replied to victor's post 4 months ago

very cool! I just ordered RPi5 to run some tests, also this awesome mic hat

replied to victor's post 4 months ago

I know it's possible to run real-time whisper on a rapberrypi with whisper.cpp @ggerganov

replied to victor's post 4 months ago

Are you thinking of running it on a device or in the cloud?

replied to abhishek's post 5 months ago

hello 👋

Radamés Ajna

AI & ML interests

Articles

Hugging Face + Google Visual Blocks

Organizations

radames's activity