keras-dreambooth (Keras Dreambooth Event)

kadirnar

posted an update 30 days ago

Post

3878

Researchers developed Sonic AI enabling precise facial animation from speech cues 🎧 Decouples head/expression control via audio tone analysis + time-aware fusion for natural long-form synthesis

1 reply

·

eienmojiki

posted an update about 1 month ago

Post

2102

🪄 LayerDiffuse - Flux Version (Demo) 🪄

LayerDiffuse - Transparent Image Layer Diffusion using Latent Transparency

Demo: https://huggingface.co/spaces/eienmojiki/Flux-LayerDiffuse

3 replies

·

lucifertrj

posted an update about 1 month ago

Post

541

Bhagavad Gita GPT assistant - Build fast RAG pipeline to index 1000+ pages using Binary Optimization

DeepSeek R-1 and Qdrant Binary Quantization

Check out the latest tutorial where we build a Bhagavad Gita GPT assistant—covering:
- DeepSeek R1 vs OpenAI O1
- Using Qdrant client with Binary Quantization
- Building the RAG pipeline with LlamaIndex
- Running inference with DeepSeek R1 Distill model on Groq
- Develop Streamlit app for the chatbot inference

Watch the full implementation here: https://www.youtube.com/watch?v=NK1wp3YVY4Q

1 reply

·

not-lain

posted an update about 1 month ago

Post

4285

I have just released a new blogpost about kv caching and its role in inference speedup 🚀
🔗 https://huggingface.co/blog/not-lain/kv-caching/
some takeaways :

4 replies

·

kadirnar

posted an update about 2 months ago

Post

3047

I created my own AI image and video from scratch using the fal.ai platform 💫

Workflow: Flux Lora Training + Upscale + Kling AI(1.6)

5 replies

·

not-lain

posted an update about 2 months ago

Post

1649

we now have more than 2000 public AI models using ModelHubMixin🤗

not-lain

posted an update about 2 months ago

Post

4022

Published a new blogpost 📖
In this blogpost I have gone through the transformers' architecture emphasizing how shapes propagate throughout each layer.
🔗 https://huggingface.co/blog/not-lain/tensor-dims
some interesting takeaways :

nielsgl

updated a Space 2 months ago

11

Dreambooth Pug Ace

🐶

asusevski

authored a paper 3 months ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published Dec 10, 2024 • 27

lucifertrj

posted an update 3 months ago

Post

540

Image Prompt Engineering Guide:
➡️ Artistic styling for Image generation
➡️ Prompt weighting using the parentheses method to generate realistic images.
➡️ Advanced features like style and positioning control[experimental].
➡️ Image placement on the generated AI image using Recraft V3 Mockup.

Watch: https://www.youtube.com/watch?v=d3nUG28-jIc

lunarflu

posted an update 3 months ago

Post

1826

great blogpost! 🔥@wolfram
https://huggingface.co/blog/wolfram/llm-comparison-test-2024-12-04

not-lain

posted an update 4 months ago

Post

2323

ever wondered how you can make an API call to a visual-question-answering model without sending an image url 👀

you can do that by converting your local image to base64 and sending it to the API.

recently I made some changes to my library "loadimg" that allows you to make converting images to base64 a breeze.
🔗 https://github.com/not-lain/loadimg

API request example 🛠️:

from loadimg import load_img
from huggingface_hub import InferenceClient

# or load a local image
my_b64_img = load_img(imgPath_url_pillow_or_numpy ,output_type="base64" ) 

client = InferenceClient(api_key="hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx")

messages = [
	{
		"role": "user",
		"content": [
			{
				"type": "text",
				"text": "Describe this image in one sentence."
			},
			{
				"type": "image_url",
				"image_url": {
					"url": my_b64_img # base64 allows using images without uploading them to the web
				}
			}
		]
	}
]

stream = client.chat.completions.create(
    model="meta-llama/Llama-3.2-11B-Vision-Instruct", 
	messages=messages, 
	max_tokens=500,
	stream=True
)

for chunk in stream:
    print(chunk.choices[0].delta.content, end="")

lucifertrj

authored a paper 5 months ago

News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News

Paper • 2410.07520 • Published Oct 10, 2024

lucifertrj

posted an update 5 months ago

Post

1551

AI Agents LlamaIndex in 40 minutes

The video covers code and workflow explanations for:

- Function Calling
- Function Calling Agents + Agent Runner
- Agentic RAG
- REAcT Agent: Build your own Search Assistant Agent

Watch: https://youtu.be/bHn4dLJYIqE

bharat-raghunathan

updated 2 models 6 months ago

keras-dreambooth/dreambooth_dosa_v2

Text-to-Image • Updated Sep 3, 2024 • 23

keras-dreambooth/dreambooth_dosa

Updated Sep 3, 2024 • 16

kadirnar

posted an update 6 months ago

Post

4388

I am training a controlnet model for Flux. And some of my experiences:

Checkpoint-10000:

https://x.com/kadirnar_ai/status/1829831750471606668

Checkpoint-12000:

https://x.com/kadirnar_ai/status/1829889524962640001

Checkpoint-14000:

https://x.com/kadirnar_ai/status/1829989622878744711

Checkpoint (16000-18000):

https://x.com/kadirnar_ai/status/1830179551407665654

Dataset: kadirnar/fluxdev_controlnet_16k
GPU: 1xA100(80GB)
GPU Hours: 65

1 reply

·

tgohblio

updated a model 7 months ago

keras-dreambooth/dreambooth_hogwarts_legacy

Text-to-Image • Updated Aug 23, 2024 • 43

lunarflu

posted an update 7 months ago

Post

1367

@Blane187 could you please modify the title of your blogpost? content is cool, title could be nicer imo https://huggingface.co/blog/Blane187/wtf-is-rvc