1 30

Umberto Cocca

okamirvs

AI & ML interests

None yet

Recent Activity

reacted to hesamation's post with 👍 4 days ago

OpenAI just released a 34-page practical guide to building agents, Here's 10 things it teaches us: 1➜ agents are different from workflows: they are complete autonomous systems that perform tasks on your behalf. many applications use LLMs for workflows, but this is not an agent. 2➜ use them for tricky stuff: complex decision making, dynamic rules, unstructured data 3➜ core recipe: each agent has three main components: Model (the brain), Tools, Instructions on how to behave 4➜ choose the right brain: set up evals to get a baseline performance, use a smart model to see what's possible, gradually downgrade the model for cost and speed 5➜ tools are key: choose well-defined and tested tools. an agent needs tools to retrieve data and context, and take actions. 6➜ instruction matters A LOT: be super clear telling the agent its goals, steps, and rules. Vague instructions = unpredictable agent. Be explicit. 7➜ start simple, then scale: often a single agent with several tools is ok. don't jump to complex multi-agent systems immediately. 8➜ if you use multi-agents: you can have a "manager" agent directing traffic to specialist agents, or have agents hand off tasks to each other. 9➜ gaurdrails are a MUST: check user input for weird stuff, make sure the agent isn't about to do something risky, filter out private info, block harmful content. Don't let it run wild. 10➜ build and plan for humans: start small, test, improve. always have a plan for when the agent gets stuck or is about to do something high-risk. Download: https://t.co/fJaCkgf7ph

liked a model 19 days ago

all-hands/openhands-lm-32b-v0.1

liked a model about 2 months ago

Qwen/QwQ-32B

View all activity

Organizations

None yet

okamirvs's activity

reacted to hesamation's post with 👍 4 days ago

Post

2005

OpenAI just released a 34-page practical guide to building agents,

Here's 10 things it teaches us:

1➜ agents are different from workflows: they are complete autonomous systems that perform tasks on your behalf. many applications use LLMs for workflows, but this is not an agent.

2➜ use them for tricky stuff: complex decision making, dynamic rules, unstructured data

3➜ core recipe: each agent has three main components: Model (the brain), Tools, Instructions on how to behave

4➜ choose the right brain: set up evals to get a baseline performance, use a smart model to see what's possible, gradually downgrade the model for cost and speed

5➜ tools are key: choose well-defined and tested tools. an agent needs tools to retrieve data and context, and take actions.

6➜ instruction matters A LOT: be super clear telling the agent its goals, steps, and rules. Vague instructions = unpredictable agent. Be explicit.

7➜ start simple, then scale: often a single agent with several tools is ok. don't jump to complex multi-agent systems immediately.

8➜ if you use multi-agents: you can have a "manager" agent directing traffic to specialist agents, or have agents hand off tasks to each other.

9➜ gaurdrails are a MUST: check user input for weird stuff, make sure the agent isn't about to do something risky, filter out private info, block harmful content. Don't let it run wild.

10➜ build and plan for humans: start small, test, improve. always have a plan for when the agent gets stuck or is about to do something high-risk.

Download: https://t.co/fJaCkgf7ph

3 replies

liked a model 19 days ago

all-hands/openhands-lm-32b-v0.1

Text Generation • Updated 7 days ago • 223k • 356

liked a model about 2 months ago

Qwen/QwQ-32B

Text Generation • Updated Mar 11 • 667k • • 2.71k

liked a Space 2 months ago

1.42k

LuminaBrush

📈

Execute code from environment variable

liked a model 3 months ago

coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 1.84M • 2.62k

liked a Space 3 months ago

708

TTS Arena

🏆

Vote on the latest TTS models!

liked 2 models 4 months ago

nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic

Updated Dec 6, 2024 • 51k • 10

RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic

Text Generation • Updated 7 days ago • 395 • 4

liked a Space 5 months ago

2.54k

XTTS

🐸

liked 2 models 6 months ago

Qwen/Qwen2.5-3B-Instruct

Text Generation • Updated Sep 25, 2024 • 1.23M • 244

bartowski/Llama-3.2-3B-Instruct-GGUF

Text Generation • Updated Oct 8, 2024 • 79.1k • 125

liked a model 7 months ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated Feb 6 • 1.05M • • 1.18k

liked a Space 7 months ago

Joytag

📊

Tag images with descriptive labels

liked a Space 9 months ago

INVALSIbenchmark

🥇

View and filter LLM benchmark results

reacted to Xenova's post with 👍 9 months ago

Post

8106

Introducing Whisper Diarization: Multilingual speech recognition with word-level timestamps and speaker segmentation, running 100% locally in your browser thanks to 🤗 Transformers.js!

Tested on this iconic Letterman interview w/ Grace Hopper from 1983!
- Demo: Xenova/whisper-speaker-diarization
- Source code: Xenova/whisper-speaker-diarization