ℏεsam's picture

10 4

ℏεsam PRO

hesamation

·

AI & ML interests

post-training / reasonign models / RAG

Recent Activity

reacted to their post with ❤️ about 22 hours ago

The best researchers from Yale, Stanford, Google DeepMind, and Microsoft laid out all we know about Agents in a 264-page paper [book], Here are some of their key findings: They build a mapping of different agent components, such as perception, memory, and world modelling, to different regions of the human brain and compare them: - brain is much more energy-efficient - no genuine experience in agents - brain learns continuously, agent is static An agent is broken down to: - Perception: the agent's input mechanism. can be improved with multi-modality, feedback mechanisms (e.g., human corrections), etc. - Cognition: learning, reasoning, planning, memory. LLMs are key in this part. - Action: agent's output and tool use. Agentic memory is represented as: - Sensory memory or short-term holding of inputs which is not emphasized much in agents. - Short-term memory which is the LLM context window - Long-term memory which is the external storage such as RAG or knowledge graphs. The memory in agents can be improved and researched in terms of: - increasing the amount of stored information - how to retrieve the most relevant info - combining context-window memory with external memory - deciding what to forget or update in memory The agent must simulate or predict the future states of the environment for planning and decision-making. ai world models are much simpler than the humans' with their causal reasoning (cause-and-effect) or physical intuition. LLM world models are mostly implicit and embedded. EMOTIONS are a deep aspect of humans, helping them with social interactions, decision-making, or learning. Agents must understand emotions to better interact with us. But rather than encoding the feeling of emotions, they have a surface-level modelling of emotions. Perception is the process by which an agent receives and interprets raw data from its surroundings. READ PAPER: https://huggingface.co/papers/2504.01990

posted an update 1 day ago

The best researchers from Yale, Stanford, Google DeepMind, and Microsoft laid out all we know about Agents in a 264-page paper [book], Here are some of their key findings: They build a mapping of different agent components, such as perception, memory, and world modelling, to different regions of the human brain and compare them: - brain is much more energy-efficient - no genuine experience in agents - brain learns continuously, agent is static An agent is broken down to: - Perception: the agent's input mechanism. can be improved with multi-modality, feedback mechanisms (e.g., human corrections), etc. - Cognition: learning, reasoning, planning, memory. LLMs are key in this part. - Action: agent's output and tool use. Agentic memory is represented as: - Sensory memory or short-term holding of inputs which is not emphasized much in agents. - Short-term memory which is the LLM context window - Long-term memory which is the external storage such as RAG or knowledge graphs. The memory in agents can be improved and researched in terms of: - increasing the amount of stored information - how to retrieve the most relevant info - combining context-window memory with external memory - deciding what to forget or update in memory The agent must simulate or predict the future states of the environment for planning and decision-making. ai world models are much simpler than the humans' with their causal reasoning (cause-and-effect) or physical intuition. LLM world models are mostly implicit and embedded. EMOTIONS are a deep aspect of humans, helping them with social interactions, decision-making, or learning. Agents must understand emotions to better interact with us. But rather than encoding the feeling of emotions, they have a surface-level modelling of emotions. Perception is the process by which an agent receives and interprets raw data from its surroundings. READ PAPER: https://huggingface.co/papers/2504.01990

upvoted a paper 3 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

View all activity

Organizations

hesamation's activity

liked a Space 4 days ago

Daily Papers Atlas

A demo network of the top ML research papers.

liked a Space 10 days ago

LLM Embeddings Explained: A Visual and Intuitive Guide

How Language Models Turn Text into Meaning, From Traditional

liked a dataset 21 days ago

hesamation/git-prompt

Viewer • Updated Aug 5, 2024 • 322 • 79 • 2

liked a Space 24 days ago

Model Atlas

A demo for exploring and analyzing large-scale model repos