Sambit Mukherjee PRO

sadhaklal

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face Discord Community's profile picture

sadhaklal's activity

New activity in discord-community/LevelBot 3 days ago
commented on Tool Use, Unified 7 days ago
view reply

Excellent article. Very clearly written.

I have one question though. It seems that that model replies with either (i) a text response or (ii) a tool call. However, in the original ReAct paper, there is a "Thought" -> "Action" -> "Observation" cycle. In other words, in response to the user's query, the model first outputs a "Thought" followed by an "Action". How do I implement this (i.e., make the model "think" before performing a tool call)?

The following are the original ReAct prompts for HotpotQA (from the official ReAct GitHub repo): https://raw.githubusercontent.com/ysymyth/ReAct/refs/heads/master/prompts/prompts_naive.json

If you examine these prompts, you'll notice that the "thoughts" come before the "actions".

upvoted an article 8 days ago
posted an update 8 days ago
view post
Post
1753
What happens when you combine the Chain of Thought (CoT) reasoning capabilities of LLMs with a heuristic-guided tree search algorithm? In the Tree of Thoughts (ToT) paper, the authors (Yao et al.) have coupled GPT-4 with tree search algorithms to attack a few tasks on which left-to-right CoT struggles. And the results are impressive. For example, on the "Game of 24" task, while GPT-4 with CoT prompting only managed to solve 4% of tasks, ToT achieved a success rate of 74%.

I've written a blog post that makes the ToT paper easy to understand and implement by taking you through all the details in a step-by-step manner: https://huggingface.co/blog/sadhaklal/tree-of-thoughts

If you are interested in the topics of algorithmic AI, tree search, reasoning, planning, or "System 2" thinking, then you may find this blog post useful.
published an article 9 days ago
view article
Article

Understanding and Implementing the Tree of Thoughts Paradigm

By sadhaklal
8