Spaces:

LVKinyanjui
/

FastInferenceChat

Sleeping

App Files Files Community

LVKinyanjui commited on Nov 15, 2024

Commit

c6b7e90

•

1 Parent(s): 183f195

Created a nice chatbot ui with streamlit and gave it a chat history

Browse files

Files changed (9) hide show

examples/Groq_Autogen.ipynb +444 -0
examples/Groq_Langchain.ipynb +343 -0
examples/coding/tmp_code_65e3424c19629fcfd124f7ea31ec17d8.py +19 -0
examples/data/sample.png +0 -0
examples/getting_started.py +18 -0
examples/vison.ipynb +152 -0
requirements.txt +7 -0
st_image_chat.py +63 -0
st_long_context_chat.py +37 -0

examples/Groq_Autogen.ipynb ADDED Viewed

	@@ -0,0 +1,444 @@

+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "983c0b04-833c-415a-8cff-fb683f89d832",
+   "metadata": {},
+   "source": [
+    "<h1 align=center> GroQ + Autogen </h1>\n",
+    "\n",
+    "`Autogen` is an LLM framework developed by microsoft. It provides a variety of features that may be lacking in other frameworks such as *multi agent tasks*, *nested chats*, *code execution* and so on. It would be interesting to see how agents powered by lightning fast groq perform with such a feature rich framework. Our notebook follows this [guide](https://microsoft.github.io/autogen/0.2/docs/topics/non-openai-models/cloud-groq/)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "114b3ac6-30fb-4e43-9c50-a7fa18bf5fcb",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "api_key = os.getenv(\"GROQ_API_KEY\")\n",
+    "if api_key is None:\n",
+    "    raise ValueError(\"Groq API key environment variable not set\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "2bee41af-6510-44e2-875f-8c55da14aa4c",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "config_list = [\n",
+    "    {\n",
+    "        \"model\": \"llama3-8b-8192\",\n",
+    "        \"api_key\": api_key,\n",
+    "        # \"api_type\": \"groq\",  # Removed as explained here https://stackoverflow.com/a/77560277\n",
+    "        \"base_url\": \"https://api.groq.com/openai/v1\",\n",
+    "        # \"frequency_penalty\": 0.5,\n",
+    "        # \"max_tokens\": 2048,\n",
+    "        # \"presence_penalty\": 0.2,\n",
+    "        # \"seed\": 42,\n",
+    "        # \"temperature\": 0.5,\n",
+    "        # \"top_p\": 0.2,\n",
+    "    }\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "id": "998dfb98-4253-4377-b39d-b1241f047cf0",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from pathlib import Path\n",
+    "\n",
+    "from autogen import AssistantAgent, UserProxyAgent\n",
+    "from autogen.coding import LocalCommandLineCodeExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "id": "3880db83-8514-4b2d-919f-b1c42647bb74",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Setting up the code executor\n",
+    "workdir = Path(\"coding\")\n",
+    "workdir.mkdir(exist_ok=True)\n",
+    "code_executor = LocalCommandLineCodeExecutor(work_dir=workdir)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "id": "e261e35e-3c14-4d84-8710-b1726d9fe4b6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Setting up the agents\n",
+    "\n",
+    "# The UserProxyAgent will execute the code that the AssistantAgent provides\n",
+    "user_proxy_agent = UserProxyAgent(\n",
+    "    name=\"User\",\n",
+    "    code_execution_config={\"executor\": code_executor},\n",
+    "    is_termination_msg=lambda msg: \"FINISH\" in msg.get(\"content\"),\n",
+    ")\n",
+    "\n",
+    "system_message = \"\"\"You are a helpful AI assistant who writes code and the user executes it.\n",
+    "Solve tasks using your coding and language skills.\n",
+    "In the following cases, suggest python code (in a python coding block) for the user to execute.\n",
+    "Solve the task step by step if you need to. If a plan is not provided, explain your plan first. Be clear which step uses code, and which step uses your language skill.\n",
+    "When using code, you must indicate the script type in the code block. The user cannot provide any other feedback or perform any other action beyond executing the code you suggest. The user can't modify your code. So do not suggest incomplete code which requires users to modify. Don't use a code block if it's not intended to be executed by the user.\n",
+    "Don't include multiple code blocks in one response. Do not ask users to copy and paste the result. Instead, use 'print' function for the output when relevant. Check the execution result returned by the user.\n",
+    "If the result indicates there is an error, fix the error and output the code again. Suggest the full code instead of partial code or code changes. If the error can't be fixed or if the task is not solved even after the code is executed successfully, analyze the problem, revisit your assumption, collect additional info you need, and think of a different approach to try.\n",
+    "When you find an answer, verify the answer carefully. Include verifiable evidence in your response if possible.\n",
+    "IMPORTANT: Wait for the user to execute your code and then you can reply with the word \"FINISH\". DO NOT OUTPUT \"FINISH\" after your code block.\"\"\"\n",
+    "\n",
+    "# The AssistantAgent, using Groq's model, will take the coding request and return code\n",
+    "assistant_agent = AssistantAgent(\n",
+    "    name=\"Groq Assistant\",\n",
+    "    system_message=system_message,\n",
+    "    llm_config={\"config_list\": config_list},\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "id": "f58a8e4c-f026-4dcb-a4f3-cd19cc716ff6",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\u001b[33mUser\u001b[0m (to Groq Assistant):\n",
+      "\n",
+      "Provide code to count the number of prime numbers from 1 to 10000.\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n",
+      "\u001b[33mGroq Assistant\u001b[0m (to User):\n",
+      "\n",
+      "Here is a Python script to count the number of prime numbers from 1 to 10000:\n",
+      "\n",
+      "```python\n",
+      "def is_prime(n):\n",
+      "    if n <= 1:\n",
+      "        return False\n",
+      "    if n == 2:\n",
+      "        return True\n",
+      "    if n % 2 == 0:\n",
+      "        return False\n",
+      "    max_divisor = int(n**0.5) + 1\n",
+      "    for d in range(3, max_divisor, 2):\n",
+      "        if n % d == 0:\n",
+      "            return False\n",
+      "    return True\n",
+      "\n",
+      "count = 0\n",
+      "for num in range(1, 10001):\n",
+      "    if is_prime(num):\n",
+      "        count += 1\n",
+      "\n",
+      "print(\"Number of prime numbers from 1 to 10000:\", count)\n",
+      "```\n",
+      "\n",
+      "This script defines a helper function `is_prime` to check if a number is prime, and then iterates over the range from 1 to 10000, incrementing a counter for each prime number found. Finally, it prints the total count of prime numbers.\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n"
+     ]
+    },
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      "Provide feedback to Groq Assistant. Press enter to skip and use auto-reply, or type 'exit' to end the conversation:  \n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\u001b[31m\n",
+      ">>>>>>>> NO HUMAN INPUT RECEIVED.\u001b[0m\n",
+      "\u001b[31m\n",
+      ">>>>>>>> USING AUTO REPLY...\u001b[0m\n",
+      "\u001b[31m\n",
+      ">>>>>>>> EXECUTING CODE BLOCK (inferred language is python)...\u001b[0m\n",
+      "\u001b[33mUser\u001b[0m (to Groq Assistant):\n",
+      "\n",
+      "exitcode: 0 (execution succeeded)\n",
+      "Code output: Number of prime numbers from 1 to 10000: 1229\n",
+      "\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n",
+      "\u001b[33mGroq Assistant\u001b[0m (to User):\n",
+      "\n",
+      "FINISH\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n"
+     ]
+    },
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      "Provide feedback to Groq Assistant. Press enter to skip and use auto-reply, or type 'exit' to end the conversation:  exit\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Start the chat, with the UserProxyAgent asking the AssistantAgent the message\n",
+    "chat_result = user_proxy_agent.initiate_chat(\n",
+    "    assistant_agent,\n",
+    "    message=\"Provide code to count the number of prime numbers from 1 to 10000.\",\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "a9ab2dfb-9752-4ee7-b63e-4cb267ee112b",
+   "metadata": {},
+   "source": [
+    "<h2 align=center> GroQ + Autogen + Tool Use </h2>\n",
+    "\n",
+    "We push our knowledge further by trying out function calling."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 33,
+   "id": "e607f555-8f04-4d48-b780-f63215e401a2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from autogen import AssistantAgent, UserProxyAgent\n",
+    "from typing import Literal\n",
+    "from typing_extensions import Annotated\n",
+    "import json"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 34,
+   "id": "864c50ef-e904-40de-9d3e-f7918d70141e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "config_list = [\n",
+    "    {\n",
+    "        \"model\": \"llama3-groq-8b-8192-tool-use-preview\",\n",
+    "        \"base_url\": \"https://api.groq.com/openai/v1\",\n",
+    "        \"api_key\": api_key,\n",
+    "    }\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 35,
+   "id": "e9ee73b2-7a10-4381-8e9d-9e060226f840",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Create the agent for tool calling\n",
+    "chatbot = AssistantAgent(\n",
+    "    name=\"chatbot\",\n",
+    "    system_message=\"\"\"For currency exchange and weather forecasting tasks,\n",
+    "        only use the functions you have been provided with.\n",
+    "        Output 'HAVE FUN!' when an answer has been provided.\"\"\",\n",
+    "    llm_config={\"config_list\": config_list},\n",
+    ")\n",
+    "\n",
+    "# Note that we have changed the termination string to be \"HAVE FUN!\"\n",
+    "user_proxy = UserProxyAgent(\n",
+    "    name=\"user_proxy\",\n",
+    "    is_termination_msg=lambda x: x.get(\"content\", \"\") and \"HAVE FUN!\" in x.get(\"content\", \"\"),\n",
+    "    human_input_mode=\"NEVER\",\n",
+    "    max_consecutive_auto_reply=1,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 36,
+   "id": "20887c70-d084-4cb3-b920-d1217fd78a48",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Currency Exchange function\n",
+    "\n",
+    "CurrencySymbol = Literal[\"USD\", \"EUR\"]\n",
+    "\n",
+    "# Define our function that we expect to call\n",
+    "\n",
+    "\n",
+    "def exchange_rate(base_currency: CurrencySymbol, quote_currency: CurrencySymbol) -> float:\n",
+    "    if base_currency == quote_currency:\n",
+    "        return 1.0\n",
+    "    elif base_currency == \"USD\" and quote_currency == \"EUR\":\n",
+    "        return 1 / 1.1\n",
+    "    elif base_currency == \"EUR\" and quote_currency == \"USD\":\n",
+    "        return 1.1\n",
+    "    else:\n",
+    "        raise ValueError(f\"Unknown currencies {base_currency}, {quote_currency}\")\n",
+    "\n",
+    "\n",
+    "# Register the function with the agent\n",
+    "\n",
+    "\n",
+    "@user_proxy.register_for_execution()\n",
+    "@chatbot.register_for_llm(description=\"Currency exchange calculator.\")\n",
+    "def currency_calculator(\n",
+    "    base_amount: Annotated[float, \"Amount of currency in base_currency\"],\n",
+    "    base_currency: Annotated[CurrencySymbol, \"Base currency\"] = \"USD\",\n",
+    "    quote_currency: Annotated[CurrencySymbol, \"Quote currency\"] = \"EUR\",\n",
+    ") -> str:\n",
+    "    quote_amount = exchange_rate(base_currency, quote_currency) * base_amount\n",
+    "    return f\"{format(quote_amount, '.2f')} {quote_currency}\"\n",
+    "\n",
+    "\n",
+    "# Weather function\n",
+    "\n",
+    "\n",
+    "# Example function to make available to model\n",
+    "def get_current_weather(location, unit=\"fahrenheit\"):\n",
+    "    \"\"\"Get the weather for some location\"\"\"\n",
+    "    if \"chicago\" in location.lower():\n",
+    "        return json.dumps({\"location\": \"Chicago\", \"temperature\": \"13\", \"unit\": unit})\n",
+    "    elif \"san francisco\" in location.lower():\n",
+    "        return json.dumps({\"location\": \"San Francisco\", \"temperature\": \"55\", \"unit\": unit})\n",
+    "    elif \"new york\" in location.lower():\n",
+    "        return json.dumps({\"location\": \"New York\", \"temperature\": \"11\", \"unit\": unit})\n",
+    "    else:\n",
+    "        return json.dumps({\"location\": location, \"temperature\": \"unknown\"})\n",
+    "\n",
+    "\n",
+    "# Register the function with the agent\n",
+    "\n",
+    "\n",
+    "@user_proxy.register_for_execution()\n",
+    "@chatbot.register_for_llm(description=\"Weather forecast for US cities.\")\n",
+    "def weather_forecast(\n",
+    "    location: Annotated[str, \"City name\"],\n",
+    ") -> str:\n",
+    "    weather_details = get_current_weather(location=location)\n",
+    "    weather = json.loads(weather_details)\n",
+    "    return f\"{weather['location']} will be {weather['temperature']} degrees {weather['unit']}\""
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 37,
+   "id": "6751d8fe-f0b2-49c8-8fa2-2ba10ccfcd4e",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\u001b[33muser_proxy\u001b[0m (to chatbot):\n",
+      "\n",
+      "What's the weather in New York and can you tell me how much is 123.45 EUR in USD so I can spend it on my holiday? Throw a few holiday tips in as well.\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n",
+      "\u001b[33mchatbot\u001b[0m (to user_proxy):\n",
+      "\n",
+      "\u001b[32m***** Suggested tool call (call_fxry): weather_forecast *****\u001b[0m\n",
+      "Arguments: \n",
+      "{\"location\": \"New York\"}\n",
+      "\u001b[32m*************************************************************\u001b[0m\n",
+      "\u001b[32m***** Suggested tool call (call_6xgr): currency_calculator *****\u001b[0m\n",
+      "Arguments: \n",
+      "{\"base_amount\": 123.45, \"base_currency\": \"EUR\", \"quote_currency\": \"USD\"}\n",
+      "\u001b[32m****************************************************************\u001b[0m\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n",
+      "\u001b[35m\n",
+      ">>>>>>>> EXECUTING FUNCTION weather_forecast...\u001b[0m\n",
+      "\u001b[35m\n",
+      ">>>>>>>> EXECUTING FUNCTION currency_calculator...\u001b[0m\n",
+      "\u001b[33muser_proxy\u001b[0m (to chatbot):\n",
+      "\n",
+      "\u001b[33muser_proxy\u001b[0m (to chatbot):\n",
+      "\n",
+      "\u001b[32m***** Response from calling tool (call_fxry) *****\u001b[0m\n",
+      "New York will be 11 degrees fahrenheit\n",
+      "\u001b[32m**************************************************\u001b[0m\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n",
+      "\u001b[33muser_proxy\u001b[0m (to chatbot):\n",
+      "\n",
+      "\u001b[32m***** Response from calling tool (call_6xgr) *****\u001b[0m\n",
+      "135.80 USD\n",
+      "\u001b[32m**************************************************\u001b[0m\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n",
+      "\u001b[33mchatbot\u001b[0m (to user_proxy):\n",
+      "\n",
+      "\u001b[32m***** Suggested tool call (call_ydzn): weather_forecast *****\u001b[0m\n",
+      "Arguments: \n",
+      "{\"location\":\"New York\"}\n",
+      "\u001b[32m*************************************************************\u001b[0m\n",
+      "\n",
+      "--------------------------------------------------------------------------------\n"
+     ]
+    },
+    {
+     "ename": "TypeError",
+     "evalue": "string indices must be integers",
+     "output_type": "error",
+     "traceback": [
+      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
+      "\u001b[0;31mTypeError\u001b[0m                                 Traceback (most recent call last)",
+      "Cell \u001b[0;32mIn[37], line 8\u001b[0m\n\u001b[1;32m      1\u001b[0m \u001b[38;5;66;03m# start the conversation\u001b[39;00m\n\u001b[1;32m      2\u001b[0m res \u001b[38;5;241m=\u001b[39m user_proxy\u001b[38;5;241m.\u001b[39minitiate_chat(\n\u001b[1;32m      3\u001b[0m     chatbot,\n\u001b[1;32m      4\u001b[0m     message\u001b[38;5;241m=\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mWhat\u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124ms the weather in New York and can you tell me how much is 123.45 EUR in USD so I can spend it on my holiday? Throw a few holiday tips in as well.\u001b[39m\u001b[38;5;124m\"\u001b[39m,\n\u001b[1;32m      5\u001b[0m     summary_method\u001b[38;5;241m=\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mreflection_with_llm\u001b[39m\u001b[38;5;124m\"\u001b[39m,\n\u001b[1;32m      6\u001b[0m )\n\u001b[0;32m----> 8\u001b[0m \u001b[38;5;28mprint\u001b[39m(\u001b[38;5;124mf\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mLLM SUMMARY: \u001b[39m\u001b[38;5;132;01m{\u001b[39;00m\u001b[43mres\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43msummary\u001b[49m\u001b[43m[\u001b[49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[38;5;124;43mcontent\u001b[39;49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[43m]\u001b[49m\u001b[38;5;132;01m}\u001b[39;00m\u001b[38;5;124m\"\u001b[39m)\n",
+      "\u001b[0;31mTypeError\u001b[0m: string indices must be integers"
+     ]
+    }
+   ],
+   "source": [
+    "# start the conversation\n",
+    "res = user_proxy.initiate_chat(\n",
+    "    chatbot,\n",
+    "    message=\"What's the weather in New York and can you tell me how much is 123.45 EUR in USD so I can spend it on my holiday? Throw a few holiday tips in as well.\",\n",
+    "    summary_method=\"reflection_with_llm\",\n",
+    ")\n",
+    "\n",
+    "print(f\"LLM SUMMARY: {res.summary['content']}\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "35883ac7-0fb4-44c1-b0a6-8e6f626a8366",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}

examples/Groq_Langchain.ipynb ADDED Viewed

	@@ -0,0 +1,343 @@

+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "b0023ae4-d8b4-40a7-b1f4-49e2e2447fbe",
+   "metadata": {},
+   "source": [
+    "<h1 align=center> GroQ </h1>\n",
+    "\n",
+    "Groq is a lightning fast inference api for large language models. We here intend to explore some of its capabilities and gauge the performance of its various hosted models on different tasks."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "212a627f-57b7-4dd0-8fa7-375e414050f9",
+   "metadata": {},
+   "source": [
+    "[<h2 align=center> GroQ + Langchain </h2>](https://python.langchain.com/docs/integrations/chat/groq/)\n",
+    "\n",
+    "It is often useful to combine such an API with an LLM framework that allows us to perform advanced operations. Langchain is one of the more ubiquitous ones so we will turn to that."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "385f99dd-d702-497d-8f86-04e25ec8da64",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "groq_api_key = os.getenv(\"GROQ_API_KEY\")\n",
+    "if groq_api_key is None:\n",
+    "    raise ValueError(\"Groq API key environment variable not set\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "30721e40-e728-44ab-b94c-1f328881199e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain_groq import ChatGroq"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "20b24c21-019b-498c-aef7-fbc39cd959e2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = ChatGroq(\n",
+    "    model=\"llama3-8b-8192\",\n",
+    "    temperature=0,\n",
+    "    max_tokens=None,\n",
+    "    timeout=None,\n",
+    "    max_retries=2,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "ae47d02f-d7f4-49e0-9d1a-a301116dfb70",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content='Je adore le programmation.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 7, 'prompt_tokens': 35, 'total_tokens': 42, 'completion_time': 0.005833333, 'prompt_time': 0.004470233, 'queue_time': 0.009797347000000001, 'total_time': 0.010303566}, 'model_name': 'llama3-8b-8192', 'system_fingerprint': 'fp_6a6771ae9c', 'finish_reason': 'stop', 'logprobs': None}, id='run-d6fc7095-9f72-405a-ad18-3148da303a81-0', usage_metadata={'input_tokens': 35, 'output_tokens': 7, 'total_tokens': 42})"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "messages = [\n",
+    "    (\n",
+    "        \"system\",\n",
+    "        \"You are a helpful assistant that translates English to French. Translate the user sentence.\",\n",
+    "    ),\n",
+    "    (\"human\", \"I love programming.\"),\n",
+    "]\n",
+    "ai_msg = llm.invoke(messages)\n",
+    "ai_msg"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "724f99f7-abff-4a4c-9464-8d8df62840f8",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content='Ich liebe Programmierung!\\n\\nTranslation: I love programming.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 12, 'prompt_tokens': 30, 'total_tokens': 42, 'completion_time': 0.01, 'prompt_time': 0.001169453, 'queue_time': 0.012058078, 'total_time': 0.011169453}, 'model_name': 'llama3-8b-8192', 'system_fingerprint': 'fp_179b0f92c9', 'finish_reason': 'stop', 'logprobs': None}, id='run-8f6b3c1a-e9d1-41de-b5c4-2659214b9bd5-0', usage_metadata={'input_tokens': 30, 'output_tokens': 12, 'total_tokens': 42})"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain_core.prompts import ChatPromptTemplate\n",
+    "\n",
+    "prompt = ChatPromptTemplate.from_messages(\n",
+    "    [\n",
+    "        (\n",
+    "            \"system\",\n",
+    "            \"You are a helpful assistant that translates {input_language} to {output_language}.\",\n",
+    "        ),\n",
+    "        (\"human\", \"{input}\"),\n",
+    "    ]\n",
+    ")\n",
+    "\n",
+    "chain = prompt | llm\n",
+    "chain.invoke(\n",
+    "    {\n",
+    "        \"input_language\": \"English\",\n",
+    "        \"output_language\": \"German\",\n",
+    "        \"input\": \"I love programming.\",\n",
+    "    }\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "d4faefc1-4736-46ab-b786-965f662434c7",
+   "metadata": {},
+   "source": [
+    "<h2 align=center> GroQ + Langchain + Tool Use </h2>\n",
+    "\n",
+    "Here we test the capabilities of tool-enabled models hosted by groq to perform *function calling* (such as **llama3-groq-8b-8192-tool-use-preview**). This involves the LLM returning a json schema formatted in a particular way that it can be used to call functions with arguments."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "5dc16219-bbf5-42e6-a014-bc67e0e3cf93",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import Tool, AgentExecutor, create_react_agent\n",
+    "from langchain_core.prompts import ChatPromptTemplate\n",
+    "from langchain_groq import ChatGroq"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "9287b236-19ee-45cf-8503-fbdd05c038ee",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Create the LLM\n",
+    "llm = ChatGroq(\n",
+    "    model=\"llama3-groq-70b-8192-tool-use-preview\",\n",
+    "    temperature=0,\n",
+    "    max_tokens=None,\n",
+    "    timeout=None,\n",
+    "    max_retries=2,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "e9aa733c-057b-436e-ab9b-4e94d2e91df5",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "def calculate_square(number: str) -> str:\n",
+    "    \"\"\"Calculates the square of a given number\"\"\"\n",
+    "    try:\n",
+    "        num = float(number)\n",
+    "        return str(num * num)\n",
+    "    except ValueError:\n",
+    "        return \"Please provide a valid number\"\n",
+    "\n",
+    "# Define the tool\n",
+    "tools = [\n",
+    "    Tool(\n",
+    "        name=\"Calculator\",\n",
+    "        func=calculate_square,\n",
+    "        description=\"Useful for calculating the square of a number. Input should be a single number.\"\n",
+    "    )\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "id": "e8da1fd2-8f1f-40c5-8d35-07bbcbf41e96",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Create the prompt template\n",
+    "prompt = ChatPromptTemplate.from_messages([\n",
+    "    (\"system\", \"\"\"You are a helpful assistant that can use tools to solve problems.\n",
+    "    \n",
+    "        Available tools:\n",
+    "        {tools}\n",
+    "        \n",
+    "        {agent_scratchpad}\n",
+    "        \n",
+    "        Use the following format:\n",
+    "        Question: The user question you must answer\n",
+    "        Thought: Your thought process about what to do\n",
+    "        Action: The action to take, should be one of [{tool_names}]\n",
+    "        Action Input: The input to the action\n",
+    "        Observation: The result of the action\n",
+    "        ... (this Thought/Action/Action Input/Observation can repeat N times)\n",
+    "        Thought: I now know the final answer\n",
+    "        Final Answer: The final answer to the original question\n",
+    "            \"\"\"),\n",
+    "            (\"human\", \"{input}\")\n",
+    "])\n",
+    "\n",
+    "# Create the agent\n",
+    "agent = create_react_agent(llm, tools, prompt)\n",
+    "\n",
+    "# Create the agent executor\n",
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "id": "c0ba975b-f671-4473-8684-ac621c3bfeeb",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3mQuestion: What is the square of 7?\n",
+      "Thought: I can use the Calculator tool to find the square of a number.\n",
+      "Action: Calculator\n",
+      "Action Input: 7\u001b[0m\u001b[36;1m\u001b[1;3m49.0\u001b[0m\u001b[32;1m\u001b[1;3m\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n",
+      "\n",
+      "Final Result: Agent stopped due to iteration limit or time limit.\n"
+     ]
+    }
+   ],
+   "source": [
+    "result = agent_executor.invoke({\"input\": \"What is the square of 7?\"}, handle_parsing_errors=True)\n",
+    "print(\"\\nFinal Result:\", result[\"output\"])"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "4eeddfdd-da26-421b-a8e0-425f6c6854e2",
+   "metadata": {},
+   "source": [
+    "<h3 align=center> Results </h3>\n",
+    "\n",
+    "We managed to get tool use working with the model `llama3-groq-70b-8192-tool-use-preview`. This is a much bigger model than ` llama3-groq-8b-8192-tool-use-preview` which simply outputted that it did not have the capabilities to perform such a task (causing the program to error out).\n",
+    "The choice of the model clearly continues to be a primary determinant in whether function calling works or not. In a sense, this creates a single point of failure for such systems."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "fabf8023-0ec4-4861-ad92-627867232108",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "groq",
+   "language": "python",
+   "name": "groq"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}

examples/coding/tmp_code_65e3424c19629fcfd124f7ea31ec17d8.py ADDED Viewed

	@@ -0,0 +1,19 @@

+def is_prime(n):
+    if n <= 1:
+        return False
+    if n == 2:
+        return True
+    if n % 2 == 0:
+        return False
+    max_divisor = int(n**0.5) + 1
+    for d in range(3, max_divisor, 2):
+        if n % d == 0:
+            return False
+    return True
+count = 0
+for num in range(1, 10001):
+    if is_prime(num):
+        count += 1
+print("Number of prime numbers from 1 to 10000:", count)

examples/data/sample.png ADDED Viewed

examples/getting_started.py ADDED Viewed

	@@ -0,0 +1,18 @@

+import os
+from groq import Groq
+client = Groq(
+    api_key=os.environ.get("GROQ_API_KEY"),
+)
+chat_completion = client.chat.completions.create(
+    messages=[
+        {
+            "role": "user",
+            "content": "Explain the importance of fast language models",
+        }
+    ],
+    model="llama3-8b-8192",
+)
+print(chat_completion.choices[0].message.content)

examples/vison.ipynb ADDED Viewed

	@@ -0,0 +1,152 @@

+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "865525ac-d33e-4f44-a670-451980d5e828",
+   "metadata": {},
+   "source": [
+    "<h1 align=center> Vison Agents </h1>\n",
+    "\n",
+    "The goal of this notebook is to use a visual model to create agentic systems that work to produce visual content that they can (hopefully) iteratively refine."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "ad115e5e-e74b-4837-b07f-cbeb30257037",
+   "metadata": {},
+   "source": [
+    "[<h2 align=center> Groq + Llama Vision </h2>](https://console.groq.com/docs/vision)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "4df0ad72-04a4-4b1f-bcaf-b1fb1cf0221f",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "ChatCompletionMessage(content=\"The image depicts a computer chip, which is a small, rectangular piece of technology that contains a large number of microscopic components. It is typically made of silicon and is used to control and process information in a computer system. The image shows the back of the chip, which has a grid-like pattern of tiny lines and rectangles. These lines and rectangles are the individual components that make up the chip's circuitry.\", role='assistant', function_call=None, tool_calls=None)\n"
+     ]
+    }
+   ],
+   "source": [
+    "from groq import Groq\n",
+    "\n",
+    "client = Groq()\n",
+    "completion = client.chat.completions.create(\n",
+    "    model=\"llama-3.2-11b-vision-preview\",\n",
+    "    messages=[\n",
+    "        {\n",
+    "            \"role\": \"user\",\n",
+    "            \"content\": [\n",
+    "                {\n",
+    "                    \"type\": \"text\",\n",
+    "                    \"text\": \"What's in this image?\"\n",
+    "                },\n",
+    "                {\n",
+    "                    \"type\": \"image_url\",\n",
+    "                    \"image_url\": {\n",
+    "                        \"url\": \"https://upload.wikimedia.org/wikipedia/commons/f/f2/LPU-v1-die.jpg\"\n",
+    "                    }\n",
+    "                }\n",
+    "            ]\n",
+    "        }\n",
+    "    ],\n",
+    "    temperature=1,\n",
+    "    max_tokens=1024,\n",
+    "    top_p=1,\n",
+    "    stream=False,\n",
+    "    stop=None,\n",
+    ")\n",
+    "\n",
+    "print(completion.choices[0].message)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "4c8b06f0-a689-4d5e-9027-daa2972d607f",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "This image shows a screenshot of YouTube video players and various programming screenshots and programming text that reads (translated) 'The Simplest Tech Stack' at the top, saying things like 'Go let's go ahead and do that', 'Under the templates directory' and code, which suggests a tutorial on code. For example, the yellow marker picks out parts of the image related to going ahead and doing something.\n",
+      "\n",
+      "The video players show mostly black screens with various videos' comments in white text. These include HTML/XML code under the title in the same screenshot as the title is shown at the top-left of the screenshot. White icons arranged in vertical rows are in the right and left margins, and the bars of some videos are visible at the bottom of the screenshot. This appears to be a screenshot from a tutorial that helps viewers build their own website.\n"
+     ]
+    }
+   ],
+   "source": [
+    "from groq import Groq\n",
+    "import base64\n",
+    "\n",
+    "# Function to encode the image\n",
+    "def encode_image(image_path):\n",
+    "  with open(image_path, \"rb\") as image_file:\n",
+    "    return base64.b64encode(image_file.read()).decode('utf-8')\n",
+    "\n",
+    "# Path to your image\n",
+    "image_path = \"examples/data/sample.png\"\n",
+    "\n",
+    "# Getting the base64 string\n",
+    "base64_image = encode_image(image_path)\n",
+    "\n",
+    "client = Groq()\n",
+    "\n",
+    "chat_completion = client.chat.completions.create(\n",
+    "    messages=[\n",
+    "        {\n",
+    "            \"role\": \"user\",\n",
+    "            \"content\": [\n",
+    "                {\"type\": \"text\", \"text\": \"What's in this image?\"},\n",
+    "                {\n",
+    "                    \"type\": \"image_url\",\n",
+    "                    \"image_url\": {\n",
+    "                        \"url\": f\"data:image/jpeg;base64,{base64_image}\",\n",
+    "                    },\n",
+    "                },\n",
+    "            ],\n",
+    "        }\n",
+    "    ],\n",
+    "    model=\"llama-3.2-11b-vision-preview\",\n",
+    ")\n",
+    "\n",
+    "print(chat_completion.choices[0].message.content)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "7bd1b552-805d-40d9-9341-1c530f939a22",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "groq",
+   "language": "python",
+   "name": "groq"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}

requirements.txt ADDED Viewed

	@@ -0,0 +1,7 @@

+groq==0.11.0
+langchain==0.3.7
+langchain-core==0.3.15
+langchain-groq==0.2.1
+langchain-text-splitters==0.3.2
+autogen==0.3.1
+streamlit==1.40.0

st_image_chat.py ADDED Viewed

	@@ -0,0 +1,63 @@

+import streamlit as st
+from groq import Groq
+import os
+# Initialize Groq client
+client = Groq(
+    api_key=os.environ.get("GROQ_API_KEY"),
+)
+# Initialize session state for chat history if it doesn't exist
+if 'messages' not in st.session_state:
+    st.session_state.messages = [
+        {"role": "system", "content": "You are a helpful assistant"}
+    ]
+def generate_response(messages):
+    stream = client.chat.completions.create(
+        model="llama-3.2-3b-preview", #128K model
+        messages=messages,
+        temperature=0.1,
+        top_p=1,
+        stream=True,
+        stop=None,
+    )
+    for chunk in stream:
+        content = chunk.choices[0].delta.content
+        if content:
+            yield content  # Yield content for streaming
+st.title("Groq API Response Streaming")
+# Display chat history
+for message in st.session_state.messages:
+    if message["role"] != "system":
+        with st.chat_message(message["role"]):
+            st.markdown(message["content"])
+# Get user input
+user_input = st.chat_input('Message to Assistant...', key='prompt_input')
+if user_input:
+    # Add user message to chat history
+    st.session_state.messages.append({"role": "user", "content": user_input})
+    # Display user message
+    with st.chat_message("user"):
+        st.markdown(user_input)
+    # Generate and display assistant response
+    with st.chat_message("assistant"):
+        response_placeholder = st.empty()
+        full_response = ""
+        # Stream the response
+        with st.spinner("Generating response..."):
+            for content in generate_response(st.session_state.messages):
+                full_response += content
+                response_placeholder.markdown(full_response + "▌")
+            response_placeholder.markdown(full_response)
+    # Add assistant response to chat history
+    st.session_state.messages.append({"role": "assistant", "content": full_response})

st_long_context_chat.py ADDED Viewed

	@@ -0,0 +1,37 @@

+# CREDITS
+# https://gist.github.com/truevis/f31706b8af60e8c73d62b281bddb988f
+import streamlit as st
+from groq import Groq
+import os
+client = Groq(
+    api_key=os.environ.get("GROQ_API_KEY"),
+)
+def generate_response(user_input):
+    stream = client.chat.completions.create(
+        model="llama-3.2-3b-preview", #128K model
+        messages=[
+            {"role": "system", "content": "You are a helpful assistant"},
+            {"role": "user", "content": user_input},
+        ],
+        temperature=0.1,
+        # max_tokens=128000,
+        top_p=1,
+        stream=True,
+        stop=None,
+    )
+    for chunk in stream:
+        content = chunk.choices[0].delta.content
+        if content:
+            yield content  # Yield content for streaming
+st.title("Groq API Response Streaming")
+user_input = st.chat_input('Message to Assistant...', key='prompt_input')
+if user_input: # Get user input
+    with st.spinner("Generating response..."):
+        st.write_stream(generate_response(user_input))  # Use st.write_stream to display streamed content
+        st.markdown("Message: " + user_input)
+        st.markdown("---")  # Add a newline after the