NexaAIDev
/

octopus-v4-gguf

function calling

on-device language model

Model card Files Files and versions Community

Zack Zhiyuan Li commited on May 5

Commit

ad56bbf

•

1 Parent(s): c103f91

wip

Files changed (1) hide show

README.md +22 -10

README.md CHANGED Viewed

@@ -2,22 +2,36 @@
 language:
   - en
 license: apache-2.0
-model_name: Octopus-V2-2B
 base_model: NexaAIDev/Octopus-v4
 inference: false
 model_creator: NexaAIDev
-quantized_by: Second State Inc.
 tags:
   - function calling
   - on-device language model
-  - android
 ---
-# Octopus-v4-GGUF
-## Original Model
-[NexaAIDev/Octopus-v4](https://huggingface.co/NexaAIDev/Octopus-v4)
 ## Run with [Ollama](https://github.com/ollama/ollama)
@@ -30,10 +44,10 @@ Input example:
 ```json
 Query: Tell me the result of derivative of x^3 when x is 2?
-# <nexa_4> represents the math gpt.
 Response: <nexa_4> ('Determine the derivative of the function f(x) = x^3 at the point where x equals 2, and interpret the result within the context of rate of change and tangent slope.')<nexa_end>
 ```
 ### Dataset and Benchmark
@@ -65,6 +79,4 @@ Response: <nexa_4> ('Determine the derivative of the function f(x) = x^3 at the
 | Octopus-v4-Q8_0.gguf   | Q8_0         | 8    | 3.78 GB | 50.10                  | very large, good quality                  |
 | Octopus-v4-f16.gguf    | f16          | 16   | 7.20 GB | 30.61                  | extremely large                           |
-_Quantized with llama.cpp_

 language:
   - en
 license: apache-2.0
+model_name: Octopus-V4-GGUF
 base_model: NexaAIDev/Octopus-v4
 inference: false
 model_creator: NexaAIDev
+quantized_by: Nexa AI, Inc.
 tags:
   - function calling
   - on-device language model
+  - gguf
+  - llama cpp
 ---
+# Octopus V4-GGUF: Graph of language models
+<p align="center">
+- <a href="https://huggingface.co/NexaAIDev/Octopus-v4" target="_blank">Original Model</a>
+- <a href="https://www.nexa4ai.com/" target="_blank">Nexa AI Website</a>
+- <a href="https://github.com/NexaAI/octopus-v4" target="_blank">Octopus-v4 Github</a>
+- <a href="https://arxiv.org/abs/2404.19296" target="_blank">ArXiv</a>
+- <a href="https://huggingface.co/spaces/NexaAIDev/domain_llm_leaderboard" target="_blank">Domain LLM Leaderbaord</a>
+</p>
+<p align="center" width="100%">
+  <a><img src="octopus-v4-logo.png" alt="nexa-octopus" style="width: 40%; min-width: 300px; display: block; margin: auto;"></a>
+</p>
+**Acknowledgement**:
+We sincerely thank our community members, [ThunderBeee](https://huggingface.co/ThunderBeee) and [ZY6](https://huggingface.co/ZY6), for their extraordinary contributions to this quantization effort. Please explore [Octopus-v4](https://huggingface.co/NexaAIDev/Octopus-v4) for our original huggingface model.
 ## Run with [Ollama](https://github.com/ollama/ollama)
 ```json
 Query: Tell me the result of derivative of x^3 when x is 2?
 Response: <nexa_4> ('Determine the derivative of the function f(x) = x^3 at the point where x equals 2, and interpret the result within the context of rate of change and tangent slope.')<nexa_end>
 ```
+Note that `<nexa_4>` represents the math gpt.
 ### Dataset and Benchmark
 | Octopus-v4-Q8_0.gguf   | Q8_0         | 8    | 3.78 GB | 50.10                  | very large, good quality                  |
 | Octopus-v4-f16.gguf    | f16          | 16   | 7.20 GB | 30.61                  | extremely large                           |
+_Quantized with llama.cpp_