rocket-3B-GGUF

Running

limcheekin commited on Dec 2, 2023

Commit

a5c9525

•

1 Parent(s): 58a8533

feat: added chat_format and update Q6_K model

Files changed (4) hide show

Dockerfile CHANGED Viewed

@@ -15,7 +15,7 @@ RUN pip install -U pip setuptools wheel && \
 # Download model
 RUN mkdir model && \
-    curl -L https://huggingface.co/TheBloke/rocket-3B-GGUF/resolve/main/rocket-3b.Q8_0.gguf -o model/gguf-model.bin
 COPY ./start_server.sh ./
 COPY ./main.py ./

 # Download model
 RUN mkdir model && \
+    curl -L https://huggingface.co/TheBloke/rocket-3B-GGUF/resolve/main/rocket-3b.Q6_K.gguf -o model/gguf-model.bin
 COPY ./start_server.sh ./
 COPY ./main.py ./

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: rocket-3B-GGUF (Q8_0)
 colorFrom: purple
 colorTo: blue
 sdk: docker
@@ -15,6 +15,6 @@ tags:
 pinned: false
 ---
-# rocket-3B-GGUF (Q8_0)
 Please refer to the [index.html](index.html) for more information.

 ---
+title: rocket-3B-GGUF (Q6_K)
 colorFrom: purple
 colorTo: blue
 sdk: docker
 pinned: false
 ---
+# rocket-3B-GGUF (Q6_K)
 Please refer to the [index.html](index.html) for more information.

index.html CHANGED Viewed

@@ -1,10 +1,10 @@
 <!DOCTYPE html>
 <html>
   <head>
-    <title>rocket-3B-GGUF (Q8_0)</title>
   </head>
   <body>
-    <h1>rocket-3B-GGUF (Q8_0)</h1>
     <p>
       With the utilization of the
       <a href="https://github.com/abetlen/llama-cpp-python">llama-cpp-python</a>

 <!DOCTYPE html>
 <html>
   <head>
+    <title>rocket-3B-GGUF (Q6_K)</title>
   </head>
   <body>
+    <h1>rocket-3B-GGUF (Q6_K)</h1>
     <p>
       With the utilization of the
       <a href="https://github.com/abetlen/llama-cpp-python">llama-cpp-python</a>

main.py CHANGED Viewed

@@ -6,7 +6,8 @@ app = create_app(
     Settings(
         n_threads=2,  # set to number of cpu cores
         model="model/gguf-model.bin",
-        embedding=True
     )
 )

     Settings(
         n_threads=2,  # set to number of cpu cores
         model="model/gguf-model.bin",
+        embedding=True,
+        chat_format="chatml"
     )
 )