Adding 15M, 42M & 110M models

Browse files

Files changed (5) hide show

README.md +21 -15
stories110Mtok32000.gguf +3 -0
stories15Mtok4096.gguf +3 -0
stories42Mtok32000.gguf +3 -0
stories42Mtok4096.gguf +3 -0

README.md CHANGED Viewed

@@ -12,7 +12,18 @@ language:
 - The models were created with the training procedure outlined in [karpathy/llama2.c](https://github.com/karpathy/llama2.c)
 - You can run them local too, as described in [karpathy/llama2.c](https://github.com/karpathy/llama2.c)
-## Setup git
 See: [Getting Started: set-up](https://huggingface.co/docs/hub/repositories-getting-started#set-up)
@@ -22,26 +33,20 @@ pip install huggingface-hub
 git clone <this-repo>
 cd <this-repo>
 git lfs install
-git lfs track "*.gguf"
 huggingface-cli lfs-enable-largefiles .
-# add & push as usual with git
 git add <file-name>
 git commit -m "Adding <file-name>"
 git push -u origin main
 ```
-## TinyStories models
-| model | notes |
-|-------|-------|
-| stories260Ktok512.guff       | Use this for development & debugging |
-| stories15Mtok4096.guff       | Fits in canister & works well ! |
-| stories42Mtok4096.guff       | As of April 28, hits instruction limit of canister |
-| stories42Mtok32000.guff  (*) | As of April 28, hits instruction limit of canister |
-| stories110Mtok32000.guff (*) | As of April 28, hits instruction limit of canister |
 We used [convert-llama2c-to-ggml](https://github.com/ggerganov/llama.cpp/tree/32c8486e1f0297393cb22ac0a0d26a6b17ad4d54/examples/convert-llama2c-to-ggml) to convert the llama2.c model+tokenizer to llama.cpp gguf format.
 - Good read: [lama : add support for llama2.c models](https://github.com/ggerganov/llama.cpp/issues/2379)
@@ -64,8 +69,9 @@ convert-llama2c-to-ggml --llama2c-model stories42Mtok32000.bin --copy-vocab-from
 main -m stories15Mtok4096.gguf -p "Joe loves writing stories" -n 600 -c 128
 # Quantization
-#
 ```
-(*) Files with asterix behind them were not trained by us, but simply copied from [karpathy/tinyllamas](https://huggingface.co/karpathy/tinyllamas/tree/main) and renamed. We are providing them here under a different name for clarity and ease-of-access.

 - The models were created with the training procedure outlined in [karpathy/llama2.c](https://github.com/karpathy/llama2.c)
 - You can run them local too, as described in [karpathy/llama2.c](https://github.com/karpathy/llama2.c)
+## TinyStories models
+| model | notes |
+|-------|-------|
+| stories260Ktok512.guff       | Use this for development & debugging |
+| stories15Mtok4096.guff       | Fits in canister & works well ! |
+| stories42Mtok4096.guff       | As of April 28, hits instruction limit of canister |
+| stories42Mtok32000.guff   | As of April 28, hits instruction limit of canister |
+| stories110Mtok32000.guff  | As of April 28, hits instruction limit of canister |
+## Setup local git with lfs
 See: [Getting Started: set-up](https://huggingface.co/docs/hub/repositories-getting-started#set-up)
 git clone <this-repo>
 cd <this-repo>
+# configure lfs for local repo
 git lfs install
 huggingface-cli lfs-enable-largefiles .
+# tell lfs what files to track (.gitattributes)
+git lfs track "*.gguf"
+# add, commit & push as usual with git
 git add <file-name>
 git commit -m "Adding <file-name>"
 git push -u origin main
 ```
+## Model creation
 We used [convert-llama2c-to-ggml](https://github.com/ggerganov/llama.cpp/tree/32c8486e1f0297393cb22ac0a0d26a6b17ad4d54/examples/convert-llama2c-to-ggml) to convert the llama2.c model+tokenizer to llama.cpp gguf format.
 - Good read: [lama : add support for llama2.c models](https://github.com/ggerganov/llama.cpp/issues/2379)
 main -m stories15Mtok4096.gguf -p "Joe loves writing stories" -n 600 -c 128
 # Quantization
+# TODO
 ```

stories110Mtok32000.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1e92613ae001f20c05b690a7cb5044058b6f605457c0f0bbb8b5f61e9cc62630
+size 537153728

stories15Mtok4096.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e1d6eb48fa458b38268d484bb667c8b7b25d72f5befda5ea58d7c6d6c9cf7da2
+size 33436768

stories42Mtok32000.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c2d4c26604cebc4d76aaa83874af9cb3b012145494c9d9b28c2ba7d1b7952cf9
+size 233022592

stories42Mtok4096.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b6957cb15eaa94dc7b5d4b6a4badbcd1acb8bd5c7e79bd6907924f2e1995845e
+size 118097408