ValiantLabs
/

Llama3.1-8B-Enigma

sequelbox commited on Sep 4

Commit

5ab53f9

•

1 Parent(s): 5e49be0

46c59c884b6859208f322326f73ca9d97d08eda4563d679b6fe3c5212285aff6

Files changed (4) hide show

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ tags:
 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 datasets:
 - sequelbox/Tachibana
-- LDJnr/Pure-Dove
 model_type: llama
 license: llama3.1
 ---
@@ -32,11 +32,12 @@ license: llama3.1
 Enigma is a code-instruct model built on Llama 3.1 8b.
 - High quality code instruct performance within the Llama 3 Instruct chat format
 - Finetuned on synthetic code-instruct data generated with Llama 3.1 405b. [Find the current version of the dataset here!](https://huggingface.co/datasets/sequelbox/Tachibana)
 ## Version
-This is the **2024-08-10** release of Enigma for Llama 3.1 8b.
 Help us and recommend Enigma to your friends! We're excited for more Enigma releases in the future.
@@ -73,9 +74,9 @@ print(outputs[0]["generated_text"][-1])
 ```
 ## The Model
-Enigma is built on top of Llama 3.1 8b Instruct, using code-instruct data to supplement code-instruct performance using Llama 3.1 Instruct prompt style.
-Our current version of the Enigma code-instruct dataset is [sequelbox/Tachibana](https://huggingface.co/datasets/sequelbox/Tachibana), supplemented with a small selection of data from [LDJnr/Pure-Dove](https://huggingface.co/datasets/LDJnr/Pure-Dove) for general chat consistency.
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)

 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 datasets:
 - sequelbox/Tachibana
+- sequelbox/Supernova
 model_type: llama
 license: llama3.1
 ---
 Enigma is a code-instruct model built on Llama 3.1 8b.
 - High quality code instruct performance within the Llama 3 Instruct chat format
 - Finetuned on synthetic code-instruct data generated with Llama 3.1 405b. [Find the current version of the dataset here!](https://huggingface.co/datasets/sequelbox/Tachibana)
+- Overall chat performance supplemented with [generalist synthetic data.](https://huggingface.co/datasets/sequelbox/Supernova)
 ## Version
+This is the **2024-09-04** release of Enigma for Llama 3.1 8b, enhancing code-instruct and general chat capabilities.
 Help us and recommend Enigma to your friends! We're excited for more Enigma releases in the future.
 ```
 ## The Model
+Enigma is built on top of Llama 3.1 8b Instruct, using high quality code-instruct data and general chat data in Llama 3.1 Instruct prompt style to supplement overall performance.
+Our current version of Enigma is trained on code-instruct data from [sequelbox/Tachibana](https://huggingface.co/datasets/sequelbox/Tachibana) and general chat data from [sequelbox/Supernova.](https://huggingface.co/datasets/sequelbox/Supernova)
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)

config.json CHANGED Viewed

@@ -33,7 +33,7 @@
   "rope_theta": 500000.0,
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
-  "transformers_version": "4.44.0",
   "use_cache": true,
   "vocab_size": 128256
 }

   "rope_theta": 500000.0,
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
+  "transformers_version": "4.44.2",
   "use_cache": true,
   "vocab_size": 128256
 }

generation_config.json CHANGED Viewed

@@ -8,5 +8,5 @@
   ],
   "temperature": 0.6,
   "top_p": 0.9,
-  "transformers_version": "4.44.0"
 }

   ],
   "temperature": 0.6,
   "top_p": 0.9,
+  "transformers_version": "4.44.2"
 }

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 5450,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {